(Be sure to join our community to get our bi-weekly newsletter with exclusive content, demo videos, and other SQL Server goodies! Also check out our online training courses.)
This is a follow-on from two posts:
- Corruption: Last resorts that people try first… where I discussed the two worst things you can do (in my opinion) to a database – rebuilding the transaction log and running REPAIR_ALLOW_DATA_LOSS.
- Using EMERGENCY mode to access a RECOVERY PENDING or SUSPECT database where I introduced EMERGENCY mode and walked through an example script showing its use.
People get themselves into situations where they have no backups (or damaged backups) and the data or log files are damaged such that the only way to access the database is with EMERGENCY mode. In these situations, prior to SQL Server 2005, there was no documented or supported way to fix a database while in EMERGENCY mode – the only guidance could be found on the Internet or from calling Product Support and paying for help. The sequence of events was:
- Hack the system tables to get the database into EMERGENCY mode.
- Use the undocumented and unsupported DBCC REBUILD_LOG command to build a new transaction log.
- Run DBCC CHECKDB with the REPAIR_ALLOW_DATA_LOSS option to fix up corruptions in the data files – both those that may have caused the issue, and those caused by rebuilding the transaction log (e.g. because an active transaction altering the database structure was lost).
- Figure out what data was lost or is transactionally inconsistent (e.g. because a transaction altering multiple tables was lost) as far as your business logic is concerned
- Take the database out of EMERGENCY mode
- And then all the other stuff like root-cause analysis and getting a better backup strategy
How Does EMERGENCY-Mode Repair Work?
I decided to add a new feature to SQL Server 2005 called EMERGENCY-mode repair that will do steps 2 and 3 as an atomic operation. The reasons for this were:
- Much of the advice of how to do this on the Internet missed steps out (particularly missing step 3!)
- The DBCC REBUILD_LOG command was unsupported and undocumented and we didn’t like advising customers to use it
- Adding a documented, last-resort method of recovering from this situation would reduce calls to Product Support – saving time and money for customers and Microsoft.
So, when in EMERGENCY mode, you can use DBCC CHECKDB to bring the database back online again. The only repair option allowed in EMERGENCY mode is REPAIR_ALLOW_DATA_LOSS and it does a lot more than usual:
- Forces recovery to run on the transaction log (if it exists). You can think of this as ‘recovery with CONTINUE_AFTER_ERROR‘ – see this post for more details on the real CONTINUE_AFTER_ERROR options for BACKUP and RESTORE. The idea behind this is that the database is already inconsistent because either the transaction log is corrupt or something in the database is corrupt in such a way that recovery cannot complete. So, given that the database is inconsistent and we’re about to rebuild the transaction log, it makes sense to salvage as much transactional information as possible from the log before we throw it away and build a new one.
- Rebuild the transaction log – but only if the transaction log is corrupt.
- Run DBCC CHECKDB with the REPAIR_ALLOW_DATA_LOSS option.
- Set the database state to ONLINE.
It’s a one-way operation and can’t be rolled back. I always advise taking a copy of the database files before doing this in case something goes wrong or there are unrepairable errors. And if it does? Probably time to update your resume for not having a water-tight backup and disaster-recovery strategies in place. I have seen it fail several times in real customer situations. It’s not pretty when it does.
EMERGENCY-Mode Repair Example
Let’s walk-through an example of using it. I’m assuming there’s a database called EmergencyDemo that’s in the same state as at the end of the second post I link to at the beginning of this post: the database has no log file, is in EMERGENCY mode and the salaries table is corrupt.
First off I’ll try bringing the database online, just to see what happens:
ALTER DATABASE [EmergencyDemo] SET ONLINE; GO
File activation failure. The physical file name "C:\Program Files\Microsoft SQL Server\MSSQL.1\MSSQL\DATA\EmergencyDemo_log.LDF" may be incorrect. The log cannot be rebuilt because the database was not cleanly shut down. Msg 945, Level 14, State 2, Line 1 Database 'EmergencyDemo' cannot be opened due to inaccessible files or insufficient memory or disk space. See the SQL Server errorlog for details. Failed to restart the current database. The current database is switched to master. Msg 5069, Level 16, State 1, Line 1 ALTER DATABASE statement failed.
The first message makes sense – the database knows it needs to be recovered because it wasn’t cleanly shut down, but the log file simply isn’t there. The second message is from the new feature in 2005 that will automatically create a log file if one is missing on startup or attach – as long as the database was cleanly shut down. The 945 and 5069 errors are self-explanatory but notice that the database gets switched to master underneath us. This has bitten me several times in the past.
Well, I expected that not to work. Let’s run EMERGENCY-mode repair:
DBCC CHECKDB (N'EmergencyDemo', REPAIR_ALLOW_DATA_LOSS) WITH ALL_ERROMSGS, NO_INFOMSGS; GO
Msg 945, Level 14, State 2, Line 1 Database 'EmergencyDemo' cannot be opened due to inaccessible files or insufficient memory or disk space. See the SQL Server errorlog for details.
Hmm – looks like the failed ALTER DATABASE statement did change the state – but what to?
SELECT
[state_desc]
FROM
sys.databases
WHERE
[name] = N'EmergencyDemo';
GO
state_desc ----------------- RECOVERY_PENDING
That makes sense because the log file does not exist. The recovery system knows that recovery has to run on the database but it is unable to start without the log file.
Back to EMERGENCY mode and runing repair:
ALTER DATABASE [EmergencyDemo] SET EMERGENCY; GO DBCC CHECKDB (N'EmergencyDemo', REPAIR_ALLOW_DATA_LOSS) WITH ALL_ERRORMSGS, NO_INFOMSGS; GO
Msg 7919, Level 16, State 3, Line 1 Repair statement not processed. Database needs to be in single user mode.
EMERGENCY mode is not SINGLE_USER mode. A database needs to be in SINGLE_USER mode for repair to run and EMERGENCY mode allows multiple connections from members of the sysadmin role. You can set SINGLE_USER mode as well as EMERGENCY mode – however, the sys.databases field state_desc will still just say EMERGENCY.
ALTER DATABASE [EmergencyDemo] SET SINGLE_USER; GO DBCC CHECKDB (N'EmergencyDemo', REPAIR_ALLOW_DATA_LOSS) WITH ALL_ERRORMSGS, NO_INFOMSGS; GO
File activation failure. The physical file name "C:\Program Files\Microsoft SQL Server\MSSQL.1\MSSQL\DATA\EmergencyDemo_log.LDF" may be incorrect. The log cannot be rebuilt because the database was not cleanly shut down. Warning: The log for database 'EmergencyDemo' has been rebuilt. Transactional consistency has been lost. The RESTORE chain was broken, and the server no longer has context on the previous log files, so you will need to know what they were. You should run DBCC CHECKDB to validate physical consistency. The database has been put in dbo-only mode. When you are ready to make the database available for use, you will need to reset database options and delete any extra log files.
This time it worked. First of all we get the same error as if we tried to bring the database online – that’s from the code that’s trying to run ‘recovery with CONTINUE_AFTER_ERROR‘ on the transaction log. Next we get a nice long warning that the transaction log has been rebuilt and the consequences of doing that (basically that you need to start a new log backup chain by taking a full or differential backup). If there had been any corruptions we’d have seen the usual output from DBCC CHECKDB about what errors it found and fixed. There’s also a bunch of stuff in the error log:
2013-01-20 15:45:45.550 spid51 Starting up database 'EmergencyDemo'. 2013-01-20 15:45:45.560 spid51 Error: 5105, Severity: 16, State: 1. 2013-01-20 15:45:45.560 spid51 A file activation error occurred. The physical file name 'C:\Program Files\Microsoft SQL Server\MSSQL.1\MSSQL\DATA\EmergencyDemo_log.LDF' may be incorrect. Diagnose and correct additional errors, and retry the operation. 2013-01-20 15:45:45.560 spid51 Starting up database 'EmergencyDemo'. 2013-01-20 15:45:45.610 spid51 Starting up database 'EmergencyDemo'. 2013-01-20 15:45:45.630 spid51 Warning: The log for database 'EmergencyDemo' has been rebuilt. Transactional consistency has been lost. The RESTORE chain was broken, and the server no longer has context on the previous log files, so you will need to know what they were. You should run DBCC CHECKDB to validate physical consistency. The database has been put in dbo-only mode. When you are ready to make the database available for use, you will need to reset database options and delete any extra log files. 2013-01-20 15:45:45.730 spid51 EMERGENCY MODE DBCC CHECKDB (EmergencyDemo, repair_allow_data_loss) WITH all_errormsgs, no_infomsgs executed by APPLECROSS\paul found 0 errors and repaired 0 errors. Elapsed time: 0 hours 0 minutes 0 seconds.
Note that the usual error log entry from running DBCC CHECKDB is preceded by ‘EMERGENCY MODE‘ this time.
Checking the database state:
SELECT
[state_desc]
FROM
sys.databases
WHERE
[name] = N'EmergencyDemo';
GO
state_desc ----------- ONLINE
We find that it’s been brought back online again because everything worked. Let’s see what happened to our table:
USE [EmergencyDemo]; GO SELECT * FROM [salaries]; GO
FirstName LastName Salary ---------- ----------- ------- John Williamson 10000 Stephen Brown 0 Jack Bauer 10000 (3 row(s) affected)
And of course its still corrupt – because even though the transaction log was rebuilt and repaired, the original transaction that changed the salary to 0 never got a chance to rollback because I deleted the transaction log (in the previous post).
Now remember, you should only use this as a last resort, but if you do get yourself into trouble, you know there’s a command that should be able to help you.
28 Responses to EMERGENCY-mode repair: the very, very last resort
Paul:
Your article called "CHECKDB From Every Angle: EMERGENCY mode repair – the very, very last resort" is awesome. It just saved my butt. You are the only one on the web the steps you through the process to recover from a suspect database, step by step. Thanks. You deserve a huge pay raise, where ever you work at now. Best of future to you in your future endeavors. Godspeed my friend.
JMB
Paul,
Brilliant, you saved my day!
I accidentally deleted log file after setting database offline, then found out that it’s not the same as "detach". Bingo, I expected at least a couple of days just to reimport 80,000,000 records from the so called Death Master File. Thanks to your article, everything was fixed in 30 minutes!
Best Wishes in New Year!
Vladimir
When I do this that is when database is in recovery pending state and when i try to follow the above steps..it says Database ‘dbWiki’ is being recovered. Waiting until recovery is finished.
And the recovery goes on for a long time. Not sure if the DB is even being recovered. What is the ay out
Hi Ravi,
Is there any activity in the database (i.e. is recovery still running)? Do you get this message when you try to set the database into emergency mode? How did you get into this state? etc etc
Thanks
Super! You’re info has brought up my db up and running again!
well, i’m really glad, that you went through all those steps and shows therefore how to repair hopeless cases, running under sql-server 2005
unfortunately, most examples on the internet are written for sql 2000 and don’t get a glimpse of this new way unter 2005.
regards and great thanks
raimund
Paul,
I get the same message as ravi above. Is it because you can’t go into emergency with a bad boot page?
Can a db be set to emergency mode if the database boot page is torn as in the following error:
Msg 824, Level 24, State 2, Line 1
SQL Server detected a logical consistency-based I/O error: torn page
(expected signature: 0xaaaaaaaa; actual signature: 0x56aaaaaa). It occurred
during a read of page (1:9) in database ID 7 at offset 0×00000000012000 in
file ‘D:\Program Files\Microsoft SQL Server\MSSQL.1\MSSQL\Data\ICS.mdf’.
Additional messages in the SQL Server error log or system event log may
provide more detail. This is a severe error condition that threatens
database integrity and must be corrected immediately. Complete a full
database consistency check (DBCC CHECKDB). This error can be caused by many
factors; for more information, see SQL Server Books Online.
Thanks,
Josh
Tried your steps on a database that had 3 files (Data, Index and Log) where we only have Data left (Index, Log and Backups were all on RAID5 external array that blew up.
We have re-created the database, stopped SQL 2k5, replaced Data file and started SQL.
ALTER DATABASE db_name SET EMERGENCY
Command(s) completed successfully.
ALTER DATABASE db_name SET SINGLE_USER
Command(s) completed successfully.
SELECT state_desc FROM sys.databases WHERE name=’db_name’
RECOVERY_PENDING
No matter what we do, the state is always ‘RECOVERY_PENDING’
DBCC CHECKDB (db_name, REPAIR_ALLOW_DATA_LOSS) WITH NO_INFOMSGS;
Msg 922, Level 14, State 1, Line 1
Database ‘db_name’ is being recovered. Waiting until recovery is finished.
Server does not show any activity – how can I tell if anything is actually happening?
What can we try next?
Hmm – given that you’ve got a missing data file (the index one) this may be a lost cause. You could try setting that file offline, and setting all the indexes that were on it offline too. As long as they’re just nonclustered indexes, you *might* be able to get that to work – but this is a lot more complicated. Topic for another blog post…
Really brilliant you saved the day of many DBA and Administrators with this article,
Thanks a lot
if you get "Database ‘db_name’ is being recovered. Waiting until recovery is finished. "
Follow the following steps:
1 stop sql server
2 rename your db files or move them to another location
3 start sql server
4 remove the database from the list
5 create a new database with the same name and the same datafiles
6 set this database in emergency mode
7 stop sql server
8 copy your original db files back
9 start sql server.
Your original db is now in emergency mode.
Thanks a lot Paul, you’re the man. I was in the last devconnections in Orlando and I was in all your sessions. A couple of days later I had a very big corruption problem and your session and this were really, really, really helpfully. Once again thanks a lot.
regards
Your info was what I was missing to fix my SUSPECT database. It seemed to be quite difficult to find this info on the internets. Thanks very much Paul!
Paul,
I already used this command ALTER DATABASE db_name SET EMERGENCY after that database has come in emergency mode but when i used this command ALTER DATABASE db_name SET SINGLE_USER,
it has run sucessfully but database status not changed Emergency to Single User mode.
Dear Paul,
I have a database in restore state with corrupted log.
I did put it into emergency mode and performed a plain DBCC checkdb(‘KAE_Retail_4′) and had no consistency errors.
However when I try the DBCC CHECKDB (KAE_Retail_4, REPAIR_ALLOW_DATA_LOSS) I get the following messages.
I would really appreceiate ANY HELP or Suggestion.
—————————————————————————————————–
Msg 5120, Level 16, State 101, Line 1
Unable to open the physical file "F:\ERPDB\KAE_fromKAE.mdf". Operating system error 5: "5(error not found)".
File activation failure. The physical file name "F:\ERPDB\KAE_Retail_4_log.ldf" may be incorrect.
Msg 5120, Level 16, State 101, Line 1
Unable to open the physical file "F:\ERPDB\KAE_fromKAE.mdf". Operating system error 5: "5(error not found)".
File activation failure. The physical file name "F:\ERPDB\KAE_Retail_4_log.ldf" may be incorrect.
Msg 5028, Level 16, State 5, Line 1
The system could not activate enough of the database to rebuild the log.
Msg 7909, Level 20, State 1, Line 1
The emergency-mode repair failed.You must restore from backup.
———————————————————————————————————-
I am trying to recover a 2000 DB on a 2008 box, all the steps worked until I got to the DBCC. Now I get the error
Msg 946, Level 14, State 1, Line 3
Cannot open database ‘CompTecDataSQL1′ version 539. Upgrade the database to the latest version.
Is there a fix for this?
You saved my day Paul. Thanks for posting this wonderful article. As some of the mentioned, you surely deserve a pay raise or a one month paid vacation at Bahamas :-).
[...] Using EMERGENCY mode repair to repair a damaged transaction log (see this blog post) [...]
[...] inconsistent. I'm going to choose to repair the database using emergency-mode repair. See CHECKDB From Every Angle: EMERGENCY mode repair – the very, very last resort for a detailed description of this [...]
Hi. I hope you don’t mind I published a link to your site. great article, thanks!
http://thelonelydba.wordpress.com/2013/01/30/sql-database-in-emergency-mode/
[...] EMERGENCY-mode repair: the very, very last resort [...]
Thank you
You saved me! Thank you very much for this information. I thought we were done for.
Thank you very much for this article.
I used this procedure but now I cannot backup the database.
System.Data.SqlClient.SqlError: The backup of the file or filegroup “XYZ” is not permitted because it is not online. BACKUP can be performed by using the FILEGROUP or FILE clauses to restrict the selection to include only online data. (Microsoft.SqlServer.Smo)
I cannot backup using FILEGROUP or FILE.
The database is online.
I even created a FULLTEXT…but still.
Hoping to get a save here :).
Any suggestions?
Thanks
Do:
ALTER DATABASE XYZ SET OFFLINE
ALTER DATABASE XYZ SET ONLINE
If they succeed, nothing should prevent the backup succeeding.
Thank you thank you thank you!!! :D yeah!!!
Thank you for the post. I was able to follow things and it seems like it is recovering the database. The question I have is how long should it take roughly for the recovery to be completed. My database size is around 1.2GB with around 5 million records and around 50 or so tables. It is now 40 minutes and it is still saying “Exceuting Query”. Does this seem to be normal?
It’s impossible for me to say – totally depends on what transactions were in flight when the crash occurred.