Performance bug: NOLOCK scans involving off-row LOB data

Here's an interesting bug that was just recently fixed, and someone actually hit it today, so it's time to blog it.

Bottom line is that if you're doing NOLOCK scans of tables involving LOB data, the perfmance might suck. 2005 SP3 CU3 has the bug fix, but unfortunately the KB article was written by someone who didn't seem to know what they were talking about, so I suggested it was rewritten, which it has been (but not republished yet). The original KB article is at http://support.microsoft.com/kb/961049/. It's also fixed in 2008, but I don't know which build (I believe CU4 at least).

Here's my explanation of the problem.

LOB data can be stored in-row or off-row (my previous LOB post Importance of choosing the right LOB storage technique has more details). When it's stored off-row, it must be accessed by first reading the pointer to the LOB data from the data/index record, and then following the pointer (remember that 'record' is synonymous with 'row'). When a LOB data value is updated, the off-row value is updated first, and then if the off-row link changed, the data/index record is updated with the new link. There's obvioulsy a window here, where someone reading the data/index record might see the wrong/non-existent off-row pointer. This is exactly what can happen when a NOLOCK scan occurs.

To mitigate the possibility of a NOLOCK scan trying to follow a bad off-row link, the old behavior was to scan all the IAM pages (see Inside the Storage Engine: IAM pages, IAM chains, and allocation units) for the table/index to make sure that the off-row link actually pointed to a page allocated to the table/index. If there are lots of IAM pages, this means lots of logical IOs, and poorly performing SELECT queries. And it does the scan once for *every row*. The person that hit it today had a 500 row select of ~20KB per row taking 20 seconds – 10MB of physical IOs and 30MB of logical IOs!

The fix is to make further use of an already existing in-memory cache of IAM pages to do a quick lookup of the right IAM page covering the GAM interval of the LOB page being read, without having to scan the whole IAM chain.

Hope this helps explain things, and track down perf problems for some of you.

10 thoughts on “Performance bug: NOLOCK scans involving off-row LOB data”

Jerry says:

May 1, 2009 at 6:04 am

Paul – when you say "NOLOCK", am I correct to assume this behavior will also exist if you have set your transaction isolation level to READ UNCOMMITTED?

Thanks.

Jerry

Reply
Paul Randal says:

May 1, 2009 at 6:12 am

@Jerry – Absolutely.

Reply
Adam Machanic says:

May 1, 2009 at 6:52 am

Great post, thanks. I’ve seen numerous cases where adding NOLOCK slowed down a query by 20% or more, and this probably explains a lot of them.

Reply
Rob Boek says:

May 1, 2009 at 8:42 am

Thanks for your help Paul! I’m the one who ran into the problem yesterday. I am on SQL Server 2008 CU4 so the fix had already been applied. I tested returning 500 rows with NOLOCK vs without this morning.

With NOLOCK:
Scan count 1, logical reads 430, physical reads 0, read-ahead reads 0, lob logical reads 1592, lob physical reads 0, lob read-ahead reads 0.
CPU time = 4734 ms, elapsed time = 10105 ms.

Without NOLOCK:
Scan count 1, logical reads 430, physical reads 0, read-ahead reads 0, lob logical reads 1596, lob physical reads 0, lob read-ahead reads 0.
CPU time = 16 ms, elapsed time = 591 ms.

Note that this is with SQL 2008 CU4 installed. The logical reads are the same but NOLOCK takes significantly more CPU time.

-Rob

Reply
Rob Boek says:

May 1, 2009 at 3:07 pm

It sounds like this was not fixed in SQL Server 2008 CU4, but will be in CU5 (not available yet). MS Support is supposed to be getting me details.

Reply
Adam Machanic says:

May 2, 2009 at 12:12 pm

So why are the additional LOB reads not showing up in the STATISTICS IO output? Clearly something is causing that huge spike in CPU time…

Reply
Rob Boek says:

May 5, 2009 at 11:13 am

Apparently, you can see the extra reads in profiler. Query: 77MB of IO. Query with NOLOCK: 110 GB of IO.

Reply
Rob Boek says:

June 8, 2009 at 1:50 pm

After applying SQL Server 2008 CU5, the same query that did used 110GB of IO using nolock, went down to 388MB using nolock. It’s still better if you can avoid using nolock on LOB data, but not nearly as painful as before the fix.

Reply
Michael Pearson says:

June 30, 2009 at 3:54 pm

Paul,
Does this apply the the VARCHAR(MAX) and NVARCHAR(MAX) data types as well?

Reply
paul says:

July 1, 2009 at 9:56 am

Yes – these are LOB data types.

Reply

Imagine feeling confident enough to handle whatever your database throws at you.

With training and consulting from SQLskills, you’ll be able to solve big problems, elevate your team’s capacity, and take control of your data career.

Performance bug: NOLOCK scans involving off-row LOB data

10 thoughts on “Performance bug: NOLOCK scans involving off-row LOB data”

Leave a Reply Cancel reply

Other articles

2024: the year in books

2023 (not a typo!): the year in books

Black Friday Super Sale!

Summer School Savings SALE is live!

The Curious Case of… missing log information

The Curious Case of… missing 832 errors during memory corruption

Imagine feeling confident enough to handle whatever your database throws at you.