{"id":465,"date":"2012-12-08T00:35:39","date_gmt":"2012-12-08T00:35:39","guid":{"rendered":"\/blogs\/joe\/post\/Exploring-Column-Correlation-and-Cardinality-Estimates.aspx"},"modified":"2013-12-29T19:15:45","modified_gmt":"2013-12-30T03:15:45","slug":"exploring-column-correlation-and-cardinality-estimates","status":"publish","type":"post","link":"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/","title":{"rendered":"Exploring Column Correlation and Cardinality Estimates"},"content":{"rendered":"<p>Last Thursday I presented a session at the <a href=\"http:\/\/performance.sqlpass.org\/2012Palooza.aspx\" target=\"_blank\" class=\"broken_link\">PASS Winter 2012 Performance Palooza<\/a>.\u00a0 It was a great experience and I appreciated the opportunity.\u00a0 The topic was \u201cTroubleshooting Query Plan Quality Issues\u201d and I received a few email questions after the presentation, so I thought I would walk through the full scenario, weaving in a few additional points that were motivated by the questions I received.<\/p>\n<p>First, let me set up the scenario.\u00a0 I used the Credit database, which you can download <a href=\"https:\/\/www.sqlskills.com\/resources\/conferences\/creditbackup100.zip\" target=\"_blank\">here<\/a>.<\/p>\n<p>The first T-SQL I executed updated the member table with ten different city and state_prov combinations:<\/p>\n<blockquote><p>USE Credit;<br \/>\nGO<\/p>\n<p>UPDATE\u00a0 [dbo].[member]<br \/>\nSET\u00a0\u00a0\u00a0\u00a0 [city] = &#8216;Minneapolis&#8217;,<br \/>\n[state_prov] = &#8216;MN&#8217;<br \/>\nWHERE\u00a0\u00a0 [member_no] % 10 = 0;<\/p>\n<p>UPDATE\u00a0 [dbo].[member]<br \/>\nSET\u00a0\u00a0\u00a0\u00a0 [city] = &#8216;New York&#8217;,<br \/>\n[state_prov] = &#8216;NY&#8217;<br \/>\nWHERE\u00a0\u00a0 [member_no] % 10 = 1;<\/p>\n<p>UPDATE\u00a0 [dbo].[member]<br \/>\nSET\u00a0\u00a0\u00a0\u00a0 [city] = &#8216;Chicago&#8217;,<br \/>\n[state_prov] = &#8216;IL&#8217;<br \/>\nWHERE\u00a0\u00a0 [member_no] % 10 = 2;<\/p>\n<p>UPDATE\u00a0 [dbo].[member]<br \/>\nSET\u00a0\u00a0\u00a0\u00a0 [city] = &#8216;Houston&#8217;,<br \/>\n[state_prov] = &#8216;TX&#8217;<br \/>\nWHERE\u00a0\u00a0 [member_no] % 10 = 3;<\/p>\n<p>UPDATE\u00a0 [dbo].[member]<br \/>\nSET\u00a0\u00a0\u00a0\u00a0 [city] = &#8216;Philadelphia&#8217;,<br \/>\n[state_prov] = &#8216;PA&#8217;<br \/>\nWHERE\u00a0\u00a0 [member_no] % 10 = 4;<\/p>\n<p>UPDATE\u00a0 [dbo].[member]<br \/>\nSET\u00a0\u00a0\u00a0\u00a0 [city] = &#8216;Phoenix&#8217;,<br \/>\n[state_prov] = &#8216;AZ&#8217;<br \/>\nWHERE\u00a0\u00a0 [member_no] % 10 = 5;<\/p>\n<p>UPDATE\u00a0 [dbo].[member]<br \/>\nSET\u00a0\u00a0\u00a0\u00a0 [city] = &#8216;San Antonio&#8217;,<br \/>\n[state_prov] = &#8216;TX&#8217;<br \/>\nWHERE\u00a0\u00a0 [member_no] % 10 = 6;<\/p>\n<p>UPDATE\u00a0 [dbo].[member]<br \/>\nSET\u00a0\u00a0\u00a0\u00a0 [city] = &#8216;San Diego&#8217;,<br \/>\n[state_prov] = &#8216;CA&#8217;<br \/>\nWHERE\u00a0\u00a0 [member_no] % 10 = 7;<\/p>\n<p>UPDATE\u00a0 [dbo].[member]<br \/>\nSET\u00a0\u00a0\u00a0\u00a0 [city] = &#8216;Dallas&#8217;,<br \/>\n[state_prov] = &#8216;TX&#8217;<br \/>\nWHERE\u00a0\u00a0 [member_no] % 10 = 8;<br \/>\nGO<\/p><\/blockquote>\n<p>Next, with \u201cInclude Actual Execution Plan\u201d enabled, I executed the following query:<\/p>\n<blockquote><p>SELECT\u00a0 [lastname],<br \/>\n[firstname]<br \/>\nFROM\u00a0\u00a0\u00a0 [dbo].[member]<br \/>\nWHERE\u00a0\u00a0 [city] = &#8216;Minneapolis&#8217;;<br \/>\nGO<\/p><\/blockquote>\n<p>Looking at the estimated rows versus actual (using <a href=\"http:\/\/www.sqlsentry.net\/plan-explorer\/sql-server-query-view.asp\" target=\"_blank\">SQL Sentry Plan Explorer<\/a>), I see that the estimate is spot-on with 1,000 rows estimated and 1,000 rows actual:<\/p>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/7a013434\/\\Limage.png\"><img fetchpriority=\"high\" decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/4ba7ae87\/\\Limage_thumb.png\" width=\"815\" height=\"66\" border=\"0\" \/><\/a><\/p>\n<p>Now my database has statistics auto-updates enabled, so even though I don\u2019t have a supporting index on the city column, I <em>do<\/em> have supporting statistics (which were created in conjunction with my query execution):<\/p>\n<blockquote><p>EXEC dbo.sp_helpstats &#8216;member&#8217;;<\/p><\/blockquote>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/18d7a813\/\\Limage.png\"><img decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/2ab43be0\/\\Limage_thumb.png\" width=\"432\" height=\"56\" border=\"0\" \/><\/a><\/p>\n<p>Looking at the statistics information via DBCC SHOW_STATISTICS, I see the following STAT_HEADER, DENSITY_VECTOR, and HISTOGRAM information (highlighting the Minneapolis histogram step):<\/p>\n<blockquote><p>DBCC SHOW_STATISTICS(&#8216;member&#8217;, &#8216;_WA_Sys_00000006_0CBAE877&#8217;);<\/p><\/blockquote>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/2a4808eb\/\\Limage.png\"><img decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/3df4f27f\/\\Limage_thumb.png\" width=\"871\" height=\"311\" border=\"0\" \/><\/a><\/p>\n<p>So we see the DENSITY_VECTOR shows an \u201call density\u201d value of 0.1 and we also see a Minneapolis RANGE_HI_KEY histogram step with an EQ_ROWS value of 1000.<\/p>\n<p>Next, I executed the following query, looking at city AND state_prov:<\/p>\n<blockquote><p>SELECT\u00a0 [lastname],<br \/>\n[firstname]<br \/>\nFROM\u00a0\u00a0\u00a0 [dbo].[member]<br \/>\nWHERE\u00a0\u00a0 [city] = &#8216;Minneapolis&#8217; AND<br \/>\n[state_prov] = &#8216;MN&#8217;<br \/>\nOPTION (RECOMPILE);<br \/>\nGO<\/p><\/blockquote>\n<p>Now I personally know that these two columns are correlated, but the query optimizer does <em>not<\/em>.\u00a0 The query optimizer assumes that these two columns are <em>independent<\/em>.\u00a0 Here is the estimated versus actual for this query:<\/p>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/35fd501d\/\\Limage.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/03997c9e\/\\Limage_thumb.png\" width=\"822\" height=\"76\" border=\"0\" \/><\/a><\/p>\n<p>We see an estimate of 100 rows, versus the actual 1,000 rows.<\/p>\n<p>We also have new statistics generated for the state_prov column:<\/p>\n<blockquote><p>EXEC dbo.sp_helpstats &#8216;member&#8217;;<\/p><\/blockquote>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/42f7302e\/\\Limage.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/6dcf9440\/\\Limage_thumb.png\" width=\"391\" height=\"72\" border=\"0\" \/><\/a><\/p>\n<p>And notice that the statistics_keys are for single-column statistics.\u00a0 SQL Server does not automatically generate multi-column statistics.<\/p>\n<p>Looking at the statistics information via DBCC SHOW_STATISTICS, I see the following STAT_HEADER, DENSITY_VECTOR, and HISTOGRAM information (highlighting the MN histogram step):<\/p>\n<blockquote><p>DBCC SHOW_STATISTICS(&#8216;member&#8217;, &#8216;_WA_Sys_00000007_0CBAE877&#8217;);<\/p><\/blockquote>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/065f3191\/\\Limage.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/2c54e1e7\/\\Limage_thumb.png\" width=\"959\" height=\"299\" border=\"0\" \/><\/a><\/p>\n<p>The \u201call density\u201d value for state_prov is 0.125 (and notice that unlike with city, we have one state that has 3,000 EQ_ROWS value, for TX).<\/p>\n<p>So the MN step shows 1,000 rows out of 10,000 rows (10%).\u00a0 And the Minneapolis step shows 1,000 rows out of 10,000 rows (10%).\u00a0 But SQL Server is not assuming that these two columns are correlated, and so we end up with an estimate of 1% of the rows (0.10 * 0.10).\u00a0 And while this is a small scale example, imagine this for much larger skews.\u00a0 What kind of impact could this have on the query execution plan?\u00a0 And how many times do you have predicates referencing correlated columns in your query?<\/p>\n<p>Now, to help out the query optimizer, I can manually create multi-column statistics on city and state_prov:<\/p>\n<blockquote><p>CREATE STATISTICS [member_city_state_prov]<br \/>\nON [dbo].[member]([city],[state_prov]);<br \/>\nGO<\/p><\/blockquote>\n<p>If I re-execute my original query with city and state_prov predicates, I see that my estimates are now exactly correct:<\/p>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/394ec1f8\/\\Limage.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/71f96c05\/\\Limage_thumb.png\" width=\"818\" height=\"68\" border=\"0\" \/><\/a><\/p>\n<p>But this isn\u2019t the end of the story, because if you look at the STAT_HEADER, DENSITY_VECTOR, and HISTOGRAM of the manually created statistics, you\u2019ll see the following:<\/p>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/78404293\/\\Limage.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/490e56fc\/\\Limage_thumb.png\" width=\"909\" height=\"302\" border=\"0\" \/><\/a><\/p>\n<p>Notice that the DENSITY_VECTOR shows two rows \u2013 one with city and one with city, state_prov.\u00a0 Both show an \u201call density\u201d of 0.1 \u2013 which reflects our correlation between the two columns.<\/p>\n<p>But also notice that the HISTOGRAM does NOT show multi-column steps.\u00a0 It just shows the leading statistics key column, city \u2013 with steps equal to the various city values.\u00a0 So in the case of Minneapolis, MN \u2013 the \u201call density\u201d value was correct.<\/p>\n<p>What about a scenario where I pick a mismatched city and state_prov combination (Minneapolis and Texas)?<\/p>\n<blockquote><p>SELECT\u00a0 [lastname],<br \/>\n[firstname]<br \/>\nFROM\u00a0\u00a0\u00a0 [dbo].[member]<br \/>\nWHERE\u00a0\u00a0 [city] = &#8216;Minneapolis&#8217; AND<br \/>\n[state_prov] = &#8216;TX&#8217;<br \/>\nOPTION (RECOMPILE);<br \/>\nGO<\/p><\/blockquote>\n<p>This time we get the following cardinality estimate skew:<\/p>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/3d0c66c8\/\\Limage.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/5cbb4090\/\\Limage_thumb.png\" width=\"913\" height=\"76\" border=\"0\" \/><\/a><\/p>\n<p>We estimated the rows based on DENSITY_VECTOR, but without a multi-column HISTOGRAM, the query optimizer doesn\u2019t know that there are no Minneapolis city and Texas state_prov rows.\u00a0 So while multi-column statistics can be helpful, there are limits.<\/p>\n<p>Now what if I drop my statistics and add in a multi-column index instead?<\/p>\n<blockquote><p>DROP STATISTICS\u00a0 [dbo].[member].[member_city_state_prov];<br \/>\nGO<\/p>\n<p>CREATE INDEX [member_city_state_prov]<br \/>\nON [dbo].[member]([city],[state_prov]);<br \/>\nGO<\/p><\/blockquote>\n<p>The multi-column index will provide me with the same results for the Minneapolis \/ MN combination \u2013 as well as the same skew for the Minneapolis \/ TX combo.<\/p>\n<p>One question I received was around the index choice after I created the index on city and state_prov.\u00a0 Why didn\u2019t that index get used via an index seek?<\/p>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/62218936\/\\Limage.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/1acc3344\/\\Limage_thumb.png\" width=\"272\" height=\"108\" border=\"0\" \/><\/a><\/p>\n<p>Well, the new index did get used from a cardinality estimate perspective, but the final plan choice involved a clustered index scan.\u00a0 The warning indicator you see on the SELECT was for the following missing index:<\/p>\n<blockquote><p>CREATE NONCLUSTERED INDEX [&lt;Name of Missing Index, sysname,&gt;]<br \/>\nON [dbo].[member] ([city],[state_prov])<br \/>\nINCLUDE ([lastname],[firstname])<br \/>\nGO<\/p><\/blockquote>\n<p>This is where we should explore the cost alternatives.\u00a0 Below shows the side-by-side costs of a Clustered Index Scan, a (forced) bookmark lookup using the index I created earlier just on city and state, and then finally the missing index suggestion with the additional INCLUDE columns (using INDEX hints to force the three different options):<\/p>\n<blockquote><p>&#8212; Clustered index scan<br \/>\nSELECT\u00a0 [lastname],<br \/>\n[firstname]<br \/>\nFROM\u00a0\u00a0\u00a0 [dbo].[member]<br \/>\nWITH (INDEX = [member_ident])<br \/>\nWHERE\u00a0\u00a0 [city] = &#8216;Minneapolis&#8217; AND<br \/>\n[state_prov] = &#8216;MN&#8217;<br \/>\nOPTION (RECOMPILE);<br \/>\nGO<\/p>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/7b258179\/\\Limage.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/3ef9b5d1\/\\Limage_thumb.png\" width=\"667\" height=\"73\" border=\"0\" \/><\/a><\/p>\n<p>&#8212; Non-covering NCI<br \/>\nSELECT\u00a0 [lastname],<br \/>\n[firstname]<br \/>\nFROM\u00a0\u00a0\u00a0 [dbo].[member]<br \/>\nWITH (INDEX = [member_city_state_prov])<br \/>\nWHERE\u00a0\u00a0 [city] = &#8216;Minneapolis&#8217; AND<br \/>\n[state_prov] = &#8216;MN&#8217;<br \/>\nOPTION (RECOMPILE);<br \/>\nGO<\/p>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/5a320ed2\/\\Limage.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/79e0e89a\/\\Limage_thumb.png\" width=\"668\" height=\"101\" border=\"0\" \/><\/a><\/p>\n<p>&#8212; Covering NCI<br \/>\nSELECT\u00a0 [lastname],<br \/>\n[firstname]<br \/>\nFROM\u00a0\u00a0\u00a0 [dbo].[member]<br \/>\nWITH (INDEX = [member_city_state_prov_2])<br \/>\nWHERE\u00a0\u00a0 [city] = &#8216;Minneapolis&#8217; AND<br \/>\n[state_prov] = &#8216;MN&#8217;<br \/>\nOPTION (RECOMPILE);<br \/>\nGO<\/p>\n<p><a href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/393e9c2b\/\\Limage.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-top: 0px; padding-left: 0px; display: inline; padding-right: 0px; border: 0px;\" title=\"image\" alt=\"image\" src=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/120452f6\/\\Limage_thumb.png\" width=\"678\" height=\"71\" border=\"0\" \/><\/a><\/p><\/blockquote>\n<p>So while the query optimizer used the index statistics for my cardinality estimate, the estimated cost of a Clustered Index Scan was 0.011 versus using the non-covering nonclustered index and key lookup cost of 0.158.\u00a0 And of course, the fully covering index was the cheapest estimated cost out of the three \u2013 at 0.001, although whether it makes sense holistically to accommodate that one query with a covering index is another topic altogether.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Last Thursday I presented a session at the PASS Winter 2012 Performance Palooza.\u00a0 It was a great experience and I appreciated the opportunity.\u00a0 The topic was \u201cTroubleshooting Query Plan Quality Issues\u201d and I received a few email questions after the presentation, so I thought I would walk through the full scenario, weaving in a few [&hellip;]<\/p>\n","protected":false},"author":7,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[42,28],"tags":[],"class_list":["post-465","post","type-post","status-publish","format-standard","hentry","category-cardinality-estimation","category-performance"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v21.9.1 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Exploring Column Correlation and Cardinality Estimates - Joe Sack<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Exploring Column Correlation and Cardinality Estimates - Joe Sack\" \/>\n<meta property=\"og:description\" content=\"Last Thursday I presented a session at the PASS Winter 2012 Performance Palooza.\u00a0 It was a great experience and I appreciated the opportunity.\u00a0 The topic was \u201cTroubleshooting Query Plan Quality Issues\u201d and I received a few email questions after the presentation, so I thought I would walk through the full scenario, weaving in a few [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/\" \/>\n<meta property=\"og:site_name\" content=\"Joe Sack\" \/>\n<meta property=\"article:published_time\" content=\"2012-12-08T00:35:39+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2013-12-30T03:15:45+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/4ba7ae87\/Limage_thumb.png\" \/>\n<meta name=\"author\" content=\"Joseph Sack\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Joseph Sack\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/\",\"url\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/\",\"name\":\"Exploring Column Correlation and Cardinality Estimates - Joe Sack\",\"isPartOf\":{\"@id\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/#website\"},\"datePublished\":\"2012-12-08T00:35:39+00:00\",\"dateModified\":\"2013-12-30T03:15:45+00:00\",\"author\":{\"@id\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/#\/schema\/person\/533eb0113a15fb5a6e8067a49e4ae648\"},\"breadcrumb\":{\"@id\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Cardinality Estimation\",\"item\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/category\/cardinality-estimation\/\"},{\"@type\":\"ListItem\",\"position\":3,\"name\":\"Exploring Column Correlation and Cardinality Estimates\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/#website\",\"url\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/\",\"name\":\"Joe Sack\",\"description\":\"SQL Server Performance Tuning, High Availability and Disaster Recovery Blog\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/#\/schema\/person\/533eb0113a15fb5a6e8067a49e4ae648\",\"name\":\"Joseph Sack\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/a4b39a7719a6bfff1add3ec00527810734579ee114d6d983e8e68f937b77be96?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/a4b39a7719a6bfff1add3ec00527810734579ee114d6d983e8e68f937b77be96?s=96&d=mm&r=g\",\"caption\":\"Joseph Sack\"},\"description\":\"Joe Sack is a Principal Consultant with SQLskills. He has worked as a SQL Server professional since 1997 and has supported and developed for SQL Server environments in financial services, IT consulting, manufacturing, retail and the real estate industry. Prior to joining SQLskills he worked at Microsoft as a Premier Field Engineer supporting very large enterprise customer environments. He was responsible for providing deep SQL Server advisory services, training, troubleshooting and ongoing solutions guidance. His areas of expertise include performance tuning, scalability, T-SQL development and high-availability. In 2006 Joe earned the \u201cMicrosoft Certified Master: SQL Server 2005\u201d certification and in 2008 he earned the \u201cMicrosoft Certified Master: SQL Server 2008\u201d certification. In 2009 he took over responsibility for the entire SQL Server Microsoft Certified Master program and held that post until 2011. He was given the SQL Server MVP award in 2013.\",\"sameAs\":[\"http:\/\/3.209.169.194\/blogs\/joe\",\"https:\/\/twitter.com\/https:\/\/twitter.com\/josephsack\"],\"url\":\"https:\/\/www.sqlskills.com\/blogs\/joe\/author\/joe\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Exploring Column Correlation and Cardinality Estimates - Joe Sack","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/","og_locale":"en_US","og_type":"article","og_title":"Exploring Column Correlation and Cardinality Estimates - Joe Sack","og_description":"Last Thursday I presented a session at the PASS Winter 2012 Performance Palooza.\u00a0 It was a great experience and I appreciated the opportunity.\u00a0 The topic was \u201cTroubleshooting Query Plan Quality Issues\u201d and I received a few email questions after the presentation, so I thought I would walk through the full scenario, weaving in a few [&hellip;]","og_url":"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/","og_site_name":"Joe Sack","article_published_time":"2012-12-08T00:35:39+00:00","article_modified_time":"2013-12-30T03:15:45+00:00","og_image":[{"url":"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-content\/uploads\/windows-live-writer\/642f0e2e97ed\/4ba7ae87\/\\Limage_thumb.png"}],"author":"Joseph Sack","twitter_misc":{"Written by":"Joseph Sack","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/","url":"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/","name":"Exploring Column Correlation and Cardinality Estimates - Joe Sack","isPartOf":{"@id":"https:\/\/www.sqlskills.com\/blogs\/joe\/#website"},"datePublished":"2012-12-08T00:35:39+00:00","dateModified":"2013-12-30T03:15:45+00:00","author":{"@id":"https:\/\/www.sqlskills.com\/blogs\/joe\/#\/schema\/person\/533eb0113a15fb5a6e8067a49e4ae648"},"breadcrumb":{"@id":"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.sqlskills.com\/blogs\/joe\/exploring-column-correlation-and-cardinality-estimates\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.sqlskills.com\/blogs\/joe\/"},{"@type":"ListItem","position":2,"name":"Cardinality Estimation","item":"https:\/\/www.sqlskills.com\/blogs\/joe\/category\/cardinality-estimation\/"},{"@type":"ListItem","position":3,"name":"Exploring Column Correlation and Cardinality Estimates"}]},{"@type":"WebSite","@id":"https:\/\/www.sqlskills.com\/blogs\/joe\/#website","url":"https:\/\/www.sqlskills.com\/blogs\/joe\/","name":"Joe Sack","description":"SQL Server Performance Tuning, High Availability and Disaster Recovery Blog","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.sqlskills.com\/blogs\/joe\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.sqlskills.com\/blogs\/joe\/#\/schema\/person\/533eb0113a15fb5a6e8067a49e4ae648","name":"Joseph Sack","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.sqlskills.com\/blogs\/joe\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/a4b39a7719a6bfff1add3ec00527810734579ee114d6d983e8e68f937b77be96?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/a4b39a7719a6bfff1add3ec00527810734579ee114d6d983e8e68f937b77be96?s=96&d=mm&r=g","caption":"Joseph Sack"},"description":"Joe Sack is a Principal Consultant with SQLskills. He has worked as a SQL Server professional since 1997 and has supported and developed for SQL Server environments in financial services, IT consulting, manufacturing, retail and the real estate industry. Prior to joining SQLskills he worked at Microsoft as a Premier Field Engineer supporting very large enterprise customer environments. He was responsible for providing deep SQL Server advisory services, training, troubleshooting and ongoing solutions guidance. His areas of expertise include performance tuning, scalability, T-SQL development and high-availability. In 2006 Joe earned the \u201cMicrosoft Certified Master: SQL Server 2005\u201d certification and in 2008 he earned the \u201cMicrosoft Certified Master: SQL Server 2008\u201d certification. In 2009 he took over responsibility for the entire SQL Server Microsoft Certified Master program and held that post until 2011. He was given the SQL Server MVP award in 2013.","sameAs":["http:\/\/3.209.169.194\/blogs\/joe","https:\/\/twitter.com\/https:\/\/twitter.com\/josephsack"],"url":"https:\/\/www.sqlskills.com\/blogs\/joe\/author\/joe\/"}]}},"_links":{"self":[{"href":"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-json\/wp\/v2\/posts\/465","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-json\/wp\/v2\/comments?post=465"}],"version-history":[{"count":0,"href":"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-json\/wp\/v2\/posts\/465\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-json\/wp\/v2\/media?parent=465"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-json\/wp\/v2\/categories?post=465"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.sqlskills.com\/blogs\/joe\/wp-json\/wp\/v2\/tags?post=465"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}