A Join A Day – Except

2012-12-17 - A Join A Day, Fundamentals, General, Series, T-SQL Statements

Introduction

This is the seventeenth post in my A Join A Day series about SQL Server Joins. Make sure to let me know how I am doing or ask your burning join related questions by leaving a comment below.

Like the INTERSECT key word, the EXCEPT key word was added to the T-SQL language with the SQL Server 2005 release. It does not look like a join at first, but gets handled by SQL Server like one. Also – like INTERSECTEXCEPT comes directly from the mathematical set theory (pdf).

EXCEPT Example

Let's look at the same example sets again that we used during the INTERSEC post yesterday: : {A, B, M, T, Y, Z} and {G, K, L, M, O, Y}

two sets

The EXCEPT key word represents the difference in set theory. The difference of two sets is defined as the set of the elements that are in the first set but not in the second set. It is denoted with the \ sign:

difference of the two sets

{A, B, M, T, Y, Z} \ {G, K, L, M, O, Y} = {A, B, T, Z}

The EXCEPT keyword therefore returns all rows from the first query expression that are not part of the second query expression. Let's look at the TblA – TblB example again. The tables look like this:

setup for all-cases join example

Let's run the following query:

[sql] SELECT TblA_Val FROM dbo.TblA
EXCEPT
SELECT TblB_Val FROM dbo.TblB
[/sql]

This query results in only a single row:

except example

The value 1 is the only one that is contained in TblA but not contained in TblB.

This looks exactly like the result we got back from the NOT EXISTS() query in the Left Anti Semi Join post. But again, there is one important difference. To show that difference we need to add two more rows to TblA:

[sql] INSERT INTO dbo.TblA(TblA_Val, TblA_Desc)
VALUES (9,'A-9 (1st'), (9, 'A-9 (2nd)');
[/sql]

Now the result of above query looks like this:

except example with additional rows

Now let's recall the NOT EXISTS() query, modified to also only return the TblA_Val column:

[sql] SELECT TblA_Val
FROM dbo.TblA AS a
WHERE NOT EXISTS ( SELECT 1
FROM dbo.TblB AS b
WHERE a.TblA_Val = b.TblB_Val );
[/sql]

The result of this query looks like this:

not exists example with additional rows

The difference between EXCEPT and NOT EXISTS() is the same as the difference between INTERSECT and EXISTS(): The EXCEPT keyword causes duplicate rows to be eliminated from the final result. This is intentional as the action is borrowed from set theory and a set cannot contain the same element more than once.

If the column list contains the primary key (or another unique column) from TblA, the result of an EXCEPT query and a NOT EXISTS() query are the same.

If you are following along with the examples use the following statement to clean up the additional rows.

[sql] DELETE dbo.TblA WHERE TblA_Val = 9;
[/sql]

EXECPT Syntax

Like INTERSECT, the use of the EXCEPT key word tends to lead to cleaner and easier to maintain queries.

The syntax is similar to the UNION and the INTERSECT statements. You can connect any two queries with the EXCEPT statement. The only restriction is here too that the column count and data types must match. Again, implicit conversions are allowed.

The full EXCEPT syntax you can find here: http://msdn.microsoft.com/en-us/library/ms188055(v=sql.105).aspx

EXCEPT Operator

As you might have expected by now, the operator of choice for EXCEPT queries is the Left Anti Semi Join:

except execution plan

A Stream Aggregate operator is added here too, to remove the duplicate rows.

Summary

EXCEPT is a T-SQL statement that was introduced in SQL Server 2005. It returns the set-theoretical difference of two query results. Rows from the left are returned as long as they are not part of the right also. Because the result is meant to be a set of rows, duplicate rows are automatically removed.

A Join A Day

This post is part of my December 2012 "A Join A Day" blog post series. You can find the table of contents with all posts published so far in the introductory post: A Join A Day – Introduction. Check back there frequently throughout the month.

Categories: A Join A Day, Fundamentals, General, Series, T-SQL Statements

Leave a Reply