This article includes a list ofgeneral references, butit lacks sufficient correspondinginline citations. Please help toimprove this article byintroducing more precise citations.(July 2022) (Learn how and when to remove this message) |
Set operations inSQL is a type of operations which allow the results of multiple queries to be combined into a singleresult set.[1]
Set operators in SQL includeUNION,INTERSECT, andEXCEPT, whichmathematically correspond to the concepts ofunion,intersection andset difference.
InSQL theUNION clause combines the results of two SQL queries into a singletable of all matchingrows. The two queries must result in the same number ofcolumns and compatibledata types in order to unite. Any duplicate records are automatically removed unlessUNION ALL is used.
UNION can be useful indata warehouse applications where tables are not perfectlynormalized.[2] A simple example would be a database having tablessales2005 andsales2006 that have identical structures but are separated because of performance considerations. AUNION query could combine results from both tables.
Note thatUNION ALL does not guarantee the order of rows. Rows from the second operand may appear before, after, or mixed with rows from the first operand. In situations where a specific order is desired,ORDER BY must be used.
Note thatUNION ALL may be much faster than plainUNION.
Given these two tables:
| person | amount |
|---|---|
| Joe | 1000 |
| Alex | 2000 |
| Bob | 5000 |
| person | amount |
|---|---|
| Joe | 2000 |
| Alex | 2000 |
| Zach | 35000 |
Executing this statement:
SELECT*FROMsales2005UNIONSELECT*FROMsales2006;
yields this result set, though the order of the rows can vary because noORDER BY clause was supplied:
| person | amount |
|---|---|
| Joe | 1000 |
| Alex | 2000 |
| Bob | 5000 |
| Joe | 2000 |
| Zach | 35000 |
Note that there are two rows for Joe because those rows are distinct across their columns. There is only one row for Alex because those rows are not distinct for both columns.
UNION ALL gives different results, because it will not eliminate duplicates. Executing this statement:
SELECT*FROMsales2005UNIONALLSELECT*FROMsales2006;
would give these results, again allowing variance for the lack of anORDER BY statement:
| person | amount |
|---|---|
| Joe | 1000 |
| Joe | 2000 |
| Alex | 2000 |
| Alex | 2000 |
| Bob | 5000 |
| Zach | 35000 |
The discussion offull outer joins also has an example that usesUNION.
The SQLINTERSECT operator takes the results of two queries and returns only rows that appear in both result sets. For purposes of duplicate removal theINTERSECT operator does not distinguish betweenNULLs. TheINTERSECT operator removes duplicate rows from the final result set. TheINTERSECT ALL operator does not remove duplicate rows from the final result set, but if a row appears X times in the first query and Y times in the second, it will appear times in the result set.
The following exampleINTERSECT query returns all rows from the Orders table where Quantity is between 50 and 100.
SELECT*FROMOrdersWHEREQuantityBETWEEN1AND100INTERSECTSELECT*FROMOrdersWHEREQuantityBETWEEN50AND200;
The SQLEXCEPT operator takes the distinct rows of one query and returns the rows that do not appear in a second result set. For purposes of row elimination and duplicate removal, theEXCEPT operator does not distinguish betweenNULLs. TheEXCEPT ALL operator does not remove duplicates, but if a row appears X times in the first query and Y times in the second, it will appear times in the result set.
Notably, the Oracle platform provides aMINUS operator which is functionally equivalent to theSQL standardEXCEPT DISTINCT operator.[3]
The following exampleEXCEPT query returns all rows from the Orders table where Quantity is between 1 and 49, and those with a Quantity between 76 and 100.
Worded another way; the query returns all rows where the Quantity is between 1 and 100, apart from rows where the quantity is between 50 and 75.
SELECT*FROMOrdersWHEREQuantityBETWEEN1AND100EXCEPTSELECT*FROMOrdersWHEREQuantityBETWEEN50AND75;
The following example is equivalent to the above example but without using theEXCEPT operator.
SELECTo1.*FROM(SELECT*FROMOrdersWHEREQuantityBETWEEN1AND100)o1LEFTJOIN(SELECT*FROMOrdersWHEREQuantityBETWEEN50AND75)o2ONo1.id=o2.idWHEREo2.idISNULL
UNION ALL views technique for managing maintenance and performance in your large data warehouse environment ... ThisUNION ALL technique has saved many of my clients with issues related to time-sensitive database designs. These databases usually have an extremely volatile current timeframe, month, or day portion and the older data is rarely updated. Using different container DASD allocations, tablespaces, tables, and index definitions, the settings can be tuned for the specific performance considerations for these different volatility levels and update frequency situations."Terabyte Data Warehouse Table Design Choices - Part 2 (accessed on July 25, 2006)EXCEPT DISTINCT table operator: UseMINUS instead ofEXCEPT DISTINCT""Oracle Compliance To Core SQL:2003".Docs.oracle.com. Retrieved7 July 2022.