I have a table (EMAIL) of email addresses:
EmailAddress ------------ jack@aol.com jill@aol.com tom@aol.com bill@aol.lcom
and a table (BLACKLIST) of blacklisted email addresses:
EmailAddress ------------ jack@aol.com jill@aol.com
and I want to select those email addresses that are in the EMAIL table but NOT in the BLACKLIST table. I’m doing:
SELECT EmailAddress
FROM EMAIL
WHERE EmailAddress NOT IN
(
SELECT EmailAddress
FROM BLACKLIST
)
but when the row counts get very high the performance is terrible.
How can I better do this? (Assume generic SQL if possible. If not, assume T-SQL.)
Advertisement
Answer
You can use a left outer join, or a not exists clause.
Left outer join:
select E.EmailAddress from EMAIL E left outer join BLACKLIST B on (E.EmailAddress = B.EmailAddress) where B.EmailAddress is null;
Not Exists:
select E.EmailAddress
from EMAIL E where not exists
(select EmailAddress from BLACKLIST B where B.EmailAddress = E.EmailAddress)
Both are quite generic SQL solutions (don’t depend on a specific DB engine). I would say that the latter is a little bit more performant (not by much though). But definitely more performant than the not in one.
As commenters stated, you can also try creating an index on BLACKLIST(EmailAddress), that should help speed up the execution of your query.