Skip to content
Advertisement

COUNT DISTINCT with CONDITIONS

I want to count the number of distinct items in a column subject to a certain condition, for example if the table is like this:

tag | entryID
----+---------
foo | 0
foo | 0
bar | 3

If I want to count the number of distinct tags as “tag count” and count the number of distinct tags with entry id > 0 as “positive tag count” in the same table, what should I do?

I’m now counting from two different tables where in the second table I’ve only selected those rows with entryID larger than zero. I think there should be a more compact way to solve this problem.

Advertisement

Answer

You can try this:

select
  count(distinct tag) as tag_count,
  count(distinct (case when entryId > 0 then tag end)) as positive_tag_count
from
  your_table_name;

The first count(distinct...) is easy. The second one, looks somewhat complex, is actually the same as the first one, except that you use case...when clause. In the case...when clause, you filter only positive values. Zeros or negative values would be evaluated as null and won’t be included in count.

One thing to note here is that this can be done by reading the table once. When it seems that you have to read the same table twice or more, it can actually be done by reading once, in most of the time. As a result, it will finish the task a lot faster with less I/O.

User contributions licensed under: CC BY-SA
5 People found this is helpful
Advertisement