BigQuery – How to find count of Unique overlapping values in 1 or or more categories (Count of categorical values)?

Question

I am very new to BigQuery and standard SQL. I might not be able to figure out the correct approach to solve a problem. Please help me out. Please help me change this code to get the desired output. I have a Color Column and a ID column. Example shown below: Color | ID Blue | id_1 Blue | id_5

Accepted Answer

I would do this on separate rows rather than columns:select cnt, count(*) as num_colorsfrom (select id, count(*) as cnt      from t      group by id     ) igroup by cntorder by cntIf you want this by columns, you can use conditional aggregation:select countif(cnt = 1),       countif(cnt = 2),       countif(cnt = 3),       countif(cnt = 4)from (select id, count(*) as cnt      from t      group by id     ) i;Note:  These assumes that the id/color rows are unique in the original data.  Otherwise, use count(distinct color) as cnt.

Advertisement

Answer