similar to groupByKey() in Spark but using SQL queries

Question

I trying to make into using only SQL queries. It is kind of similar to using groupByKey() in pyspark. Is there a way to do this? Answer Just use conditional aggregation. One method is: In Postgres, this would be phrased using the standard filter clause:

Accepted Answer

Just use conditional aggregation.  One method is:select id,       max(case when category = 'X' then value end) as x_value,       max(case when category = 'Y' then value end) as y_valuefrom tgroup by id;In Postgres, this would be phrased using the standard filter clause:select id,       max(value) filter (where category = 'X'),       max(value) filter (where category = 'Y')from tgroup by id;

Advertisement

Answer