De-duplicate rows in GCP Big Query (SQL) based on two columns [closed]

Question

Closed. This question needs debugging details. It is not currently accepting answers. Edit the question to include desired behavior, a specific problem or error, and the shortest code necessary to reproduce the problem. This will help others answer the question. Closed 2 years ago. Improve this question I'm trying to output all columns while have certain rows de-duplicated. Everything I've

Accepted Answer

Consider using ARRAY_AGG:with TestData as (  select 'Tom' as Name, '1' as Phone, timestamp('2020-01-01 00:00:00') as LastUpdateDate   union all  select 'Tom' as Name, '2' as Phone, timestamp('2020-01-02 00:00:00') as LastUpdateDate  union all  select 'Eva' as Name, '3' as Phone, timestamp('2020-01-03 00:00:00') as LastUpdateDate  union all  select 'Eva' as Name, '4' as Phone, timestamp('2020-01-04 00:00:00') as LastUpdateDate)SELECT deduplicated.* FROM (  SELECT ARRAY_AGG(t ORDER BY t.LastUpdateDate DESC LIMIT 1)[OFFSET(0)] as deduplicated  FROM TestData as t   GROUP BY Name)

Advertisement

Answer