Generate a random number for each group and assign it to all rows in the group

Question

I have a table in the form The goal is to group the table by ID and for each group, select a random number from the number of groups (in this case, select a random number from [1, 3]) and assign all rows of each group one number. One possible configuration is I was thinking of using ROW_NUMBER() and PARTITION…

Accepted Answer

If the random number can be sequential, you can use dense_rank():select t.*, dense_rank() over (order by id) as group_numfrom t;Or for a bit more randomness:select t.*,       dense_rank() over (order by farm_fingerprint(cast(id as string)), id) as group_numfrom t;Alternatively, a separate calculation by id might be simplest:select *from t join     (select id,             dense_rank() over (order by rand()) as group_num      from t      group by id     ) tt     using (id)

Advertisement

Answer