Count string occurances within a list column – Snowflake/SQL

Question

I have a table with a column that contains a list of strings like below: EXAMPLE: STRING User_ID [...] "[""null"&...

Accepted Answer

Based on your description, here is my sample table:create table u (user_id number, string varchar);insert into u values(2122213, '"[""null"",""personal"",""Other""]"'),(2132214, '"[""Other"",""to_dos_and_thing""]"'),(2132215, '"[""getting_things_done"",""TO_dos_and_thing"",""Work!!!!!""]"' );I used SPLIT_TO_TABLE to split each string as a row, and then REGEXP_SUBSTR to clean the data. So here&#8217;s the query and output:select REGEXP_SUBSTR( s.VALUE, '""(.*)""', 1, 1, 'i', 1 ) extracted, count(*) from u,lateral SPLIT_TO_TABLE( string  , ',' ) sGROUP BY extractedorder by count(*) DESC;+---------------------+----------+|      EXTRACTED      | COUNT(*) |+---------------------+----------+| Other               |        2 || null                |        1 || personal            |        1 || to_dos_and_thing    |        1 || getting_things_done |        1 || TO_dos_and_thing    |        1 || Work!!!!!           |        1 |+---------------------+----------+SPLIT_TO_TABLE  https://docs.snowflake.com/en/sql-reference/functions/split_to_table.htmlREGEXP_SUBSTR https://docs.snowflake.com/en/sql-reference/functions/regexp_substr.html

Advertisement

Answer