SQL filter elements of array

Question

I have a table of employee similar to this: I want to get the department, name, and age of all employees, something similar to this: How can I achieve this using SQL query? Answer I assume you are using Hive/Spark and the datatype of the column is an array of maps. Using explode and collect_list and map funct…

Accepted Answer

I assume you are using Hive/Spark and the datatype of the column is an array of maps.Using explode and collect_list and map functions.select dept,collect_list(map("name",t.map_elem['name'],"age",t.map_elem['age'])) as resfrom tbl lateral view explode(data) t as map_elem group by deptNote that this would be not be as performant as a Spark solution or a UDF with which you can access the required keys in an array of maps, without a function like explode.One more way to do this with Spark SQL functions transform and map_filter (only available starting Spark version 3.0.0).spark.sql("select dept,transform(data, map_elem -> map_filter(map_elem, (k, v) -> k != "job")) as res from tbl")Another option with Spark versions > 2.4 is using function element_at with transform and selecting the required keys.spark.sql("select dept," +          "transform(data, map_elem -> map("name",element_at(map_elem,"name"),"age",element_at(map_elem,"age"))) as res " +          "from tbl")

Advertisement

Answer