How to get observation frequency counts from multiple dataset into one table?

Question

I have a bunch of large datasets. DS_1 (contains all unique IDs and names): DS_2: DS_3: I&#8217;m looking to create a new dataset that shows frequency counts across the datasets (and lastly calculates Total_Obs). I would output something like this: The datasets are fairly large. Is there a more efficient way …

Accepted Answer

You can do below &#8211;Select t1.id As id      ,t1.name As name      ,coalesce(DS_2_obs,0) as DS_2_obs      ,coalesce(DS_3_obs,0) as DS_3_obs      ,coalesce(DS_2_obs,0) + coalesce(DS_3_obs,0) As Total_Obsfrom DS_1 t1left join (Select id, count(1) as DS_2_obs from DS_2 group by id) t2on t1.id = t2.idleft join (Select id, count(1) as DS_3_obs from DS_3 group by id) t3on t1.id = t3.id;Also, you should always tag which Database you are using.If the above SQL takes a lot of time, instead of t2 and t3 as inline queries you can consider creating aggregate observation tables with frequency/counts and having an index on id. That way when you join the observation aggregates with the primary table, the join can be faster based on indexes.

Advertisement

Answer