Skip to content
Advertisement

Break out nested data within SQL, criteria across multiple rows (similar to dcast in R)

I’m trying to write a simple query to take a data set that looks like this:

and return this:

Essentially, I have an ID column that can have either B, C, or D in Col2. I am trying to identify which IDs only have B and D. I have a query to find both, but not only that combination. Query:

Alternatively, I could use help in finding a way to filter that query on B and D and leave off B and C. I have seen perhaps a self join, but am not sure how to implement that.

Thanks!

EDIT: Most of the data set has, for a given ID, all three of B, C, and D. The goal here is to isolate the IDs that are missing one, namely missing C.

Advertisement

Answer

I am trying to identify which IDs only have B and D. I have a query to find both

If this is what you want, you don’t need multiple columns:

If you want only 'B' and 'D' and no others, then:

If there are only two columns, you can also easily pivot the values using aggregation:

User contributions licensed under: CC BY-SA
7 People found this is helpful
Advertisement