Why is my join filter being applied to my entire query in redshift?

Question

I'm trying to build a tricky join across grains, with tableA being at a lower grain than tableB. In this example, I'm trying to accomplish the following results: tableA.id tableA.fieldA tableB.id ...

Accepted Answer

I wrote the following code per your description and SQL on my Redshift and I get the answer you are looking for.CREATE TABLE table_a (  ID int,  A int);                   INSERT INTO table_a VALUES (123, 1)  ;  INSERT INTO table_a VALUES (123, 2)  ;INSERT INTO table_a VALUES (123, 3)  ;INSERT INTO table_a VALUES (234, 1)  ;INSERT INTO table_a VALUES (234, 2)  ;INSERT INTO table_a VALUES (234, 3)  ;CREATE TABLE table_b (  ID int);  INSERT INTO table_b VALUES (123)  ;  INSERT INTO table_b VALUES (234)  ;Select    *from    table_A    left join table_B    on table_B.id = table_A.id    and table_A.a = 1;Results in:id  a   id123 1   123123 2   NULL123 3   NULL234 1   234234 2   NULL234 3   NULLI see a few possibilities at the moment &#8211; 1) your cluster is having issues / bug in the exact version you are running OR 2) your question write up doesn&#8217;t represent the whole picture OR 3) my code doesn&#8217;t represent your situation.Does the code I provided above recreate the issue on your cluster?  If not can you provide the necessary ingredients (DDL, SQL) to recreate the issue?

tableA.id	tableA.fieldA	tableB.id
123	1	123
123	2	null
123	3	null
234	1	234
234	2	null
234	3	null

tableA.id	tableA.fieldA	tableB.id
123	1	123
234	1	234

Advertisement

Answer