Is it possible to remove duplicates from the result for the data set?

Question

I have two following tables, dim_customers and fact_daily_customer_shipments: dim_customers +-------------+-----------------------+---------------------+ | customer_id | membership_start_date | ...

Accepted Answer

The reason you are having issues with duplication is that you have two entries in the dim_customers table with the same customer_id value (but different membership dates). What this means is that you need to change the JOIN condition to include the membership_dates. By then changing to a LEFT JOIN, we can determine whether a customer was a member at the time by whether the customer_id value from the JOIN is NULL. So the query you should use is:select fc.ship_date,        case when dc.customer_id is null then 'Y' else 'N' end as is_member,        sum(fc.quantity)from fact_daily_customer_shipments fcleft join dim_customers dc on dc.customer_id = fc.customer_id and fc.ship_date between dc.membership_start_date and dc.membership_end_dategroup by fc.ship_date, is_memberOutput:ship_date   is_member   sum(fc.quantity)2015-02-13  N           22015-03-01  N           72015-03-01  Y           102015-06-01  Y           12015-10-01  Y           3SQLFiddle Demo

Advertisement

Answer