SQL consolidate overlapping dates based on criteria

Question

I'm trying to merge overlapping dates between Admit and discharge dates of patients. There are a few edge cases which I couldn't cover in the query. Input Expected Output Query I used the logic that was here But this doesn't cover the edge case for ID 2 and 3. Also the subquery is slower when the data is huge. Is

Accepted Answer

This is a type of gaps-and-islands problem.  I would suggest using a cumulative max to determine when an &#8220;island&#8221; starts and then aggregate:select id, min(admit_dt), max(discharge_dt)from (select t.*,             sum(case when prev_Discharge_dt >= Admit_Dt then 0 else 1 end) over (partition by id order by admit_dt, discharge_dt) as grp      from (select t.*,                   max(Discharge_dt) over (partition by id                                           order by Admit_Dt, Discharge_dt                                           rows between unbounded preceding and 1 preceding) as prev_Discharge_dt            from t           ) t     ) tgroup by id, grp;Here is a db<>fiddle.The innermost subquery is retrieving the maximum discharge date before each row.  This allows you to check for an overlap.  The middle subquery counts up the number of times there is no overlap &#8212; the beginning of a group.  And the outer query aggregates.

Advertisement

Answer