What is the best way in SQL to combine sequential events based on matching end time to start time?

Question

That database I work in records events based on a part ID and the times in which it is active. The issue I came across is these events are truncated to fit within a single day. If the active time for a part carries over to the next day, the event will be split by the number of days it

Accepted Answer

This is a gaps and island problem, where you want to group together adjacent rows.Here is one solution that uses window functions:select     min(date) date,    part_id,    min(active_start) active_start,    max(active_end) active_endfrom (    select        t.*,        sum(case when lag_active_end = active_start then 0 else 1 end)            over(partition by part_id order by active_start) grp    from (        select             t.*,             lag(active_end) over(partition by part_id order by active_start) lag_active_end        from mytable t    ) t) tgroup by part_id, grpThe most inner query retrieves the end date of the previous record that has the same part_id. The intermediate query does a window sum that increases by 1 every time the previous end date is not equal to the current start date: this defines the groups of adjacent rows. Finally, the outer query aggregates by group, and computes the start and end of the range.

Advertisement

Answer