Gap and Island problem – query not working for all periods

Question

I have to create a query to find the gaps and islands between dates. This seems to be a standard gaps and island problem. To show my issue I will use sample of data. The queries are executed in ...

Accepted Answer

You can express the gaps-and-islands logic like this:select min(startdate), max(enddate)from (select t.*,             sum(case when prev_enddate >= startdate then 0 else 1 end) over (order by startdate) as grp      from (select t.*,                   max(enddate) over (order by startdate rows between unbounded preceding and 1 preceding) as prev_enddate            from test t           ) t     ) tgroup by grporder by min(startdate);Here is a db<>fiddle.The idea is to look for the maximum enddate on all the &#8220;earlier&#8221; rows.  This value is used to check if there is an overlap.So, the innermost subquery calculates the previous enddate.  The middle subquery does a cumulative sum of the beginnings of groups to assign a group identifier.The outer query just aggregates by the group identifier.

Advertisement

Answer