Cumulative elapsed minutes on hourly basis in PostgreSQL

Question

I have a datetime column. I need to derive a column of total minutes elapsed from the first to the last value of every hour grouped by hour, but, in cases of overlapping event, the time should be ...

Accepted Answer

This is a gaps-and-islands problem with some twists. First, I would summarize by the “islands” defined by the gaps of 30 minutes:select min(moves_ts) as start_ts, max(moves_ts) as end_tsfrom (select o.*, count(prev_moves_ts) filter (where moves_ts > prev_moves_ts + interval '30 minute') over (order by moves_ts) as grp from (select o.*, lag(moves_ts) over (order by moves_ts) as prev_moves_ts from original o ) o ) ogroup by grp;Then you can use this with generate_series() to expand the data and calculate the overlaps with each hour:with islands as ( select min(moves_ts) as start_ts, max(moves_ts) as end_ts from (select o.*, count(prev_moves_ts) filter (where moves_ts > prev_moves_ts + interval '30 minute') over (order by moves_ts) as grp from (select o.*, lag(moves_ts) over (order by moves_ts) as prev_moves_ts from original o ) o ) o group by grp )select hh.hh, sum( least(hh.hh + interval '1 hour', i.end_ts) - greatest(hh.hh, i.start_ts) ) as duration from (select generate_series(date_trunc('hour', min(moves_ts)), date_trunc('hour', max(moves_ts)), interval '1 hour' ) hh from original o ) hh left join islands i on i.start_ts < hh.hh + interval '1 hour' and i.end_ts >= hh.hhgroup by hh.hhorder by hh.hh;Here is a db<>fiddle.

Advertisement

Answer