How to group data within a range of contigious timestamps

Question

I have a table made up of rows of data collected through an indeterministic polling process. Each row has a start and end timestamp denoting the time period in which the data was collected. In some ...

Accepted Answer

This is a simplified gaps-and-island problem. Assuming that your RDBMS support window functions, you can approach this with a window sum. When the Start_Timestamp of record is different than the End_Timestamp of the previous record, a new group starts:select    t.Row,    sum(case when Start_Timestamp = lag_End_Timestamp then 0 else 1 end)         over(order by End_Timestamp) series,    t.Start_Timestamp,    t.End_Timestamp,    t.Data_Itemfrom (    select        t.*,        lag(End_Timestamp) over (order by End_Timestamp) lag_End_Timestamp    from mytable t) tDemo on DB Fiddle:Row | series | Start_Timestamp     | End_Timestamp       | Data_Item--: | -----: | :------------------ | :------------------ | --------:  1 |      1 | 2019-08-12 22:07:53 | 2019-08-12 22:09:57 |       100  2 |      1 | 2019-08-12 22:09:57 | 2019-08-12 22:12:01 |       203  3 |      1 | 2019-08-12 22:12:01 | 2019-08-12 22:13:03 |       487  4 |      1 | 2019-08-12 22:13:03 | 2019-08-12 22:16:19 |       113  5 |      2 | 2019-08-12 22:24:34 | 2019-08-12 22:26:37 |       632  6 |      2 | 2019-08-12 22:26:37 | 2019-08-12 22:27:40 |       532  7 |      2 | 2019-08-12 22:27:40 | 2019-08-12 22:28:42 |       543  8 |      2 | 2019-08-12 22:28:42 | 2019-08-12 22:31:57 |       142  9 |      3 | 2019-08-13 19:56:06 | 2019-08-13 19:57:08 |       351 10 |      3 | 2019-08-13 19:57:08 | 2019-08-13 19:58:10 |       982

Advertisement

Answer