Using SQL to group consecutive items that share a common status (dummy data included)

Question

Given a table that has sometimes repeated statuses within a group (in this case &#8220;vehicles&#8221;), I want to consolidate those statuses into a single row and aggregate status_seconds. The data looks like this (I&#8217;ll include some TSQL below to select dummy data into a temp table to make it easy to w…

Accepted Answer

This is a typical gaps-and-islands problem, where you want to group together &#8220;adjacent&#8221; rows that share the same vehicle and status (the islands).You don&#8217;t need a recursive query for this: window functions can get this done. Here, the simplest approach probably is to use the difference between row numbers to identify the groups.select vehicle_name, vehicle_status,     min(status_end_time) as min_status_end_time,     max(status_end_time) as max_status_end_time,     sum(status_seconds)  as sum_status_seconds from (    select vs.*,         row_number() over(partition by vehicle_name order by status_end_time) rn1,        row_number() over(partition by vehicle_name, vehicle_status order by status_end_time) rn2    from ##vehiclesAndStates vs) tgroup by vehicle_name, vehicle_status, rn1 - rn2order by vehicle_name, min(status_end_time)You can run the subquery separately and look how the row numbers change to understand more.For your sample data, the query returns:vehicle_name | vehicle_status | min_status_end_time     | max_status_end_time     | sum_status_seconds:----------- | :------------- | :---------------------- | :---------------------- | -----------------:T101         | STOPPED        | 2020-12-04 09:43:18.000 | 2020-12-04 09:43:20.000 |                  4T101         | TURNING        | 2020-12-04 09:43:22.000 | 2020-12-04 09:43:22.000 |                  1T101         | TRAVELLING     | 2020-12-04 09:43:23.000 | 2020-12-04 09:43:33.000 |                 11T101         | TURNING        | 2020-12-04 09:43:34.000 | 2020-12-04 09:43:34.000 |                  1T101         | TRAVELLING     | 2020-12-04 09:43:35.000 | 2020-12-04 09:43:35.000 |                  1T101         | STOPPED        | 2020-12-04 09:43:35.000 | 2020-12-04 09:43:35.000 |                  3T102         | STOPPED        | 2020-12-04 09:43:23.000 | 2020-12-04 09:43:23.000 |                 10T102         | STOPPPED       | 2020-12-04 09:43:33.000 | 2020-12-04 09:43:35.000 |                 10T102         | PARKED         | 2020-12-04 09:43:35.000 | 2020-12-04 09:43:35.000 |                 10

Advertisement

Answer