Skip to content
Advertisement

Postgres: how to check for records in time buckets for multiple values

I have a POSTGRES table with roughly the following shape:

Given a

  • Start time
  • End time
  • Bucket width
  • List of sensor ids

I’d like to write a query that slices the range [start_time, end_time] into sub-intervals (buckets) of width bucket_width and, for each bucket, check if each of the sensors in the list had a record with recorded_at during that bucket.

So for example, assuming inputs of

  • Start time = '2020-01-01 00:00:00'
  • End time = '2020-01-01 02:00:00'
  • Bucket width = '1 hour'
  • List of sensor ids = ['A', 'B', 'C']

the query should return something like

I don’t need the actual count, so I imagine LIMIT 1 will show up somewhere. I just included it in the example to help make it clear what I’m looking for.

Advertisement

Answer

One option uses generate_series() to generate the rows, then a left join to bring the table, and finally conditional aggregation to get the count by sensor:

Note that this would generate an extra record (from 02:00:00 to 03:00:00') as compared to your desired results. If you want to avoid that, you can slightly modify the generate_series()` arguments, like:

User contributions licensed under: CC BY-SA
6 People found this is helpful
Advertisement