How to use timebucket_gapfill when rows can have null values?

Question

I have a time series table where measurements are recorded into &#8220;wide&#8221; rows. Rows may contain all measurements or only some. The other columns are then set to NULL. I would like to use timebucket_gapfill() to &#8220;clean&#8221; this table and make sure that every row in the output has data in all…

Accepted Answer

Timescaledb does not consider NULL as missing values. I have to rewrite the query to avoid the rows with NULL values, that means doing multiple queries with timebucket_gapfill and joining the results together.This works and does what I wanted:SELECT    condh.ival, humidity, temperaturefrom(    select    time_bucket_gapfill('1000ms', time,      start => '2019-07-10 05:02:13',      finish => '2019-07-10 05:02:21'    ) as ival,    count(*) as samplesUsed,    interpolate(avg(humidity)) as humidity    FROM conditions    WHERE humidity is not NULL    GROUP BY ival) condh INNER JOIN (     SELECT    time_bucket_gapfill('1000ms', time,      start => '2019-07-10 05:02:13',      finish => '2019-07-10 05:02:21'    ) as ival,    count(*) as samplesUsed,    interpolate(avg(temperature)) as temperature    FROM conditions    WHERE temperature is not NULL    GROUP BY ival) condton (condt.ival = condh.ival)ORDER BY ival;Output:          ival          | humidity | temperature ------------------------+----------+------------- 2019-07-10 05:02:13-07 |          |             2019-07-10 05:02:14-07 |       50 |          70 2019-07-10 05:02:15-07 |       49 |          71 2019-07-10 05:02:16-07 |       48 |          72 2019-07-10 05:02:17-07 |       48 |      72.025 2019-07-10 05:02:18-07 |       48 |       72.05 2019-07-10 05:02:19-07 |       46 |      72.525 2019-07-10 05:02:20-07 |       45 |          73(8 rows)Got some help on the timescaledb slack &#8211; thanks gayathri.

Advertisement

Answer