Snowflake/SQL: create a time-series table such that every ID is visible, and if ID is null, it uses the previous value? (similar to shift)

Question

Suppose I have the following table: Day ID Value 2022-11-05 0 A 2022-11-06 1 B 2022-11-07 0 C Now given a time window of 1 day, I want to create a time-series table that: The Day column granular unit is 1 day Each Day row displays every ID in the table (like cross-join) Moreover, if for that day, the ID

Accepted Answer

Way One:first we start with some datathen we find the_days in the period we are interested inthen we find the data_start for each idthen we join those values together, and use LAG with the IGNORE NULLS OVER clause to find the &#8220;prior values&#8221; if the current values in not present via NVLwith data(Day, ID, Value) as (    select * from values        ('2022-11-05'::date, 0, 'A'),        ('2022-11-06'::date, 1, 'B'),        ('2022-11-07'::date, 0, 'C')), the_days as (    select         row_number() over (order by null)-1 as rn        ,dateadd('day', rn, from_day) as day    from (        select             min(day) as from_day            ,'2022-11-08' as to_day            ,datediff('days', from_day, to_day) as days        from data    ), table(generator(ROWCOUNT => 200))    qualify rn <= days), data_starts as (    select         id,         min(day) as start_day    from data    group by 1)select     td.day,    ds.id,    nvl(d.value, lag(d.value) ignore nulls over (partition by ds.id order by td.day)) as valuefrom data_starts as dsjoin the_days as td     on td.day >= ds.start_dayleft join data as d    on ds.id = d.id and d.day = td.dayorder by 1,2;gives:DAYIDVALUE2022-11-050A2022-11-060A2022-11-061B2022-11-070C2022-11-071B2022-11-080C2022-11-081BWay Two:with data(Day, ID, Value) as (    select * from values        ('2022-11-05'::date, 0, 'A'),        ('2022-11-06'::date, 1, 'B'),        ('2022-11-07'::date, 0, 'C')), the_days as (    select         dateadd('day', row_number() over (order by null)-1, '2022-11-05') as day    from table(generator(ROWCOUNT => 4)))select     td.day,    i.id,    nvl(d.value, lag(d.value) ignore nulls over (partition by i.id order by td.day)) as _valuefrom the_days as tdcross join (select distinct id from data) as ileft join data as d    on i.id = d.id and d.day = td.dayqualify _value is not nullorder by 1,2;this requires a unique name for the _values output so it can be referenced in the qualify without needing to duplicate the code.

Day	ID	Value
2022-11-05	0	A
2022-11-06	0	A
2022-11-06	1	B
2022-11-07	0	C
2022-11-07	1	B
2022-11-08	0	C
2022-11-08	1	B

Advertisement

Answer

Way One:

Way Two: