SQL query to interpolate between values

Question

I intend to interpolate (linear interpolation) between values in a column and insert that into a new column using a SQL query. Based on my search online, I suspect LEAD analytic function could be useful. I am new to writing SQL queries. So, any insights on how it can be achieved will be quite helpful. The sample data set is

Accepted Answer

This can probably be simplified a bit but gets the answer you wanted, I believe. The slightly tricky bit is getting both the number of days between not-null values (i.e. the size of the gap you&#8217;re filling) and then the position within that gap:-- CTE for sample datawith your_table (emp, test_date, value) as (            select 'A', date '2001-01-01', null from dual  union all select 'A', date '2001-01-02', 100 from dual  union all select 'A', date '2001-01-03', null from dual  union all select 'A', date '2001-01-04', 80 from dual  union all select 'A', date '2001-01-05', null from dual  union all select 'A', date '2001-01-06', null from dual  union all select 'A', date '2001-01-07', 75 from dual)-- actual queryselect emp, test_date, value,  coalesce(value,    (next_value - prev_value) -- v3-v1    / (count(*) over (partition by grp) + 1) -- d3-d1    * row_number() over (partition by grp order by test_date desc) -- d2-d1, indirectly    + prev_value -- v1  ) as interpolatedfrom (  select emp, test_date, value,    last_value(value ignore nulls)      over (partition by emp order by test_date) as prev_value,    first_value(value ignore nulls)      over (partition by emp order by test_date range between current row and unbounded following) as next_value,    row_number() over (partition by emp order by test_date) -      row_number() over (partition by emp order by case when value is null then 1 else 0 end, test_date) as grp  from your_table)order by test_date;E TEST_DATE       VALUE INTERPOLATED- ---------- ---------- ------------A 2001-01-01                        A 2001-01-02        100          100A 2001-01-03                      90A 2001-01-04         80           80A 2001-01-05              76.6666667A 2001-01-06              78.3333333A 2001-01-07         75           75I&#8217;ve used last_value and first_value instead of lead and lag, but either works. (Lead/lag might be faster on a large data set I suppose). The grp calculation is Tabibitosan.

Advertisement

Answer