Redshift SQL: add and reset a counter with date and group considered

Question

Suppose I have a table below. I'd like to have a counter to count the # of times when a Customer (there are many) is in Segment A. If the Customer jumps to a different Segment between 2 quarters, the ...

Accepted Answer

This is a type of gaps-and-islands problem.  You can solve this with a difference of row numbers.  The real problem is dealing with the quarters.  But string functions can handle that.select quarter, customer, segment,       row_number() over (partition by customer, segment, seqnum - seqnum_cs order by right(quarter, 4), left(quarter, 2)) as counterfrom (select t.*,             row_number() over (partition by customer order by right(quarter, 4), left(quarter, 2)) as seqnum,             row_number() over (partition by customer, segment order by right(quarter, 4), left(quarter, 2)) as seqnum_cs      from t     ) torder by customer, seqnum;The key idea here is that the difference of row numbers defines the adjacent rows for a customer with the same status.  It can be a bit hard to see why this is the case.  However, if you look at the results of the subquery, you will no doubt see and understand why this is works.

Advertisement

Answer