Find the longest streak of perfect scores per player

Question

I have a the following result from a SELECT query with ORDER BY player_id ASC, time ASC in PostgreSQL database: I'm trying to find each player's longest streak where points = 100, with the tiebreaker being whichever streak began most recently. I also need to determine the time at which that player's longest streak began. The expected result would be:

Accepted Answer

A gaps-and-islands problem indeed.Assuming:&#8220;Streaks&#8221; are not interrupted by rows from other players.All columns are defined NOT NULL. (Else you have to do more.)This should be simplest and fastest as it only needs two fast row_number() window functions:SELECT DISTINCT ON (player_id)       player_id, count(*) AS seq_len, min(ts) AS time_beganFROM  (   SELECT player_id, points, ts        , row_number() OVER (PARTITION BY player_id ORDER BY ts)         - row_number() OVER (PARTITION BY player_id, points ORDER BY ts) AS grp   FROM   tbl   ) subWHERE  points = 100GROUP  BY player_id, grp  -- omit "points" after WHERE points = 100ORDER  BY player_id, seq_len DESC, time_began DESC;db<>fiddle hereUsing the column name ts instead of time, which is a reserved word in standard SQL. It&#8217;s allowed in Postgres, but with limitations and it&#8217;s still a bad idea to use it as identifier.The &#8220;trick&#8221; is to subtract row numbers so that consecutive rows fall in the same group (grp) per (player_id, points). Then filter the ones with 100 points, aggregate per group and return only the longest, most recent result per player.Basic explanation for the technique:Select longest continuous sequenceWe can use GROUP BY and DISTINCT ON in the same SELECT, GROUP BY is applied before DISTINCT ON. Consider the sequence of events in a SELECT query:Best way to get result count before LIMIT was appliedAbout DISTINCT ON:Select first row in each GROUP BY group?

Advertisement

Answer