How can I select an ID from the row with the maximum value for each of several columns?

Question

I want to extract the values of an ID column from the rows containing the maximum values of each of several other columns, and then collate these in a new table which has the column headers in one ...

Accepted Answer

The following approach takes advantage of one of sqlite3&#8217;s documented but non-standard behaviors when mixing aggregate and non-aggregate results in a query &#8211; when using max(), all non-aggregate values are taken from one of the rows with the maximum value (Ties broken at random):WITH maxes(column, name, maxval) AS  (SELECT 'A', name, max(A) FROM mytable   UNION ALL   SELECT 'B', name, max(B) FROM mytable   UNION ALL   SELECT 'C', name, max(C) FROM mytable   UNION ALL   SELECT 'D', name, max(D) FROM mytable   UNION ALL   SELECT 'E', name, max(E) FROM mytable)SELECT column AS id, name AS top_scorerFROM maxesORDER BY column;which givesid          top_scorer----------  ----------A           name1B           name1C           name4D           name2E           name4However, a database design that uses a one-to-many relationship with a second table instead of one column per thing is going to be a better approach.Consider this schema:CREATE TABLE names(id INTEGER PRIMARY KEY, name TEXT);CREATE TABLE scores(name_id INTEGER REFERENCES names(id)                  , score_id TEXT                  , val INTEGER                  , PRIMARY KEY(name_id, score_id)) WITHOUT ROWID;populated with your test data:INSERT INTO names VALUES(101,'name1');INSERT INTO names VALUES(102,'name2');INSERT INTO names VALUES(103,'name3');INSERT INTO names VALUES(104,'name4');INSERT INTO scores VALUES(101,'A',4);INSERT INTO scores VALUES(101,'B',4);INSERT INTO scores VALUES(101,'C',1);INSERT INTO scores VALUES(101,'D',3);INSERT INTO scores VALUES(101,'E',3);INSERT INTO scores VALUES(102,'A',3);INSERT INTO scores VALUES(102,'B',1);INSERT INTO scores VALUES(102,'C',2);INSERT INTO scores VALUES(102,'D',4);INSERT INTO scores VALUES(102,'E',2);INSERT INTO scores VALUES(103,'A',2);INSERT INTO scores VALUES(103,'B',2);INSERT INTO scores VALUES(103,'C',3);INSERT INTO scores VALUES(103,'D',2);INSERT INTO scores VALUES(103,'E',1);INSERT INTO scores VALUES(104,'A',1);INSERT INTO scores VALUES(104,'B',3);INSERT INTO scores VALUES(104,'C',4);INSERT INTO scores VALUES(104,'D',1);INSERT INTO scores VALUES(104,'E',4);you can use this query:WITH maxes AS (SELECT score_id, name, max(val)  FROM names  JOIN scores ON id = name_id  GROUP BY score_id)SELECT score_id AS id, name AS top_scorerFROM maxesORDER BY score_id;which avoids having to hard code each id being tracked like the current one-per-column design you&#8217;re using. Much, much cleaner and flexible.

Advertisement

Answer