Group rows by the same value in the field, while matching on partial value only

Question

I have a table that has many rows (between a few 1000s to a few million). I need my query to do the following: group results by the same part of the value in the field; order by the biggest group first. The table has mostly values that have only some part are similar (and i.e. suffix would be different).

Accepted Answer

If you&#8217;re using MySQL 5.x, you can strip the trailing _ and digits from the Uri value using this expression:LEFT(Uri, LENGTH(Uri) - LOCATE('_', REVERSE(Uri)))Using a REGEXP test to see if the Uri ends in _ and some digits, we can then process the Uri according to that and then GROUP BY that value to get the counts:SELECT CASE WHEN Uri REGEXP '_[0-9]+$' THEN LEFT(Uri, LENGTH(Uri) - LOCATE('_', REVERSE(Uri)))       ELSE Uri       END AS Uri2,       COUNT(*) AS CountFROM dataGROUP BY Uri2Output:Uri2        Countcopy_all    2delete      3merge_all   1select      4Demo on SQLFiddle

Advertisement

Answer