MySQL use DELETE FROM to remove duplicates rows

Question

I&#8217;m learning MySQL and today I tried to solve an MySQL question on leetcode: https://leetcode.com/problems/delete-duplicate-emails/solution/ +&#8212;-+&#8212;&#8212;&#8212;&#8212;&#8212;&#8212;+ | Id | Email | +&#8212;-+&#8211;&#8230;

Accepted Answer

First, this is a very bad way of implementing this code.  But I guess you get what you pay for.Second, simply run the query as a select:SELECT p1.*, p2.*FROM Person p1 JOIN     Person p2      ON p1.Email = p2.Email AND p1.Id > p2.Id;(Note that I&#8217;ve rewritten the logic as a JOIN.  You should always use proper, explicit, standard, readable JOIN syntax, but the two methods are functionally equivalent.)On your second example, the results of this query are:table1 email     table1 id    table2 idjohn@example.com.    2            1john@example.com.    3            1john@example.com.    3            2What is notable is that id = 1 is never in the second column &#8212; and that is the column that determines which ids are deleted.  In other words, all but the smallest id for each email get deleted because there is a smaller id.This also hints at why this is a really bad solution.  MySQL has to deal with two rows for id = 3.  Perhaps it attempts to delete both.  Perhaps it has to just deal with extra data.  Either way, there is extra work.  And the more rows with the same email in the data the more extra duplicates are created.An alternative method, such as:delete p    from person p join         (select email, min(id) as min_id          from person p2          group by email         ) p2         on p.email = p2.email and p.id > p2.min_id;Does not have this problem and, in my opinion, the intent is clearer.

Advertisement

Answer