Filter duplicate rows in Postgres based on conditions between those rows

Question

Given a table CREATE TABLE data( irs_number VARCHAR (50), mop_up INTEGER, ou VARCHAR (50) ); How would I return all matching records that&#8230; have at least one identical value for irs_number in &#8230;

Accepted Answer

You should be able to do this with a simple exists clause:SELECT irs_number, mop_up, ouFROM data dWHERE EXISTS (SELECT 1 FROM data d2 WHERE d2.irs_number = d.irs_number AND d2.mop_up = 1 AND d2.ou <> d.ou );EDIT:The above misinterpreted the question. It assumed that a mop_up = 1 needed to be on a different ou. As I read the question, this is ambiguous but doesn’t appear to be what you want. So, two exists address this:SELECT irs_number, mop_up, ouFROM data dWHERE EXISTS (SELECT 1 FROM data d2 WHERE d2.irs_number = d.irs_number AND d2.mop_up = 1 ) AND EXISTS (SELECT 1 FROM data d2 WHERE d2.irs_number = d.irs_number AND d2.ou <> d.ou );Here is a db<>fiddle.Both these solutions will be able to take advantage of an index on (irs_number, mop_up, ou).

Advertisement

Answer