How to keep only one entry among several in PostgreSQL database

Question

I have a database that monitors a network (snapshots table, that contains a snapshot_date column). This production database was flooded by a faulty crontab, resulting in many snapshots for the same device every day. I don&#8217;t won&#8217;t to remove everything, but i want to keep only one snapshot per snaps…

Accepted Answer

One option uses distinct on:select distinct on (snapshot_date, device_id) *from mytable order by snapshot_date, device_id, snapshot_idThis retains the one row per snapshot_date and device_id that has the smalles snapshot_id. Note that this assumes that snapshot_id is unique (or, at least, is unique for each (snapshot_date, device_id) tuple).If you wanted a delete statement, then:delete from mytable tusing (    select snapshot_date, device_id, min(snapshot_id) snapshot_id    from mytable     group by snapshot_date, device_id) t1where     t.snapshot_date = t1.snapshot_date    and t.device_id = t1.device_id    and t.snapshot_id < t1.id

Advertisement

Answer