MS SQL identify duplicate rows based on log time

Question

I have a fairly simple database table that logs every time a tray passes over an RFID reader. What sometimes happens is the data is being sent twice, so I have been asked if I can find out how often this happens. Rather than spending the next few days going through every record in the log table, I presume the…

Accepted Answer

You can use lead() and lag().  For all such rows:select l.*from (select l.*,             lag(logged) over (partition by rfid order by logged) as prev_logged,             lead(logged) over (partition by rfid order by logged) as next_logged      from logs l     ) lwhere prev_logged > dateadd(second, -5, logged) or      next_logged < dateadd(second, 5, logged);Your sample code looks like SQL Server so this uses SQL Server syntax.If you just want the first record in a sequence of duplicates, you can use similar logic:select l.*from (select l.*,             lag(logged) over (partition by rfid order by logged) as prev_logged,             lead(logged) over (partition by rfid order by logged) as next_logged      from logs l     ) lwhere (prev_logged < dateadd(second, -5, logged) or prev_logged is null) and      next_logged < dateadd(second, 5, logged);Note that both of these only use the rfid.  I&#8217;m not sure if the rfid should really be used in conjunction with other columns, such as point, but your question explicitly mentioned only duplicate rfid.

ID	TYPE	POINT	RFID	LOGGED
1	101	4101	1234	2021-01-20 06:31:25:154
2	101	4101	4567	2021-01-20 06:32:24:165
3	101	4102	1234	2021-01-20 06:35:55:154
4	101	4102	1234	2021-01-20 06:35:55:516

Advertisement

Answer