Retrieve a row of data with the most recent date when another row ‘X’ in T-SQL

Question

I have a database of customers who have an effective date and end date of their membership, both separate columns. The data is a bit dirty, however, and a customer can have multiple rows of data, only one of which is their most recent membership record. A member is considered "active" if they have an end date that = NULL.

Accepted Answer

You could do this with row_number() and a conditional sort:select name, id, membership_effective_date, membership_end_datefrom (    select         t.*,        row_number() over(            partition by id             order by                case when membership_end_date is null then 0 else 1 end,                case when membership_end_date <> membership_effective_date then 0 else 1 end,                membership_end_date desc        ) rn    from mytable t) twhere rn = 1The trick lies in the order by clause of row_number(): it gives priority to rows whose end date is null, then to rows whose end date is not equal to the start date, then to the greatest end date. You can run the subquery separately to see how the row number is assigned.With this information at hand, all that is left to do is filter on the top ranked record per group.Demo on DB Fiddle:name  | id | membership_effective_date | membership_end_date:---- | -: | :------------------------ | :------------------Bob   |  1 | 2020-01-01                | null               Kim   |  2 | 2019-01-01                | 2020-01-01         Susan |  3 | 2018-01-01                | 2018-12-31         Larry |  4 | 2020-01-01                | 2020-01-01

Advertisement

Answer