Skip to content
Advertisement

SQL – Find most recent value from other fields (per user, between occurrences)

I have actions and results mixed together in an “event” field. But each of these actions essentially “causes” the result (and the subsequent outcome) (let’s assume). I want to tie each future outcome back to the actions that caused it, for each user.

I want the value from the “outcome” field to be copied into a new field for each user’s most upcoming, future result (if a result has occurred after that action). And I want to record the time of that result/outcome. So for example, if user_1 does three events then has a good outcome, I want the word “good” in a new field for each of those three preceding events. If user_2 has 2 actions, then a bad outcome, then 3 actions: I want the 2 first actions to have “bad” and the last 3 actions to have “?” in the event_outcome field.

*Note: event_timestamp field does not always come chronologically (with relation to row number). I’ve shown it ordered this way to make it easier to understand.

What I have is this*

What I want is this

I can find the most recent outcome for each user (Ideally I would have “?” instead of null if no outcome is known). with this code:

Which yeilds this (NOT what I want)

But I don’t know where to go from here. Maybe another level of nesting “partition by”? Over the event field?…

We are using BigQuery, if that context helps. Thank you!

Advertisement

Answer

Here is how i would do it in regular SQL. I am assuming that Big Query supports the analytical functions that i have used.

DB Fiddle Link

https://dbfiddle.uk/?rdbms=sqlserver_2019&fiddle=e97ebf473ccf3c33d7c6fa62fd14e51b

User contributions licensed under: CC BY-SA
4 People found this is helpful
Advertisement