Finding shortest geo-spatial distance from one point to all other points in SQL

Question

There are two types of users who purchase the movie tickets from either town A, town B, town C or online. I have the following tables as: Locations: This table consists of locations of movie centers Users: This table contains the history of user's purchase i.e. either online or in towns. Also consists of user's latitude/longitude during the purchase. I

Accepted Answer

You CTE specific_location is missing a JOIN to USERS as locations itself does not have a user_id column.I would also make an enriched user, to add a sequence just so later the location match can be distinctly per user row, and then do the user/location join in a second CTE, and thus the select you do at the end is with pre-computed values:I also swapped you two value CASE statements for IFF&#8217;sWITH enriched_user AS (    SLECT         u.user_id,        u.latitude,        u.longitude,        u.town,        seq4() as seq,        IFF(towns IN ('Town_A','Town_B','Town_C'), 'Town', 'Town_Online') AS purchase_in    FROM user AS u), user_and_closest_location AS (    SELECT         u.user_id,        u.latitude,        u.longitude,        u.town,        u.purchase_in        l.town as closest_town        haversine(u.latitude, u.longitude, l.latitude, l.longitude)    FROM enriched_user AS u,        location AS l    QUALIFY row_number() OVER (PARTION BY u.seq ORDER BY haversine(u.latitude, u.longitude, l.latitude, l.longitude)) = 1)SELECT          u.user_id,    u.latitude,    u.longitude,    u.town,    IFF(u.purchase_in = 'Town', u.closest_town, u.purchase_in) AS nearest_townFROM user_and_closest_location AS uORDER BY 1,2,3; The logic all calculating the distance based join for all row, is that it will be faster, and if there are things you want to not do it for, it would be better to prune the input there, but then you will need to rejoin to input to captured the skipped values.WITH enriched_user AS (    SLECT         u.user_id,        u.latitude,        u.longitude,        u.town,        seq4() as seq,        IFF(towns IN ('Town_A','Town_B','Town_C'), 'Town', 'Town_Online') AS purchase_in    FROM user AS u), user_and_closest_location AS (    SELECT         u.user_id,        u.latitude,        u.longitude,        u.town,        u.purchase_in        l.town as closest_town        haversine(u.latitude, u.longitude, l.latitude, l.longitude)    FROM enriched_user AS u,        location AS l    WHERE u.purchase_in = 'Town'    QUALIFY row_number() OVER (PARTION BY u.seq ORDER BY haversine(u.latitude, u.longitude, l.latitude, l.longitude)) = 1)SELECT          u.user_id    u.latitude,    u.longitude,    u.town,    IFF(u.purchase_in = 'Town', ucl.closest_town, u.purchase_in) AS nearest_townFROM enriched_user user_and_closest_location AS uLEFT JOIN user_and_closest_location AS ucl     ON u.seq = ucl.seqORDER BY 1,2,3;also the in towns could be flipped to be not &#8216;online`IFF(towns IN ('Town_A','Town_B','Town_C'), 'Town', 'Town_Online') AS purchase_inbecoming:IFF(towns != 'online', 'Town', 'Town_Online')at which point the actual test can be moved to where it&#8217;s used later.

Advertisement

Answer