How to deal with ambiguous column reference in sql column name reference?

Question

I have some code: I then try but I get an error Error in SQL statement: AnalysisException: Reference &#8216;A.CDE_WR&#8217; is ambiguous, could be: A.CDE_WR, A.CDE_WR.; line 6 pos 4 in databricks. How can I deal with this? Answer This query: is using SELECT *. The * is shorthand for all columns from both tabl…

Accepted Answer

This query:SELECT * FROM cctv1_details c1d LEFT JOIN     cctv3_details c3d     ON c1d.CDE_WR = c3d.CDE_WR AND c1d.CDE_dist = c3d.CDE_distis using SELECT *.  The * is shorthand for all columns from both tables.Obviously the combined columns from the two tables have duplicate column names; at least, CDE_WR and CDE_dist &#8212; and there may be others.  The general solution is to list all the columns out:SELECT c1d.col1, c1d.col2, . . . c3d.colx, c3d.colyFROM cctv1_details c1d LEFT JOIN     cctv3_details c3d     ON c1d.CDE_WR = c3d.CDE_WR AND c1d.CDE_dist = c3d.CDE_dist;However, this is often shorted to:SELECT c1d.*, c3d.colx, c3d.colyFROM cctv1_details c1d LEFT JOIN     cctv3_details c3d     ON c1d.CDE_WR = c3d.CDE_WR AND c1d.CDE_dist = c3d.CDE_dist;Note that I changed the table aliases to be reasonable abbreviations for the table names, making the query much clearer and easier to maintain.

Advertisement

Answer