Tag: amazon-redshift

Distinct performance in Redshift

amazon-redshift amazon-redshift-spectrum aws-glue query-optimization sql

I am trying to populate a multiple dimension tables from single Base table. Sample Base Table: Required Dimension tables : emp_name_dim with values – AAA,BBB,CCC emp_surname_dim with values – ZZZ,XXX country_dim with values – USA,IND,CAN dept_dim with values – CE,IT,MECH university_dim…

Multiple conditions in CASE WHEN statement

amazon-redshift case sql

I’m working in SQL (Amazon Redshift) with Recruitment data where each applicant has multiple sources, which I’ve split into different columns called source_1, source_2, and source_3, along with a number_of_sources column to record how many sources that applicant has. We want each applicant to only…

How to get the last/maximum date that is on/earlier than another baseline date by user?

amazon-redshift pandas sql

I have a df where I am trying to create the Last Login Date column, as shown in the image. I am not sure how to get the maximum login date that was on/prior the email notification date for that current row. I added explanations on how I expect the data to look. Any help is appreciated in either sql

Combining Data Across Multiple Data Fields

amazon-redshift sql

I am trying to create a common data field based on multiple columns below. When I try to use nvl i am getting blank values in the common field and its not getting picked up..The input and output tables are below: Input table model-manufacture model -CAM MODEL -schedule CX190 NM890 NM890 JK1231 LO121 LO121 VB1…

How to create a new column using existing column in SQL Redshift

amazon-redshift dbvisualizer logic sql syntax

I am trying to create a new column in SQL with certain conditions in DBVisualizer by connecting redshift data source. I used the following command to achieve the results but getting syntax error. I would be really grateful for any help. Code: Error: Expected Output: Thanks. Answer As stated in the error messa…

partition by customer for distinct items

amazon-redshift sql

select customer_id, row_number()over(partition by customer id order by date) as rn from table How to get same rn when Item Id is the same? Below did not work: #1 select customer_id, row_number()over(partition by customer id, Item Id order by date) as rn from table Answer We can try to use DENSE_RANK instead o…

DATEPART and DATEFIRST alternative for redshift

amazon-redshift datepart sql

I’m trying to run in Redshift with the start day being Saturday. I tried using different syntax for SET DATEFIRST but it seems like it’s not supported (or I just can’t find the correct syntax for redshift). Are there any alternatives available? The documentation page only shows information f…

Distance between two integer values

amazon-redshift postgresql sql

is there a simple method to calculate the distance between two values e.g. 10 and 30 which will result to 20. Currently I do it as follows: Answer The Absolute value function does the trick.

How can I make TO_DATE in Amazon Redshift work with multiple date formats in the same column?

amazon-redshift amazon-web-services datetime sql

I have a string column with date-time values that I need to convert into dates in order to calculate a dense rank. This column can have any of the below 3 formats: My SQL Statement to calculate the dense rank is based on this date-time column that I need to convert to date in order to rank descending: However…