Tag: amazon-redshift

How to add SUPER column to existing AWS Redshift table?

GOAL I would like to add a Redshift SUPER column to and existing redshift table. I need this to store JSON data there CODE This is how Normally I would add a new column. 1. Tried Error running query: ALTER TABLE ADD COLUMN defined as NOT NULL must have a non-null default expression Reviewed Solutions Alter column data type in

Group By based on consequtive flag in Redshift (Gaps and Islands problem)

amazon-redshift sql

I am trying to solve “gaps and islands” and group consecutive checks together. My data looks like this I want to group by consecutive location_id and consecutive reservation_id (both should be consecutive respectively) within same date and site_id, and sum revenue. so for the example above the output should be: Location_id and reservation_id are of no importance except for this

Why does strpos, position, charindex return 1 when empty substring is passed

amazon-redshift sql

Why do all three of these return a value of 1 in redshift? select strpos(‘X’, ”) select position(” in ‘X’) select charindex(”, ‘X’) Answer Because the empty string is the very first part of all strings.

How can I perform the same query on multiple tables in Redshift

amazon-redshift sql workbench

I’m working in SQL Workbench in Redshift. We have daily event tables for customer accounts, the same format each day just with updated info. There are currently 300+ tables. For a simple example, I would like to extract the top 10 rows from each table and place them in 1 table. Table name format is Events_001, Events_002, etc. Typical values

Redshift – How to use column in one table as pattern in SIMILAR TO

amazon-redshift regex sql

I have a problem where I have two tables. One table constains urls and their information and another groups of urls that should be grouped by a pattern. I have tried something like this bearing in mind that url_patterns will only have one row per group. The main problem here is that it seems that applying SIMILAR TO with a

Redshift sql identify records based on timestamp which came before a certain record

amazon-redshift amazon-web-services sql

I’ve a redshift table which has the following structure I’m trying to find 3 sets, all titleIds [preferably entire row, not just the list of titleIds] which have country IN after US, the other way around which have US after IN and all titles which just has an IN entry and nothing else. Now its possible that we have for

Effectively select latest row for each group in a very large table?

amazon-redshift greatest-n-per-group query-optimization sql

I have (for example’s sake) a table Users (user_id, status, timestamp, …). I also have another table SpecialUsers (user_id, …). I need to show each special user’s latest status. The problem is that the Users table is VERY, VERY LARGE (more than 50 Billion rows). Most of the solutions in for instance this question just hang or get “disk full”

Redshift SQL: Column Counting Matching Rows Given a Condition within another column

amazon-redshift count database sql sql-like

I have a table with two columns, user ids and strings. I want to add a third column that counts the number of strings within the second column that start with the entire string value in any given row. There is only one row per user. The goal is to get the following table structure: Here the count is equal

How to add a minutes column to date column sql

addition amazon-redshift date sql time

I may sound silly asking it – but I have the need to add a value derived from one column of database to the other which is the date. Following is the image of data I have – timeelapsed/60000 gives me the duration of task in minutes and I would require the same to be deducted from the lastupdatedtime to

How to add date to a file’s name using UNLOAD in Redshift

amazon-redshift date filenames sql unload

I found 2 solutions: Using AWS Data Pipeline to schedule the query (Unload) and use ‘s3://reporting-team-bucket/importfiles/test_123-#{format(@scheduledStartTime,’YYYY-MM-dd-HH’)}.csv’ writing an MV command to rename the file on the s3 bucket Is there a way to give a file’s the current date by only using Redshift, with no other services? Here is my code so far: Just need to get CurrentDate to be