Skip to content
Advertisement

Tag: amazon-redshift

Returning median values of time deltas across different groups

Trying to calculate the range between different steps in my data-table and return the median for each calculation using this SQL code: This returns the following error message: [0A000][500310] Amazon Invalid operation: within group ORDER BY clauses for aggregate functions must be the same; java.lang.RuntimeException: com.amazon.support.exceptions.ErrorException: Amazon Invalid operation: within group ORDER BY clauses for aggregate functions must be the

Extracting data from JSON field in Amazon Redshift

I am trying to extract some data from a JSON field in Redshift. Given below is a sample view of the data I am working with. I am able to extract data for the first level namely data corresponding to fileFormat and data as below: I am trying to extract information under data like name, age,dateofbirth Answer You could use

Track customer loss in redshift

I have a data base that looks like this. It has: acct_open_dt, or the day the account was opened. acct_close_dt, or the day the account was closed. dayssinceopendt, which is TODAY – acct_open_dt if …

Alternative to BigQuery for medium-sized data

This is a follow-up to the question Why doesn’t BigQuery perform as well on small data sets. Let’s suppose I have a data-set that is ~1M rows. In the current database that we’re using (mysql) aggregation queries would run quite slow, perhaps taking ~10s or so on complex aggregations. On BigQuery, the initialization time required might make this query take

Advertisement