Tag: hadoop

Convert from lateral view to case statements in hive

I need to place below code in case statement: I have tried: But getting error. Answer For the provided SQL on table tab1, the actual logic is like: Explode field secondary.tertiary, alias it as lv, which results in a temporary result set (table) tab2; A join-like operation to concatenate tab2’s fields back to rows in tab1, resulting in another intermediate

Hive: How do I INSERT data FROM a PARTITIONED table INTO a PARTITIONED table?

cloudera hadoop hive sql

This is an extension of a previous question I asked: Is it possible to change an existing column’s metadata on an EXTERNAL table that is defined by an AVRO schema file? Question: In Hive 2.1.1, how do I INSERT data FROM a PARTITIONED table INTO a PARTITIONED table? What is the correct syntax? I have seen material all over the

Array operation on hive collect_set

arrays hadoop hive set sql

I am working on hive on large dataset, I have table with colum array and the content of the colum is as follows. [“20190302Prod4” “20190303Prod1” “20190303Prod4” “20190304Prod4” “20190305Prod3” “…

Two different table counts insert into Hive Table

hadoop hive hiveql sql

i need to insert data into hive table from two different table counts. example, let assume i have a table sample having fields counter1 and counter2 now i have another two tables test1 and test2. i …

Split column in hive

apache-spark-sql hadoop hive hiveql sql

I am new to Hive and Hadoop framework. I am trying to write a hive query to split the column delimited by a pipe ‘|’ character. Then I want to group up the 2 adjacent values and separate them into …

“Hive” max column value from multiple columns

hadoop hive hiveql sql

Hi: I have a situation where I need to find the max value on 3 calculated fields and store it in another field, is it possible to do it in one SQL query? Below is the example One way I tried is to calculate Personal_Income, Share_Income, Job_Income in the first pass and in the second pass I used but this

How to find previous date in HIVE

date hadoop hive hiveql sql

I want to find previous date in Hive. For example : I have Date as 20180821 (yyyyMMdd) format, I want to find previous date of this in Hive. I tried date_sub() function, seems it works only with yyyy-MM-dd format. How to solve my issue. Please help Answer In addition to already provided solution with unix_timestamp conversion, please consider also this.

To compare count between two hive table

hadoop hdfs hive sql

I am trying to do count comparision between two table . As minus operator does not work in hive , it is not happening. Could you please give some simple way to do count comparision between two tables. …

How to prevent sql without limit in Hive?

hadoop hive hive-configuration sql

Many users are used to using ‘select * from tables’ in oracle/mysql But i should not allow them to query like that in hive Is there any way to prevent a full_table scan in hive? Like trigger or somethng else? Thanks a lot! Answer To prevent full table scan use these settings. Hive will generate exception if partition predicate is

How to convert a date format YYYY-MM-DD into integer YYYYMMDD in Presto/Hive?

date hadoop hive presto sql

How to CONVERT a date in format YYYY-MM-DD into integer YYYYMMDD in Presto/Hive? I am trying to convert the below list into YYYYMMDD integers WITH all_dates as (SELECT CAST(date_column AS …