How to count all rows in raw data file using Hive?

Question

I am reading some raw input which looks something like this: Note the first two rows are &#8220;good&#8221; rows and the last two rows are &#8220;bad&#8221; rows since they are missing some data. Here is the snippet of my hive query which is reading this raw data into a readonly external table: I need to get …

Accepted Answer

Hmmm . . . You can use:select sum(case when col1 is not null and col2 is not null and col3 is not null then 1 else 0 end) as num_good,       sum(case when col1 is null or col2 is null or col3 is null then 1 else 0 end) as num_badfrom readonly_s3;

Advertisement

Answer