Skip to content
Advertisement

Statistical functions on columns of arrays in BigQuery

If I have data that looks like the follows:

and I want to get the array averages of the val column, naively I’d want to just try something like

But that doesn’t work. I get an error like No matching signature for aggregate function AVG for argument types: ARRAY<INT64>. Supported signatures: AVG(INT64); AVG(UINT64); AVG(DOUBLE); AVG(NUMERIC)

I know that if I have just one column val I can do something like

but what if I have multiple columns (val, val2, etc.) and need multiple statistics? The above method just seems really cumbersome.

To be clear the result I’d want is:

Is there a simple way to do this? Or do I need to create some sort of temporary function to accomplish this? Or am I stuck doing something like what I see in https://stackoverflow.com/a/45560462/1902480

Advertisement

Answer

Consider below example

if applied to sample data in your question – output is

enter image description here

User contributions licensed under: CC BY-SA
9 People found this is helpful
Advertisement