Unnest Query optimisation for singular record

Question

I'm trying to optimise my query for when an internal customer only want to return one result *(and it's associated nested dataset). My aim is to reduce the query process size. However, it appears to be the exact same value regardless of whether I'm querying for 1 record (with unnested 48,000 length array) or the whole dataset (10,000 records with

Accepted Answer

This is happening because there is still need for a full table scan to find all the test IDs that are equal to the specified one.It is not clear from your example which columns are part of the timeseries record. In case test_id is not one of them, I would suggest to cluster the table on the test_id column. By clustering, the data will be automatically organized according to the contents of the test_id column.So, when you query with a filter on that column a full scan won&#8217;t be needed to find all values.Read more about clustered tables here.

test_id	value
T0003	1.0
T0003	2.0
T0003	3.0
T0003	4.0

test_id	value
T0001	1.0
T0001	2.0
T0001	3.0
T0001	4.0

Advertisement

Answer