Sampling a large number of rows from a table

Question

I want to extract a roughly 5 million row sample from a table that will contain somewhere between 10 million and 20 million rows. Due to the large number of rows, efficiency is key. As such, I am trying to avoid sorting the rows where possible, hence why I am avoiding the dbms_random.value solution that I have seen in similar

Accepted Answer

You could use PL/SQL with dynamic SQL like this:declare  cnt integer;begin  select count(*) into cnt from full_table;  dbms_output.put_line(cnt);  execute immediate    'insert into target_table'    ||' select * from full_tablesample (' || ceil(100 * 5000000/cnt) || ')';end;

Advertisement

Answer