avoiding write conflicts while re-sorting a table

Question

I have a large table that I need to re-sort periodically. I am partly basing this on a suggestion I was given to stay away from using cluster keys since I am inserting data ordered differently (by time) from how I need it clustered (by ID), and that can cause re-clustering to get a little out of control. Sinc…

Accepted Answer

There are some reasons to avoid the automatic reclustering, but they&#8217;re basically all the same reasons why you shouldn&#8217;t set up a job to re-cluster frequently. You&#8217;re making the database do all the same work, but without the built in management of it.If your table is big enough that you are seeing performance issues with the clustering by time, and you know that the ID column is the main way that this table is filtered (in JOINs and WHERE clauses) then this is probably a good candidate for automatic clustering.So I would recommend at least testing out a cluster key on the ID and then monitoring/comparing performance.To give a brief answer to the question about resorting without conflicts as written:I might recommend using a time column to re-sort records older than a given time (probably in a separate table). While it&#8217;s sorting, you may get some new records. But you will be able to use that time column to marry up those new records with the, now sorted, older records.

Advertisement

Answer