MySql Partitioning questions for very segmented data

Question

There are a number of partitioning questions out here and a lot of the time people caution that it often does not help performance. I want to see if my use case is one of the few that does and also clarify something. In my use case, every user is going to be assigned to a bucket. Every user in

Accepted Answer

Yes, this sounds like a good case for partitioning, since the queries you want to optimize will all be searching for a specific partition. The usual problem is that many apps have a mix of queries, so only some of them benefit from partition pruning, while the others have a bit of extra overhead for doing N queries against partitions and unifying the results.If you define unique indexes as compound indexes like (ColumnA, PartitionColumn) keep in mind that ColumnA will only be unique within a given partition. Other rows in other partitions can have the same value in ColumnA.For example, the following is allowed, even with the unique index:INSERT INTO XXX (ColumnA, PartitionColumn) VALUES  ('abc', 5),  ('abc', 6),  ('abc', 19);This might not be what you want, but that&#8217;s how compound unique indexes work. As long as the set of columns have unique values that don&#8217;t occur on any other row, the new row is allowed.As for using the index, yes, a query can use some other index that does not include the partition column, even after doing partition pruning.The bulky list of explicit partitions is required if you use RANGE partitioning as you show.You could alternatively use HASH partitioningALTER TABLE XXX PARTITION BY HASH(PartitionColumn) PARTITIONS 200;Then you don&#8217;t have to list every partition explicitly. Data will automatically be inserted into the respective partition according to the modulus of the partition expression. The partition expression must return an integer.You can also use KEY partitioning, which is not restricted to a single column, nor integer columns. The value(s) are concatenated and hashed and that determines an integer used to find the right partition. The method of hashing depends on the storage engine of the table.Re comments:I will have to create an additional unique constraint on the primary key alone to enforce cross-partition uniquenessThat doesn&#8217;t work. The partition column must be part of all unique keys.This is the most common reason that most people who want to use partitioning decide they can&#8217;t.is it guaranteed that only 1 number will map to only 1 partition (assuming I don&#8217;t go over 200) and that once I assign somebody a value in this column, unless I change it, they would always be on the same partition.Yes, HASH partitioning uses a deterministic operation: modulus. If you have 200 partitions and all the values are less than 200, then each partition will have only rows with one value. And rows with that value will not occupy more than one partition.

Advertisement

Answer