Get distinct sets of rows for an INSERT executed in concurrent transactions

Question

I am implementing a simple pessimistic locking mechanism using Postgres as a medium. The goal is that multiple instances of an application can simultaneously acquire locks on distinct sets of users. The app instances are not trying to lock specific users. Instead they will take any user locks they can get. Sa…

Accepted Answer

The locking clause SKIP LOCKED should be perfect for you. Added with Postgres 9.5.The manual:With SKIP LOCKED, any selected rows that cannot be immediately locked are skipped.FOR NO KEY UPDATE should be strong enough for your purpose. (Still allows other, non-exclusive locks.) And ideally, you take the weakest lock that&#8217;s strong enough.Work with just locksIf you can do your work while a transaction locking involved users stays open, then that&#8217;s all you need:BEGIN;SELECT id FROM my_usersLIMIT  3FOR    NO KEY UPDATE SKIP LOCKED;-- do some work on selected users here  !!!COMMIT;Locks are gathered along the way and kept till the end of the current transaction. While the order can be arbitrary, we don&#8217;t even need ORDER BY. No waiting, no deadlock possible with SKIP LOCKED. Each transaction scans over the table and locks the first 3 rows still up for grabs. Very cheap and fast.Since transaction might stay open for a while, don&#8217;t put anything else into the same transaction so not to block more than necessary.Work with lock table additionallyIf you can&#8217;t do your work while a transaction locking involved users stays open, register users in that additional table my_locks.Before work:INSERT INTO my_locks(user_id)SELECT id FROM my_users uWHERE  NOT EXISTS (   SELECT FROM my_locks l   WHERE  l.user_id = u.id   )LIMIT  3FOR    NO KEY UPDATE SKIP LOCKEDRETRUNGING *;No explicit transaction wrapper needed.Users in my_locks are excluded in addition to those currently locked exclusively. That works under concurrent load. While each transaction is open, locks are active. Once those are released at the end of the transaction they have already been written to the locks table &#8211; and are visible to other transaction at the same time.There&#8217;s a theoretical race condition for concurrent statements not seeing newly committed rows in the locks table just yet, and grabbing the same users after locks have just been released. But that would fail trying to write to the locks table. A UNIQUE constraint is absolute and will not allow duplicate entries, disregarding visibility.Users won&#8217;t be eligible again until deleted from your locks table.Further reading:Postgres UPDATE &#8230; LIMIT 1Select rows which are not present in other tableAside:&#8230; multiple simultaneous requests would be processed in their entirety one after the other.It doesn&#8217;t work that way.To understand how it actually works, read about the Multiversion Concurrency Control (MVCC) of Postgres in the manual, starting here.

Advertisement

Answer

Work with just locks

Work with lock table additionally