Isolation Levels

Isolation Levels in YugabyteDB.

YSQL

Yugabyte supports three isolation levels in the transactional layer - Serializable, Snapshot and Read Committed. PostgreSQL (and the SQL standard) have four isolation levels - Serializable, Repeatable read, Read Committed and Read uncommitted. The mapping between the PostgreSQL isolation levels in YSQL, along with which transaction anomalies can occur at each isolation level are shown below.

PostgreSQL Isolation	YugabyteDB Equivalent	Dirty Read	Nonrepeatable Read	Phantom Read	Serialization Anomaly
Read uncommitted	Read Committed^$	Allowed, but not in YSQL	Possible	Possible	Possible
Read committed	Read Committed^$	Not possible	Possible	Possible	Possible
Repeatable read	Snapshot	Not possible	Not possible	Allowed, but not in YSQL	Possible
Serializable	Serializable	Not possible	Not possible	Not possible	Not possible

^$ Read Committed Isolation is supported only if the tserver gflag yb_enable_read_committed_isolation is set to true. By default this gflag is false and in this case the Read Committed isolation level of Yugabyte's transactional layer falls back to the stricter Snapshot Isolation (in which case READ COMMITTED and READ UNCOMMITTED of YSQL also in turn use Snapshot Isolation). Read Committed support is currently in Beta.

Note

The default isolation level for the YSQL API is essentially Snapshot (i.e., same as PostgreSQL's REPEATABLE READ) because READ COMMITTED, which is the YSQL API's (and also PostgreSQL's) syntactic default, maps to Snapshot Isolation (unless the tserver gflag yb_enable_read_committed_isolation is set to true).

To set the transaction isolation level of a transaction, use the command SET TRANSACTION.

As seen from the table above, the most strict isolation level is Serializable, which requires that any concurrent execution of a set of Serializable transactions is guaranteed to produce the same effect as running them in some serial (one transaction at a time) order. The other levels are defined by which anomalies must not occur as a result of interactions between concurrent transactions. Due to the definition of Serializable isolation, none of these anomalies are possible at that level. For reference, the various transaction anomalies are described briefly below:

Dirty read: A transaction reads data written by a concurrent uncommitted transaction.
Nonrepeatable read: A transaction re-reads data it has previously read and finds that data has been modified by another transaction (that committed since the initial read).
Phantom read: A transaction re-executes a query returning a set of rows that satisfy a search condition and finds that the set of rows satisfying the condition has changed due to another recently-committed transaction.
Serialization anomaly: The result of successfully committing a group of transactions is inconsistent with all possible orderings of running those transactions one at a time.

Let us now look at how Serializable, Snapshot and Read Committed isolation works in YSQL.

Serializable Isolation

The Serializable isolation level provides the strictest transaction isolation. This level emulates serial transaction execution for all committed transactions; as if transactions had been executed one after another, serially, rather than concurrently. Serializable isolation can detect read-write conflicts in addition to write-write conflicts. This is accomplished by writing provisional records for read operations as well.

Let's use a bank overdraft protection example to illustrate this case. The hypothetical case is that there is a bank which allows depositors to withdraw money up to the total of what they have in all accounts. The bank will later automatically transfer funds as needed to close the day with a positive balance in each account. Within a single transaction they check that the total of all accounts exceeds the amount requested.

Let's say someone tries to withdraw $900 from two of their accounts simultaneously, each with $500 balances. At the REPEATABLE READ transaction isolation level, that could work; but if the SERIALIZABLE transaction isolation level is used, a read/write conflict will be detected and one of the transactions will be rejected.

The example can be set up with these statements:

create table account
  (
    name text not null,
    type text not null,
    balance money not null default '0.00'::money,
    primary key (name, type)
  );
insert into account values
  ('kevin','saving', 500),
  ('kevin','checking', 500);

session #1	session #2
Begin a transaction in session #1 with the Serializable isolation level. The account total is $1000, so a $900 withdrawal is OK. `begin isolation level serializable; select type, balance from account where name = 'kevin'; type \| balance ----------+--------- saving \| $500.00 checking \| $500.00 (2 rows)`
	Begin a transaction in session #2 with the Serializable isolation level as well. Once again, the account total is $1000, so a $900 withdrawal is OK. `begin isolation level serializable; select type, balance from account where name = 'kevin'; type \| balance ----------+--------- saving \| $500.00 checking \| $500.00 (2 rows)`
Withdraw $900 from the savings account, given the total is $1000 this should be OK. `update account set balance = balance - 900::money where name = 'kevin' and type = 'saving';`
	Simultaneously, withdrawing $900 from the checking account is going to be a problem. This cannot co-exist with the other transaction's activity. This transaction would fail immediately. `update account set balance = balance - 900::money where name = 'kevin' and type = 'checking';` `ERROR: 40001: Operation failed. Try again.: Transaction aborted: XXXX`
This transaction can now be committed. `commit; select type, balance from account where name = 'kevin'; type \| balance ----------+---------- checking \| $500.00 saving \| -$400.00 (2 rows)`

Snapshot Isolation

The Snapshot isolation level only sees data committed before the transaction began (or in other words, it works on a "snapshot" of the table). Transactions running under Snapshot isolation do not see either uncommitted data or changes committed during transaction execution by other concurrently running transactions. Note that the query does see the effects of previous updates executed within its own transaction, even though they are not yet committed. This is a stronger guarantee than is required by the SQL standard for the REPEATABLE READ isolation level.

Snapshot isolation detects only write-write conflicts, it does not detect read-write conflicts. In other words:

INSERT, UPDATE, and DELETE commands behave the same as SELECT in terms of searching for target rows. They will only find target rows that were committed as of the transaction start time.
If such a target row might have already been updated (or deleted or locked) by another concurrent transaction by the time it is found. This scenario is called a transaction conflict, where the current transaction conflicts with the transaction that made (or is attempting to make) an update. In such cases, one of the two transactions get aborted, depending on priority.

Note

Applications using this level must be prepared to retry transactions due to serialization failures.

Let's run through the scenario below to understand how transactions behave under the snapshot isolation level (which PostgreSQL's Repeatable Read maps to).

First, create an example table with sample data.

CREATE TABLE IF NOT EXISTS example (k INT PRIMARY KEY);
TRUNCATE TABLE example;

Next, connect to the cluster using two independent ysqlsh instances called session #1 and session #2 below.

Note

You can connect the session #1 and session #2 ysqlsh instances to the same server, or to different servers.

session #1	session #2
Begin a transaction in session #1. This will be snapshot isolation by default, meaning it will work against a snapshot of the database as of this point. `BEGIN TRANSACTION;`
Insert a row, but let's not commit the transaction. This row should be visible only to this transaction. `INSERT INTO example VALUES (1); SELECT * FROM example; k --- 1 (1 row)`
	Insert a different row here. Verify that the row inserted in the transaction in session #1 is not visible in this session. `INSERT INTO example VALUES (2); SELECT * FROM example; k --- 2 (1 row)`
The row inserted in the other session would not be visible here, because we're working against an older snapshot of the database. Let's verify that. `SELECT * FROM example; k --- 1 (1 row)`
Now let's commit this transaction. As long as the row(s) we're writing as a part of this transaction are not modified during the lifetime of the transaction, there would be no conflicts. Let's verify we can see all rows after the commit. `COMMIT; SELECT * FROM example; k --- 1 2 (2 rows)`

Read Committed Isolation

This is same as Snapshot Isolation except that every statement in the transaction will see all data that has been committed before it is issued (note that this implicitly also means that the statement will see a consistent snapshot). In other words, each statement works on a new "snapshot" of the database that includes everything that is committed before the statement is issued. Conflict detection is the same as in Snapshot Isolation.

session #1	session #2
Create a sample table. `CREATE TABLE test (k int PRIMARY KEY, v int); INSERT INTO test VALUES (1, 2);`
By default, the tserver gflag yb_enable_read_committed_isolation=false. In this case, Read Committed maps to Snapshot Isolation at the transactional layer. So, READ COMMITTED of YSQL API in turn maps to Snapshot Isolation. `BEGIN TRANSACTION ISOLATION LEVEL READ COMMITTED; SELECT * FROM test; k \| v ---+--- 1 \| 2 (1 row)`
	Insert a new row. `INSERT INTO test VALUES (2, 3);`
Perform read again in the same transaction. Note that the recently inserted row (2, 3) isn't visible to the statement because Read Committed is disabled at the transactional layer and maps to Snapshot (in which the whole transaction sees a consistent snapshot of the database). `SELECT * FROM test; COMMIT; k \| v ---+--- 1 \| 2 (1 row)`
Set tserver gflag yb_enable_read_committed_isolation=true `BEGIN TRANSACTION ISOLATION LEVEL READ COMMITTED; SELECT * FROM test; k \| v ---+--- 1 \| 2 2 \| 3 (2 rows)`
	In another session, insert a new row. `INSERT INTO test VALUES (3, 4);`
Perform read again in the same transaction. This time, the statement will be able to see the row (3, 4) that was committed after this transaction was started but before the statement was issued. `SELECT * FROM test; k \| v ---+--- 1 \| 2 2 \| 3 3 \| 4 (3 rows)`