Read and Write Concern
- Status: Accepted
- Minimum Server Version: 2.4
Abstract
A driver must support configuring and sending read concern and write concerns to a server. This specification defines the API drivers must implement as well as how that API is translated into messages for communication with the server.
META
The keywords “MUST”, “MUST NOT”, “REQUIRED”, “SHALL”, “SHALL NOT”, “SHOULD”, “SHOULD NOT”, “RECOMMENDED”, “MAY”, and “OPTIONAL” in this document are to be interpreted as described in RFC 2119.
Terminology
MaxWireVersion The maxWireVersion value reported by the hello command.
Server Selection The process of selecting a server to read or write from. See the server selection specification.
Specification
This specification includes guidance for implementing Read Concern and Write Concern in a driver. It does not define how read and write concern behave or are implemented on the server.
Read Concern
For naming and deviation guidance, see the CRUD specification. Defined below are the constructs for drivers.
enum ReadConcernLevel {
/**
* This is rendered as "local" (lower-case) on the wire.
*/
local,
/**
* This is rendered as "majority" (lower-case) on the wire.
*/
majority,
/**
* This is rendered as "linearizable" (lower-case) on the wire.
*/
linearizable,
/**
* This is rendered as "available" (lower-case) on the wire.
*/
available,
/**
* This is rendered as "snapshot" (lower-case) on the wire.
*/
snapshot
}
class ReadConcern {
/**
* The level of the read concern.
*/
level: Optional<ReadConcernLevel | String>
}
The read concern option is available for the following operations:
aggregatecommandcountcommanddistinctcommandfindcommandmapReducecommand where theoutoption is{ inline: 1 }parallelCollectionScancommandgeoNearcommandgeoSearchcommand
Starting in MongoDB 4.2, an aggregate command with a write stage (e.g. $out, $merge) supports a readConcern;
however, it does not support the “linearizable” level (attempting to do so will result in a server error).
Server versions before 4.2 do not support a readConcern at all for aggregate commands with a write stage.
The mapReduce command where the out option is anything other than { inline: 1 } does not support a readConcern.
Unknown Levels and Additional Options for String Based ReadConcerns
For forward compatibility, a driver MUST NOT raise an error when a user provides an unknown level or additional
options. The driver relies on the server to validate levels and other contents of the read concern.
Server’s Default Read Concern
When a ReadConcern is created but no values are specified, it should be considered the server’s default ReadConcern.
readconcern>: { } is not the same as readconcern>: { level="local"}. The former is the server’s default
ReadConcern while the latter is the user explicitly specifying a ReadConcern with a level of “local”.
Snapshot Read Concern
When a ReadConcern level snapshot is used, atClusterTime may be specified to indicate the desired point in time
for reading. find, aggregate and distinct operations executed with ReadConcern snapshot but without
atClusterTime will return atClusterTime timestamp in the server response. The obtained atClusterTime timestamp can
be used for subsequent read operations. ReadConcern level snapshot with clusterTime is supported in find,
aggregate and distinct operations.
On the Wire
Read Commands
Read commands that support ReadConcern take a named parameter spelled (case-sensitively) readConcern. See command
documentation for further examples.
If the Client, Database, or Collection being operated on either has no ReadConcern set, or has the server
default ReadConcern - readconcern>: { }:
- If the
ReadConcernspecified for the command is the server defaultreadConcern: { }, the driver MUST omit it when sending the command. - If the
ReadConcernspecified for the command is anyReadConcernbesides the server default, including an explicitly specifiedReadConcernofreadConcern: { level: "local" }, the driver MUST include theReadConcernwhen sending the command.
If the Client, Database, or Collection being operated on has a non-default ReadConcern specified, then the
driver MUST include the command’s ReadConcern when sending the command. This includes if the command specifies the
server default ReadConcern, so that the command can override the Client, Database, or Collection’s ReadConcern
to use the server default instead.
Generic Command Method
If your driver offers a generic RunCommand method on your database object, ReadConcern MUST NOT be applied
automatically to any command. A user wishing to use a ReadConcern in a generic command must supply it manually.
Errors
ReadConcern errors from a server MUST NOT be handled by a driver. There is nothing a driver can do about them and any
such errors will get propagated to the user via normal error handling.
Location Specification
Via Code
ReadConcern SHOULD be specifiable at the Client, Database, and Collection levels. Unless specified, the value
MUST be inherited from its parent and SHOULD NOT be modifiable on an existing Client, Database or Collection. In
addition, a driver MAY allow it to be specified on a per-operation basis in accordance with the CRUD specification.
For example:
var client = new MongoClient({ readConcern: { level: "local" } });
// db1's readConcern level is "local".
var db1 = client.getDatabase("db1");
// col1's readConcern level is "local"
var col1 = db1.getCollection("col_name");
// db2's readConcern level is "majority".
var db2 = client.getDatabase("db_name", { readConcern: { level: "majority" } });
// col2's readConcern level is "majority"
var col2 = db2.getCollection("col_name");
// col3's readConcern level is the server's default read concern
var col3 = db2.getCollection("col_name", { readConcern: { } });
Via Connection String
Options
readConcernLevel- defines the level for the read concern.
For example:
mongodb://server:27017/db?readConcernLevel=majority
Errors
- MaxWireVersion < 4 Only the server’s default
ReadConcernis support by MaxWireVersion < 4. When using otherreadConcernLevelswith clients reportingMaxWireVersion< 4, the driver MUST raise an error. This check MUST happen after server selection has occurred in the case of mixed version clusters. It is up to users to appropriately define aReadPreferencesuch that intermittent errors do not occur.
Note
ReadConcernis only supported for commands.
Write Concern
When a driver sends a write concern document to the server, the structure of the write concern document MUST be as follows:
class WriteConcern {
/**
* If true, wait for the the write operation to get committed to the
* journal. When unspecified, a driver MUST NOT send "j".
*
* @see https://www.mongodb.com/docs/manual/core/write-concern/#journaled
*/
j: Optional<Boolean>,
/**
* When an integer, specifies the number of nodes that should acknowledge
* the write and MUST be greater than or equal to 0.
* When a string, indicates tags. "majority" is defined, but users
* could specify other custom error modes.
* When not specified, a driver MUST NOT send "w".
*/
w: Optional<Int32 | String>,
/**
* If provided, and the write concern is not satisfied within the
* specified timeout (in milliseconds), the server will return an error
* for the operation. When unspecified, a driver SHOULD NOT send "wtimeout".
*
* The value, if provided, MUST be greater than or equal to 0.
*
* @see https://www.mongodb.com/docs/manual/core/write-concern/#timeouts
*/
wtimeout: Optional<Int64>
}
When a driver provides a way for the application to specify the write concern, the following data structure SHOULD be used. For acceptable naming and deviation guidance, see the CRUD specification.
class WriteConcern {
/**
* Corresponds to the "j" field in the WriteConcern document sent to
* the server.
*/
journal: Optional<Boolean>,
/**
* Corresponds to the "w" field in the WriteConcern document sent to
* the server.
*/
w: Optional<Int32 | String>,
/**
* Corresponds to the "wtimeout" field in the WriteConcern document sent to
* the server.
*
* NOTE: This option is deprecated in favor of timeoutMS.
*/
wtimeoutMS: Optional<Int64>
}
FSync
FSync SHOULD be considered deprecated. Those drivers supporting the deprecated fsync option SHOULD treat fsync
identically to journal in terms of consistency with w and whether a WriteConcern that specifies fsync is
acknowledged or unacknowledged.
wtimeoutMS
wtimeoutMS MUST be considered deprecated in favor of
timeoutMS.
Server’s Default WriteConcern
When a WriteConcern is created but no values are specified, it should be considered the server’s default
WriteConcern.
The server has a settings field called getLastErrorDefaults which allows a user to customize the default behavior of a
WriteConcern. Because of this, writeConcern: { } is not the same as - writeconcern: {w: 1}. Sending {w:1}
overrides that default. As another example, writeConcern: { } is not the same as writeConcern: {journal: false}.
Inconsistent WriteConcern
Drivers MUST raise an error when an inconsistent WriteConcern is specified. The following is an exhaustive list of
inconsistent WriteConcerns:
writeConcern = { w: 0, journal: true };
Unacknowledged WriteConcern
An Unacknowledged WriteConcern is when (w equals 0) AND (journal is not set or is false).
These criteria indicates that the user does not care about errors from the server.
Examples:
writeConcern = { w: 0 }; // Unacknowledged
writeConcern = { w: 0, journal: false }; // Unacknowledged
writeConcern = { w: 0, wtimeoutMS: 100 }; // Unacknowledged
On the Wire
Write Commands
The insert, delete, and update commands take a named parameter, writeConcern. See the command documentation for
further examples.
When a user has not specified a WriteConcern or has specified the server’s default WriteConcern, drivers MUST omit
the writeConcern parameter from the command.
All other WriteConcerns, including the Unacknowledged WriteConcern, MUST be sent with the writeConcern parameter.
Note
Drivers MAY use
OP_INSERT,OP_UPDATE, andOP_DELETEwhen anUnacknowledged WriteConcernis used.
Generic Command Method
If your driver offers a generic RunCommand method on your database object, WriteConcern MUST NOT be applied
automatically to any command. A user wishing to use a WriteConcern in a generic command must manually include it in
the command document passed to the method.
The generic command method MUST NOT check the user’s command document for a WriteConcern nor check whether the server
is new enough to support a write concern for the command. The method simply sends the user’s command to the server
as-is.
Find And Modify
The findAndModify command takes a named parameter, writeConcern. See command documentation for further examples.
If writeConcern is specified for the Collection, writeConcern MUST be omitted when sending findAndModify with
MaxWireVersion < 4.
If the findAndModify helper accepts writeConcern as a parameter, the driver MUST raise an error with MaxWireVersion < 4.
Note
Driver documentation SHOULD include a warning in their server 3.2 compatible releases that an elevated
WriteConcernmay cause performance degradation when usingfindAndModify. This is becausefindAndModifywill now be honoring a potentially high latency setting where it did not before.
Other commands that write
Command helper methods for commands that write, other than those discussed above, MAY accept a write concern or write concern options in their parameter list. If the helper accepts a write concern, the driver MUST error if the selected server’s MaxWireVersion < 5 and a write concern has explicitly been specified.
Helper methods that apply the write concern inherited from the Collection or Database, SHOULD check whether the selected server’s MaxWireVersion >= 5 and if so, include the inherited write concern in the command on the wire. If the selected server’s MaxWireVersion < 5, these methods SHOULD silently omit the write concern from the command on the wire.
These commands that write are:
aggregatewith write stage (e.g.$out,$merge)copydbcreatecreateIndexesdropdropDatabasedropIndexesmapReducewhere theoutoption is not{ inline: 1 }clonecloneCollectioncloneCollectionAsCappedcollModconvertToCappedrenameCollectioncreateUserupdateUserdropUser
Errors
In general, server errors associated with WriteConcern return successful ("ok": 1) responses with a
writeConcernError field indicating the issue. For example,
rs0:PRIMARY> db.runCommand({insert: "foo", documents: [{x:1}], writeConcern: { w: "blah"}})
{
n: 1,
opTime: {
ts: Timestamp(1583026145, 1),
t: NumberLong(5)
},
electionId: ObjectId("7fffffff0000000000000005"),
ok: 1,
writeConcernError: {
code: 79,
codeName: "UnknownReplWriteConcern",
errmsg: "No write concern mode named 'blah' found in replica set configuration",
errInfo: {
writeConcern: {
w: "blah",
wtimeout: 0,
provenance: "clientSupplied"
}
}
},
$clusterTime: {
clusterTime: Timestamp(1583026145, 1),
signature: {
hash: BinData(0, "AAAAAAAAAAAAAAAAAAAAAAAAAAA="),
keyId: NumberLong(0)
}
},
operationTime: Timestamp(1583026145, 1)
}
Drivers SHOULD parse server replies for a “writeConcernError” field and report the error only in the command-specific helper methods for commands that write, from the list above. For example, helper methods for “findAndModify” or “aggregate” SHOULD parse the server reply for “writeConcernError”.
Drivers SHOULD report writeConcernErrors however they report other server errors: by raising an exception, returning
“false”, or another idiom that is consistent with other server errors. Drivers SHOULD report writeConcernErrors with a
WriteConcernError defined in the CRUD specification.
Drivers SHOULD NOT parse server replies for “writeConcernError” in generic command methods.
(Reporting of writeConcernErrors is more complex for bulk operations, see the Bulk API Spec.)
writeConcernError Examples
The set of possible writeConcernErrors is quite large because they can include errors caused by shutdown, stepdown, interruption, maxTimeMS, and wtimeout. This section attempts to list all known error codes that may appear within a writeConcernError but may not be exhaustive. Note that some errors have been abbreviated:
{ok:1, writeConcernError: {code: 91, codeName: "ShutdownInProgress"}}{ok:1, writeConcernError: {code: 189, codeName: "PrimarySteppedDown"}}{ok:1, writeConcernError: {code: 11600, codeName: "InterruptedAtShutdown"}}{ok:1, writeConcernError: {code: 11601, codeName: "Interrupted"}}{ok:1, writeConcernError: {code: 11602, codeName: "InterruptedDueToReplStateChange"}}{ok:1, writeConcernError: {code: 64, codeName: "WriteConcernTimeout", errmsg: "waiting for replication timed out", errInfo: {wtimeout: True}}}{ok:1, writeConcernError: {code: 64, codeName: "WriteConcernTimeout", errmsg: "multiple errors reported : {...} at shardName1 :: and :: {...} at shardName2"}}1{ok:1, writeConcernError: {code: 50, codeName: "MaxTimeMSExpired"}}{ok:1, writeConcernError: {code: 100, codeName: "UnsatisfiableWriteConcern", errmsg: "Not enough data-bearing nodes"}}{ok:1, writeConcernError: {code: 79, codeName: "UnknownReplWriteConcern"}}
Note also that it is possible for a writeConcernError to be attached to a command failure. For example:
{ok:0, code: 251, codeName: "NoSuchTransaction", writeConcernError: {code: 91, codeName: "ShutdownInProgress"}}2
Location Specification
Via Code
WriteConcern SHOULD be specifiable at the Client, Database, and Collection levels. Unless specified, the value
MUST be inherited from its parent and SHOULD NOT be modifiable on an existing Client, Database or Collection. In
addition, a driver MAY allow it to be specified on a per-operation basis in accordance with the CRUD specification.
For example:
var client = new MongoClient({ writeConcern: { w: 2 } });
// db1's writeConcern is {w: 2}.
var db1 = client.getDatabase("db1");
// col1's writeConcern is {w: 2}.
var col1 = db1.getCollection("col_name");
// db2's writeConcern is {journal: true}.
var db2 = client.getDatabase("db_name", { writeConcern: { journal: true } });
// col2's writeConcern {journal: true}.
var col2 = db2.getCollection("col_name");
// col3's writeConcern is the server's default write concern.
var col3 = db2.getCollection("col_name", { writeConcern: { } });
// Override col3's writeConcern.
col3.drop({ writeConcern: { w: 3 } });
Via Connection String
Options
w- corresponds towin the class definition.journal- corresponds tojournalin the class definition.wtimeoutMS- corresponds towtimeoutMSin the class definition.
For example:
mongodb://server:27017/db?w=3
mongodb://server:27017/db?journal=true
mongodb://server:27017/db?wtimeoutMS=1000
mongodb://server:27017/db?w=majority&wtimeoutMS=1000
Backwards Compatibility
There should be no backwards compatibility concerns. This specification merely deals with how to specify read and write concerns.
Test Plan
Yaml tests are located here
Below are English descriptions of other items that should be tested:
ReadConcern
- Commands supporting a read concern MUST raise an error when MaxWireVersion is less than 4 and a non-default, non-local read concern is specified.
- Commands supporting a read concern MUST NOT send the default read concern to the server.
- Commands supporting a read concern MUST send any non-default read concern to the server.
WriteConcern
- Commands supporting a write concern MUST NOT send the default write concern to the server.
- Commands supporting a write concern MUST send any non-default write concern to the server in the command.
Reference Implementation
These are currently under construction.
Q & A
Q: Why is specifying a non-default ReadConcern for servers < 3.2 an error while a non-default write concern gets
ignored in findAndModify? findAndModify is an existing command and since WriteConcern may be defined globally,
anyone using findAndModify in their applications with a non-default WriteConcern defined globally would have all
their findAndModify operations fail.
Q: Why does a driver send { readConcern: { level: "local" } } to the server when that is the server’s default? First,
to mirror how WriteConcern already works, ReadConcern() does not equal ReadConcern(level=local) in the same way that
WriteConcern() does not equal WriteConcern(w=1). This is true for WriteConcern because the server’s default could be
set differently. While this setting does not currently exist for ReadConcern, it is a possible eventuality and it
costs a driver nothing to be prepared for it. Second, it makes sense that if a user doesn’t specify a ReadConcern, we
don’t send one and if a user does specify a ReadConcern, we do send one. If the user specifies level=“local”, for
instance, we send it.
Changelog
-
2025-02-25: Rename WriteConcernFailed to WriteConcernTimeout
-
2015-10-16: ReadConcern of local is no longer allowed to be used when talking with MaxWireVersion < 4.
-
2016-05-20: Added note about helpers for commands that write accepting a writeConcern parameter.
-
2016-06-17: Added “linearizable” to ReadConcern levels.
-
2016-07-15: Command-specific helper methods for commands that write SHOULD check the server’s MaxWireVersion and decide whether to send writeConcern. Advise drivers to parse server replies for writeConcernError and raise an exception if found, only in command-specific helper methods that take a writeConcern parameter, not in generic command methods. Don’t mention obscure commands with no helpers.
-
2016-08-06: Further clarify that command-specific helper methods for commands that write take write concern options in their parameter lists, and relax from SHOULD to MAY.
-
2017-03-13: reIndex silently ignores writeConcern in MongoDB 3.4 and returns an error if writeConcern is included with MongoDB 3.5+. See SERVER-27891.
-
2017-11-17: Added list of commands that support readConcern
-
2017-12-18: Added “available” to Readconcern level.
-
2017-05-29: Added user management commands to list of commands that write
-
2019-01-29: Added section listing all known examples of writeConcernError.
-
2019-06-07: Clarify language for aggregate and mapReduce commands that write.
-
2019-10-31: Explicitly define write concern option mappings.
-
2020-02-13: Inconsistent write concern must be considered an error.
-
2021-04-07: Updated to use hello command.
-
2021-06-15: Added “snapshot” to Readconcern level
-
2021-07-12: Add missing commas after ReadConcernLevel enum values
-
2022-01-19: Deprecate wTimeoutMS in favor of timeoutMS.
-
2022-10-05: Remove spec front matter and reformat changelog.
-
2024-08-23: Migrated from reStructuredText to Markdown.
-
2024-10-30: Remove reference to getLastError
-
This is only possible in a sharded cluster. When a write is routed to multiple shards and more than one shard returns a writeConcernError, then mongos will construct a new writeConcernError with the “WriteConcernTimeout” error code and an errmsg field contains the stringified writeConcernError from each shard. Note that each shard may return a different writeConcernError. ↩