MongoDB
MongoDB engine is read-only table engine which allows to read data from a remote MongoDB collection.
Only MongoDB v3.6+ servers are supported.
Seed list(mongodb+srv) is not yet supported.
Creating a Table
Engine Parameters
- 
host:port— MongoDB server address.
- 
database— Remote database name.
- 
collection— Remote collection name.
- 
user— MongoDB user.
- 
password— User password.
- 
options— MongoDB connection string options (optional parameter).
- 
oid_columns- Comma-separated list of columns that should be treated asoidin the WHERE clause._idby default.
If you are using the MongoDB Atlas cloud offering connection url can be obtained from 'Atlas SQL' option.
Seed list(mongodb**+srv**) is not yet supported, but will be added in future releases.
Alternatively, you can pass a URI:
Engine Parameters
- 
uri— MongoDB server's connection URI.
- 
collection— Remote collection name.
- 
oid_columns- Comma-separated list of columns that should be treated asoidin the WHERE clause._idby default.
Types mappings
| MongoDB | ClickHouse | 
|---|---|
| bool, int32, int64 | any numeric type, String | 
| double | Float64, String | 
| date | Date, Date32, DateTime, DateTime64, String | 
| string | String | 
| document | String(as JSON) | 
| array | Array, String(as JSON) | 
| oid | String | 
| binary | String if in column, base64 encoded string if in an array or document | 
| uuid (binary subtype 4) | UUID | 
| any other | String | 
If key is not found in MongoDB document (for example, column name doesn't match), default value or NULL (if the column is nullable) will be inserted.
OID
If you want a String to be treated as oid in the WHERE clause, just put the column's name in the last argument of the table engine.
This may be necessary when querying a record by the _id column, which by default has oid type in MongoDB.
If the _id field in the table has other type, for example uuid, you need to specify empty oid_columns, otherwise the default value for this parameter _id is used.
By default, only _id is treated as oid column.
In this case the output will be 0, because ClickHouse doesn't know that another_oid_column has oid type, so let's fix it:
Supported clauses
Only queries with simple expressions are supported (for example, WHERE field = <constant> ORDER BY field2 LIMIT <constant>).
Such expressions are translated to MongoDB query language and executed on the server side.
You can disable all these restriction, using mongodb_throw_on_unsupported_query.
In that case ClickHouse tries to convert query on best effort basis, but it can lead to full table scan and processing on ClickHouse side.
It's always better to explicitly set type of literal because Mongo requires strict typed filters.
For example you want to filter by Date:
This will not work because Mongo will not cast string to Date, so you need to cast it manually:
This applied for Date, Date32, DateTime, Bool, UUID.
Usage Example
Assuming MongoDB has sample_mflix dataset loaded
Create a table in ClickHouse which allows to read data from MongoDB collection:
Query:
Troubleshooting
You can see the generated MongoDB query in DEBUG level logs.
Implementation details can be found in mongocxx and mongoc documentations.