Over a million developers have joined DZone.

Bulk Insert in MongoDB With a C# Driver [Code Snippet]

In this post we take a look at how to efficiently save a large collection of documents in Mongo. Read on for details.

· Database Zone

Build fast, scale big with MongoDB Atlas, a hosted service for the leading NoSQL database. Try it now! Brought to you in partnership with MongoDB.

There are situations where you need to save a lot of documents inside a collection in MongoDB. My scenario is a migration of documents from a collection to another database, with in-memory manipulation of the documents.

The most common error in these situations is to read the documents from the original collection, then execute a function that modify the document in-memory, and finally issuing an insert in destination collection. This is wrong because you have a roundtrip against MongoDB for each document you are saving.

Whenever you are calling Insert or Save function, you are paying the penality of a call to MongoDb process, network latency, etc, whenever possible you should reduce the number of calls to database engine.

In such a scenario, the MongoDB driver has a function called InsertBatch that allows you to insert documents in batches — and the fun part is that it simply accepts an IEnumerable. As an example, I have a function that manipulates a BsonDocument stored in a variable called Action. I have source and test database where I need to copy documents with manipulation and this is the code that does everything.

var sourceQueue = source.GetCollection(queue);
var destQueue = dest.GetCollection(queue);

if (sourceQueue.Count() == 0) return;

//migrate counterCollection
Console.WriteLine("Migrating Queue " + queue);
var allElement = sourceQueue
.FindAll()
.AsEnumerable()
.Select(document => {
    Action(document);
    return document;
});
destQueue.InsertBatch(allElement);

The name of the collection is contained in queue variable (actually I’m transforming a software that manages jobs), and as you can verify I can simply enumerate all source documents with FindAll (this code uses the old 1.10 driver), for each object I’m calling the Action function that manipulates the document, and finally I can simply use the InsertBatch to insert documents in batches.

This function runs faster than saving each document with a separate call, even if the MongoDB instance runs on the very same machine, so you do not pay the network latency.

If you use the latest version of the drivers, you have the InsertMany method that offers even more options and basically does the very same operation as InsertBatch.

Now it's easier than ever to get started with MongoDB, the database that allows startups and enterprises alike to rapidly build planet-scale apps. Introducing MongoDB Atlas, the official hosted service for the database on AWS. Try it now! Brought to you in partnership with MongoDB.

Topics:
database ,driver ,mongodb ,c# ,insert

Published at DZone with permission of Ricci Gian Maria, DZone MVB. See the original article here.

Opinions expressed by DZone contributors are their own.

The best of DZone straight to your inbox.

SEE AN EXAMPLE
Please provide a valid email address.

Thanks for subscribing!

Awesome! Check your inbox to verify your email so you can start receiving the latest in tech news and resources.
Subscribe

{{ parent.title || parent.header.title}}

{{ parent.tldr }}

{{ parent.urlSource.name }}