How to Merge DOCX Files and Replace Text Strings in Java
Combine multiple Office Word Documents (docx) into a single document; Instantly find and replace specific text string variables like names and dates.
Join the DZone community and get the full member experience.Join For Free
Microsoft Word is one of the most recognized and utilized word processing programs across all demographics and is often the default formatted-text software for business use. With the ability to easily format and edit text files of any type or length, a wide variety of document genres can be created with basic user knowledge.
When working with multiple documents simultaneously, however, it may be useful to combine two or more files to create a longer file containing all the necessary information for your needs. For example, if you are building a documentation report detailing your services or organizing contract packets for potential partners, different pieces may be stored in separate files. This means that you will then need to go through each file and manually add the content into a master document file. With this comes possible issues with formatting compatibility and accidental, unwanted editing or erasure of text.
Rather than attempting to copy each separate file to create your document, the following Convert APIs will allow you to merge DOCX files instantly while maintaining formatting. We will begin by looking at our process for combining two files into one. Then, we will go over how you can add more input files to create much longer and more complex documents for bigger projects. Then, we will show you how to replace text strings within the final Word document to aid in adding user input such as names, dates, and other variable information. This will allow you to create templates to be used recurrently, decreasing the time spent on organizing and parsing your files.
Our goal with this tutorial is to allow your business to share information and documentation more easily with your clients and partners. Following these steps will not only boost your time to deliverable but also improve your organization’s overall standardization and interfacing.
Our first move for all of these APIs will be to install our SDKs with Maven. To do this, we will add a Jitpack reference to the repository in pom.xml:
Then, we can add a reference to the dependency:
After this we can begin calling our functions, starting with a two-document merge. To do this, you’ll first need to add the imports to the top of the file.
Then, you can call the function, MergeDocumentDocx. For input, you will need to add in the two DOCX files you are wishing to combine.
The response to this function should be a single file that you can then save and use immediately. To ensure that this API works properly, you need to ensure certain requirements are met:
Your input files are valid and not encrypted or password protected
You have input your API Key. This can be retrieved at no cost on the Cloudmersive website, providing 800 monthly calls across our API library.
You may notice a note within this code block stating that more than two files can be supplied. This references our next function, which will allow you to add up to ten input files at once. However, for both functions, it is possible to perform multiple concurrent merges. For example, if you already have a merged document that originated as two or more separate files, you can then use it as one of your input files in the following merge.
To use our next API, you will once again install the SDK with Maven and add the imports to the top of the file as shown above. Then, you can call the function, MergeDocumentDocxMulti:
This will help you organize and simplify your documentation projects to increase your efficiency, improve document appearance, and decrease time-to-delivery to your clients.
Our final API, as shown below, will allow you to take your now merged documents and replace all instances of specific text strings with new input. This means that you can easily change the names, dates, and other variables instantly, without needing to manually parse the document. With this function, you can also specify whether text case should be matched with a default of unmatched case.
As with the two previous APIs, we will first install our SDK with Maven. However, the next step should differ slightly because the function is a Transformation API, not a Merge API. This will be reflected in the imports as shown here:
Once you have placed the imports, you can call our final function, TransformDocumentDocxReplace:
This, too, will allow you to decrease project time and allow you to quickly adjust documents to meet deliverable requirements.
If you have any questions about using these APIs or inquiries concerning other API solutions, you can visit the Cloudmersive website where our team is happy to help with anything you might need.
Opinions expressed by DZone contributors are their own.