Converting Word to Pdf in Java Example

Converting Word to Pdf in Java Example

I am updating this post to provide,

  • maven project with all the required dependencies
  • Resolved log4j warning at the top of the converted pdf.

 

Converting Word to Pdf in Java Example:

 

pom.xml:

To keep the post clean I kept only the required portion, please download the zip file for complete files.


<dependency>

<dependency>

<groupId>org.docx4j</groupId>

<artifactId>docx4j-ImportXHTML</artifactId>

<version>3.0.0</version>

<exclusions>

<exclusion>

<groupId>org.slf4j</groupId>

<artifactId>slf4j-log4j12</artifactId>

</exclusion>

<exclusion>

<groupId>log4j</groupId>

<artifactId>log4j</artifactId>

</exclusion>

</exclusions>

</dependency>

 

WordToPDFConvert.java:

Please create the folder named mirthbees under your c drive(if you want to create in other places or other names, accordingly update the path details in the below program inputWordPath & outputPDFPath varaibles) and have the input word document there, ensure it is not opened when you running this conversion program.

 


package in.javadomain;

import java.io.File;
import java.io.FileInputStream;
import java.io.FileOutputStream;
import java.io.InputStream;
import java.util.List;

import org.docx4j.Docx4J;
import org.docx4j.fonts.IdentityPlusMapper;
import org.docx4j.fonts.Mapper;
import org.docx4j.model.structure.SectionWrapper;
import org.docx4j.openpackaging.packages.WordprocessingMLPackage;

/**
* This converts Docx file to PDF
* Author: Naveen - Javadomain.in
*/
public class WordToPDFConvert {
public static void main(String[] args) {
String inputWordPath = "C:\\mirthbees\\javadomain.docx";
String outputPDFPath = "C:\\mirthbees\\javadomain.pdf";
try {
System.err.println("Word Document to PDF Convert Begins!");
InputStream is = new FileInputStream(new File(inputWordPath));
WordprocessingMLPackage wordMLPackage = WordprocessingMLPackage.load(is);
List sections = wordMLPackage.getDocumentModel().getSections();
for (int i = 0; i < sections.size(); i++) {
wordMLPackage.getDocumentModel().getSections().get(i).getPageDimensions().setHeaderExtent(3000);
}
Mapper fontMapper = new IdentityPlusMapper();
// For font specific, enable the below lines
// PhysicalFont font = PhysicalFonts.getPhysicalFonts().get("Comic
// Sans MS");
// fontMapper.getFontMappings().put("Algerian", font);
wordMLPackage.setFontMapper(fontMapper);
Docx4J.toPDF(wordMLPackage, new FileOutputStream(outputPDFPath));
System.err.println("Your Word Document Converted to PDF Successfully!");
} catch (Exception e) {
e.printStackTrace();
}
}
}

 

 

Resolving “To HIDE THESE MESSAGES, TURN OFF log4j debug level logging for org.docx4j.convert.out.pdf.viaXSLFO”

Create docx4j.properties in your classpath (src/main/resources) and paste the below file contents to resolve this “To HIDE THESE MESSAGES, TURN OFF log4j debug level logging for org.docx4j.convert.out.pdf.viaXSLFO”

Docx4j.properties Github link 

 
 
 

 
 
 

New Maven Project Input & Output Screenshots :

Input Word File:

Converting Word to Pdf in Java Example

 

Output PDF File:

Converting Word to Pdf in Java Example

 

 
 
 

[Below for your reference only, for better results, use the above maven projects only.]

 

Old Post: Just for reference only, check the above mentioned maven project only.

Converting Word to Pdf in Java:
Jars:

avalon-framework-4.1.5
commons-io-2.4
docx4j-2.7.1
log4j-1.2.15
serializer-2.7.1
xmlgraphics-commons-1.3
batik-util-1.6-1
commons-logging-1.1.3
fop-0.93
xalan-2.7.1

Java Program:

package in.javadomain;

import java.io.File;
import java.io.FileInputStream;
import java.io.FileOutputStream;
import java.io.InputStream;
import java.io.OutputStream;
import java.util.List;

import org.docx4j.convert.out.pdf.viaXSLFO.PdfSettings;
import org.docx4j.fonts.IdentityPlusMapper;
import org.docx4j.fonts.Mapper;
import org.docx4j.fonts.PhysicalFont;
import org.docx4j.fonts.PhysicalFonts;
import org.docx4j.openpackaging.packages.WordprocessingMLPackage;

public class Word2Pdf {
public static void main(String[] args) {
try {

long start = System.currentTimeMillis();

InputStream is = new FileInputStream(
new File("D:\\javadomain.docx"));
WordprocessingMLPackage wordMLPackage = WordprocessingMLPackage
.load(is);
List sections = wordMLPackage.getDocumentModel().getSections();
for (int i = 0; i < sections.size(); i++) {

System.out.println("sections Size" + sections.size());
wordMLPackage.getDocumentModel().getSections().get(i)
.getPageDimensions().setHeaderExtent(3000);
}
Mapper fontMapper = new IdentityPlusMapper();

PhysicalFont font = PhysicalFonts.getPhysicalFonts().get(
"Comic Sans MS");

fontMapper.getFontMappings().put("Algerian", font);

wordMLPackage.setFontMapper(fontMapper);
PdfSettings pdfSettings = new PdfSettings();
org.docx4j.convert.out.pdf.PdfConversion conversion = new org.docx4j.convert.out.pdf.viaXSLFO.Conversion(
wordMLPackage);

OutputStream out = new FileOutputStream(new File(
"D:\\javadomain.pdf"));
conversion.output(out, pdfSettings);
System.err.println("Time taken to Generate pdf "
+ (System.currentTimeMillis() - start) + "ms");

} catch (Exception e) {
e.printStackTrace();
}
}
}

Input [word file]

word input

Output [pdf file]

pdf output

Recommended Books:

7,416 total views, 1 views today

2 thoughts on “Converting Word to Pdf in Java Example

  1. If the code above does not work for all your Word documents, please consider our java library jWordConvert.

    It takes 2 lines to convert a Word document to PDF:

    // Load the Word document
    WordDocument wdoc = new WordDocument ("input.doc");

    // Save the document as a PDF file
    wdoc.saveAsPDF ("output.pdf");

  2. Hi Mirthbees ,

    Your tutorial is good. I have generated PDF file. But ‘ TO HIDE THESE MESSAGES, TURN OFF log4j debug level logging for org.docx4j.convert.out.pdf.viaXSLFO ‘ message come by default. So to hide this message , I have used ” org.docx4j.convert.out.pdf.viaXSLFO.Conversion.log.setLevel(Level.OFF); ” line. But problem is when generating war its making problem.

Leave a Reply

Your email address will not be published. Required fields are marked *