wiki:WikiStart

docx4j component

/projects/docx4j

News

User survey: if you are using docx4j, please take the survey, at  http://www.plutext.com/limesurvey/index.php?sid=64431&lang=en

docx4j v2.5.0 is out. Includes pptx4j.

Purpose

docx4j is our library for unzipping a docx "package", and parsing the WordprocessingML XML to create an in-memory representation in Java.

It is available under the Apache License (v2).

This library is used by the plutext collaboration server, and the docx4all editor.

The library is designed to round trip docx files with 100% fidelity, and supports all WordML.

Getting started

We have a Getting Started guide, in  HTML and  PDF.

The HTML version is also available in the following languages:

Download

A nightly build can be found  here (built for Java 1.5+).

Otherwise, you can set up your own development environment from scratch.

Requirements and Dependencies

You currently need to be using Java 1.5+. See  here regarding issues with 1.4, which retrotranslator can  reportedly solve.

docx4j relies on quite a few third party jar files. We are grateful these are available. Please see here or  better here for details.

Heritage

This library is based originally on  OpenXML4J, and a number of its source files retain that project's copyright notice. We gratefully acknowledge OpenXML4J's work, and encourage you to use it directly or via Apache POI, especially if you are working with spreadsheets. Here is a stab at some of the differences.

Keywords

docx java, Office Open XML, OpenXML, ooxml, WordML, WordprocessingML