InfoQ

InfoQ

News

My Bookmarks

Login or Register to enable bookmarks for unlimited time.

The content has been bookmarked!

There was an error bookmarking this content! Please retry.

Jakarta POI 3.0 - Java API To Access Microsoft Office Format Files

Posted by James Kao on May 28, 2007

Sections
Development,
Architecture & Design
Topics
Java plus .NET Integration ,
Java
Tags
Microsoft Office
Version 3.0 of the Jakarta POI, the venerable Java library that provides the ability to read and write certain Microsoft Office documents, has been released. This release adds support for MS Excel formulas, improved PowerPoint support, and image extraction for MS Word documents.

Users of POI deal with a number of sub-projects to handle various aspects of these formats:
POIFS
Access to file structure for MS Office documents
HSSF
MS Excel data access
HWPF
MS Word data access
HPSF
Properties attributes for Office documents
HSLF
PowerPoint data access
POI-Ruby
Ruby bindings to gcj compiled binary libraries
The programming paradigm is relatively simple. A developer will use POIFS to create/open the document as a stream, and then connect that to the appropriate data access API (HSSF, HWPF, or HSLF) to actually interact with the content. Angsuman Chakraborty has written a concise piece that goes into more detail about using POI to read Excel files that is a good introduction and quick start guide.

This marks the last release of POI under the Jakarta subproject before being promoted to a top-level Apache project.

No comments

Watch Thread Reply

Educational Content

Jesper Boeg on Priming Kanban

In this interview, Jesper Boeg, author of the new InfoQ book – Priming Kanban, discusses the keys to using Kanban effectively, and how to get started if you are currently using other approaches.

New-age Transactional Systems - Not Your Grandpa's OLTP

John Hugg discusses high volume transaction processing applications with high and low frequency profiles, and how VoltDB can be used for that purpose.

Cool Code

Kevlin Henney examines code samples to see what can be learned from them starting from the premise that one won’t write great code unless he knows how to read it.

Collaboration: At the Extremities of Extreme

Jason Ayers share the observations he made watching a team of developers collaborating in real time on the same code base, pushing XP, pair programming and continuous integration to their extremes.

Yesod Web Framework

Michael Snoyman presents Yesod, a web framework written in Haskell and containing a web server, templating, ORM, libraries (templating, gravatar, etc.).

Transactions without Transactions

Richard Kreuter and Kyle Banker on how to avoid classical RDBMS transactional systems by using compensation mechanisms, transactional messaging or transactional procedures.

Attila Szegedi on JVM and GC Performance Tuning at Twitter

Attila Szegedi talks about performance tuning Java and Scala programs at Twitter: how to approach GC problems, the importance of asynchronous I/O, when to use MySQL/Cassandra/Redis, and much more.

10 tips on how to prevent business value risk

One category of risk that project teams need to ensure they address is business value failure – delivering a product that fails to provide value for the business investor.