InfoQ

InfoQ

News

My Bookmarks

Login or Register to enable bookmarks for unlimited time.

The content has been bookmarked!

There was an error bookmarking this content! Please retry.

Manipulate Office Documents from the Command Line

Posted by Jonathan Allen on Jun 24, 2008

Sections
Development
Topics
.NET ,
Scripting
Tags
OpenXML ,
PowerShell

In 2006 .NET 3.0 was released with rudimentary support for Open XML-style ZIP files. While not very useful on its own, it serves as the basis for the recently released Open XML SDK. This SDK exposes strongly typed classes for manipulating Office documents.

Almost in tandem, PowerTools for Open XML was announced. This open source project adds a collection of PowerShell commands that allow manipulating Open XML from the command line. Since it relies on the Open XML SDK and .NET 3.0, users don't need to install MS Office or mess with COM automation. This is especially important in server-side applications where Office is inherently unstable.

Below is the list of commands available in the first release.

  • Accept-OpenXmlChange: Accepts all text change tracking elements in a document.
  • Add-OpenXmlContent: Insert custom markup inside a specific part in a Wordprocessing document
  • Add-OpenXmlDigitalSignature: Inserts a new digital signature inside a Wordprocessing document
  • Add-OpenXmlDocumentIndex: Generate the index of a Wordprocessing document
  • Add-OpenXmlDocumentTOA: Generate the TOA (Table of Authorities) of a Wordprocessing document
  • Add-OpenXmlDocumentTOC: Generate the TOC (Table of Contents) of a Wordprocessing document
  • Add-OpenXmlDocumentTOF: Generate the TOF (Table of Figures) of a Wordprocessing document
  • Add-OpenXmlPicture: Inserts a picture inside an OpenXML document
  • Export-OpenXmlSpreadsheet: Generates a spreadsheet document from piped objects.
  • Export-OpenXmlToHtml: Exports Wordprocessing documents to html documents
  • Export-OpenXmlWordprocessing: Create a new Wordprocessing document from text.
  • Get-OpenXmlBackground: Extracts background information from a Wordprocessing document
  • Get-OpenXmlComment: Extracts all the comments contained in a Wordprocessing document
  • Get-OpenXmlCustomXmlData: Gets a customXml part from a document.
  • Get-OpenXmlDigitalSignature: Gets information about digital signatures present in a Wordprocessing document
  • Get-OpenXmlDocument: Creates objects for OpenXML documents
  • Get-OpenXmlFooter: Retrieves footer information from Wordprocessing documents
  • Get-OpenXmlHeader: Retrieves header(s) information from Wordprocessing documents
  • Get-OpenXmlStyle: Retrieves style definitions from a Wordprocessing document.
  • Get-OpenXmlTheme: Gets the theme content from a Wordprocessing document.
  • Get-OpenXmlWatermark: Gets watermark text from a Wordprocessing document.
  • Lock-OpenXmlDocument: Locks one or more Wordprocessing documents.
  • Remove-OpenXmlComment: Removes comments from Wordprocessing documents.
  • Remove-OpenXmlDigitalSignature: Removes the digital signature of a Wordprocessing document
  • Set-OpenXmlBackground: Sets the background color or image of a Wordprocessing document.
  • Set-OpenXmlContentFormat: Sets the format of a paragraph or run on a Wordprocessing document.
  • Set-OpenXmlContentStyle: Sets the style for a paragraph or run on a Wordprocessing document.
  • Set-OpenXmlCustomXmlData: Sets the contents of a custom XML part in a Wordprocessing document.
  • Set-OpenXmlFooter: Sets footers in a Wordprocessing document.
  • Set-OpenXmlHeader: Sets headers on a Wordprocessing document.
  • Set-OpenXmlStyle: Sets the style library for a Wordprocessing document.
  • Set-OpenXmlTheme: Sets a Wordprocessing document theme.
  • Set-OpenXmlWatermark: Sets a watermark in a Wordprocessing document.

No comments

Watch Thread Reply

Educational Content

New-age Transactional Systems - Not Your Grandpa's OLTP

John Hugg discusses high volume transaction processing applications with high and low frequency profiles, and how VoltDB can be used for that purpose.

Cool Code

Kevlin Henney examines code samples to see what can be learned from them starting from the premise that one won’t write great code unless he knows how to read it.

Collaboration: At the Extremities of Extreme

Jason Ayers share the observations he made watching a team of developers collaborating in real time on the same code base, pushing XP, pair programming and continuous integration to their extremes.

Yesod Web Framework

Michael Snoyman presents Yesod, a web framework written in Haskell and containing a web server, templating, ORM, libraries (templating, gravatar, etc.).

Transactions without Transactions

Richard Kreuter and Kyle Banker on how to avoid classical RDBMS transactional systems by using compensation mechanisms, transactional messaging or transactional procedures.

Attila Szegedi on JVM and GC Performance Tuning at Twitter

Attila Szegedi talks about performance tuning Java and Scala programs at Twitter: how to approach GC problems, the importance of asynchronous I/O, when to use MySQL/Cassandra/Redis, and much more.

10 tips on how to prevent business value risk

One category of risk that project teams need to ensure they address is business value failure – delivering a product that fails to provide value for the business investor.

Interview: Software Systems Architecture: Working With Stakeholders Using Viewpoints and Perspectives

InfoQ spoke to the authors of Software Systems Architecture on a couple of new topics, the System Context viewpoint and Agile, which have been added to the second edition.