Google has open sourced Gumbo, an HTML parsing library written in C. Gumbo adheres to the HTML5 parsing algorithm, passing all html5lib-0.95 tests, and has been tested on 2.5 billion pages indexed by Google.
InfoQ interviewed Chris Mattman from Apache Tika, a text extraction and detection library, in the occasion of the 1.0 release and the publication of the "Tika in Action" book.
Irony is a framework created by Roman Ivantsov and used to write internal DSLs or entire new languages that run on .NET, the grammar being written in C#.
LESS and Sass are Ruby tools that allow to reduce redundancy in CSS files by introducing variables, mixins, and other time proven language features into CSS. We take a look at how the two tools work and what they offer.
A port of the popular code coverage tool rcov is now available for JRuby. Ola Bini started a Hibernate-based library for persisting Ruby objects named Ribs. And finally, JRuby trunk contains a new MBean for analysing parse times.