BT
rss

Google Open Sources Gumbo, An HTML5 Parsing Library

by Abel Avram on  Aug 14, 2013

Google has open sourced Gumbo, an HTML parsing library written in C. Gumbo adheres to the HTML5 parsing algorithm, passing all html5lib-0.95 tests, and has been tested on 2.5 billion pages indexed by Google.

Apache Tika 1.0 Allows Easy Text Extraction for Java

by Fabian Lange on  Dec 28, 2011 6

InfoQ interviewed Chris Mattman from Apache Tika, a text extraction and detection library, in the occasion of the 1.0 release and the publication of the "Tika in Action" book.

Writing New .NET Languages with Irony

by Abel Avram on  Nov 03, 2009

Irony is a framework created by Roman Ivantsov and used to write internal DSLs or entire new languages that run on .NET, the grammar being written in C#.

DRYer CSS with LESS or Sass

by Werner Schuster on  Jul 24, 2009

LESS and Sass are Ruby tools that allow to reduce redundancy in CSS files by introducing variables, mixins, and other time proven language features into CSS. We take a look at how the two tools work and what they offer.

JRuby Roundup: RCov Port Available, Ribs For Hibernate Support, Parser Stats

by Werner Schuster on  Sep 03, 2008

A port of the popular code coverage tool rcov is now available for JRuby. Ola Bini started a Hibernate-based library for persisting Ruby objects named Ribs. And finally, JRuby trunk contains a new MBean for analysing parse times.

General Feedback
Bugs
Advertising
Editorial
InfoQ.com and all content copyright © 2006-2013 C4Media Inc. InfoQ.com hosted at Contegix, the best ISP we've ever worked with.
Privacy policy
BT