New-age Transactional Systems - Not Your Grandpa's OLTP
John Hugg discusses high volume transaction processing applications with high and low frequency profiles, and how VoltDB can be used for that purpose.
The content has been bookmarked!
There was an error bookmarking this content! Please retry.
Posted by Ian Roughley on Oct 01, 2007
New work to enhancement LUCENE-724 (http://issues.apache.org/jira/browse/LUCENE-724) by Marcelo Ochoa allows for better integration of Lucene as a domain index from within the Oracle database. The updates, that were commissioned by Lending Club (www.lendingclub.com), allow greater flexibility than using Oracle Text - the results of the work is open source that anyone can use.
Why put Lucene in the database? Marcelo Ochoa answers this question with:
Oracle include a full feature Enterprise Engine named Oracle Text made in C and fully integrated to the Oracle Engine, but:
- you can not control which functionality will be included into next release
- and you can not easily customize it for your needs
The OJVMDirectory, a Lucene Integration running inside the Oracle JVM will allow developers to:
The features of the new release include:
- Synchronized with latest Lucene 2.2.0 production
- Replaced in memory storage using Vector based implementation by direct BLOB IO, reducing memory usage for large index
- Support for user data stores, which means you are not limited to only one column/index at a time (limited imposed by Data Cartridge API on 10g); now you can index multiples columns at base table and columns on related tabled joined together
- User Data Stores can be customized by the user, it means writing a simple Java Class users can control which column are indexed, padding used or any other functionality previous to document adding step
- There is a DefaultUserDataStore which gets all columns of the query and built a Lucene Document with Fields representing each database columns these fields are automatically padded if they have NUMBER or rounded if they have DATE data, for example
- lcontains() SQL operator support full Lucene's QueryParser syntax to provide access to all columns indexed, see examples below
- Support for DOMAIN_INDEX_SORT and FIRST_ROWS hint, it means that if you want to get rows order by lscore() operator (ascending,descending) the optimizer hint will assume that Lucene Domain Index will returns rowids in proper order avoided an inline-view to sort it
- Automatic index synchronization by using AQ's Call Back
- Lucene Domain Index creates extra tables named IndexName$T and an Oracle AQ named IndexName$Q with his storage table IndexName$QT at user's schema, so you can alter storage's preference if you want
- ojvm project is at SourceForge.net CVS, so anybody can get it and collaborate ;)
- Tested against 10gR2 and 11g database
Examples and information can be found at dbprism/ojvm project, and the code can be downloaded from the project also.
More details on the integration can be found at Marcelo's blog.
Improve Java Garbage Collection, Runtime Execution, and JVM visibility with Zing
Fair Trade Software Licensing - A Guide to Neo4j Licensing Options
Using Drools? See what you're missing! Get the Power of Drools with the Assurance of Red Hat
18 agile and lean practices for effective software development governance
Both Marcelo and I worked on this project and would like to clarify that its purpose is not to replace Oracle Text based on performance characteristics (nothing can beat a low level C implementation :-), but rather to provide a powerful and flexible open source alternative which brings a myriad of possibilities that will for sure advance the state-of-the-art in both search (information retrieval) and relational database fields.
John Hugg discusses high volume transaction processing applications with high and low frequency profiles, and how VoltDB can be used for that purpose.
Kevlin Henney examines code samples to see what can be learned from them starting from the premise that one won’t write great code unless he knows how to read it.
Jason Ayers share the observations he made watching a team of developers collaborating in real time on the same code base, pushing XP, pair programming and continuous integration to their extremes.
Michael Snoyman presents Yesod, a web framework written in Haskell and containing a web server, templating, ORM, libraries (templating, gravatar, etc.).
Richard Kreuter and Kyle Banker on how to avoid classical RDBMS transactional systems by using compensation mechanisms, transactional messaging or transactional procedures.
Attila Szegedi talks about performance tuning Java and Scala programs at Twitter: how to approach GC problems, the importance of asynchronous I/O, when to use MySQL/Cassandra/Redis, and much more.
One category of risk that project teams need to ensure they address is business value failure – delivering a product that fails to provide value for the business investor.
InfoQ spoke to the authors of Software Systems Architecture on a couple of new topics, the System Context viewpoint and Agile, which have been added to the second edition.
1 comment
Watch Thread Reply