Solr as an Analytics Engine - A Key Theme at this Week's Lucene Revolution
The de facto Lucene and Solr conference is underway today in Boston, Massachusetts. I believe it's the third annual US version of the conference. A European version of the conference kicked off for the first time last year in October and it was called Lucene EuroCon. The conferences have been very popular among the growing number of developers who are being asked to focus on building Lucene/Solr into their companies' software. If you're missing it right now, no need to worry, there ought to be some videos shared from the sessions soon enough.
Some of the best looking sessions include interesting info about:
- Walking through the usage of Wikipedia's data dump for testing via the wikipedia_Solr and wikipedia_elasticsearch repositories
How SolrCloud Changes the User Experience in a Sharded Environment
- An overview of finite-state technology and then describe how it's used today in Lucene: synonym filtering, fuzzy queries, respelling/suggesting, terms dictionary, in-memory postings format (MemoryPostingsFormat) and Japanese analysis (Kuromoji analyzer).
Integrating Lucene into a transactional XML database (xDB)
- Solr for Analytics: integrating text analytics components into Solr, adjustments to the Solr Schema, as well as UI-level changes that support the integration of structured and unstructured data from several sources.
Building Query Auto-Completion Systems with Lucene 4.0
- And tons of case studies of Solr/Lucene being used at innovative software companies
It's amazing all the things you can use Solr and Lucene for. I'm particularly interested in Solr as an Analytics Engine.