Michael Mccandless05/16/12
692 views
0 replies
Lucene Revolution 2012 is now done, and the talk Robert and I gave went well! We showed how we are using automata (FSAs and FSTs) to make great improvements throughout Lucene.
Carlo Scarioni05/15/12
1537 views
0 replies
As you know one of the most important techniques when designing your database schema or your searching enabled systems is the selection and creation of indexes for increasing the performance of your various queries. What you may not know is why and how indexes help.
Spyros Doulgeridis05/14/12
1023 views
0 replies
The following example shows how to use Ant to search in a target directory, including and excluding files/dirs, each property key in label.properties. It produces a new file with the unused properties.
Mitch Pronschinske05/12/12
3205 views
0 replies
Some more cool news has come out of this week's Lucene Revolution conference related to their theme of 'Solr's not just for search, it's for analytics. In addition to their Solr-as-a-Service cloud search platform, which is basically LucidWorks Enterprise on the cloud, Lucid Imagination has now announced the beta release of LucidWorks Big Data.
Francesco De Vittori05/11/12
1710 views
0 replies
In this tutorial I will show you how you can write some code that will search a database for an image similar to a given one –with some obvious limitations– in an extremely simple way just by looking at an image’s color distribution.
Mitch Pronschinske05/10/12
1955 views
0 replies
Scala users, check this out! If you've been trying to have a better interface for using Apache Solr in your Scala projects, there's a pretty new simple Solr client for Scala being developed on GitHub by Naoki Takezoe.
Mitch Pronschinske05/09/12
2491 views
0 replies
The de facto Lucene and Solr conference is underway today in Boston, Massachusetts. The conferences have been very popular among the growing number of developers who are being asked to focus on building Lucene/Solr into their companies' software.
Chris Hostetter05/08/12
1130 views
0 replies
It’s not to late to submit your interesting Solr problem to try and stump me — even if you can’t make it to Boston for the Lucene Revolution conference. Check out the prizes the winners get...
Mitch Pronschinske05/07/12
1500 views
0 replies
Looking for a Solr CMS plugin on par with the Drupal plugin? The FarCry Solr Pro plugin lets you use the power of Lucene (through Solr) to index and search your site. It goes above and beyond ColdFusion's native implementation of Solr.
Tony Russell-rose05/06/12
2275 views
0 replies
I need to compare two text classifiers – one human, one machine. They are assigning multiple tags from an ontology. We have an initial corpus of ~700 records tagged by both classifiers. The goal is to measure the ‘value added’ by the human. However, we don’t yet have any ground truth data (i.e. agreed annotations).
Mitch Pronschinske05/04/12
1754 views
0 replies
Continuing where the last screencast left off, this third tutorial shows how to install the Acquia Connector (Acquia is an enterprise Drupal SaaS), and then use that connector to implement an Acquia Search service in a Drupal site.
Mitch Pronschinske05/03/12
2107 views
0 replies
Faceting is one of Solr's primary features, and in this screencast you'll see how to install the facet API and do some basic faceted searching.
Mitch Pronschinske05/02/12
1708 views
0 replies
In this video you will see exactly how to install the Solr-Drupal module and start a Solr server. This video deals with installation on Drupal 7.
Michael Mccandless05/01/12
2505 views
1 replies
In the examples of this article, each node is a position, and each arc is a token. The TokenStream enumerates a directed acyclic graph, one arc at a time. Bit by bit our token streams are producing graphs!
Rafał Kuć04/30/12
2329 views
0 replies
One of the new features, which will be introduced in Solr 4.0 is a
new SpellChecker implementation that doesn’t require its own index. I
decided to take a quick look at it and share my thoughts.
What We Have Today
As for today (Solr 3.6) we can use...
Michael Mccandless04/29/12
2252 views
0 replies
I'm happy to announce that two Lucene Google Summer of Code projects were accepted for this summer! Here's some of problems they're going to help us solve this summer.
Chris Hostetter04/27/12
1695 views
0 replies
In spite of my best efforts to be really lazy, Chris Hostetter worked harder in his first Stump The Chump session then he had ever worked prepping+giving any other presentation he'd done in my entire life. It was one of the most stressful, nerve racking, and mentally/physically taxing hours of my life — and it was also hands down one of the most enjoyable experience he's ever had at a tech conference.
Mitch Pronschinske04/25/12
2993 views
0 replies
In this talk, we'll start with a simple search example in Apache Lucene and build up to a vector space model and explain some of the underlying math needed normalize the weights used to make fair comparisons among documents.
Mark Needham04/24/12
2183 views
0 replies
I’ve been graphing connections between ThoughtWorks people using neo4j and wanted to build auto complete functionality so I can search for the names of people in the graph. The solution I came up was to create a Lucene index with an entry for each node and a common property on each document in the index.
Jason Hull04/23/12
2054 views
0 replies
The web manager and an marketing department should work together on addressing the positions of their site's search results for both user satisfaction, and promotion of the correct products that marketing wants to offer. There are useful tools out there for measuring the profitability of your search results.
Mitch Pronschinske04/21/12
2431 views
0 replies
The Solr/Lucene development platform, Lucid Works Enterprise, just had a new release with the addition of crawler scheduling, Drools support, dynamic fields permitting schema free configuration, plus more...
Mark Needham04/20/12
3470 views
0 replies
Mark Needham created a graph of all the people in ThoughtWorks and the connections between them based on working with each other. He wanted to include auto complete functionality when typing in a name so he needed to figure out how to partially query the index of people’s names. That's where Lucene comes in...
Tony Russell-rose04/19/12
3010 views
0 replies
There are four primary dimensions that the author uses commonly use to characterize search and discovery contexts. In this article he runs through these dimensions with Google Instant as an example feature.
Marek Rogoziński04/18/12
2433 views
0 replies
We are proud to inform that we deployed a new version of explain.solr.pl, software for debugging and analyzing Solr queries. This version contains the following changes...
Kelvin Tan04/17/12
2379 views
0 replies
This post describes a method of augmenting the lucene-spatial contrib package to support multi-point searches. Read about this 6-step algorithm in detail here...