[Building Sakai] Sakai search - A solr implementation

Colin Hebert colin.hebert at oucs.ox.ac.uk
Thu May 24 08:25:31 PDT 2012


Hello,

I've worked the past few weeks on a different implementation of the Sakai
Search API using solr : https://github.com/ColinHebert/Sakai-Solr.
It is still a work in progress, but it's functional; I didn't have much
time to do a proper documentation, but it's pretty easy to set up.

The solr configuration used in unit tests (
https://github.com/ColinHebert/Sakai-Solr/tree/master/impl/src/test/resources/solr)
is the one I currently use when I run solr with sakai and works (so far)
fine.

To enable search with solr, the sakai.properties file has to be modified to
contain this:
solr.server=http://localhost:8983/solr/
search.enable=true

And of course, the project has to be deployed. With all that it should run.


I intend to add some user documentation later, but this early version has
some nice features:

- faceted search, searching for "tool:content myKeyword" will search
"myKeyword" only in documents indexed by the content module;
"title:presentation.pdf" will search every indexed element named
presentation.pdf, etc.
- the possibility to index and search metadata on documents indexed by
"content", (dublin core metadata or any property stored in
ResourceProperties), or simply hide it (from the solr configuration).
- all the goodness <http://wiki.apache.org/solr/ClusteringComponent>
of solr<http://lucene.apache.org/solr/>(it works with 3.5, 3.6 and I'm
working on making it compatible with 1.4)
and tika <http://tika.apache.org/> with solrCell
- an entity provider for entity broker (for some reasons, the current
implementation doesn't seem to work right now)


Some classes aren't meant to stay in this project but should be either in
their own project or could be integrated in already existing projects
(CitationContentProducer, SiteContentProducer,
BinaryContentHostingContentProducer) as they're not specific to my
implementation.


So far, it's more a developer preview than anything, but as I said, it's
working pretty well and only needs a few adjustments.

If you have some time to review the code, test it, give some feedback,
contribute or find some bugs, I'll be glad to hear of it.
Everything is on github, so feel free to fork the project.

---

Colin
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://collab.sakaiproject.org/pipermail/sakai-dev/attachments/20120524/b44bb9de/attachment.html 


More information about the sakai-dev mailing list