Have been playing around with PyLucene
which is a native-compiled Python version of Lucene
, a Java text indexing and searching package.
It's an impressive engineering feat - basically compile Lucene to native objects using GCJ and then use SWIG to produce Python bindings - given how simply it works.
Was quite quick to get up and running, and have started improving search in Pootle using it. The main slowdown was documentation - you have to read the Lucene Javadoc
, and the SWIG-generated Python classes don't have any documentation or info about their own parameters directly.
So I hacked together some code that parses the Java files, strips out the method declarations and javadoc comments, and then tries to insert them as docstrings into the Python files. More info in the 'Simple automated docs'
message I posted to the list. Andi, the original author, was planning to add docs before the 1.0 release, hopefully these will help people until then.
Anyway, now I can carry on using the indexing instead of hacking at it :-)
Share on Facebook