Recoll is a full-text search tool for Unix and Linux desktops.

Recoll finds keywords inside documents as well as file names.

The current Recoll version is 1.19.12 (Release notes, known bugs).

Recoll is based on the very strong Xapian search engine library, for which it provides a powerful text extraction layer and a complete, yet easy to use, Qt graphical interface.

Recoll will index an MS-Word document stored as an attachment to an e-mail message inside a Thunderbird folder archived in a Zip file (and more...). It will also help you search for it with a friendly and powerful interface, and let you open a copy of a PDF at the right page with two clicks. There is little that will remain hidden on your disk.

Recoll has extensive documentation. If you run into a problem, or want to propose improvements, you are welcome to use the mailing list or problem tracker.

Recoll user ? Maybe there are still a few useful search tricks that you don't know about. A quick look at the search tips might prove useful ! Also the Faqs and Howtos on, and some contributed result list formats.


I have separated the code for the Recoll Unity Scope from the main body of code, in hope that it may interest someone to work on it. It's Python and simple, mostly depending on the Unity API. The Ubuntu Unity API is apparently going to change *again* for the next version, and I think I've seen enough of it.
1.19.12 is out. It's mostly identical to 1.19.11 apart from a new parameter to change the max size of stored attributes. No need to update in general.
I hear from time to time about recollindex crashes. These appear to be quite rare, but they do happen, and I think that they are linked to a yet unfound bug in multithread indexing. If you experience such crashes or stalls, you can disable multithreading by adding the following to your recoll.conf:
thrQSizes = -1 -1 -1
While working on a Recoll-Mutt interface I discovered incidentally that the Recoll Webui Web interface works quite well with the links web browser inside a terminal window. This appears to be an interesting solution for people looking for a search interface usable in a non-GUI environment.
A new filter for PowerPoint files. The previous one was based on the ancient catppt from the catdoc utilities and usually extracted nothing from more recent PowerPoint files (this is about .ppt: .pptx is handled by a native Recoll filter).
Sometimes things just work...
Thanks to some of its users, Recoll now has filters to index and retrieve Lotus Notes messages (some implementation notes from an early user), and there is also now a Web browser interface for querying your Recoll indexes.
A problem with a simple workaround has caused several reported recollindex crashes recently (for 1.17). If you store and index Mozilla/Thunderbird email out of the standard location (~/.thunderbird), you should add the following at the end of your configuration file (e.g.: ~/.recoll/recoll.conf):

              mhmboxquirks = tbird
Adjust the path to your local value of course... Without this hint, recollindex has trouble finding the message delimiters inside the folder files, and will possibly use all the computer's memory and crash. Apart from crashes, which only occur for very big folders, this also causes incorrect mail indexing.
A new user-contributed script for those who use real-time indexing on laptops: stop or start indexing according to AC power status. See the details on the Wiki.
We now have a Chinese user manual: Recoll现在有中文手册咯: Recoll中文手册,HTML


Recoll borrows a lot of code from other packages, and welcomes code and ideas from contributors, see some of the Credits.

On the side

We rent a big country house in the Aude area, in the south of France (see map on the site). If you are looking for a wonderful country place with a pool to spend holidays with a big bunch of family and/or friends in a nice historical but very quiet area, this may be it.