Recoll is a full-text search tool for Unix and Linux desktops.
Recoll finds keywords inside documents as well as file names.
- It can search most document formats.
- It can reach any storage place: files, archive members, email attachments, transparently handling decompression.
- One click will open the document inside a native editor or display an even quicker text preview.
- The software is free, open source, and licensed under the GPL.
- Detailed features.
Recoll is based on the very strong Xapian search engine library, for which it provides a powerful text extraction layer and a complete, yet easy to use, Qt graphical interface.
Recoll will index an MS-Word document stored as an attachment to an e-mail message inside a Thunderbird folder archived in a Zip file (and more...). It will also help you search for it with a friendly and powerful interface, and let you open a copy of a PDF at the right page with two clicks. There is little that will remain hidden on your disk. More details …
Recoll user ? Maybe there are still a few useful search tricks that you don't know about. A quick look at the search tips might prove useful ! Also the Faqs and Howtos on bitbucket.org, and some contributed result list formats.
- Release 1.19.11 is out. This brings a Unity Scope for Ubuntu 13.10, python3 compatibility, and a new and much improved filter for PowerPoint (.ppt) files.
- A new filter for PowerPoint files. The previous one was based on the ancient catppt from the catdoc utilities and usually extracted nothing from more recent PowerPoint files (this is about .ppt: .pptx is handled by a native Recoll filter).
- 1.19.9 is released. This fixes a number of significant bugs (query date condition handling, possible GUI crashes...).
- The recoll-webui Recoll browser access project has been updated to be compatible with Recoll 1.19 (1.18 compatibility is maintained), and add new functions permitted by the improvements in the Python module (previewing and downloading in addition to file::// urls).
- Recoll 1.19.4 is out. It fixes a few relatively minor issues in 1.19.3. See the release notes for a description of the changes.
- Sometimes things just work...
- Thanks to some of its users, Recoll now has filters to index and retrieve Lotus Notes messages (some implementation notes from an early user), and there is also now a Web browser interface for querying your Recoll indexes.
- Release 1.18.2 fixes a few really minor bugs in 1.18.1. I will not package it.
- The next version of Recoll indexing will be multithreaded, and much faster on most current processors. The conversion has been an interesting project with multiple stages, and I've taken a few notes about the transformation.
- Recoll 1.18.1 is out. It has optional character case and diacritics sensitivity, direct access to the hit page when opening PDF files, complex search history, and a host of other smaller improvements. See the release notes. Those of you not using evince as a PDF/Postscript/DVI viewer may be surprised to see the recoll GUI try to use this viewer anyway.This is because some of the new functions in 1.18 work better with evince, but you can override the choice by going to the GUI preferences, and removing application/pdf, application/postscript and application/dvi from the Exceptions list.
- Recoll has a brand new Firefox extension for indexing visited Web pages. This is very similar to the old Beagle plugin, but slightly better integrated (no-fuss installation), and with a new lease on life. See Dave King's project page on sourceforge and the page about Recoll (trivial) configuration.
- A problem with a simple workaround has caused
several reported recollindex
crashes recently (for 1.17). If you store and index
Mozilla/Thunderbird email out of the standard location
(~/.thunderbird), you should add the following at the end of
your configuration file (e.g.:
[/path/to/my/mozilla/mail] mhmboxquirks = tbirdAdjust the path to your local value of course... Without this hint, recollindex has trouble finding the message delimiters inside the folder files, and will possibly use all the computer's memory and crash. Apart from crashes, which only occur for very big folders, this also causes incorrect mail indexing.
- An easy way to extend the "Web queue" Recoll web history indexing mechanism to other browsers than Firefox (Elinks in this case).
- A new user-contributed script for those who use real-time indexing on laptops: stop or start indexing according to AC power status. See the details on the Wiki.
- We now have a Chinese user manual: Recoll现在有中文手册咯： Recoll中文手册，HTML
- Recoll gets a Ubuntu Unity Lens. If you are running an Ubuntu release where this makes sense, you can install the recoll-lens package from the Recoll PPA. The Lens uses the Recoll GUI as a proxy to extract and display embedded documents, which native utilities can't reach directly. And of course you still need to run the GUI (or the command line recollindex) to get the indexing going !
Recoll borrows a lot of code from other packages, and welcomes code and ideas from contributors, see some of the Credits.