I use recoll extensively, but not for email. Sorry. My IMAP email server has fantastic search capability (added by Yahoo!), so all searching happens on the server, never the client. It is self-hosted, but more complicated than 95% of Linux people should attempt. Any email that gets copied locally can be wiped completely thanks to IMAP.
As for the size of recoll DBs,
Code:
$ du -shc recoll-index
2.1G recoll-index
2.1G total
So - 2.1G for
Code:
$ inxi -Dx
Drives: HDD Total Size: 32326.3GB (72.2% used)
ID-1: USB /dev/sda size: 8001.6GB temp: 0C
ID-2: USB /dev/sdb size: 8001.6GB temp: 0C
ID-3: /dev/sdc size: 4000.8GB temp: 34C
ID-4: /dev/sdd size: 320.1GB temp: 38C
ID-5: /dev/sde size: 4000.8GB temp: 36C
ID-6: /dev/sdf size: 4000.8GB temp: 37C
ID-7: /dev/sdg size: 4000.8GB temp: 39C
about 32TB.
inxi amazes me with some new summary all the time. Most of those files are media, so only the metadata gets indexed. I don't use the Recoll GUI to search. Rather, I use a little shell script to perform the search and display just the parts I want.
I'm curious. Just installed it on a desktop where I run Thunderbird for email. It uses Qt - yuck. It is doing the indexing for my HOME now. Shouldn't take too long, as there's only a few GB there. I'll add more if I see it.
Update1: It is still indexing. .... this seems like a very long time, considering there's only 6.4G in the index directory I specified.
Update2: It doesn't appear to find any thunderbird email. Haven't figured out why not yet.
Bookmarks