Unknown reporter writes

Recoll does not index Thunderbird Mails. It ends with {{{ recollindex -i /home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner }}} {{{

:4:../internfile/mh_mbox.cpp:249:MimeHandlerMbox::set_document_file(/home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner) :4:../internfile/internfile.cpp:284:FileInterner:: init ok text/x-mail [/home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner] :4:../internfile/internfile.cpp:742:FileInterner::internfile. ipath [] :5:../internfile/mh_mbox.cpp:367:MimeHandlerMbox::next_document: fn /home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner, msgnum 0 mtarg 0 :2:../internfile/internfile.cpp:737:FileInterner::internfile: next_document error [/home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner] text/x-mail :4:../internfile/internfile.cpp:835:FileInterner::internfile: conversion ended with no doc :4:../rcldb/rcldb.cpp:880:Db::add: udi [/home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner|] parent [] :5:../rcldb/rcldb.cpp:1091:Rcl::Db::add: new doc record: url=file:////home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner mtype=text/x-mail fmtime=01285667928 origcharset= fbytes=644603 sig=6446031285667928+ dbytes=0 caption=Partner }}}

medoc writes

Hello, and thank you for reporting this. I tried to reproduce the problem, and it seems that Thunderbird sometimes creates mbox files with a bad format (no empty line before the From separator). I’m not quite sure that this is your problem though. Would it be possible for you to reproduce it on some non-confidential data, and either forward the folder file to me at jfd@recoll.org or attach it to an update here ?

medoc writes

Hopefully fixed by <<changeset 2049 > >

Unknown User writes

Hello, now i got this message: {{{ #!log :4:../rcldb/rcldb.cpp:541:Db::open: m_isopen 0 m_iswritable 0 :4:../rcldb/rcldb.cpp:573:Db::open: lastdocid: 443 :4:../index/fsindexer.cpp:269:FsIndexer::localfieldsfromconf :4:../common/rclconfig.cpp:359:RclConfig::addLocalFields: keydir [/home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/] :4:../rcldb/rcldb.cpp:1194:Db::needUpdate:yes: olsig [6446021292252393+] new [6446021292252393] :5:../index/fsindexer.cpp:357:processone: processing: [629 KB ] /home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner :4:../internfile/internfile.cpp:207:FileInterner:: [/home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner] mime [(null)] preview 0 :4:../internfile/mh_mbox.cpp:250:MimeHandlerMbox::set_document_file(/home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner) :4:../internfile/mh_mbox.cpp:276:MimeHandlerMbox: setting quirks TBIRD :4:../internfile/internfile.cpp:284:FileInterner:: init ok text/x-mail [/home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner] :4:../internfile/internfile.cpp:742:FileInterner::internfile. ipath [] :5:../internfile/mh_mbox.cpp:375:MimeHandlerMbox::next_document: fn /home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner, msgnum 0 mtarg 0 :2:../internfile/internfile.cpp:737:FileInterner::internfile: next_document error [/home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner] text/x-mail :4:../internfile/internfile.cpp:835:FileInterner::internfile: conversion ended with no doc :4:../rcldb/rcldb.cpp:881:Db::add: udi [/home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner|] parent [] :5:../rcldb/rcldb.cpp:1109:Rcl::Db::add: new doc record: url=file:///home/user/.thunderbird/profile.default/ImapMail/smtp/EMails.sbd/Partner mtype=text/x-mail fmtime=01292252393 origcharset= fbytes=644602 sig=6446021292252393+ dbytes=0 caption=Partner }}} I will create a new mbox-file and attached it here.

Unknown User writes

Hmm..it doesn’t work with my mbox files: {{{ #!mbox From Received: from mail.xxx.de ([xxxxxx]) by pdc.xxxx.de with Microsoft SMTPSVC(5.0.2195.6713); Tue, 11 Jan 2005 09:33:59 +0100 Received: from xxx.xxxx.DE (xxx.xxxx.de [xxxxx]) by mail.xxxx.de (8.12.1/8.12.1) with ESMTP id j0B92HW4014221 for <xxx@xxx.de >; Tue, 11 Jan 2005 10:02:17 +0100 (CET) Received: by xxxx.xxxx.de with Internet Mail Service (5.5.2653.19) id <Z3LYSMW5 >; Tue, 11 Jan 2005 09:33:43 +0100 From: =?iso-8859-1?Q?=22H=xxxx=2C_xxxxx=22?= <xxx@xxxx.de > To: xxx xxxx <IMCEAEX-_O=4TEK_OU=FIRST+20ADMINISTRATIVE+20GROUP_CN=RECIPIENTS_CN=xxx@xxxx.de > Date: Tue, 11 Jan 2005 09:33:42 +0100 Subject: AW: Nachricht von xxx xx Mitglieder-Formular Thread-Topic: Nachricht von xxxx Trust Mitglieder-Formular

}}}
but if i change the first line from
"From" to "From - Mon Dec 13 16:10:55 2010", recoll indexes the file.


== medoc writes

Hello,
This is weird, my thunderbird does not seem to create such 'From ' lines. Could you please check in your mboxes if all From lines are like this, or if there appears to be some kind of pattern ? Also, are they usually preceded by a blank line (when not the first message) ?


== Unknown User writes

Hello, i created a new profile and now the 'From ' lines are with Date and recoll is indexing!
strange...

== medoc writes

A quick google search (ie: thunderbird folder +mbox +From +line) will indicate  that thunderbird's handling of its own mbox format is a total mess, which is an incredible shame in a major application for such a *trivial* AND *critical* issue. Properly handling the mboxrd format which is supposedly used (but not) would fix this, and should not be that complicated.

Hopefuly, your problem is gone for good. If it isn't, try to record what makes it come back so that we can at least derive some maybe reusable knowledge...

Cheers,

jf


== Unknown User writes

Hi,

sadly it isn't working anymore. Thunderbird changed the "From-Line" back to the old format. Maybe after a compressing the inbox?

Any Ideas?

== medoc writes

Do you think you would be able to produce a sample folder with non-confidential data and send it to me ?


== Unknown User writes

With the tbird fix it is working again!
Thanks

== medoc writes

Finally? fixed by r2127