Unknown reporter writes
recoll not indexing base64 encoded email attach (html endoded in base64)
file maildir/somedir/mailfile: smtp mail text
email header (cut):
MIME-Version: 1.0 Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01CCA86D.508E767C"
email content:
This is a multi-part message in MIME format.
------_=_NextPart_001_01CCA86D.508E767C Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: base64
Q2lhbyBNYXNzaW1vLCANCg0KcHVvaSBwcm9jZWRlcmUgZSBmYXJjaSBzYXBlcmU/DQoNCkdyYXpp ZSwNCg0KRGVib3JhaA0KDQogDQoNCkRlYm9yYWggRGFuYQ0KDQpkZWJvcmFoLmRhbmFAa2V5LW9u …
------_=_NextPart_001_01CCA86D.508E767C Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: base64
PGh0bWwgeG1sbnM6dj0idXJuOnNjaGVtYXMtbWljcm9zb2Z0LWNvbTp2bWwiIHhtbG5zOm89InVy bjpzY2hlbWFzLW1pY3Jvc29mdC1jb206b2ZmaWNlOm9mZmljZSIgeG1sbnM6dz0idXJuOnNjaGVt …
medoc writes
Hi,
From the snippet you send, the html is not an attachment, it’s the html part in a multipart/alternative body (such as sent when you check "send both text and html" in, ie, thunderbird). When processing this type of part, recoll indexes the text/plain alternative, not the the text/html one, because the text content is supposedly the same, and the text/plain part requires less processing. Is this what you are seeing ? Or no indexing at all ?
If there is another issue than the one described above, a sample file would be very useful. You can send it directly to me (jfd at recoll org).
Cheers,
jf
medoc writes
Closing this as no actual problem was described