rdzidlic writes
I have some files like "xxx - yyy zzz.pdf.xml" in .kde/share/apps/okular/docdata/
recollindex will fail with messages like unable to parse xxx unable to load xxx …
Should not be indexed at all but that is another point.
medoc writes
I can’t reproduce this. As far as I can see the rclxml filter handles filenames with spaces just fine. Could you please verify that parseable xml files with spaces in the name are handled correctly ? They are for me.
The "should not be indexed at all" part is a question of local configuration, obviously.
rdzidlic writes
testcase, test with
$ recollindex -i ~/.kde/share/apps/okular/docdata/666661.Marc\ Stevens\ -\ On\ Collisions\ for\ MD5.pdf.xml
medoc writes
Thanks, I had forgotten that okular notes were processed by their own filter, not rclxml. The rclokulnote filter did have a filename whitespace bug. Just pushed a fixed version.
rdzidlic writes
Has this been pushed before 1.20.6 ??
I still see this:
:4:../internfile/internfile.cpp:161:FileInterner::init fn [/home/rz/.kde/share/apps/okular/docdata/599837.J. Biol. Chem.-1987-Enyeart-3154-9.pdf.xml] mime [(null)] preview 0
:4:../internfile/mimehandler.cpp:249:getMimeHandler: mtype [application/x-okular-notes] filtertypes 1
:4:../internfile/mimehandler.cpp:64:getMimeHandlerFromCache: 25cba80955cbb4531d56e14b65e1d026 cache size 16
:4:../internfile/mimehandler.cpp:77:getMimeHandlerFromCache: 25cba80955cbb4531d56e14b65e1d026 found size 15
:4:../internfile/internfile.cpp:248:FileInterner:: init ok application/x-okular-notes [/home/rz/.kde/share/apps/okular/docdata/599837.J. Biol. Chem.-1987-Enyeart-3154-9.pdf.xml]
:4:../internfile/internfile.cpp:737:FileInterner::internfile. ipath []
:4:../utils/execmd.cpp:330:ExecCmd::startExec: (0|1) /usr/share/recoll/filters/rclokulnote {/home/rz/.kde/share/apps/okular/docdata/599837.J. Biol. Chem.-1987-Enyeart-3154-9.pdf.xml}
:4:../utils/execmd.cpp:683:ExecCmd::wait: got status 0x600
:2:../internfile/mh_exec.cpp:117:MimeHandlerExec: command status 0x600 for /usr/share/recoll/filters/rclokulnote
:2:../internfile/internfile.cpp:732:FileInterner::internfile: next_document error [/home/rz/.kde/share/apps/okular/docdata/599837.J. Biol. Chem.-1987-Enyeart-3154-9.pdf.xml] application/x-okular-notes
:4:../internfile/mimehandler.cpp:98:returnMimeHandler: returning filter for application/x-okular-notes cache size 15
:4:../internfile/internfile.cpp:852:FileInterner::internfile: conversion ended with no doc
:4:../rcldb/rcldb.cpp:1249:Db::add: udi [/home/rz/.kde/share/apps/okular/docdata/599837.J. Biol. Chem.-1987-Enyeart-3154-9.pdf.xml|] parent []
:3:../rcldb/rcldb.cpp:604:Db::add: docid 286291 updated [/home/rz/.kde/share/apps/okular/docdata/599837.J. Biol. Chem.-1987-Enyeart-3154-9.pdf.xml|]
and in term:
warning: failed to load external entity "/home/rz/.kde/share/apps/okular/docdata/599837.J."
unable to parse /home/rz/.kde/share/apps/okular/docdata/599837.J.
warning: failed to load external entity "Biol."
unable to parse Biol.
warning: failed to load external entity "Chem.-1987-Enyeart-3154-9.pdf.xml"
unable to parse Chem.-1987-Enyeart-3154-9.pdf.xml
/home/rz/.kde/share/apps/okular/docdata/1011964.1021_159486_1.pdf.xml:1: parser error : Document is empty
^
/home/rz/.kde/share/apps/okular/docdata/1011964.1021_159486_1.pdf.xml:1: parser error : Start tag expected, '<' not found
medoc writes
Reopened to remember checking new data
medoc writes
No, sorry, this is not in 1.20.6, I forgot to merge it from the trunk. It will be in future 1.20 (and 1.21) releases). You can just replace rclokulnotes (and rcldoc) with copies from the source repository.