DescriptionWhen crawling a directory, the FS adaptor
will return the DocIds of hidden files and
folders within that directory. A subsequent
crawl request for a hidden file will return
a 404. A crawl request for a hidden directory
will also return a 404 - importantly its
contents are not fed.
The FS adaptor monitor works differently.
It pays no attention to hidden files or
folders and simply feeds all changed items.
This results in modified files contained
within a hidden directory being fed. My
first inclination was to have the Monitor
not feed any hidden items, but that is not
consistent with the existing lister/retriever
model.
This change has the monitor not feed any
docIds for any items with a hidden ancestor.
The item itself might be hidden and still be
fed. This is consistent with the existing
getDocContent() behavior.
Patch Set 1 #
Total comments: 2
Patch Set 2 : Miguel's Feedback. #MessagesTotal messages: 13
|