Dashboard > Community Wiki > ... > Troubleshooting > Corrupted Search Index
Corrupted Search Index Log In View a printable version of the current page.

Added by Jan Haderka , last edited by Jan Haderka on Jan 25, 2008
Labels: 

Symptoms

Search Index gets corrupted when broken pdf file is uploaded to dms and indexed.

Solution

Remove org.apache.jackrabbit.extractor.PdfTextExtractor from the textFilterClasses list in your workspace.xml as well as from your jackrabbit configuration file.
Side effect of this fix is that PDF files will not be indexed anymore.
Another option is to wrap this indexer in your own which will perform check on every pdf file before passing it through for indexing.

Powered by a free Atlassian Confluence Open Source Project License granted to Magnolia International. Evaluate Confluence today.
Powered by Atlassian Confluence 2.7, the Enterprise Wiki. Bug/feature request - Atlassian news - Contact administrators