#1 StepEnable the JiFile search plugin
#2 StepConfigure the path of the file (PDF for example) to indexing
: path to the folder where the documents are present. This can not be outside the root of Joomla! Must be crated manually before proceeding.
(You can edit more times this field if you have more folders in the system with documents to be indexed.)Index path
: path to the folder where the system will build the index structure. IMPORTANT
: The folder must be absolutely already present in the system and therefore should not be created. The system will automatically create the folder and some files useful for the indexing process.Ignore file
: List of file extensions separated by a comma (,). Allows you to not display files with the extensions specified in the Filesystem section.Encoding SO
: This field defines the type of encoding of your operating system. This must be configured only if some file names that have special characters are displayed incorrectly in the Filesystem section.
#3 StepConfiguration of JiFile
Incorrect configuration could give you problems in the indexing process is therefore recommended to carefully read through this part of guide.
We will only describe the fundamental fields for quick use
: Defines the type of "charset encoding" in which the document was written. If not valorized the system attempts to retrieve the type of encoding independently of each field. An incorrect configuration could result in incomplete indexing or return the error message "Empty body".Error Message “Empty body”
: To understand how to properly configure this field, it is important to remember that each document may have been written with a different encoding.Let's take an example
:A document "PDF", written in Italian, was written and saved in ISO-8859-1 (Windows 1252 - Latin1). Now the system to index the PDF must read the content.By default, the XPDF uses ISO-8859-1 (Latin-1 or Windows 1252) as encoding of reading, and therefore buffers the PDF text in ISO-8859-1.Since the (field Analyzer) is set in "UTF8_CaseInsensitive" (remember that this field should never be changed after you have defined and saved), the indexing system must convert the content buffered by XPDF (in ISO-8859-1) in UTF8 (final result of the index).To have a proper indexing of the content, without receiving the annoying message "Empty Body", you must define the encoding of the document in the "Encoding" in the configuration of JiFile in ISO-8859-1.Server Bit
: Defines the type of server it is installed on your Joomla. This configuration can be used for indexing of PDF documents as the system must be able to use the XPDF compiled for the OS 32 or 64 bit.XPDF
: you can define the path of the XPDF executable if those in JiFile were not compatible with your server (you can ask your hosting what is the path of pdftotext and pdfinfo).Analyzer
: The type of analyzer is used for the management of documents both during indexing that in the research phase. If you configure a type of analyzer to index a document and later change it, during the search the results may be different from those expected. In fact an analyzer should always be the same throughout the life of the index.
#4 StepIndex your PDF files, DOC, XLS, etc.
#Delete files indexed