Welcome,
Guest
|
|
Text of body not indexing. Check the type of encoding System Info: Latest stable Debian release JiFile Versions: pkg_com_jifile-1.0.9-j25 Example File: (unzip to test) This attachment is hidden for guests. Please log in or register to see it. Extra Info: - JiFile works on my local (Windows) development environment - After extension installation had to manually change permissions for libraries/ifile/adapter/helpers/binaries/linux/antiword from 644 to 755 libraries/ifile/adapter/helpers/binaries/linux/pdftotext from 644 to 755 |
Last Edit: 03 Sep 2013 12:13 by kutzooi.
The administrator has disabled public write access.
|
|
We verify your problem and let you know.
|
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
The administrator has disabled public write access.
|
|
Any news?
|
The administrator has disabled public write access.
|
|
Hi,
today we send you a responce. |
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
The administrator has disabled public write access.
|
|
Hi,
your document is correctly indexed (see images). The problem is the configuration of JiFile. You have the field : JiFile Configuration > Encoding: equal at blank Remember that each document could to be written with different encoding. When you indexed one document (or N° documents) you must define encoding type to read this document, and the type of encoding for convert this in "Analyzer Encoding". When you use "Blank" the sistem seeks to recognize the encoding, but this way isn't a very best. I'll explain: Your "DOC" document is saved in ISO-8859-1 (or Windows 1252). Now the "PHP" interface (you used PHP interface for read document, see JiFile Configuration > Microsoft Word To Text > Type) , reads the content of the DOC file, only in ISO-8859-1 (or Windows 1252 or Latin-1 - for any another type you must used Antiword), and buffer this in ISO-8859-1 encoding. Because the Analyzer is set in "UTF8_CaseInsensitive" (remember that this field should never be changed), JiFile must convert the buffered content (ISO-8859-1) in UTF8 (Analyzer). To do this, you must define encoding of the content document in field "Encoding" in JiFile configuration. If your "DOC" document is in another type of encoding, you must configure the: JiFile Configuration > Microsoft Word To Text > Type > ANTIWORD and define resourse, for read and buffer the content in this encoding. Let us know. Thanks This image is hidden for guests. Please log in or register to see it. This image is hidden for guests. Please log in or register to see it. This image is hidden for guests. Please log in or register to see it. This image is hidden for guests. Please log in or register to see it. |
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
The administrator has disabled public write access.
The following user(s) said Thank You: kutzooi
|
If you not change browser settings, you agree to it. Learn more