Welcome,
Guest
|
|
hi
We're using Joomla 2.5 and JiFile latest version. We have found a major bug when we do a search with the option "exact phrase" and accents One example for explanation. tests were made with the following words la lumière notamment de mesures transitoires These words are found in the document : test_document.pdf When we do the full text search with option "all words" result of 44 documents. "any words" result of 17 documents when we open the document present in the results list, <la lumière notamment de mesures transitoires> is not found the phrase but there are aparts words, there is lumière, mesures, transitoires, but there is not a place with the integer sentence "exact phrase" 0 result but when we do again with the following words <notamment de mesures transitoires> we delete the words <la lumière> in the text search , it returns 1 result the file test_document.pdf contains the exact phrase <notamment de mesures transitoires> So the accent are not supported in search with exact phrase. Can you suggest any workaround for this or is this something that could be looked at for a future version of JiFile? Thanks you config of jifile: <ifile> <root-application>../</root-application> <duplicate>0</duplicate> <encoding>UTF-8</encoding> <doctotxt type="PHP" encoding=""/> <zend-document> <fields> <field name="name" type="Keyword"/> <field name="extensionfile" type="UnIndexed"/> <field name="path" type="UnIndexed"/> <field name="filename" type="UnIndexed"/> <field name="introtext" type="UnIndexed"/> <field name="body" type="UnStored"/> <field name="title" type="Text"/> <field name="subject" type="Text"/> <field name="description" type="Text"/> <field name="creator" type="Text"/> <field name="keywords" type="Keyword"/> <field name="created" type="UnStored"/> <field name="modified" type="UnStored"/> </fields> </zend-document> <analyzer> <type> <default>Utf8_CaseInsensitive</default> </type> </analyzer> </ifile> |
Last Edit: 22 Jul 2013 12:49 by fere.
The administrator has disabled public write access.
|
|
Hi,
you can attach the test_document.pdf, or send us this file at the: This email address is being protected from spambots. You need JavaScript enabled to view it. Thanks |
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
The administrator has disabled public write access.
|
|
This attachment is hidden for guests. Please log in or register to see it. here is the file thanks |
Last Edit: 22 Jul 2013 14:19 by fere.
The administrator has disabled public write access.
|
|
Perfect, we test your document and let you know.
|
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
The administrator has disabled public write access.
|
|
Hi,
the problem is the IFile library. On the file: libraries\ifile\IFile_Indexing_Lucene.php on line: 331 the library tokenize the phrase for search in the "Index", but not know the "encoding" that your site use for send the string in the "input type" (the encoding that use Joomla! when present the HTML and send the input type). For exemple, if your Joomla! use UTF-8, you can change the line 331 in: $tokens = Zend_Search_Lucene_Analysis_Analyzer::getDefault()->tokenize($term->getTerm(), 'UTF-8'); and the library work perfectly. Thanks for your use case, we now, think how to solved this problem automatically, without change manually the file. |
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
The administrator has disabled public write access.
|
|
Perfect Thanks very much, it works fine
![]() |
The administrator has disabled public write access.
|
If you not change browser settings, you agree to it. Learn more