Forum

JiFile for Joomla!

JIFile
JiFile is a component for Joomla! that allows you to index file contents (PDF, DOC, etc..) to perform searches in them.

Learn more...  Demo

JoomPhoto Mobile

JPhotoMobile
JoomPhoto Mobile is a component for Joomla! that allows you to share the photos from your Android device to your site Joomla.

Learn more...  Demo

iFile Framework

IFile
IFile is an open source framework written entirely in PHP, allows the indexing of textual content of a document (DOC, PDF, EXCEL, etc.) and a quick search within them.

Learn more...  Demo

Easy Language

EasyLanguage
Easy Language is a plugin for easy and immediate management of multilingual texts in every possible area of joomla, articles, components, modules, metadata, template, other components(example K2) etc.

Learn more...

Article Book Effect

Article Book Effect
View Joomla articles with the effect turns the page of a book. This plugin will display the contents of an article in Joomla as a real book or magazine, using all the benefits of HTML5

Learn more...  Demo

 

Passport photo

Passport photo
The most popular Android app that allows you to print photos cards for your documents with your Android smartphone, in a simple and intuitive way.

Learn more...

 

Crazy Shadow

Crazy Shadow
Crazy Shadow is the 3D fast-paced and fun puzzle Android game! Try to rotate and drag shapes in the position of their shadows without fail! Solve in succession all combinations of levels of the game.

Learn more...

 

Admin Countdown

Admin Countdown
Module for Joomla! 2.5 and 3.x displays in the administration part of the site, a timer with countdown of the time remaining in your session.

Learn more...  Demo

 
Welcome, Guest
Username: Password: Remember me

TOPIC: Accent in full text search with exact phrase

Accent in full text search with exact phrase 22 Jul 2013 12:47 #1302

  • fere
  • fere's Avatar
  • OFFLINE
  • Fresh Boarder
  • Posts: 2
hi

We're using Joomla 2.5 and JiFile latest version.
We have found a major bug when we do a search with the option "exact phrase" and accents

One example for explanation. tests were made with the following words

la lumière notamment de mesures transitoires

These words are found in the document : test_document.pdf

When we do the full text search with option
"all words" result of 44 documents.
"any words" result of 17 documents
when we open the document present in the results list, <la lumière notamment de mesures transitoires> is not found the phrase but there are aparts words, there is lumière, mesures, transitoires, but there is not a place with the integer sentence
"exact phrase" 0 result but when we do again with the following words <notamment de mesures transitoires> we delete the words <la lumière> in the text search , it returns 1 result the file test_document.pdf contains the exact phrase <notamment de mesures transitoires>

So the accent are not supported in search with exact phrase.

Can you suggest any workaround for this or is this something that could be looked at for a future version of JiFile?

Thanks you

config of jifile:
<ifile>
<root-application>../</root-application>
<duplicate>0</duplicate>
<encoding>UTF-8</encoding>
<doctotxt type="PHP" encoding=""/>
<zend-document>
<fields>
<field name="name" type="Keyword"/>
<field name="extensionfile" type="UnIndexed"/>
<field name="path" type="UnIndexed"/>
<field name="filename" type="UnIndexed"/>
<field name="introtext" type="UnIndexed"/>
<field name="body" type="UnStored"/>
<field name="title" type="Text"/>
<field name="subject" type="Text"/>
<field name="description" type="Text"/>
<field name="creator" type="Text"/>
<field name="keywords" type="Keyword"/>
<field name="created" type="UnStored"/>
<field name="modified" type="UnStored"/>
</fields>
</zend-document>
<analyzer>
<type>
<default>Utf8_CaseInsensitive</default>
</type>
</analyzer>
</ifile>
Last Edit: 22 Jul 2013 12:49 by fere.
The administrator has disabled public write access.

Accent in full text search with exact phrase 22 Jul 2013 12:54 #1303

  • Giampaolo
  • Giampaolo's Avatar
  • OFFLINE
  • Administrator
  • Posts: 465
  • Thank you received: 43
Hi,
you can attach the test_document.pdf, or send us this file at the:

This email address is being protected from spambots. You need JavaScript enabled to view it.

Thanks
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie
Help us by voting our extensions on Joomla.org:
JiFile
JoomPhoto Mobile
Easy Language
The administrator has disabled public write access.

Accent in full text search with exact phrase 22 Jul 2013 14:18 #1304

  • fere
  • fere's Avatar
  • OFFLINE
  • Fresh Boarder
  • Posts: 2
This attachment is hidden for guests. Please log in or register to see it.

here is the file

thanks
Last Edit: 22 Jul 2013 14:19 by fere.
The administrator has disabled public write access.

Accent in full text search with exact phrase 22 Jul 2013 15:11 #1305

  • Giampaolo
  • Giampaolo's Avatar
  • OFFLINE
  • Administrator
  • Posts: 465
  • Thank you received: 43
Perfect, we test your document and let you know.
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie
Help us by voting our extensions on Joomla.org:
JiFile
JoomPhoto Mobile
Easy Language
The administrator has disabled public write access.

Accent in full text search with exact phrase 22 Jul 2013 17:56 #1310

  • Giampaolo
  • Giampaolo's Avatar
  • OFFLINE
  • Administrator
  • Posts: 465
  • Thank you received: 43
Hi,
the problem is the IFile library.
On the file:

libraries\ifile\IFile_Indexing_Lucene.php

on line: 331

the library tokenize the phrase for search in the "Index", but not know the "encoding" that your site use for send the string in the "input type" (the encoding that use Joomla! when present the HTML and send the input type).

For exemple, if your Joomla! use UTF-8, you can change the line 331 in:

$tokens = Zend_Search_Lucene_Analysis_Analyzer::getDefault()->tokenize($term->getTerm(), 'UTF-8');

and the library work perfectly.
Thanks for your use case, we now, think how to solved this problem automatically, without change manually the file.
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie
Help us by voting our extensions on Joomla.org:
JiFile
JoomPhoto Mobile
Easy Language
The administrator has disabled public write access.

Accent in full text search with exact phrase 23 Jul 2013 12:13 #1313

  • fere
  • fere's Avatar
  • OFFLINE
  • Fresh Boarder
  • Posts: 2
Perfect Thanks very much, it works fine :)
The administrator has disabled public write access.