Benvenuto,
Ospite
|
|
|
Hi
i thank you for your great iFile Framework.But today i got a problem with XPDF and indexing PDF files. i develop a little app with your framework on a local server and all it's ok. But when i go on production server, no PDF files were indexed. Error in document: (myfile.pdf) - Empty body I checked the two servers and all it's ok with your ServerCheck. I try to change all the config XML, but with no success. could you help me? Thanks in advance |
Ultima modifica: 01/10/2015 16:24 da tesson.
L\'Amministratore ha disattivato l\'accesso in scrittura al pubblico.
|
|
Hi,
the serve in local and in production is equal? The problems could be of two types: 1. Library of the Operation System: IFile use the XPDF to get the content of the documents and the XPDF use many libraries of the operating systems. You can try the XPDF installed in IFile from command line and verify if this work correctly. Here an link that describe the command line: linux.about.com/library/cmd/blcmdl1_xpdf.htm 2. Encoding of the server. The XPDF use the operation system encoding, you need verify if the encoding of the two server are equal. If you need other information or help, you can open a Ticket from this link: www.isapp.it/en/support-ticket.html Thanks |
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
L\'Amministratore ha disattivato l\'accesso in scrittura al pubblico.
I seguenti utenti ringraziano:: tesson
|
|
Thanks a lot for your quick answer.
for your info, my servers are not totally equal (ubuntu 12.04 in local, Debian 3 in production) But i use your XPDF files to encode. I don't set any XPDFRC associated with your XPDF files (pdfinfo, pdftotext). Does it a possible issue? i add th e iFileConfig.xml file content below:(just hide the PATH-TO-ROOT text) <?xml version="1.0" encoding="UTF-8"?>
<!--
/**
* Leggi il file readme.txt per la descrizione dei TAG del file IFileConfig.xml
*/
-->
<ifile>
<root-application>PATH-TO-ROOT/html/components/com_joboffer/libraries/ifile</root-application>
<table-name collation="utf8_general_ci">indexjifile</table-name>
<!--timelimit>180</timelimit-->
<!--memorylimit></memorylimit-->
<resultlimit>100</resultlimit>
<!--default-search-field>body</default-search-field-->
<!--duplicate>1</duplicate-->
<server bit="64" />
<!-- encoding></encoding-->
<xpdf>
<pdfinfo>
<executable>PATH-TO-ROOT/html/components/com_joboffer/libraries/ifile/adapter/helpers/binaries/linux/pdfinfo</executable>
<xpdfrc></xpdfrc>
</pdfinfo>
<pdftotext>
<executable>PATH-TO-ROOT/html/components/com_joboffer/libraries/ifile/adapter/helpers/binaries/linux/pdftotext</executable>
<xpdfrc></xpdfrc>
</pdftotext>
<opw></opw>
</xpdf>
<doctotxt encoding="8859-1" type="PHP" />
<!-- doctotxt encoding="8859-1" type="ANTIWORD" /-->
<!--doctotxt encoding="" type="COM" /-->
<zend-document>
<fields>
<field name="name" type="Text" />
<field name="path" type="Text" />
<field name="filename" type="Text" />
</fields>
</zend-document>
<!--
<zend-document>
<fields>
<field name="body" type="UnStored" encoding="UTF-8"/>
<field name="introtext" type="Keyword" encoding="UTF-8" />
<field name="title" type="UnStored" encoding="UTF-8" />
<field name="subject" type="UnStored" encoding="UTF-8" />
<field name="description" type="UnStored" encoding="UTF-8" />
<field name="creator" type="UnStored" encoding="UTF-8" />
<field name="keywords" type="UnStored" encoding="UTF-8" />
<field name="created" type="UnStored" encoding="UTF-8" />
<field name="modified" type="UnStored" encoding="UTF-8" />
</fields>
</zend-document>
-->
<analyzer>
<type>
<!--default>Utf8_CaseInsensitive</default-->
<default>Utf8Num_CaseInsensitive</default>
<!--custom-default class="StandardAnalyzer_Analyzer_Standard_English">C:\xampp\htdocs\IFile\analyzer\StandardAnalyzer\Analyzer\Standard\English.php</custom-default-->
</type>
<filters>
<!--stop-words>C:\xampp\htdocs\IFile\stopwords\stopwords.txt</stop-words-->
<!--short-words>2</short-words-->
<!--custom-filters>
<filter class="StandardAnalyzer_Analysis_TokenFilter_EnglishPECLStemmer">C:\xampp\htdocs\IFile\sf_IFile-Allura\trunk\tokenfilter\stemming\english\EnglishPECLStemmer.php</filter>
<filter class="StandardAnalyzer_Analysis_TokenFilter_EnglishPorterStemmer">C:\xampp\htdocs\IFile\sf_IFile-Allura\trunk\tokenfilter\stemming\english\EnglishPorterStemmer.php</filter>
<filter class="StandardAnalyzer_Analysis_TokenFilter_ItalianPECLStemmer">C:\xampp\htdocs\IFile\sf_IFile-Allura\trunk\tokenfilter\stemming\italian\ItalianPECLStemmer.php</filter>
</custom-filters-->
</filters>
</analyzer>
</ifile> have a nice day |
Ultima modifica: 05/10/2015 17:30 da tesson.
L\'Amministratore ha disattivato l\'accesso in scrittura al pubblico.
|
|
Hi,
I see that you have configured "IFile" to work with system at 32bit. If your Operation System (Debian 3) is at 64 bit you must configured IFile to used the XPDF for linux at 64bit. Define [server bit="64"] and delete the configuration of the XPDF (IFile reads automatically the correct XPDF). Here an example: <?xml version="1.0" encoding="UTF-8"?>
<ifile>
<root-application>PATH-TO-ROOT/html/components/com_joboffer/libraries/ifile</root-application>
<table-name collation="utf8_general_ci">indexjifile</table-name>
<resultlimit>100</resultlimit>
<server bit="64" />
<doctotxt encoding="8859-1" type="PHP" />
<zend-document>
<fields>
<field name="name" type="Text" />
<field name="path" type="Text" />
<field name="filename" type="Text" />
</fields>
</zend-document>
<analyzer>
<type>
<default>Utf8Num_CaseInsensitive</default>
</type>
<filters>
</filters>
</analyzer>
</ifile> |
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
Ultima modifica: 06/10/2015 09:26 da Giampaolo.
L\'Amministratore ha disattivato l\'accesso in scrittura al pubblico.
|
|
Hi
i try your setting for config file, i checked the setting and it's a little bit different from my original settings. I have to change the permissions on the bin64 folder and files inside. Now it's ok... I found my previous problem: no PDF indexed and all "empty body" statement |
Ultima modifica: 06/10/2015 09:43 da tesson.
L\'Amministratore ha disattivato l\'accesso in scrittura al pubblico.
|
|
hi
i decided to install the XPDF library directly in my production server. i can test now and all it's ok when i try in command : sudo pdfinfo MYPDF.pdf now i have to understand why it's not ok with apache... |
L\'Amministratore ha disattivato l\'accesso in scrittura al pubblico.
|
|
Hi,
I think that the problem is the permission to invoke the XPDF from PHP. I see that you call XPDF with "sudo" but IFile call XPDF with the Apache user. |
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
L\'Amministratore ha disattivato l\'accesso in scrittura al pubblico.
|
|
Hi,
I have create a debug script to verify the problem with the XPDF. 1. you need copy the file "Adapter_Search_Lucene_Document_PDFX.php" in: ifile/adapter/ 2. rename the extension of one file "pdf" in "pdfx" 3. try to indexed this file from IFile and see the output.
Allegati:
|
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
Ultima modifica: 06/10/2015 15:40 da Giampaolo.
L\'Amministratore ha disattivato l\'accesso in scrittura al pubblico.
|
|
Facendo click su qualunque link della pagina si accetta il loro utilizzo. Per saperne di piu'