Welcome,
Guest
|
|
|
Hi
i thank you for your great iFile Framework.But today i got a problem with XPDF and indexing PDF files. i develop a little app with your framework on a local server and all it's ok. But when i go on production server, no PDF files were indexed. Error in document: (myfile.pdf) - Empty body I checked the two servers and all it's ok with your ServerCheck. I try to change all the config XML, but with no success. could you help me? Thanks in advance |
Last Edit: 01 Oct 2015 16:24 by tesson.
The administrator has disabled public write access.
|
|
Hi,
the serve in local and in production is equal? The problems could be of two types: 1. Library of the Operation System: IFile use the XPDF to get the content of the documents and the XPDF use many libraries of the operating systems. You can try the XPDF installed in IFile from command line and verify if this work correctly. Here an link that describe the command line: linux.about.com/library/cmd/blcmdl1_xpdf.htm 2. Encoding of the server. The XPDF use the operation system encoding, you need verify if the encoding of the two server are equal. If you need other information or help, you can open a Ticket from this link: www.isapp.it/en/support-ticket.html Thanks |
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
The administrator has disabled public write access.
The following user(s) said Thank You: tesson
|
|
Thanks a lot for your quick answer.
for your info, my servers are not totally equal (ubuntu 12.04 in local, Debian 3 in production) But i use your XPDF files to encode. I don't set any XPDFRC associated with your XPDF files (pdfinfo, pdftotext). Does it a possible issue? i add th e iFileConfig.xml file content below:(just hide the PATH-TO-ROOT text) <?xml version="1.0" encoding="UTF-8"?>
<!--
/**
* Leggi il file readme.txt per la descrizione dei TAG del file IFileConfig.xml
*/
-->
<ifile>
<root-application>PATH-TO-ROOT/html/components/com_joboffer/libraries/ifile</root-application>
<table-name collation="utf8_general_ci">indexjifile</table-name>
<!--timelimit>180</timelimit-->
<!--memorylimit></memorylimit-->
<resultlimit>100</resultlimit>
<!--default-search-field>body</default-search-field-->
<!--duplicate>1</duplicate-->
<server bit="64" />
<!-- encoding></encoding-->
<xpdf>
<pdfinfo>
<executable>PATH-TO-ROOT/html/components/com_joboffer/libraries/ifile/adapter/helpers/binaries/linux/pdfinfo</executable>
<xpdfrc></xpdfrc>
</pdfinfo>
<pdftotext>
<executable>PATH-TO-ROOT/html/components/com_joboffer/libraries/ifile/adapter/helpers/binaries/linux/pdftotext</executable>
<xpdfrc></xpdfrc>
</pdftotext>
<opw></opw>
</xpdf>
<doctotxt encoding="8859-1" type="PHP" />
<!-- doctotxt encoding="8859-1" type="ANTIWORD" /-->
<!--doctotxt encoding="" type="COM" /-->
<zend-document>
<fields>
<field name="name" type="Text" />
<field name="path" type="Text" />
<field name="filename" type="Text" />
</fields>
</zend-document>
<!--
<zend-document>
<fields>
<field name="body" type="UnStored" encoding="UTF-8"/>
<field name="introtext" type="Keyword" encoding="UTF-8" />
<field name="title" type="UnStored" encoding="UTF-8" />
<field name="subject" type="UnStored" encoding="UTF-8" />
<field name="description" type="UnStored" encoding="UTF-8" />
<field name="creator" type="UnStored" encoding="UTF-8" />
<field name="keywords" type="UnStored" encoding="UTF-8" />
<field name="created" type="UnStored" encoding="UTF-8" />
<field name="modified" type="UnStored" encoding="UTF-8" />
</fields>
</zend-document>
-->
<analyzer>
<type>
<!--default>Utf8_CaseInsensitive</default-->
<default>Utf8Num_CaseInsensitive</default>
<!--custom-default class="StandardAnalyzer_Analyzer_Standard_English">C:\xampp\htdocs\IFile\analyzer\StandardAnalyzer\Analyzer\Standard\English.php</custom-default-->
</type>
<filters>
<!--stop-words>C:\xampp\htdocs\IFile\stopwords\stopwords.txt</stop-words-->
<!--short-words>2</short-words-->
<!--custom-filters>
<filter class="StandardAnalyzer_Analysis_TokenFilter_EnglishPECLStemmer">C:\xampp\htdocs\IFile\sf_IFile-Allura\trunk\tokenfilter\stemming\english\EnglishPECLStemmer.php</filter>
<filter class="StandardAnalyzer_Analysis_TokenFilter_EnglishPorterStemmer">C:\xampp\htdocs\IFile\sf_IFile-Allura\trunk\tokenfilter\stemming\english\EnglishPorterStemmer.php</filter>
<filter class="StandardAnalyzer_Analysis_TokenFilter_ItalianPECLStemmer">C:\xampp\htdocs\IFile\sf_IFile-Allura\trunk\tokenfilter\stemming\italian\ItalianPECLStemmer.php</filter>
</custom-filters-->
</filters>
</analyzer>
</ifile> have a nice day |
Last Edit: 05 Oct 2015 17:30 by tesson.
The administrator has disabled public write access.
|
|
Hi,
I see that you have configured "IFile" to work with system at 32bit. If your Operation System (Debian 3) is at 64 bit you must configured IFile to used the XPDF for linux at 64bit. Define [server bit="64"] and delete the configuration of the XPDF (IFile reads automatically the correct XPDF). Here an example: <?xml version="1.0" encoding="UTF-8"?>
<ifile>
<root-application>PATH-TO-ROOT/html/components/com_joboffer/libraries/ifile</root-application>
<table-name collation="utf8_general_ci">indexjifile</table-name>
<resultlimit>100</resultlimit>
<server bit="64" />
<doctotxt encoding="8859-1" type="PHP" />
<zend-document>
<fields>
<field name="name" type="Text" />
<field name="path" type="Text" />
<field name="filename" type="Text" />
</fields>
</zend-document>
<analyzer>
<type>
<default>Utf8Num_CaseInsensitive</default>
</type>
<filters>
</filters>
</analyzer>
</ifile> |
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
Last Edit: 06 Oct 2015 09:26 by Giampaolo.
The administrator has disabled public write access.
|
|
Hi
i try your setting for config file, i checked the setting and it's a little bit different from my original settings. I have to change the permissions on the bin64 folder and files inside. Now it's ok... I found my previous problem: no PDF indexed and all "empty body" statement |
Last Edit: 06 Oct 2015 09:43 by tesson.
The administrator has disabled public write access.
|
|
hi
i decided to install the XPDF library directly in my production server. i can test now and all it's ok when i try in command : sudo pdfinfo MYPDF.pdf now i have to understand why it's not ok with apache... |
The administrator has disabled public write access.
|
|
Hi,
I think that the problem is the permission to invoke the XPDF from PHP. I see that you call XPDF with "sudo" but IFile call XPDF with the Apache user. |
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
The administrator has disabled public write access.
|
|
Hi,
I have create a debug script to verify the problem with the XPDF. 1. you need copy the file "Adapter_Search_Lucene_Document_PDFX.php" in: ifile/adapter/ 2. rename the extension of one file "pdf" in "pdfx" 3. try to indexed this file from IFile and see the output.
Attachments:
|
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie Help us by voting our extensions on Joomla.org: JiFile JoomPhoto Mobile Easy Language
Last Edit: 06 Oct 2015 15:40 by Giampaolo.
The administrator has disabled public write access.
|
|
If you not change browser settings, you agree to it. Learn more