Forum

JiFile for Joomla!

JIFile
JiFile is a component for Joomla! that allows you to index file contents (PDF, DOC, etc..) to perform searches in them.

Learn more...  Demo

JoomPhoto Mobile

JPhotoMobile
JoomPhoto Mobile is a component for Joomla! that allows you to share the photos from your Android device to your site Joomla.

Learn more...  Demo

iFile Framework

IFile
IFile is an open source framework written entirely in PHP, allows the indexing of textual content of a document (DOC, PDF, EXCEL, etc.) and a quick search within them.

Learn more...  Demo

Easy Language

EasyLanguage
Easy Language is a plugin for easy and immediate management of multilingual texts in every possible area of joomla, articles, components, modules, metadata, template, other components(example K2) etc.

Learn more...

Article Book Effect

Article Book Effect
View Joomla articles with the effect turns the page of a book. This plugin will display the contents of an article in Joomla as a real book or magazine, using all the benefits of HTML5

Learn more...  Demo

 

Passport photo

Passport photo
The most popular Android app that allows you to print photos cards for your documents with your Android smartphone, in a simple and intuitive way.

Learn more...

 

Crazy Shadow

Crazy Shadow
Crazy Shadow is the 3D fast-paced and fun puzzle Android game! Try to rotate and drag shapes in the position of their shadows without fail! Solve in succession all combinations of levels of the game.

Learn more...

 

Admin Countdown

Admin Countdown
Module for Joomla! 2.5 and 3.x displays in the administration part of the site, a timer with countdown of the time remaining in your session.

Learn more...  Demo

 
Welcome, Guest
Username: Password: Remember me

TOPIC: File is not a DOC Error

File is not a DOC Error 24 Jul 2012 16:00 #489

  • Rene
  • Rene's Avatar
  • OFFLINE
  • Fresh Boarder
  • Posts: 14
I have 240 000 documents in format pdf, doc, rtf, docx. These documents have been collected since 1997.
I get the error : "File is not a DOC" on most of the documents added between 1997 and 2005.

Please let me know if there is something I can do to sort this error out without having to convert all of these 2 pdf.

My setup :
MYSQL Database Version 5.1.61
MYSQL Database Collation utf8_general_ci
PHP Version 5.3.3
Web Server Apache/2.2.15 (CentOS)
WebServer to PHP Interface apache2handler
Joomla! Version Joomla! 2.5.6 Stable [ Ember ] 19-June-2012 14:00 GMT
The administrator has disabled public write access.

Re: File is not a DOC Error 24 Jul 2012 17:14 #490

  • Giampaolo
  • Giampaolo's Avatar
  • OFFLINE
  • Administrator
  • Posts: 465
  • Thank you received: 43
Hi,
Currently JiFile supports version of MS Word 97. Not work with version 2000/XP/2003/... etc

If you want, you can attach, at this topic, one DOC file for test this.
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie
Help us by voting our extensions on Joomla.org:
JiFile
JoomPhoto Mobile
Easy Language
The administrator has disabled public write access.

Re: File is not a DOC Error 24 Jul 2012 17:24 #491

  • Rene
  • Rene's Avatar
  • OFFLINE
  • Fresh Boarder
  • Posts: 14
Thank you for the reply.

Please refer to sourceforge.net/projects/indexfile/
According to this, IFILE - Lucene PHP framework does support Microsoft Word 97-2000 (.doc) and Microsoft Word 2003-2007 (.docx).

Would it be possible for me to "fix" JIFILE to be able to read those documents, or would I be wasting my time trying? I have a bit of a problem with time and would appreciate your opinion.

If this solution does not work, I would have to try Sphynx. I need the solution that can solve my issue ASAP.
The administrator has disabled public write access.

Re: File is not a DOC Error 25 Jul 2012 21:48 #493

  • Giampaolo
  • Giampaolo's Avatar
  • OFFLINE
  • Administrator
  • Posts: 465
  • Thank you received: 43
Hi Rene,
IFile use a PHP library for convert Word document in text.
We working for use COM library, but if you server is linux this solution not works.
You can see you server use "antiword"?
If your server have installed "antiword" is possible develop a Adapter for DOC document, that use "antiword" for parser and index your DOC file.

If you attach a DOC file that not work with IFile, we can search one solution.

I remember that Sphinx is a Search Engine that not use Lucene but MySql, but the convert document PDF or WORD in text for index this.
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie
Help us by voting our extensions on Joomla.org:
JiFile
JoomPhoto Mobile
Easy Language
The administrator has disabled public write access.

Re: File is not a DOC Error 25 Jul 2012 21:51 #494

  • Rene
  • Rene's Avatar
  • OFFLINE
  • Fresh Boarder
  • Posts: 14
Thank you for your reply.

The Sphinx solution will not work for me - I checked.

We will be converting all our word docs to pdf! This is the only feasible solution at this stage. Thank you for your trouble and thank you for a wonderfull component. It realy works well.
The administrator has disabled public write access.