Forum

JiFile for Joomla!

JIFile
JiFile is a component for Joomla! that allows you to index file contents (PDF, DOC, etc..) to perform searches in them.

Learn more...  Demo

JoomPhoto Mobile

JPhotoMobile
JoomPhoto Mobile is a component for Joomla! that allows you to share the photos from your Android device to your site Joomla.

Learn more...  Demo

iFile Framework

IFile
IFile is an open source framework written entirely in PHP, allows the indexing of textual content of a document (DOC, PDF, EXCEL, etc.) and a quick search within them.

Learn more...  Demo

Easy Language

EasyLanguage
Easy Language is a plugin for easy and immediate management of multilingual texts in every possible area of joomla, articles, components, modules, metadata, template, other components(example K2) etc.

Learn more...

Article Book Effect

Article Book Effect
View Joomla articles with the effect turns the page of a book. This plugin will display the contents of an article in Joomla as a real book or magazine, using all the benefits of HTML5

Learn more...  Demo

 

Passport photo

Passport photo
The most popular Android app that allows you to print photos cards for your documents with your Android smartphone, in a simple and intuitive way.

Learn more...

 

Crazy Shadow

Crazy Shadow
Crazy Shadow is the 3D fast-paced and fun puzzle Android game! Try to rotate and drag shapes in the position of their shadows without fail! Solve in succession all combinations of levels of the game.

Learn more...

 

Admin Countdown

Admin Countdown
Module for Joomla! 2.5 and 3.x displays in the administration part of the site, a timer with countdown of the time remaining in your session.

Learn more...  Demo

 
Welcome, Guest
Username: Password: Remember me

TOPIC: Scanned PDF don't getting indexed by jiFile

Scanned PDF don't getting indexed by jiFile 06 Nov 2012 16:17 #752

Hi @all,
I'm trying to index some scanned pdf files.
jiFile works fine with pdf printed files,
but not with scanned files. With manual indexing such a scanned file the body is empty, with automatic indexing the error says: Text of body not indexing. Check the type of encoding

I have an example file, but the content is sensitive, do you have an e-mail adress where the file could been sent to?

Thanks for your support.

Michael
Last Edit: 06 Nov 2012 16:18 by Giu Frazzetta.
The administrator has disabled public write access.

Re: Scanned PDF don't getting indexed by jiFile 06 Nov 2012 16:49 #753

  • Giampaolo
  • Giampaolo's Avatar
  • OFFLINE
  • Administrator
  • Posts: 465
  • Thank you received: 43
Hi,
you can write to

info[at]isapp.it
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie
Help us by voting our extensions on Joomla.org:
JiFile
JoomPhoto Mobile
Easy Language
The administrator has disabled public write access.

Re: Scanned PDF don't getting indexed by jiFile 06 Nov 2012 17:24 #754

Thank you Giampaolo,
sent the file.
The administrator has disabled public write access.

Re: Scanned PDF don't getting indexed by jiFile 06 Nov 2012 18:44 #755

  • Giampaolo
  • Giampaolo's Avatar
  • OFFLINE
  • Administrator
  • Posts: 465
  • Thank you received: 43
Hi,
JiFile use XPDF. This component not reads text from Image.
In fact if you try to select text from your PDF file, you can not copy text.

Your PDF is created how a image, you can scan your document using a OCR instrument, for create a PDF file in text form.
Warning!!!!, a PDF file, with OCR scan, could have problems when XPDF reads text (you could have loss of information).
If you like, if it was useful, consider a donation, Thanks
Se vuoi, se ti siamo stati utili, considera una donazione, Grazie
Help us by voting our extensions on Joomla.org:
JiFile
JoomPhoto Mobile
Easy Language
The administrator has disabled public write access.