Perlfect Solutions
 

[Perlfect-search] PDF Setup

Brian G. Hugh bhugh@onebox.com
Mon, 03 Dec 2001 20:37:16 -0600
I'm having trouble getting the search working for PDF.  When I run the
indexer, it runs through the directory that has the PDF.  It indexes
other file type that are defined in the following:

# Only files with these extensions should be indexed. [re-index]
@EXT = ("htm","pdf","shtml","txt");

I've set up the pdftotext as well as follows:

###########################################################################
### advanced configuration
### You only need this if you want to adapt advanced features

# Program that converts PDF to ascii text. pdftotext is part of xpdf,
available
# at http://www.foolabs.com/xpdf/download.html. You also have to add
"pdf" 
# to @EXT and your PDF files must have a ".pdf" suffix. You can use any
program
# that will print ASCII to STDOUT if called this way: "program pdf_filename
-". 
# WARNING: The PDFs filenames may not include special characters for
security 
# reasons, still it is recommended to use this option only to index your
own 
# files, not other people's files which filenames you cannot control.
[re-index]
#$PDFTOTEXT = '/usr/bin/pdftotext';
$PDFTOTEXT = 'c:\Program Files\xpdf\pdftotext';

I can run from the command line and convert a pdf to text manually.
I am running under Windows 2000.

Thanks in advance for any help.

__________________________________________________
FREE voicemail, email, and fax...all in one place.
Sign Up Now! http://www.onebox.com