Perlfect Solutions
 

[Perlfect-search] PDF Indexing Problems

Quimby, Marc perlfect-search@perlfect.com
Wed, 10 Mar 2004 13:34:33 -0700
How does PDF indexing work?  Does it just convert the PDF to a text file
using pdftotext and then index the text file?  Text files are being of the
PDF's are being created, but I am getting errors during the indexing.

While indexing, I am receiving these errors:
    67: c:/inetpub/wwwroot/v60/qrg/QRG_SmarTrack_Wklist Rules_Portrait.pdf
(611.86 KB)
Use of uninitialized value in substitution (s///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 275.
Use of uninitialized value in substitution (s///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 276.
Use of uninitialized value in pattern match (m//) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 464.
Use of uninitialized value in substitution (s///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 390.
Use of uninitialized value in substitution (s///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 391.
Use of uninitialized value in substitution (s///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 392.
Use of uninitialized value in substitution (s///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 393.
Use of uninitialized value in transliteration (tr///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 394.
Use of uninitialized value in substitution (s///) at tools.pl line 245.
Use of uninitialized value in substitution (s///) at tools.pl line 247.
Use of uninitialized value in substitution (s///) at tools.pl line 249.
Use of uninitialized value in split at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 431.
Use of uninitialized value in pattern match (m//) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 440.
Use of uninitialized value in pattern match (m//) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 487.
    68: c:/inetpub/wwwroot/v60/qrg/QRG_SmarTrack_Wklist Rules_Portrait.txt
(12.18 KB)
Use of uninitialized value in pattern match (m//) at tools.pl line 187.
    69: c:/inetpub/wwwroot/v60/qrg/QRG_UR_Concurrent_Rev_Portrait.pdf
(621.03 KB)
Use of uninitialized value in substitution (s///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 275.
Use of uninitialized value in substitution (s///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 276.
Use of uninitialized value in pattern match (m//) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 464.
Use of uninitialized value in substitution (s///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 390.
Use of uninitialized value in substitution (s///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 391.
Use of uninitialized value in substitution (s///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 392.
Use of uninitialized value in substitution (s///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 393.
Use of uninitialized value in transliteration (tr///) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 394.
Use of uninitialized value in substitution (s///) at tools.pl line 245.
Use of uninitialized value in substitution (s///) at tools.pl line 247.
Use of uninitialized value in substitution (s///) at tools.pl line 249.
Use of uninitialized value in split at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 431.
Use of uninitialized value in pattern match (m//) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 440.
Use of uninitialized value in pattern match (m//) at
C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 487.
    70: c:/inetpub/wwwroot/v60/qrg/QRG_UR_Concurrent_Rev_Portrait.txt (12.36
KB)

Crawler finished: indexed 70 files, 304460 terms (7185 different terms).
Ignored 1 files because of conf/no_index.txt
Calculating weight vectors:
0%  10%  20%  30%  40%  50%  60%  70%  80%  90%  100%
|----|----|----|----|----|----|----|----|----|----|
                                                  >

Thanks,
Marc Quimby