Perlfect Solutions
 

[Perlfect-search] Perlfect and PDF files

Steve Lawrence perlfect-search@perlfect.com
Fri, 22 Aug 2003 12:40:21 -0600
I can index pdf files, but when I get search results, the abstracts are
whacky. I can run pdftotext manually and dump the results to a text
file, and that�s good, but when using the search/results, it's bad. Here
is an example from the results page: 1 and 3 are screwed, but number
two, which is a plain pdf with only text is good. Help Please!

1. 5465.PDF 
%PDF-1.3 %���� 115 0 obj endobj xref 115 15 0000000016 00000 n
0000000651 00000 n 0000001503 00000 n 0000001661 00000 n 0000001861
00000 n 0000002038...
URL: http://ca.nexiaweb.com/pdf/5465.PDF   Score: 100%   Date:
2003-08-22   Size: 606 kB 


2. adobe.pdf 
This is an Adobe PDF document. If you search for the word Adobe, it
should come up in the search results.
URL: http://ca.nexiaweb.com/pdf/adobe.pdf   Score: 12%   Date:
2003-08-22   Size: 28 kB 


3. 5444.PDF 
%PDF-1.2 %���� 10 0 obj endobj xref 10 17 0000000016 00000 n 0000000686
00000 n 0000001076 00000 n 0000001229 00000 n 0000001436 00000 n
0000001616...
URL: http://ca.nexiaweb.com/pdf/5444.PDF   Score: 12%   Date: 2003-08-22
Size: 14 kB