From perlfect-search@perlfect.com Mon Mar 1 04:03:40 2004 From: perlfect-search@perlfect.com (Michael Borck) Date: Mon, 1 Mar 2004 12:03:40 +0800 Subject: [Perlfect-search] Dynamic Page (PHP pages) Indexing Problems In-Reply-To: <200402262219.45959@danielnaber.de> References: <0028CD68-67A8-11D8-B1BE-000A9587472A@computing.edu.au> <200402252024.52625@danielnaber.de> <47FFFA1C-6812-11D8-927F-000A9587472A@computing.edu.au> <200402262219.45959@danielnaber.de> Message-ID: <6CA333FC-6B35-11D8-8D7D-000A9587472A@computing.edu.au> Hi Daniel, >>  I have even >> tried chmod 777.  Still no luck.  I get the "Cannot open error". > > Michael, > > did you also set the directory's permissions to 777? Setting the files' > permissions will not be enough. Yes, no luck. This is a summary of what I have discovered so far: I can open/download the file in the browser, so file/path/permissions okay I index via CGI search works (but because of https, indexes nothing) I index via filesystem search fails (but index all files) I had perlfect search work, then it stopped The techos upgraded a few machines (but not the web server) I moved my site from .shtm to php pages (doubt related, but including anyway) Based on this, I suspect it is a problem with the sturucture of the index file, which might be being caused by different versions of DB_File installed. The configuration here is that I cannot directly log onto the web server, only have access to an shared area on the server. So I think when I index via the file system, the local version of DB_File is different to the version of DB_File on the web server. This fits with machines on our network being upgraded. I have asked the techos for more detailed information (version of DB_FILE on server etc), and will let you know of my progress. Thought I would keep you informed of my progress. Michael. -- Associate Lecturer E-mail: michael@cs.curtin.edu.au Curtin University Phone: +61 8 9266 7939 GPO Box U1987, PERTH Fax: +61 8 9266 2819 WA 6001, Australia CRICOS Provider Code: 00301J From perlfect-search@perlfect.com Mon Mar 1 09:25:04 2004 From: perlfect-search@perlfect.com (Michael Borck) Date: Mon, 1 Mar 2004 17:25:04 +0800 Subject: [Perlfect-search] Dynamic Page (PHP pages) Indexing Problems In-Reply-To: <6CA333FC-6B35-11D8-8D7D-000A9587472A@computing.edu.au> References: <0028CD68-67A8-11D8-B1BE-000A9587472A@computing.edu.au> <200402252024.52625@danielnaber.de> <47FFFA1C-6812-11D8-927F-000A9587472A@computing.edu.au> <200402262219.45959@danielnaber.de> <6CA333FC-6B35-11D8-8D7D-000A9587472A@computing.edu.au> Message-ID: <52AF561D-6B62-11D8-A0DE-000A9587472A@computing.edu.au> Hi All, > Based on this, I suspect it is a problem with the sturucture of the > index file, which might be being caused by different versions of > DB_File installed. The configuration here is that I cannot directly > log onto the web server, only have access to an shared area on the > server. So I think when I index via the file system, the local > version of DB_File is different to the version of DB_File on the web > server. Yep this was the problem. The techos ran the indexer on the server (not via a mounted filesystem on a different box) and the index files are okay and search is working again. Cheers, Michael. -- From perlfect-search@perlfect.com Mon Mar 1 09:32:18 2004 From: perlfect-search@perlfect.com (Michael Borck) Date: Mon, 1 Mar 2004 17:32:18 +0800 Subject: [Perlfect-search] Changing the width of the output In-Reply-To: <6CA333FC-6B35-11D8-8D7D-000A9587472A@computing.edu.au> References: <0028CD68-67A8-11D8-B1BE-000A9587472A@computing.edu.au> <200402252024.52625@danielnaber.de> <47FFFA1C-6812-11D8-927F-000A9587472A@computing.edu.au> <200402262219.45959@danielnaber.de> <6CA333FC-6B35-11D8-8D7D-000A9587472A@computing.edu.au> Message-ID: <5516BDC0-6B63-11D8-A0DE-000A9587472A@computing.edu.au> Hi All, I have managed to include the output of perlsearch to fit within our template so as to keep with the look and feel of the website. But I cannot see/find where to change modify the width. Often the results returned are wider than the browser window requiring to scroll across. Any idea where in search.cgi I could restrict/set this width? I have tried including the output in a table, and fixing the width of the table but it doesn't seem to be having an affect. Any help would be appreciated? Michael. -- From perlfect-search@perlfect.com Mon Mar 1 19:09:14 2004 From: perlfect-search@perlfect.com (Daniel Naber) Date: Mon, 1 Mar 2004 20:09:14 +0100 Subject: [Perlfect-search] Changing the width of the output In-Reply-To: <5516BDC0-6B63-11D8-A0DE-000A9587472A@computing.edu.au> References: <0028CD68-67A8-11D8-B1BE-000A9587472A@computing.edu.au> <6CA333FC-6B35-11D8-8D7D-000A9587472A@computing.edu.au> <5516BDC0-6B63-11D8-A0DE-000A9587472A@computing.edu.au> Message-ID: <200403012009.14996@danielnaber.de> On Monday 01 March 2004 10:32, Michael Borck wrote: > But I > cannot see/find where to change modify the width.   Either your template contains broken HTML or the browser doesn't add line breaks because there's no whitespace in the summary (or titles). That's not something which can be changed in Perlfect Search (unless you modify the code). Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Tue Mar 2 11:27:49 2004 From: perlfect-search@perlfect.com (Cyberpump!) Date: Tue, 2 Mar 2004 05:27:49 -0600 Subject: [Perlfect-search] Re: Indexing References: <20040229170002.31000.97837.Mailman@hottub.perlfect.com> Message-ID: <01da01c40049$66399980$0200a8c0@cyberpump2> I have a 2.4 Ghz, 512 RAM dedicated. I sit there on the command line and watch it take minutes to index one file. So, it takes me longer to index a 70k file that it does for you to index 2700 6kb. So, you are telling me it's this slow? Thanks. > Indexing and searching speed obviously depends on many factors. I can index > 2700 HTML files (average size 6kB) in 85 seconds (1,5 Ghz Pentium M, 512 > MB RAM, no other processes using CPU). > > Regards > Daniel > From perlfect-search@perlfect.com Tue Mar 2 18:11:17 2004 From: perlfect-search@perlfect.com (Daniel Naber) Date: Tue, 2 Mar 2004 19:11:17 +0100 Subject: [Perlfect-search] Re: Indexing In-Reply-To: <01da01c40049$66399980$0200a8c0@cyberpump2> References: <20040229170002.31000.97837.Mailman@hottub.perlfect.com> <01da01c40049$66399980$0200a8c0@cyberpump2> Message-ID: <200403021911.17974@danielnaber.de> On Tuesday 02 March 2004 12:27, Cyberpump! wrote: > I sit there on the command line and watch it take minutes to index one > file. Then something is wrong. Does it happen with any file or is that file special in a way (maybe broken HTML?)? Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Wed Mar 3 17:43:24 2004 From: perlfect-search@perlfect.com (Cyberpump) Date: Wed, 03 Mar 2004 12:43:24 -0500 Subject: [Perlfect-search] Re: Indexing Message-ID: The documents are just normal HTML files. Nothing special. That's what's puzzling why it takes so long. From perlfect-search@perlfect.com Sat Mar 6 22:14:59 2004 From: perlfect-search@perlfect.com (Cyberpump!) Date: Sat, 6 Mar 2004 16:14:59 -0600 Subject: [Perlfect-search] Error on Running Indexer Message-ID: <025601c403c8$782c7ae0$0200a8c0@cyberpump2> I ran the setup. All seemed fine. Get this now when running indexer.pl Using DB_File... Checking for old temp files... Cannot open /home/perlfect/perlfect.com/cgi-bin/search/data/inv_index_tmp: No such file or directory at indexer.pl line 132. From perlfect-search@perlfect.com Sat Mar 6 22:30:31 2004 From: perlfect-search@perlfect.com (Daniel Naber) Date: Sat, 6 Mar 2004 23:30:31 +0100 Subject: [Perlfect-search] Error on Running Indexer In-Reply-To: <025601c403c8$782c7ae0$0200a8c0@cyberpump2> References: <025601c403c8$782c7ae0$0200a8c0@cyberpump2> Message-ID: <200403062330.31934@danielnaber.de> On Saturday 06 March 2004 23:14, Cyberpump! wrote: > /home/perlfect/perlfect.com/cgi-bin/search/data/inv_index_tmp: No such For some reason there are still the example values in your configuration -- are you sure you're using the version installed by the setup and not the one you unzipped? If so, please manually modify the values in conf.pl (those in the "basic configuration" section). Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Sun Mar 7 17:06:59 2004 From: perlfect-search@perlfect.com (Cyberpump!) Date: Sun, 7 Mar 2004 11:06:59 -0600 Subject: [Perlfect-search] Re: Error on Running Indexer References: <20040307170001.25974.83635.Mailman@hottub.perlfect.com> Message-ID: <03fa01c40466$9bf18ad0$0200a8c0@cyberpump2> I did a re-install from scratch. And, this time let the indexer run fully. All is working now! Thanks!!! From perlfect-search@perlfect.com Wed Mar 10 09:54:40 2004 From: perlfect-search@perlfect.com (Yves Hanotiau) Date: Wed, 10 Mar 2004 10:54:40 +0100 Subject: [Perlfect-search] indexing of PDF metadata Message-ID: <3A990DB4C59FD411899B00D0B76961B601D1249C@mailoffice.senate.be> Hello, Are the metadata of a PDF file indexed by Perlfect ? By metadata, I think to : Title - Subject - Author - Keywords Thanks in advance. Yves HANOTIAU From perlfect-search@perlfect.com Wed Mar 10 18:46:42 2004 From: perlfect-search@perlfect.com (Daniel Naber) Date: Wed, 10 Mar 2004 19:46:42 +0100 Subject: [Perlfect-search] indexing of PDF metadata In-Reply-To: <3A990DB4C59FD411899B00D0B76961B601D1249C@mailoffice.senate.be> References: <3A990DB4C59FD411899B00D0B76961B601D1249C@mailoffice.senate.be> Message-ID: <200403101946.42797@danielnaber.de> On Wednesday 10 March 2004 10:54, Yves Hanotiau wrote: > Are the metadata of a PDF file indexed by Perlfect ? > By metadata, I think to : > Title - Subject - Author - Keywords Go to conf.pl and add -htmlmeta as an option to pdftotext. Then the meta data will be indexed (not sure if all the fields you mentioned are supported, and maybe you need a recent version of pdftotext). Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Wed Mar 10 20:34:33 2004 From: perlfect-search@perlfect.com (Quimby, Marc) Date: Wed, 10 Mar 2004 13:34:33 -0700 Subject: [Perlfect-search] PDF Indexing Problems Message-ID: <69B2A23FF27DBA43BEAE8CE30C87093A256E6D@tusexu01.midasnet.local> How does PDF indexing work? Does it just convert the PDF to a text file using pdftotext and then index the text file? Text files are being of the PDF's are being created, but I am getting errors during the indexing. While indexing, I am receiving these errors: 67: c:/inetpub/wwwroot/v60/qrg/QRG_SmarTrack_Wklist Rules_Portrait.pdf (611.86 KB) Use of uninitialized value in substitution (s///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 275. Use of uninitialized value in substitution (s///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 276. Use of uninitialized value in pattern match (m//) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 464. Use of uninitialized value in substitution (s///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 390. Use of uninitialized value in substitution (s///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 391. Use of uninitialized value in substitution (s///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 392. Use of uninitialized value in substitution (s///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 393. Use of uninitialized value in transliteration (tr///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 394. Use of uninitialized value in substitution (s///) at tools.pl line 245. Use of uninitialized value in substitution (s///) at tools.pl line 247. Use of uninitialized value in substitution (s///) at tools.pl line 249. Use of uninitialized value in split at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 431. Use of uninitialized value in pattern match (m//) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 440. Use of uninitialized value in pattern match (m//) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 487. 68: c:/inetpub/wwwroot/v60/qrg/QRG_SmarTrack_Wklist Rules_Portrait.txt (12.18 KB) Use of uninitialized value in pattern match (m//) at tools.pl line 187. 69: c:/inetpub/wwwroot/v60/qrg/QRG_UR_Concurrent_Rev_Portrait.pdf (621.03 KB) Use of uninitialized value in substitution (s///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 275. Use of uninitialized value in substitution (s///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 276. Use of uninitialized value in pattern match (m//) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 464. Use of uninitialized value in substitution (s///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 390. Use of uninitialized value in substitution (s///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 391. Use of uninitialized value in substitution (s///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 392. Use of uninitialized value in substitution (s///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 393. Use of uninitialized value in transliteration (tr///) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 394. Use of uninitialized value in substitution (s///) at tools.pl line 245. Use of uninitialized value in substitution (s///) at tools.pl line 247. Use of uninitialized value in substitution (s///) at tools.pl line 249. Use of uninitialized value in split at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 431. Use of uninitialized value in pattern match (m//) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 440. Use of uninitialized value in pattern match (m//) at C:\Inetpub\wwwroot\cgi-bin\perlfect\search\indexer.pl line 487. 70: c:/inetpub/wwwroot/v60/qrg/QRG_UR_Concurrent_Rev_Portrait.txt (12.36 KB) Crawler finished: indexed 70 files, 304460 terms (7185 different terms). Ignored 1 files because of conf/no_index.txt Calculating weight vectors: 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% |----|----|----|----|----|----|----|----|----|----| > Thanks, Marc Quimby From perlfect-search@perlfect.com Wed Mar 10 21:09:07 2004 From: perlfect-search@perlfect.com (Daniel Naber) Date: Wed, 10 Mar 2004 22:09:07 +0100 Subject: [Perlfect-search] PDF Indexing Problems In-Reply-To: <69B2A23FF27DBA43BEAE8CE30C87093A256E6D@tusexu01.midasnet.local> References: <69B2A23FF27DBA43BEAE8CE30C87093A256E6D@tusexu01.midasnet.local> Message-ID: <200403102209.07137@danielnaber.de> On Wednesday 10 March 2004 21:34, Quimby, Marc wrote: > How does PDF indexing work?  Does it just convert the PDF to a text file > using pdftotext and then index the text file? Perlfect Search calls the external converter defined in %EXT_FILTER and uses its result (text or HTML). I guess your configuration is broken. Please try to call the converter (usually pdftotext) manually and see what it prints. Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Wed Mar 10 22:58:12 2004 From: perlfect-search@perlfect.com (Quimby, Marc) Date: Wed, 10 Mar 2004 15:58:12 -0700 Subject: [Perlfect-search] PDF Indexing Problems Message-ID: <69B2A23FF27DBA43BEAE8CE30C87093A256E6E@tusexu01.midasnet.local> Here is my execution of pdftotext from the command prompt: C:\Inetpub\wwwroot\Search_Demo\catalog\documentation\care_management\v50-51> pdftotext DataVision_UserManual_50-51.pdf It generated a text file called DataVision_UserManual_50-51.txt in that same directory. (C:\Inetpub\wwwroot\Search_Demo\catalog\documentation\care_management\v50-51 ) Then (I am assuming), Perlfect uses the text file to index the pdf? Thanks, Marc Quimby -----Original Message----- From: perlfect-search-admin@perlfect.com [mailto:perlfect-search-admin@perlfect.com]On Behalf Of Daniel Naber Sent: Wednesday, March 10, 2004 2:09 PM To: perlfect-search@perlfect.com Subject: Re: [Perlfect-search] PDF Indexing Problems On Wednesday 10 March 2004 21:34, Quimby, Marc wrote: > How does PDF indexing work?  Does it just convert the PDF to a text file > using pdftotext and then index the text file? Perlfect Search calls the external converter defined in %EXT_FILTER and uses its result (text or HTML). I guess your configuration is broken. Please try to call the converter (usually pdftotext) manually and see what it prints. Regards Daniel -- http://www.danielnaber.de _______________________________________________ perlfect-search mailing list perlfect-search@perlfect.com To unsubscribe, set other personal options or view the list archives please visit: http://perlfect.com/mailman/listinfo/perlfect-search  From perlfect-search@perlfect.com Wed Mar 10 23:38:07 2004 From: perlfect-search@perlfect.com (Daniel Naber) Date: Thu, 11 Mar 2004 00:38:07 +0100 Subject: [Perlfect-search] PDF Indexing Problems In-Reply-To: <69B2A23FF27DBA43BEAE8CE30C87093A256E6E@tusexu01.midasnet.local> References: <69B2A23FF27DBA43BEAE8CE30C87093A256E6E@tusexu01.midasnet.local> Message-ID: <200403110038.07291@danielnaber.de> On Wednesday 10 March 2004 23:58, Quimby, Marc wrote: > Then (I am assuming), Perlfect uses the text file to index the pdf? No, the converter needs to print its output, not write it to a file. That's why the default configuration looks like this: "/usr/bin/pdftotext FILENAME -" Note the space and the dash at the end. Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Thu Mar 11 14:21:06 2004 From: perlfect-search@perlfect.com (Quimby, Marc) Date: Thu, 11 Mar 2004 07:21:06 -0700 Subject: [Perlfect-search] PDF Indexing Problems Message-ID: <69B2A23FF27DBA43BEAE8CE30C87093A256E6F@tusexu01.midasnet.local> Thanks, that did it. Marc Quimby -----Original Message----- From: perlfect-search-admin@perlfect.com [mailto:perlfect-search-admin@perlfect.com]On Behalf Of Daniel Naber Sent: Wednesday, March 10, 2004 4:38 PM To: perlfect-search@perlfect.com Subject: Re: [Perlfect-search] PDF Indexing Problems On Wednesday 10 March 2004 23:58, Quimby, Marc wrote: > Then (I am assuming), Perlfect uses the text file to index the pdf? No, the converter needs to print its output, not write it to a file. That's why the default configuration looks like this: "/usr/bin/pdftotext FILENAME -" Note the space and the dash at the end. Regards Daniel -- http://www.danielnaber.de _______________________________________________ perlfect-search mailing list perlfect-search@perlfect.com To unsubscribe, set other personal options or view the list archives please visit: http://perlfect.com/mailman/listinfo/perlfect-search  From perlfect-search@perlfect.com Thu Mar 11 15:55:57 2004 From: perlfect-search@perlfect.com (Carl Johnson) Date: Thu, 11 Mar 2004 10:55:57 -0500 Subject: [Perlfect-search] Manual install headaches Message-ID: <005001c40781$5934e1b0$3201010a@ad.bellcoglass.com> This is a multi-part message in MIME format. ------=_NextPart_000_004D_01C40757.6ECE4EA0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable I have read the FAQs, searched the archives and am stuck. My ISP does not offer SSH or Telnet, port 23 is blocked! Bastids! So = that leaves me with FTP install. Not to pick, but the README leaves out = some vital steps. Okay, my path is home/(user)/public_html/cgi-bin/search/, and I suspect = that the fact that I did not use cgi-bin/Perfect/search/ as my path may = be causing me some headaches. In the search directory, I had to manaually create these sub = directories: conf, data, temp, templates. I chmod to 755 each directory. = This step is not clear in the README, I figured it out by dissecting = the setup.pl file. After creating these sub directories, indexing appears to have worked = without a hitch. When I try to run search.pl, I get error 500. All = permissions are correct. I uploaded in ASCII. I opened every Perl file = and converted Dos>Unix (remove carriage returns) before uploading. For troubleshooting, I uncommented this line in search.pl: use CGI::Carp qw(fatalsToBrowser); I get this error: Software error: Can't locate object method "new" via package "Perlfect::Template" = (perhaps you forgot to load "Perlfect::Template"?) at search.pl line = 387. In the setup.pl file I find these lines install_dir($instdir."Perlfect", "0755"); install_file("Perlfect/Template.pm", $instdir."Perlfect/Template.pm", = "0644"); Okay, so I create Perfect directory in search, chmod 755. I copy = Template.pm to that directory, chmod to 644, and then to 755. Still get = error. search.pl has these lines: require Perlfect::Template; my $template =3D new Perlfect::Template($file); If I comment them out, get other errors. Am I missing something here? = I am sure it is obvious, but after playing with this for 2 days, I am = going buggy! ------=_NextPart_000_004D_01C40757.6ECE4EA0 Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable
I have read the FAQs, searched the = archives and am=20 stuck.
 
My ISP does not offer SSH or Telnet, = port 23 is=20 blocked!  Bastids!  So that leaves me with FTP install.  = Not to=20 pick, but the README leaves out some vital steps.
 
Okay, my path is=20 home/(user)/public_html/cgi-bin/search/, and I suspect that the fact = that I did=20 not use cgi-bin/Perfect/search/ as my path may be causing me some=20 headaches.
 
In the search directory, I had to = manaually create=20 these sub directories: conf, data, temp, templates. I chmod to 755 each=20 directory.  This step is not clear in the README, I figured it = out by=20 dissecting the setup.pl file.
 
After creating these sub directories, = indexing=20 appears to have worked without a hitch.  When I try to run = search.pl,=20 I get error 500.  All permissions are correct.  I uploaded in = ASCII. I=20 opened every Perl file and converted Dos>Unix (remove carriage = returns)=20 before uploading.
 
For troubleshooting, I uncommented this = line in=20 search.pl:

use CGI::Carp=20 qw(fatalsToBrowser);

I get this error:
Software error:
Can't locate=20 object method "new" via package "Perlfect::Template" (perhaps you forgot = to load=20 "Perlfect::Template"?) at search.pl line 387.

In the setup.pl file I find these = lines
install_dir($instdir."Perlfect",=20 "0755");
install_file("Perlfect/Template.pm", = $instdir."Perlfect/Template.pm",=20 "0644");
 
Okay, so I create = Perfect=20 directory in search, chmod 755. I copy Template.pm to that directory, = chmod to=20 644, and then to 755.  Still get error.
 
search.pl has = these=20 lines:
require=20 Perlfect::Template;
 

my = $template =3D new=20 Perlfect::Template($file);

If I comment them out, get other = errors.  Am I missing something here?  I am sure it is = obvious, but=20 after playing with this for 2 days, I am going=20 buggy!


------=_NextPart_000_004D_01C40757.6ECE4EA0-- From perlfect-search@perlfect.com Thu Mar 11 19:23:17 2004 From: perlfect-search@perlfect.com (Daniel Naber) Date: Thu, 11 Mar 2004 20:23:17 +0100 Subject: [Perlfect-search] Manual install headaches In-Reply-To: <005001c40781$5934e1b0$3201010a@ad.bellcoglass.com> References: <005001c40781$5934e1b0$3201010a@ad.bellcoglass.com> Message-ID: <200403112023.17720@danielnaber.de> On Thursday 11 March 2004 16:55, Carl Johnson wrote: > Okay, so I create Perfect directory in search, chmod 755. I copy > Template.pm to that directory, chmod to 644, and then to 755.  Still get > error. Carl, did you make sure that it's spelled Perlfect (capital P, l in the middle)? Also, the file inside must be Template.pm (capital T). The README is indeed incomplete when it comes to manual setup, that's because there are so many different potential problems that depend on your server's configuration... but most problems and their solutions are described in the FAQ. Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Thu Mar 11 21:41:20 2004 From: perlfect-search@perlfect.com (Carl Johnson) Date: Thu, 11 Mar 2004 16:41:20 -0500 Subject: [Perlfect-search] Manual install headaches References: <005001c40781$5934e1b0$3201010a@ad.bellcoglass.com> <200403112023.17720@danielnaber.de> Message-ID: <000501c407b1$993e9dc0$3201010a@ad.bellcoglass.com> Daniel Thanks for your reply! I finally got it, it was a typo on the Perlfect/ directory! Fixed that, chmod to 755 and bingo! From perlfect-search@perlfect.com Fri Mar 12 09:10:16 2004 From: perlfect-search@perlfect.com (Yves Hanotiau) Date: Fri, 12 Mar 2004 10:10:16 +0100 Subject: [Perlfect-search] indexing of PDF metadata Message-ID: <3A990DB4C59FD411899B00D0B76961B601D124C5@mailoffice.senate.be> > -----Original Message----- > From: Daniel Naber [mailto:daniel.naber@t-online.de] > Sent: woensdag 10 maart 2004 19:47 > On Wednesday 10 March 2004 10:54, Yves Hanotiau wrote: > > > Are the metadata of a PDF file indexed by Perlfect ? > > By metadata, I think to : > > Title - Subject - Author - Keywords > > Go to conf.pl and add -htmlmeta as an option to pdftotext. > Then the meta > data will be indexed (not sure if all the fields you mentioned are > supported, and maybe you need a recent version of pdftotext). Hi, It works fine ! Thanks you Fields indexed are : Title, Author and Keywords. The field "Subject" was not indexed. Regards Yves From perlfect-search@perlfect.com Tue Mar 16 14:32:02 2004 From: perlfect-search@perlfect.com (Robin Wenham) Date: Tue, 16 Mar 2004 14:32:02 GMT Subject: [Perlfect-search] Using the "Highlight matches" option Message-ID: <200431614322.581952@StudyXPPro> I am having problems using this option successfully. Can anyone help please? When I choose (highlight matches) in the search results, the page that appears has lost all the links to its content (CSS sheets, images, links). For example, an image that has the properties... http://www.mysite.com/image.jpg becomes... http:///image.jpg. When I simply choose the link to the page without highlights then it loads OK. Clearly I am missing something fundamental because I can find no other posts with this problem. -- Robin Wenham 07951 601 604 / 01928 739 750 From perlfect-search@perlfect.com Tue Mar 16 18:19:41 2004 From: perlfect-search@perlfect.com (Tim) Date: Tue, 16 Mar 2004 18:19:41 -0000 Subject: [Perlfect-search] Using the "Highlight matches" option References: <200431614322.581952@StudyXPPro> Message-ID: <001401c40b83$d6d6f710$ed1d9ed9@carrera6ital1d> This is a multi-part message in MIME format. ------=_NextPart_000_000F_01C40B83.3FC2AA40 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Hello Robin, Use absolute URL's for all links including css and images. Tim. ----- Original Message -----=20 From: Robin Wenham=20 To: perlfect-search@perlfect.com=20 Sent: Tuesday, March 16, 2004 2:32 PM Subject: [Perlfect-search] Using the "Highlight matches" option I am having problems using this option successfully. Can anyone help=20 please? When I choose (highlight matches) in the search results, the page that=20 appears has lost all the links to its content (CSS sheets, images,=20 links). For example, an image that has the properties... http://www.mysite.com/image.jpg becomes... http:///image.jpg. When I simply choose the link to the page without highlights then it=20 loads OK. Clearly I am missing something fundamental because I can find no other=20 posts with this problem. --=20 Robin Wenham 07951 601 604 / 01928 739 750 _______________________________________________ perlfect-search mailing list perlfect-search@perlfect.com To unsubscribe, set other personal options or view the list archives = please visit: http://perlfect.com/mailman/listinfo/perlfect-search =1A ------=_NextPart_000_000F_01C40B83.3FC2AA40 Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable
Hello Robin,
 
Use absolute URL's for all links = including css=20 and images.
 
Tim.
 
----- Original Message -----=20
From: Robin=20 Wenham
To: perlfect-search@perlfect.com= =20
Sent: Tuesday, March 16, 2004 2:32 PM
Subject: [Perlfect-search] Using the "Highlight matches"=20 option

I am having problems using this option = successfully.  Can=20 anyone help
please?

When I choose (highlight matches) in the = search=20 results, the page that
appears has lost all the links to its content = (CSS=20 sheets, images,
links).  For example, an image that has the=20 properties...

http://www.mysite.com/image.jpg<= /A>

becomes...

http:///image.jpg.

When I simply = choose the=20 link to the page without highlights then it
loads OK.

Clearly = I am=20 missing something fundamental because I can find no other
posts with = this=20 problem.

--
Robin Wenham
07951 601 604 / 01928 739=20 750



_______________________________________________
per= lfect-search=20 mailing list
perlfect-search@perlfect.com=
To=20 unsubscribe, set other personal options or view the list archives please = visit:
http://perl= fect.com/mailman/listinfo/perlfect-search
=1A

------=_NextPart_000_000F_01C40B83.3FC2AA40-- From perlfect-search@perlfect.com Tue Mar 16 18:50:44 2004 From: perlfect-search@perlfect.com (Daniel Naber) Date: Tue, 16 Mar 2004 19:50:44 +0100 Subject: [Perlfect-search] Using the "Highlight matches" option In-Reply-To: <200431614322.581952@StudyXPPro> References: <200431614322.581952@StudyXPPro> Message-ID: <200403161950.44993@danielnaber.de> On Tuesday 16 March 2004 15:32, Robin Wenham wrote: > When I choose (highlight matches) in the search results, the page that > appears has lost all the links to its content (CSS sheets, images, > links).  For example, an image that has the properties... Please try if the snapshot version solves that problem: http://www.danielnaber.de/perlfectsearch/cvs.php Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Tue Mar 23 21:16:51 2004 From: perlfect-search@perlfect.com (Darrell Ring) Date: Tue, 23 Mar 2004 15:16:51 -0600 Subject: [Perlfect-search] Searching just one page adn highlight results Message-ID: <265d01c4111c$29c3a2a0$6400a8c0@mydestinyxp> This is a multi-part message in MIME format. ------=_NextPart_000_265A_01C410E9.DE671B20 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable I cannot seem to be able to search just one page (exclude everything = else). Here is what I have for the form call:
=20 What I need to do is have the Perlfect search the page and return the = same page with highlighted results. Please help and thank you in advance. Darrell ------=_NextPart_000_265A_01C410E9.DE671B20 Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable
I cannot seem to be able to search just = one page=20 (exclude everything else).
 
Here is what I have for the form = call:
 
<form method=3D"get"=20 action=3D"/cgibin/perlfect/search/search.pl">
    = <input=20 type=3D"hidden" name=3D"p" value=3D"1">
    = <input=20 type=3D"hidden" name=3D"lang" value=3D"en">
    = <input=20 type=3D"hidden" name=3D"include" = value=3D"mypage.html">
   =20 <input type=3D"hidden" name=3D"exclude" = value=3D"">
   =20 <input type=3D"hidden" name=3D"penalty" = value=3D"0">
   =20 <select name=3D"mode">
   <option = value=3D"all">Match ALL=20 words</option>
   <option value=3D"any">Match = ANY=20 word</option>
    = </select>
   =20 <input type=3D"text" name=3D"q"><input type=3D"submit"=20 value=3D"Search">
  = </form></div>
   =20
What I need to do is have the Perlfect = search the=20 page and return the same page with highlighted results.
 
Please help and thank you in = advance.
Darrell
------=_NextPart_000_265A_01C410E9.DE671B20-- From perlfect-search@perlfect.com Tue Mar 23 21:40:28 2004 From: perlfect-search@perlfect.com (Daniel Naber) Date: Tue, 23 Mar 2004 22:40:28 +0100 Subject: [Perlfect-search] Searching just one page adn highlight results In-Reply-To: <265d01c4111c$29c3a2a0$6400a8c0@mydestinyxp> References: <265d01c4111c$29c3a2a0$6400a8c0@mydestinyxp> Message-ID: <200403232240.28524@danielnaber.de> On Tuesday 23 March 2004 22:16, Darrell Ring wrote: >     This looks correct, unless you have more than one page called "mypage.html". In that case, try adding the path, like "/subdir/ mypage.html". Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Wed Mar 24 16:38:43 2004 From: perlfect-search@perlfect.com (Darrell Ring) Date: Wed, 24 Mar 2004 10:38:43 -0600 Subject: [Perlfect-search] Using the "Highlight matches" option Message-ID: <000a01c411be$795d69c0$6400a8c0@mydestinyxp> This is a multi-part message in MIME format. ------=_NextPart_000_0007_01C4118C.2DBA8A70 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable After closer examining (checked through the logfile of SSH Secure = Shell), I fount the file extention wasn't being indexed. The form now correctly searched the single page. Is there something I can put in the form call to have is display the = highlighted results right away ?
Thank you Daniel. Fantastic Script! ------=_NextPart_000_0007_01C4118C.2DBA8A70 Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable
After closer examining (checked through = the logfile=20 of SSH Secure Shell), I fount the file extention
wasn't being indexed.
The form now correctly searched the = single=20 page.
 
Is there something I can put in the = form call to=20 have is display the highlighted results right away ?
 
 
<form method=3D3D"get"=20 action=3D3D"/cgibin/perlfect/search/search.pl">
    = <input=20 type=3D3D"hidden" name=3D3D"p" value=3D3D"1">
    = <input=20 type=3D3D"hidden" name=3D3D"lang" = value=3D3D"en">
    <input=20 type=3D3D"hidden" name=3D3D"include" = value=3D3D"mypage.html">
   =20 <input type=3D3D"hidden" name=3D3D"exclude" = value=3D3D"">
   =20 <input type=3D3D"hidden" name=3D3D"penalty" = value=3D3D"0">
   =20 <select name=3D3D"mode">
   <option = value=3D3D"all">Match=20 ALL words</option>
   <option = value=3D3D"any">Match ANY=20 word</option>
    = </select>
   =20 <input type=3D3D"text" name=3D3D"q"><input type=3D3D"submit"=20 =3D
value=3D3D"Search">
  = </form></div>
 
Thank you = Daniel.
Fantastic = Script!
 
 
------=_NextPart_000_0007_01C4118C.2DBA8A70-- From perlfect-search@perlfect.com Wed Mar 24 19:42:08 2004 From: perlfect-search@perlfect.com (Daniel Naber) Date: Wed, 24 Mar 2004 20:42:08 +0100 Subject: [Perlfect-search] Using the "Highlight matches" option In-Reply-To: <000a01c411be$795d69c0$6400a8c0@mydestinyxp> References: <000a01c411be$795d69c0$6400a8c0@mydestinyxp> Message-ID: <200403242042.08007@danielnaber.de> On Wednesday 24 March 2004 17:38, Darrell Ring wrote: > Is there something I can put in the form call to have is display the > highlighted results right away ? No, not without changing the source... Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Thu Mar 25 10:58:28 2004 From: perlfect-search@perlfect.com (Robin Wenham) Date: Thu, 25 Mar 2004 10:58:28 GMT Subject: [Perlfect-search] Re: Using the "Highlight matches" option Message-ID: <2004325105828.536016@StudyXPPro> Thanks Daniel - I downloaded v1.97 (I was using v1.95) of search.pl and everything works. PerlfectSearch is a classy piece of work! The Perlfect::Template also works a treat! >> When I choose (highlight matches) in the search results, the page >> that appears has lost all the links to its content (CSS sheets, >> images, links). For example, an image that has the properties... > Please try if the snapshot version solves that problem: > http://www.danielnaber.de/perlfectsearch/cvs.php > Regards > Daniel -- Robin Wenham