From perlfect-search@perlfect.com Mon Sep 1 19:29:38 2003 From: perlfect-search@perlfect.com (perlfect-search@perlfect.com) Date: Mon, 1 Sep 2003 21:29:38 +0200 Subject: [Perlfect-search] to index local pages without spidering them on the "real" server Message-ID: <000f01c370bf$62a43920$6b02a8c0@athlon1900> Hello! My situation: 1. I basically work with mirrored html content on an "production harddisk". 2. Once a day this content is copied automatically to the intranet server. 3. Now I don't want to spider the real intranet server but just my local harddisk to save resources. But in conf.pl I have to fill in $BASE_URL, afterwards I run indexer.pl and the scripts fetches from the http-server gives 404/403 errors etc... I want to avoid this by just indexing my files locally and than uploading all new files to intranet server. How can I manage this? How should I replace $BASE_URL? Thanks in advance! Thomas Jevenke From perlfect-search@perlfect.com Mon Sep 1 21:00:45 2003 From: perlfect-search@perlfect.com (Daniel Naber) Date: Mon, 1 Sep 2003 23:00:45 +0200 Subject: [Perlfect-search] to index local pages without spidering them on the "real" server In-Reply-To: <000f01c370bf$62a43920$6b02a8c0@athlon1900> References: <000f01c370bf$62a43920$6b02a8c0@athlon1900> Message-ID: <200309012300.45329@danielnaber.de> On Monday 01 September 2003 21:29, jevenke@gmx.de wrote: > 3. Now I don't want to spider the real intranet server but just my local > harddisk to save resources. This will only work if your local machine has the same hardware and software as the server. E.g. you cannot index on Windows and upload to Unix. You should set up your local machine so that it uses the same path names as the server. If you want to index the files directly from disk you also need to set $HTTP_START_URL = ''; Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Fri Sep 5 20:42:42 2003 From: perlfect-search@perlfect.com (Robert Boden) Date: Fri, 5 Sep 2003 16:42:42 -0400 Subject: [Perlfect-search] Berkeley DB Message-ID: <006001c373ee$40ba2ea0$272eae0a@PCSE792> The documentation for DB_File states that you must have the following installed on your system: * Perl 5.004_05 or greater. * Berkeley DB. Does Perlfect Search handle the installation of Berkeley DB or does this have to be installed separately? Thank you, Bob Boden From perlfect-search@perlfect.com Fri Sep 5 20:47:28 2003 From: perlfect-search@perlfect.com (Steve Lawrence) Date: Fri, 5 Sep 2003 14:47:28 -0600 Subject: [Perlfect-search] Berkeley DB In-Reply-To: <006001c373ee$40ba2ea0$272eae0a@PCSE792> Message-ID: <004c01c373ee$ebc34700$0501a8c0@DAD> The docs say that you should try setting up and indexing your site. If it works, great, if not, start installing what you need. -----Original Message----- From: perlfect-search-admin@perlfect.com [mailto:perlfect-search-admin@perlfect.com] On Behalf Of Robert Boden Sent: Friday, September 05, 2003 2:43 PM To: perlfect-search@perlfect.com Subject: [Perlfect-search] Berkeley DB The documentation for DB_File states that you must have the following installed on your system: * Perl 5.004_05 or greater. * Berkeley DB. Does Perlfect Search handle the installation of Berkeley DB or does this have to be installed separately? Thank you, Bob Boden _______________________________________________ perlfect-search mailing list perlfect-search@perlfect.com To unsubscribe, set other personal options or view the list archives please visit: http://perlfect.com/mailman/listinfo/perlfect-search  From perlfect-search@perlfect.com Fri Sep 5 20:51:07 2003 From: perlfect-search@perlfect.com (gape) Date: Fri, 5 Sep 2003 22:51:07 +0200 Subject: [Perlfect-search] Berkeley DB References: <006001c373ee$40ba2ea0$272eae0a@PCSE792> Message-ID: <009801c373ef$6de359a0$0200a8c0@gapelt> as i remember you have to install it seperatly you'll need root permissions as i remember With Love & Light gape www.gape.org ----- Original Message ----- From: "Robert Boden" To: Sent: Friday, September 05, 2003 10:42 PM Subject: [Perlfect-search] Berkeley DB > The documentation for DB_File states that you must have the following > installed on your system: > * Perl 5.004_05 or greater. > * Berkeley DB. > > Does Perlfect Search handle the installation of Berkeley DB or does this > have to be installed separately? > > Thank you, > > Bob Boden > > > > _______________________________________________ > perlfect-search mailing list > perlfect-search@perlfect.com > To unsubscribe, set other personal options or view the list archives please visit: > http://perlfect.com/mailman/listinfo/perlfect-search >  > From perlfect-search@perlfect.com Sat Sep 6 08:22:53 2003 From: perlfect-search@perlfect.com (Marcus Landscheidt) Date: Sat, 6 Sep 2003 10:22:53 +0200 Subject: [Perlfect-search] index local with http Proxy server Message-ID: <1164479972.20030906102253@vkl.de> Hello List :-) I wish to index on an local system in an intranet using a http proxy. what to change in the configuration or in indexer_web.pl ? indexing on an windows machine an copiing the data-file to a unix-system works great. -- Best regards, Marcus Hallo Liste, wegen meiner mangelhaften fremdsprachenkenntnisse gibts den ganzen kram auch mal in deutsch :-) Ich möchte gerne von einem Intranet aus uber einen http-proxy ein externes projekt indizieren. was muss ggf. in der conf oder vielleicht auch in der datei indexer_web geändert werden. Das Indizieren auf einem localen win-PC und spätere kopieren auf einen unix server funktioniert großartig. mfg Marcus From perlfect-search@perlfect.com Sun Sep 7 00:21:44 2003 From: perlfect-search@perlfect.com (Daniel Naber) Date: Sun, 7 Sep 2003 02:21:44 +0200 Subject: [Perlfect-search] index local with http Proxy server In-Reply-To: <1164479972.20030906102253@vkl.de> References: <1164479972.20030906102253@vkl.de> Message-ID: <200309070221.44247@danielnaber.de> On Saturday 06 September 2003 10:22, Marcus Landscheidt wrote: > I wish to index on an local system in an intranet using a http proxy. > what to change in the configuration or in indexer_web.pl ? I never tested that, but it seems you need to call $http_user_agent->env_proxy() and set HTTP_PROXY or CGI_HTTP_PROXY (see "perldoc LWP::UserAgent" for more information). > indexing on an windows machine an copiing the data-file to a > unix-system works great. Then you're lucky, it doesn't work for other people. Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Fri Sep 12 17:06:08 2003 From: perlfect-search@perlfect.com (Linda Antonsson) Date: Fri, 12 Sep 2003 19:06:08 +0200 Subject: [Perlfect-search] Searches are no longer logged. Message-ID: <3F61FD00.60001@mbox321.tele2.se> Hi, This is possibly not a PerlFect problem, but to be able to effectively troubleshoot for what the problem really is, I need to eliminate the possibility that the error lies with PerlFect. The situation is this: Last week, my host suffered disk failure. The data was rescued, tarballed and moved to another drive. However, after a few days I noticed that although my search engine works fine from the user end, the searches are no longer being logged. Prior to the crash, this worked perfectly. So, what could have caused this problem? I really think it may very well be a server error, but I could use some help pinpointing what it might be, to help my host troubleshoot. I've checked the CHMOD settings and they seem to match what is indicated in the README. That is, 755 on most everything except indexer.pl which is 700. The logfile itself was, however, set at 644 (I think it has been at 644 all the time, though, but I could be wrong), although setting it to 755 didn't solve the problem. The actual PerlFect directory is at 755 too, and the directory above it (Scripts) is at 751. Could permissions further up the chain matter as well? If so, this is the setupt: /data (755)/users (755)/ (751)/public_html (751)/Scripts (751)/PerlFect (755)/ Any thoughts on what's up? Linda From perlfect-search@perlfect.com Fri Sep 12 18:40:48 2003 From: perlfect-search@perlfect.com (Daniel Naber) Date: Fri, 12 Sep 2003 20:40:48 +0200 Subject: [Perlfect-search] Searches are no longer logged. In-Reply-To: <3F61FD00.60001@mbox321.tele2.se> References: <3F61FD00.60001@mbox321.tele2.se> Message-ID: <200309122040.48480@danielnaber.de> On Friday 12 September 2003 19:06, Linda Antonsson wrote: > So, what could have caused this problem? Does it help if you give permissions 666 to the log file (i.e. read and write for everyone)? Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Fri Sep 12 19:32:46 2003 From: perlfect-search@perlfect.com (Linda Antonsson) Date: Fri, 12 Sep 2003 21:32:46 +0200 Subject: [Perlfect-search] Searches are no longer logged. In-Reply-To: <200309122040.48480@danielnaber.de> References: <3F61FD00.60001@mbox321.tele2.se> <200309122040.48480@danielnaber.de> Message-ID: <3F621F5E.6070009@mbox321.tele2.se> Daniel Naber wrote: > On Friday 12 September 2003 19:06, Linda Antonsson wrote: > > >>So, what could have caused this problem? > > > Does it help if you give permissions 666 to the log file (i.e. read and > write for everyone)? > > Regards > Daniel > That does indeed seem to do the trick. Any thoughts on why that would be necessary, though? If I can figure out why it all of a sudden needs 666 instead of whatever is the norm (is it 644 or 755?), I might be able to fix my other ailing scripts too. Linda From perlfect-search@perlfect.com Fri Sep 12 19:46:01 2003 From: perlfect-search@perlfect.com (Daniel Naber) Date: Fri, 12 Sep 2003 21:46:01 +0200 Subject: [Perlfect-search] Searches are no longer logged. In-Reply-To: <3F621F5E.6070009@mbox321.tele2.se> References: <3F61FD00.60001@mbox321.tele2.se> <200309122040.48480@danielnaber.de> <3F621F5E.6070009@mbox321.tele2.se> Message-ID: <200309122146.01295@danielnaber.de> On Friday 12 September 2003 21:32, Linda Antonsson wrote: > That does indeed seem to do the trick. Any thoughts on why that would be > necessary, though? Because the log file is owned by your user, but the web server runs under a different user id, so it cannot write to the file. Some servers are configured so that they run under the id of the user, those servers won't need the write access for anybody. Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Fri Sep 12 20:13:50 2003 From: perlfect-search@perlfect.com (Krishnakishore R Challa) Date: Fri, 12 Sep 2003 15:13:50 -0500 Subject: [Perlfect-search] Inappropriate search results for multiple keywords... Message-ID: This message is in MIME format. Since your mail reader does not understand this format, some or all of this message may not be legible. --------------InterScan_NT_MIME_Boundary Content-Type: multipart/alternative; boundary="----_=_NextPart_001_01C37969.F3322002" ------_=_NextPart_001_01C37969.F3322002 Content-Type: text/plain; charset="iso-8859-1" Hi, Perlfect search is one of the fastest and reliable searches. I really enjoy it in my projects. But I found that it does act a little wierd when doing multiple keyword searches. For example, I searched for - latest version - in perlfect.com and I got the following results. Notice that the same lines were repeated, even though the term 'version' was next to latest. Is there anyway to get the latest and version both highlighted in the same line? 1. Perlfect Search FAQ (Frequently Asked Questions) (highlight matches) ....many pages can it handle? The latest version of Perlfect Search (v3.30) is... ....development, but not yet in the latest release: Have a look at the... ....pages can it handle? The latest version of Perlfect Search (v3.30) is capable... ....sites. In general, the performance of version 3.30 is more than enough for... Thanks! Kishore. ------_=_NextPart_001_01C37969.F3322002 Content-Type: text/html; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Inappropriate search results for multiple keywords...

Hi,

Perlfect search is one of the fastest = and reliable searches. I really enjoy it in my projects.

But I found that it does act a little = wierd when doing multiple keyword searches.

For example, I searched for - latest = version - in perlfect.com and I got the following results. Notice that = the same lines were repeated, even though the term 'version' was next = to latest.

Is there anyway to get the latest and = version both highlighted in the same line?

1. Perlfect Search FAQ = (Frequently Asked Questions) = (highlight = matches)
...many pages can it = handle? The latest version of Perlfect Search (v3.30) is...
....development, but not yet in the latest release: Have a look = at the...
....pages can it handle? The latest version of Perlfect Search = (v3.30) is capable...
....sites. In general, the performance of version 3.30 is more = than enough for...
=20

Thanks!
Kishore.

------_=_NextPart_001_01C37969.F3322002-- --------------InterScan_NT_MIME_Boundary-- From perlfect-search@perlfect.com Wed Sep 17 12:55:26 2003 From: perlfect-search@perlfect.com (=?iso-8859-1?Q?Niina_Ter=E4slahti?=) Date: Wed, 17 Sep 2003 15:55:26 +0300 Subject: [Perlfect-search] Indexing using a broswer: response code 500 Message-ID: Hello, I'm trying to get Perflect Search 3.31b to work, and everything is going fine until I'm trying to run indexer.pl in a browser. I do not have access to the server via telnet, so browser indexing is my only option. The indexer.pl starts to run properly, but then it gets stuck to an error: Couldn't get 'http://www.domain.com': response code 500. So it indexes 0 files and I'm back to where I started. I've been trying to change the HTTP_START_URL in the configuration file, but without success. Right now it looks like this: $HTTP_START_URL = 'http://www.selmic.com'; and it doesn't work. Here's what happens while running indexer.pl: Using DB_File... Checking for old temp files... Building string of special characters... Loading 'no index' regular expressions: - /home/example/example.com/html/secret_directory/* - */cgi-bin/* - */stats/* Loading stopwords...371 stopwords loaded. Starting crawler... Note: I will not visit more than $HTTP_MAX_PAGES=100 pages. Error: Couldn't get 'http://www.selmic.com': response code 500 Crawler finished: indexed 0 files, 0 terms (0 different terms). Ignored 0 files because of conf/no_index.txt Ignored 0 files because of robots.txt Calculating weight vectors: 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% |----|----|----|----|----|----|----|----|----|----| > Removing unused db files: /cgi-bin/data/tf...ok /cgi-bin/data/df...ok Renaming newly created db files... /cgi-bin/data/terms_tmp to /cgi-bin/data/terms /cgi-bin/data/docs_tmp to /cgi-bin/data/docs /cgi-bin/data/urls_tmp to /cgi-bin/data/urls /cgi-bin/data/sizes_tmp to /cgi-bin/data/sizes /cgi-bin/data/titles_tmp to /cgi-bin/data/titles /cgi-bin/data/dates_tmp to /cgi-bin/data/dates /cgi-bin/data/content_tmp to /cgi-bin/data/content /cgi-bin/data/desc_tmp to /cgi-bin/data/desc /cgi-bin/data/inv_index_tmp to /cgi-bin/data/inv_index Indexer finished. Does anyone have any suggestions of what I should do? I'm having a pretty tight schedule, so a quick reply would be highly appreciated! :) Niina From perlfect-search@perlfect.com Wed Sep 17 14:26:42 2003 From: perlfect-search@perlfect.com (Steve Lawrence) Date: Wed, 17 Sep 2003 08:26:42 -0600 Subject: [Perlfect-search] Indexing using a broswer: response code 500 In-Reply-To: Message-ID: <005201c37d27$b6f2fc40$6401a8c0@DAD> Try indexing via the file system or us http://www.selmic.com/index.html as your start url. -----Original Message----- From: perlfect-search-admin@perlfect.com [mailto:perlfect-search-admin@perlfect.com] On Behalf Of Niina Teräslahti Sent: Wednesday, September 17, 2003 6:55 AM To: perlfect-search@perlfect.com Subject: [Perlfect-search] Indexing using a broswer: response code 500 Hello, I'm trying to get Perflect Search 3.31b to work, and everything is going fine until I'm trying to run indexer.pl in a browser. I do not have access to the server via telnet, so browser indexing is my only option. The indexer.pl starts to run properly, but then it gets stuck to an error: Couldn't get 'http://www.domain.com': response code 500. So it indexes 0 files and I'm back to where I started. I've been trying to change the HTTP_START_URL in the configuration file, but without success. Right now it looks like this: $HTTP_START_URL = 'http://www.selmic.com'; and it doesn't work. Here's what happens while running indexer.pl: Using DB_File... Checking for old temp files... Building string of special characters... Loading 'no index' regular expressions: - /home/example/example.com/html/secret_directory/* - */cgi-bin/* - */stats/* Loading stopwords...371 stopwords loaded. Starting crawler... Note: I will not visit more than $HTTP_MAX_PAGES=100 pages. Error: Couldn't get 'http://www.selmic.com': response code 500 Crawler finished: indexed 0 files, 0 terms (0 different terms). Ignored 0 files because of conf/no_index.txt Ignored 0 files because of robots.txt Calculating weight vectors: 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% |----|----|----|----|----|----|----|----|----|----| > Removing unused db files: /cgi-bin/data/tf...ok /cgi-bin/data/df...ok Renaming newly created db files... /cgi-bin/data/terms_tmp to /cgi-bin/data/terms /cgi-bin/data/docs_tmp to /cgi-bin/data/docs /cgi-bin/data/urls_tmp to /cgi-bin/data/urls /cgi-bin/data/sizes_tmp to /cgi-bin/data/sizes /cgi-bin/data/titles_tmp to /cgi-bin/data/titles /cgi-bin/data/dates_tmp to /cgi-bin/data/dates /cgi-bin/data/content_tmp to /cgi-bin/data/content /cgi-bin/data/desc_tmp to /cgi-bin/data/desc /cgi-bin/data/inv_index_tmp to /cgi-bin/data/inv_index Indexer finished. Does anyone have any suggestions of what I should do? I'm having a pretty tight schedule, so a quick reply would be highly appreciated! :) Niina _______________________________________________ perlfect-search mailing list perlfect-search@perlfect.com To unsubscribe, set other personal options or view the list archives please visit: http://perlfect.com/mailman/listinfo/perlfect-search  From perlfect-search@perlfect.com Thu Sep 18 01:37:47 2003 From: perlfect-search@perlfect.com (pod) Date: Thu, 18 Sep 2003 03:37:47 +0200 Subject: [Perlfect-search] Errr indexin: << HERE $/ at tools.pl line 87. Message-ID: <3F690C6B.6000500@fisica.urbenalia.com> Hi, Running the indexer I get the following error (at the end), and then the indexer stops. I changed the link to get a shorter URI, actually only http://fisica.urbenalia.com/biblio/portada.php?imagen=otros/integra.jpg, so this URI is not linkd anywere, but I still getting te same error. By the moment I rulled it out adding *portada.php* tho no_index.txt, but I whould prefer to know what happened. The lasts lines were: Fetched 'http://fisica.urbenalia.com/biblio/portada.php?imagen=otros/integra.jpg&titulo=Fernando%20Sierra%20Pleria%20y%20William%20Pando%20Morcillo%20(Tablas%20de%20Integrales.%20Vol.I:%20Integrales%20indefinidas)', 371 bytes 241: http://fisica.urbenalia.com/biblio/portada.php?imagen=otros/integra.jpg&titulo=Fernando%20Sierra%20Pleria%20y%20William%20Pando%20Morcillo%20(Tablas%20de%20Integrales.%20Vol.I:%20Integrales%20indefinidas) (0.36 KB) Unmatched ) before HERE mark in regex m/^I:%20Integrales%20indefinidas) << HERE $/ at tools.pl line 87. Thank you for your Time, Jaume López Carballo From perlfect-search@perlfect.com Thu Sep 18 04:13:54 2003 From: perlfect-search@perlfect.com (Ritesh Raj Sarraf) Date: Thu, 18 Sep 2003 09:43:54 +0530 Subject: [Perlfect-search] question on indexing Message-ID: <200309180944.00444.rrs_rhwlist@softhome.net> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 hello all, thanks for this great util. the only problem i see in it is that on re-indexing it starts from the beginning(it took me 4 days to index all the docs on my hdd). can't there be such a way where it could use the old database and just make changes into it as per the changes inthe docs to be indexed.. rrs - -- Ritesh Raj Sarraf. RESEARCHUT. PH: 91-11-35207042 www.researchut.com =================================================================================================================== OH! A FORTUNE. filibuster, n.: Throwing your wait around. - ------------------------------------------------------------------------------------------------------------------- -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.1 (GNU/Linux) iD8DBQE/aTEH4Rhi6gTxMLwRAmL2AJ97RPQphW2XGVYhz60Zzwnxd3NJogCfVd5+ sIs2r2UZceqSGr+r4ut+Ml4= =uAyu -----END PGP SIGNATURE----- From perlfect-search@perlfect.com Thu Sep 18 18:12:15 2003 From: perlfect-search@perlfect.com (Daniel Naber) Date: Thu, 18 Sep 2003 20:12:15 +0200 Subject: [Perlfect-search] Errr indexin: << HERE $/ at tools.pl line 87. In-Reply-To: <3F690C6B.6000500@fisica.urbenalia.com> References: <3F690C6B.6000500@fisica.urbenalia.com> Message-ID: <200309182012.15737@danielnaber.de> On Thursday 18 September 2003 03:37, pod wrote: > Unmatched ) before HERE mark in regex m/^I:%20Integrales%20indefinidas) > << HERE $/ at tools.pl line 87. There's a get_suffix() function in tools.pl. Please try if it helps to replace that function with this code: sub get_suffix { my $filename = shift; if( $filename =~ m/\?/ ) { $filename =~ s/\?.*$//; } my ($suffix) = ($filename =~ m/\.([^.]*)$/); return $suffix; } Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Fri Sep 19 00:28:24 2003 From: perlfect-search@perlfect.com (pod) Date: Fri, 19 Sep 2003 02:28:24 +0200 Subject: [Perlfect-search] Errr indexin: << HERE $/ at tools.pl line 87. In-Reply-To: <200309182012.15737@danielnaber.de> References: <3F690C6B.6000500@fisica.urbenalia.com> <200309182012.15737@danielnaber.de> Message-ID: <3F6A4DA8.1080807@fisica.urbenalia.com> Daniel Naber escribió: > There's a get_suffix() function in tools.pl. Please try if it helps to > replace that function with this code: Yes, it worked pretty well, thank you. Congratulations for this wonderfull search engine. Bye, Jaume From perlfect-search@perlfect.com Fri Sep 19 09:18:46 2003 From: perlfect-search@perlfect.com (Ingo Kley) Date: Fri, 19 Sep 2003 11:18:46 +0200 Subject: [Perlfect-search] searching for B 4.1.7 Message-ID: <3F6AC9F6.41CED349@lampebank.de> Dies ist eine kryptographisch unterzeichnete Nachricht im MIME-Format. --------------msB4314202F4DB98D0F015A5A5 Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: quoted-printable Hi, we try to search for something like "B 4.1.7" which may be a section of a book. Perlfect finds nothing. As far as I can see, the first Problem is, that PS in the example is looking for +B +4 +1 +7 . The dots are not mentioned as part of the phrase (searching for ingo.kley has the same result as ingo kley). Are there any hints how to index and search for those phrases. Kind regards Ingo Kley -- = Bankhaus Lampe KG Organisation & IT Electronic Services Alter Markt 3 33602 Bielefeld Telefon +49 (0)521/582-1243 Telefax +49 (0)521/582-1714 eMail ingo.kley@lampebank.de =DCber das Internet versandte eMails k=F6nnen leicht verf=E4lscht und/ode= r unter fremden Namen erstellt werden. Sofern nicht ausdr=FCcklich schriftlich vereinbart, begr=FCndet der Inhalt dieser Nachricht daher keinerlei rechtliche Verpflichtung f=FCr die Bankhaus Lampe KG. Der Versand einer eMail dient ausschlie=DFlich dem Informationsaustausch. --------------msB4314202F4DB98D0F015A5A5 Content-Type: application/x-pkcs7-signature; name="smime.p7s" Content-Transfer-Encoding: base64 Content-Disposition: attachment; filename="smime.p7s" Content-Description: Kryptographische Unterschrift mit S/MIME MIIGlAYJKoZIhvcNAQcCoIIGhTCCBoECAQExCzAJBgUrDgMCGgUAMAsGCSqGSIb3DQEHAaCC A9kwggPVMIIDPqADAgECAg4Q6QAAAAKmZr3pfA26bTANBgkqhkiG9w0BAQQFADCBvDELMAkG A1UEBhMCREUxEDAOBgNVBAgTB0hhbWJ1cmcxEDAOBgNVBAcTB0hhbWJ1cmcxOjA4BgNVBAoT MVRDIFRydXN0Q2VudGVyIGZvciBTZWN1cml0eSBpbiBEYXRhIE5ldHdvcmtzIEdtYkgxIjAg BgNVBAsTGVRDIFRydXN0Q2VudGVyIENsYXNzIDEgQ0ExKTAnBgkqhkiG9w0BCQEWGmNlcnRp ZmljYXRlQHRydXN0Y2VudGVyLmRlMB4XDTAyMTIwOTEyNTEyMloXDTAzMTIwOTEyNTEyMlow SDELMAkGA1UEBhMCREUxEjAQBgNVBAMTCUluZ28gS2xleTElMCMGCSqGSIb3DQEJARYWaW5n by5rbGV5QGxhbXBlYmFuay5kZTCCASIwDQYJKoZIhvcNAQEBBQADggEPADCCAQoCggEBAOFQ OKSuIoGT+OrjPC5EPjcxE4EEmvyWQPgaoOrCnDRojovL1dvRTK/qqekUMI+5LXWgqvy73bcy hT6OgAR96McQ+5GADNVnFZMfCm+rJDGpDtDCiKKd+CZQF+IgL2mzPCuNmVczrrslE28tR2Ik CQOAjR1mkSRNWp29jK4pYpqJQv07sXSPhI+2AvRqnswZm5jGqNPLUJbbwBtobwXRdtZ0BWuM ZxKN7DKavHY1dBbAQigFkfaZv4F/NgpRuOtkI3tnC9Ft19WSEtwbR6ElCkamQfNAAT5YmKiu noDtiS8K/uGdAAA235idwt/XYBIVbvqnWjnGe+C032UssP2ktl8CAwEAAaOByDCBxTAMBgNV HRMBAf8EAjAAMA4GA1UdDwEB/wQEAwIF4DAzBglghkgBhvhCAQgEJhYkaHR0cDovL3d3dy50 cnVzdGNlbnRlci5kZS9ndWlkZWxpbmVzMBEGCWCGSAGG+EIBAQQEAwIFoDBdBglghkgBhvhC AQMEUBZOaHR0cHM6Ly93d3cudHJ1c3RjZW50ZXIuZGUvY2dpLWJpbi9jaGVjay1yZXYuY2dp LzEwRTkwMDAwMDAwMkE2NjZCREU5N0MwREJBNkQ/MA0GCSqGSIb3DQEBBAUAA4GBADwqhYEO +0OzpEkDWkmsIN2h2OUsDNgKS5e34kStghgu0oIf38/X1Eotffsj6O04pOzcld4AXW5Z3XoJ HSa5xcgCCfCxGqPsd3ilROOhgZ+tAwl2FYV/TNK7+AMnakxV4T3itqb8R5McF6fvjoxNi759 GlI+BRPJDMAFfr6e15BzMYICgzCCAn8CAQEwgc8wgbwxCzAJBgNVBAYTAkRFMRAwDgYDVQQI EwdIYW1idXJnMRAwDgYDVQQHEwdIYW1idXJnMTowOAYDVQQKEzFUQyBUcnVzdENlbnRlciBm b3IgU2VjdXJpdHkgaW4gRGF0YSBOZXR3b3JrcyBHbWJIMSIwIAYDVQQLExlUQyBUcnVzdENl bnRlciBDbGFzcyAxIENBMSkwJwYJKoZIhvcNAQkBFhpjZXJ0aWZpY2F0ZUB0cnVzdGNlbnRl ci5kZQIOEOkAAAACpma96XwNum0wCQYFKw4DAhoFAKCBiTAYBgkqhkiG9w0BCQMxCwYJKoZI hvcNAQcBMBwGCSqGSIb3DQEJBTEPFw0wMzA5MTkwOTE4NDdaMCMGCSqGSIb3DQEJBDEWBBSX 6a7X3q6lgQElLQ7zdUpxbm4k1DAqBgkqhkiG9w0BCQ8xHTAbMAoGCCqGSIb3DQMHMA0GCCqG SIb3DQMCAgEoMA0GCSqGSIb3DQEBAQUABIIBAKVM9qkAafWjhh5esZcdjcDnJFs4sxVinjwJ F2cE1MbE6XGQkv8/LK8HcuPrS/YRzoU5jSA6Wo0BbpaXwjw28jZPGRBCvJL7/7b3dFPR7uJA snZbW4h1FgJZleYC3kkW0/LDo7QmibICqzPHsCqnGe9OgmSRvAtUjQTLWzy5gCC6XaI5XI4r itmEwYGvHxu69wS1ZtVvY9yuAhq9QHvb3j1+r7+ukCkmDHp3Aw7B9uHz1jJ1V4cTBBaupX0f sj+YAQp74+fAJqME8uL0UUh1a0yI5aNMDkd7ZVtwAH02O1fQ4kMbkg2AqvUtrcAeDzy6wyKS Ily67FLQwn+R5NhEDQ8= --------------msB4314202F4DB98D0F015A5A5-- From perlfect-search@perlfect.com Fri Sep 19 11:55:22 2003 From: perlfect-search@perlfect.com (Daniel Naber) Date: Fri, 19 Sep 2003 13:55:22 +0200 Subject: [Perlfect-search] searching for B 4.1.7 In-Reply-To: <3F6AC9F6.41CED349@lampebank.de> References: <3F6AC9F6.41CED349@lampebank.de> Message-ID: <200309191355.22370@danielnaber.de> On Friday 19 September 2003 11:18, Ingo Kley wrote: > we try to search for something like "B 4.1.7" which may be a section of > a book. > Perlfect finds nothing. You will need to change the normalize() function both in search.pl and indexer.pl to find this. The easiest way is this: -Set $MINLENGTH = 1; and $INDEX_NUMBERS = 1; in conf.pl -Remove the one-character "words" from conf/stopwords.txt -Replace tr/a-zA-Z0-9_/ /cs; by tr/a-zA-Z0-9\._/ /cs; (note that an escaped dot was added) in indexer.pl and search.pl Regards Daniel -- http://www.danielnaber.de From perlfect-search@perlfect.com Mon Sep 29 13:41:24 2003 From: perlfect-search@perlfect.com (Robert van Hall) Date: Mon, 29 Sep 2003 15:41:24 +0200 Subject: [Perlfect-search] Searching gives incomplete results Message-ID: Dear people, I have a problem when I search: though a term is in the terms file, it doesn't show up when I search. That is: some words appear, and others not. Only if I duplicate the words in a document, it appears when I search. Has anybody had similar problem, and could tell me how it was fixed? I looked in the archives and found similar questions, but couldn't find the solution, nevertheless I got the impression that it had been solved in some cases. Kind regards, Robert van Hall