Perlfect Solutions
 

[Perlfect-search] Indexing using a broswer: response code 500

Steve Lawrence perlfect-search@perlfect.com
Wed, 17 Sep 2003 08:26:42 -0600
Try indexing via the file system or us http://www.selmic.com/index.html
as your start url. 

-----Original Message-----
From: perlfect-search-admin@perlfect.com
[mailto:perlfect-search-admin@perlfect.com] On Behalf Of Niina
Ter�slahti
Sent: Wednesday, September 17, 2003 6:55 AM
To: perlfect-search@perlfect.com
Subject: [Perlfect-search] Indexing using a broswer: response code 500


Hello,

I'm trying to get Perflect Search 3.31b to work, and everything is going
fine until I'm trying to run indexer.pl in a browser. I do not have
access
to the server via telnet, so browser indexing is my only option.
The indexer.pl starts to run properly, but then it gets stuck to an
error:
Couldn't get 'http://www.domain.com': response code 500.
So it indexes 0 files and I'm back to where I started. I've been trying
to
change the HTTP_START_URL in the configuration file, but without
success.
Right now it looks like this: $HTTP_START_URL = 'http://www.selmic.com';
and
it doesn't work.

Here's what happens while running indexer.pl:

Using DB_File...
Checking for old temp files...
Building string of special characters...
Loading 'no index' regular expressions:
    - /home/example/example.com/html/secret_directory/*
    - */cgi-bin/*
    - */stats/*
Loading stopwords...371 stopwords loaded.
Starting crawler...
Note: I will not visit more than $HTTP_MAX_PAGES=100 pages.
Error: Couldn't get 'http://www.selmic.com': response code 500

Crawler finished: indexed 0 files, 0 terms (0 different terms).
Ignored 0 files because of conf/no_index.txt
Ignored 0 files because of robots.txt

Calculating weight vectors:
0%  10%  20%  30%  40%  50%  60%  70%  80%  90%  100%
|----|----|----|----|----|----|----|----|----|----|
>
Removing unused db files:
    /cgi-bin/data/tf...ok
    /cgi-bin/data/df...ok
Renaming newly created db files...
    /cgi-bin/data/terms_tmp to /cgi-bin/data/terms
    /cgi-bin/data/docs_tmp to /cgi-bin/data/docs
    /cgi-bin/data/urls_tmp to /cgi-bin/data/urls
    /cgi-bin/data/sizes_tmp to /cgi-bin/data/sizes
    /cgi-bin/data/titles_tmp to /cgi-bin/data/titles
    /cgi-bin/data/dates_tmp to /cgi-bin/data/dates
    /cgi-bin/data/content_tmp to /cgi-bin/data/content
    /cgi-bin/data/desc_tmp to /cgi-bin/data/desc
    /cgi-bin/data/inv_index_tmp to /cgi-bin/data/inv_index
Indexer finished.

Does anyone have any suggestions of what I should do? I'm having a
pretty
tight schedule, so a quick reply would be highly appreciated! :)

Niina

_______________________________________________
perlfect-search mailing list
perlfect-search@perlfect.com
To unsubscribe, set other personal options or view the list archives
please visit: