|
|
[Perlfect-search] Indexing using a broswer: response code 500
Steve Lawrence perlfect-search@perlfect.com
Wed, 17 Sep 2003 08:26:42 -0600
Try indexing via the file system or us http://www.selmic.com/index.html
as your start url.
-----Original Message-----
From: perlfect-search-admin@perlfect.com
[mailto:perlfect-search-admin@perlfect.com] On Behalf Of Niina
Ter�slahti
Sent: Wednesday, September 17, 2003 6:55 AM
To: perlfect-search@perlfect.com
Subject: [Perlfect-search] Indexing using a broswer: response code 500
Hello,
I'm trying to get Perflect Search 3.31b to work, and everything is going
fine until I'm trying to run indexer.pl in a browser. I do not have
access
to the server via telnet, so browser indexing is my only option.
The indexer.pl starts to run properly, but then it gets stuck to an
error:
Couldn't get 'http://www.domain.com': response code 500.
So it indexes 0 files and I'm back to where I started. I've been trying
to
change the HTTP_START_URL in the configuration file, but without
success.
Right now it looks like this: $HTTP_START_URL = 'http://www.selmic.com';
and
it doesn't work.
Here's what happens while running indexer.pl:
Using DB_File...
Checking for old temp files...
Building string of special characters...
Loading 'no index' regular expressions:
- /home/example/example.com/html/secret_directory/*
- */cgi-bin/*
- */stats/*
Loading stopwords...371 stopwords loaded.
Starting crawler...
Note: I will not visit more than $HTTP_MAX_PAGES=100 pages.
Error: Couldn't get 'http://www.selmic.com': response code 500
Crawler finished: indexed 0 files, 0 terms (0 different terms).
Ignored 0 files because of conf/no_index.txt
Ignored 0 files because of robots.txt
Calculating weight vectors:
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
|----|----|----|----|----|----|----|----|----|----|
>
Removing unused db files:
/cgi-bin/data/tf...ok
/cgi-bin/data/df...ok
Renaming newly created db files...
/cgi-bin/data/terms_tmp to /cgi-bin/data/terms
/cgi-bin/data/docs_tmp to /cgi-bin/data/docs
/cgi-bin/data/urls_tmp to /cgi-bin/data/urls
/cgi-bin/data/sizes_tmp to /cgi-bin/data/sizes
/cgi-bin/data/titles_tmp to /cgi-bin/data/titles
/cgi-bin/data/dates_tmp to /cgi-bin/data/dates
/cgi-bin/data/content_tmp to /cgi-bin/data/content
/cgi-bin/data/desc_tmp to /cgi-bin/data/desc
/cgi-bin/data/inv_index_tmp to /cgi-bin/data/inv_index
Indexer finished.
Does anyone have any suggestions of what I should do? I'm having a
pretty
tight schedule, so a quick reply would be highly appreciated! :)
Niina
_______________________________________________
perlfect-search mailing list
perlfect-search@perlfect.com
To unsubscribe, set other personal options or view the list archives
please visit:
|
|