|
|
[Perlfect-search] configuring no_index and other assorted miseries (reply)
Nutsy Waterbelly dotwb2501@yahoo.com
Wed, 28 Mar 2001 09:41:42 -0800 (PST)
Hi Daniel,
Thanks for the quick response, but that really didn't answer my question.
Apparently, I'm having more difficulty than I originally thought. First, I
added the all (ok... like most) of the directories that I wanted no to be
indexed. (IMO I think it would be easier to specify7 the ones I DO want
indexed... there are FAR FEWER of them for me.) But, it still goes right ahead
and indexes (Sp??!?) them anyway. Here is a link to my no_index.txt file.
(http://www.angelfire.com/weird/nutsywb/work/no_index.txt)
Please note, I'm on NT and trying to specify the files for that system. I went
in and (unsuccessfully the first time. oopsie) edited the conf.pl (eek!) so
that it wouldn't do the \/ thing, as I thought that was the problem. I thought
this because the output from the indexer was specifying file names like this:
C:\dirhome\dirhome2\/dirIdon'twant\subdirIdon'twant\file.html
See my happy sys admin decided to install perl right in the root for the
intranet site. (??!? yeah... I know) Hey... it got the perl to work. so now
I'm indexing all the perl html help files.
Please help me!
Additionally (boy when it rains it pours), when I try to run the search I get a
403 error with
"http://semaghost/cgi-bin/perlfect/search?p=1&q=gastonia&mode=all" in the URL.
I realize I'm having permission difficulties, I just don't know what files to
set to what. Please help too!
Thanks for all your help!
Gina
--- Daniel Naber <daniel.naber@t-online.de> wrote:
> On Tuesday 27 March 2001 22:23, you wrote:
>
> > But I digress. I have installed the Perlfect Search on our NT hosting
> > system for use on my company's intranet system. But it seems to be
> > indexing subdirectories I don't want it to index. I specified the root
> > directories, but do I have to exaustively list EVERY SUB DIRECTORY I
> > don't want it to index? I mean... eek!
>
> Here's the new/extended FAQ item about this topic (Giorgos, can you please
> update the files on the server from CVS?)
>
> Regards
> Daniel
>
> How do I exclude directories/files from indexing?
>
> Inside the installation directory there is a directory called conf and
> inside it there's a file called no_index.txt containing the paths to all
> directories or files that are excluded from the indexing. Add the
> directories/files you want, and run the indexer again. You can use the *
> wildcard with the usual meaning it has in UNIX or DOS/WINDOWS, i.e. to
> exclude a complete directory use something like
> /usr/local/httpd/htdocs/private/*
>
> If you have several files with the same name that should be excluded no
> matter in which directory they are use something like
> /usr/local/httpd/htdocs/*private_file.html or
> /usr/local/httpd/htdocs/*private_directory/* (assuming $DOCUMENT_ROOT =
> '/usr/local/httpd/htdocs').
>
> If you are using $HTTP_START_URL, meta tags for robots in the HTML files
> will be checked, but a server-wide /robots.txt will be ignored. Remember
> that Perlfect Search should only be used to index your own server, not
> other people's server!
>
> --
> Daniel Naber, Paul-Gerhardt-Str. 2, 33332 Guetersloh, Germany
> Tel. 05241-59371, Mobil 0170-4819674
=====
Nutsy Waterbelly - dotwb2501@icqmail.com
WebMistress for GotSatan.com and www.underlandprivat.com
__________________________________________________
Do You Yahoo!?
Get email at your own domain with Yahoo! Mail.
http://personal.mail.yahoo.com/?.refer=text
|
|