Welcome! Log In Create A New Profile

Advanced

Some part of pages seem to not being indexed

Posted by holgi 
Some part of pages seem to not being indexed
April 19, 2007 02:33PM
Hi,

I do have the following problem:
Although words to appear in several pages, not all of them will be displayed when searching for them.

I took a deeper look into the (re-)index log and found out that there will be named following message:
Page contains less than 10 words
The strage thing is: the pages do contain more than 10 words. So I guess some parts of the pages will not be indexed.

Changing the minimum number of words to 1 causes an indexing of the pages but do not display a result for the word in it either.

Can anybody help? Pages is http://www.scoutnet.de/
As an example: word "Holgi" is as well part of page
http://www.scoutnet.de/scoutnet/wir.html
but will just be displayed in
http://www.scoutnet.de/kommunikation/mailinglists/pfadi-regeln.html
http://www.scoutnet.de/scoutnet/org.html
http://www.scoutnet.de/scoutnet/home.html

Another example is:
words above (and including) the line "4) Achte bei Deinen Mails auf richtiges Zitieren" will not be found in http://www.scoutnet.de/kommunikation/mailinglists/pfadi-regeln.html

Could it be that sphider do have problems with comments or xhtml-tags like <tag />

Thanks for your help in advance.

Holgi
Re: Some part of pages seem to not being indexed
April 23, 2007 11:39AM
I found a possible problem for that:
An ALT-Tag contained a "<" which could be the reason for the problems:

<img src="something.jpg" alt="<" />

Holgi
Re: Some part of pages seem to not being indexed
April 23, 2007 08:18PM
You should always check your pages with the [url=http://validator.w3.org/check?verbose=1&uri=http%3A%2F%2Fwww.scoutnet.de%2F]
w3.org validator[/url]
then you can find this type of errors.
wimb
Anonymous User
Re: Some part of pages seem to not being indexed
April 23, 2007 09:08PM
Try to index just 3 levels deep.
Sorry, only registered users may post in this forum.

Click here to login