Peter Vandenabeele
Google bot is _much_ more active than msn, Yahoo, Enabot and WebAlta
Sat, 01/03/2008 - 01:52 — peterv
What a difference ... While between 11:00 am 23:00 today. Googlebot visited all 923 individual job openings for Leuven (on http://allejobsinleuven.be) and all list pages, the other spiders where very lame ...
Compare this:
~/logs$ grep -v javascript nginx-access.log | grep -i bot | grep -i google | wc -l
1188
~/logs$ grep -v javascript nginx-access.log | grep -i Yahoo | wc -l
23
~/logs$ grep -v javascript nginx-access.log | grep -i EnaBot | wc -l
8
~/logs$ grep -v javascript nginx-access.log | grep -i msn | wc -l
4
~/logs$ grep -v javascript nginx-access.log | grep -i WebAlta | wc -l
4
Maybe the other spiders will come too (with a delay ?).
Anonymous wrote:
Well, you can submit your site to MSN and Yahoo yourself, that might help (no guarantee though)
MSN: http://search.msn.com/docs/submit.aspx
Yahoo: https://siteexplorer.search.yahoo.com/submit
Also, creating a sitemap file won't hurt either (http://www.sitemaps.org)...
Bart H
»
peterv wrote:
Thanks for the hints Bart,
MSN and Yahoo found the site in the meanwhile (and on #1 with the query string "Jobs in Leuven"). But _very_ little incoming hits from msn and Yahoo. Mainly from Google.
Also, in the meanwhile, I did submit an optimized sitemap by now and tuned robots.txt and the nofollow tags to focus on the pages that have actual differentiating content. That helped (the problem was that I had many automatically generated pages with little original content (except for the job title). Google doesn't seem to like that (with a reason ...).
Peter Vandenabeele - http://www.vandenabeele.com
»