fyi- still wrestling with search bots

Recent Classifieds

Forum statistics

Threads
182,975
Messages
2,536,202
Members
95,697
Latest member
JohnWiddick
Recent bookmarks
0

Sean

Admin
Admin
Joined
Aug 29, 2002
Messages
11,780
Location
New Zealand
Shooter
Multi Format
We got hit by googlebot again eventhough I had items in pace to turn it away. I'm looking into it and things seem to be stable.
 
OP
OP
Sean

Sean

Admin
Admin
Joined
Aug 29, 2002
Messages
11,780
Location
New Zealand
Shooter
Multi Format
well.. from what I can tell the methods I put in place to block these bots can take up to 30 days to kick in. Will just keep an eye on things until then..
 
Joined
Sep 7, 2002
Messages
516
Location
Sacramento
Shooter
Medium Format
Sean,

Enlighten me a little. What does the bot do that causes problems? Also, how will the data on APUG be searchable if you block the bots (or is that the price to pay for not permitting a bot to crawl the site)?


---Michael
 
OP
OP
Sean

Sean

Admin
Admin
Joined
Aug 29, 2002
Messages
11,780
Location
New Zealand
Shooter
Multi Format
well the bots are like leeches. They find your site and latch onto it, sometimes for days pulling all the data from your pages. I think what's happening is that these bots might hit something they think should exist but doesn't. For example maybe a gallery image or a forum post, it gets stuck on this, glitches, and the cpu usage goes to 100%. People on dedicated servers with a lot of grunt can usually ride these things out, but we don't have that luxury yet. Most of the site is already indexed by the major engines so we still get some good visibility for now. When I have more time I'll try to get to the bottom of this and get some indexing back online..
 

rbarker

Member
Joined
Oct 31, 2004
Messages
2,218
Location
Rio Rancho,
Shooter
Multi Format
FWIW, I just read an article in one of the IT mags that indicated some nefarious sites are using Google searches to probe web sites for security vulnerabilities. I wonder if this might be what's happening, Sean.
 
OP
OP
Sean

Sean

Admin
Admin
Joined
Aug 29, 2002
Messages
11,780
Location
New Zealand
Shooter
Multi Format
possible maybe but all indications of the ip address show that it is one of their "googlebots". seems the net is getting more and more like a battle field :sad:
 

mark

Member
Joined
Nov 13, 2003
Messages
5,690
John McCallum said:
Hey what'd we do??? :tongue: :tongue:

Sean unleashed the dreaded analoguebot to do battle with the evil invading googlebot. Analogue bot took out google. :cool:
 

garryl

Member
Joined
Jul 19, 2003
Messages
542
Location
Fort Worth,
Shooter
35mm
I think it's time to call in "Optimus Prime"! :D
Dead Link Removed
 

edz

Member
Joined
Dec 4, 2002
Messages
685
Location
Munich, Germ
Shooter
Multi Format
g said:
is that these bots might hit something they think should exist but doesn't. For example maybe a gallery image or a forum post, it gets stuck on this, glitches, and the cpu usage goes to 100%.
This is something very much apart of the world of a few of the PHP kits. They do the silly assumption that if a resource does not exist then deliver a default resource.. and something this gets recursive.. One solution, if you can't fix it, is to deliver the pages via a reverse proxy server..
 
Photrio.com contains affiliate links to products. We may receive a commission for purchases made through these links.
To read our full affiliate disclosure statement please click Here.

PHOTRIO PARTNERS EQUALLY FUNDING OUR COMMUNITY:



Ilford ADOX Freestyle Photographic Stearman Press Weldon Color Lab Blue Moon Camera & Machine
Top Bottom