Archive for April, 2009

Yanga WorldSearch Bot v1.1/beta – legit and misbehaved

Sunday, April 19th, 2009

I was looking in my hitlogs and I noticed that Yanga WorldSearch Bot v1.1/beta was fetching a whole lot of pages and (ab)using a truckload of bandwidth.

I searched around and found blogs with debates about it being “legit” or not.

It’s legit

The Yanga gang do actually have a search engine interface where you can search among the pages they crawl. A few searches on things like my own name did produce results. The evidence suggests that this is in fact a useful crawler which is used to provide a public service.

..and misbehaved

It must be mentioned that this crawler ate more than four thousand pages off one website during the last 24 hours. That really is a whole lot of pages. The logs further indicate that their crawler is very stupid and unpolite.

I’ll allow Yanga for now, and I recommend allowing it since it does appear to be useful – if your server can handle the immense load it puts on it when eating pages. Supporting alternatives to the heavily-censored Google search-engine is a good thing. I do, however, recommend that those who host sites with heavy PHP/MySQL usage on weak servers just -j DROP their IPs as it does strain serversĀ  to the point where users may notice a slowdown.


Sphere: Related Content

A quick look at BuddyPress

Thursday, April 2nd, 2009

I took a quick look at BuddyPress today. It’s quite impressive.

I like that they’ve got support for Groups. That’s nice.

livelyblog.com | Random blog | Login | Get your own blog | ^^^
oyvinds.livelyblog.com/Login