For crawling in PHP I have always used the fantastic cURL.
My curl single-threaded function:
[code lang="php"]
function singlethread_crawl($url)
{
$agent = "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)";
$ch = curl_init();
curl_setopt($ch, CURLOPT_NOSIGNAL, 1);
curl_setopt($ch,
(more...)
On ThreadWatch yesterday there was a thread about rank checkers, and I couldn't believe that some SEOs don't use them. We use our own heavy duty mega serp scraper to fully analyse any industry we are working in. Anyway, Graywolf mentioned how he would love a Google RSS or XML feed - I having been waiting for this for a long time, as their SERPs are so
(more...)
Tony has written the best christmas blog post this year!! He has done some investigating and found the Amazon wishlists for a number of people in the Search industry. Included are: Jim Boykin, Andy Beal, Aaron Wall, Jensense, Matt Cutts, Todd Malicoat, Chris Pirillo, Sergey Brin & Jeremy Zawodny.
Funniest items have to be Jason's "SEO for dummies" and Zawodny's "The Clitourist : A Guide to One of the
(more...)
Newsflash!! - Matt Cutts, everyone's favourite Google Engineer and PR supremo has been moonlighting as a Private Detective!!
Is this his new method of tracking down spammers? Or has he returned to his Government Intelligence roots?
Or maybe Earl Grey is just canonicalizing Matt & Newsweek for finding his spam network.
I can be quite slow sometimes :)
I only just realised that you could view the source code (html) of the Adsense javascript include in Firefox.
Just right-click on the ads, choose 'This Frame' then 'View Frame Source' (obivously I knew this for normal frames :)):
It has always bugged me that you can't change the ordering of the accounts in Thunderbird.
But today I came across something built for the job - Folderpane Tools