<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: PHP Crawler</title>
	<atom:link href="http://www.nott.org/blog/php-crawler.html/feed" rel="self" type="application/rss+xml" />
	<link>http://www.nott.org/blog/php-crawler.html</link>
	<description>SEO, Music, Photography &#38; Other Stuff</description>
	<lastBuildDate>Fri, 03 Dec 2010 06:50:41 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.4</generator>
	<item>
		<title>By: Air Force 1</title>
		<link>http://www.nott.org/blog/php-crawler.html/comment-page-1#comment-34857</link>
		<dc:creator>Air Force 1</dc:creator>
		<pubDate>Sun, 06 Sep 2009 11:06:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.nott.org/?p=46#comment-34857</guid>
		<description>This is a wonderful script. But I used snoopy</description>
		<content:encoded><![CDATA[<p>This is a wonderful script. But I used snoopy</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Tuvok</title>
		<link>http://www.nott.org/blog/php-crawler.html/comment-page-1#comment-31067</link>
		<dc:creator>Tuvok</dc:creator>
		<pubDate>Wed, 10 Jun 2009 16:01:28 +0000</pubDate>
		<guid isPermaLink="false">http://www.nott.org/?p=46#comment-31067</guid>
		<description>Thanks. It&#039;s nice to see examples of very useful things.

Which option is best for a 100,000+ url crawl project, with hardware not being an obstacle ?</description>
		<content:encoded><![CDATA[<p>Thanks. It&#8217;s nice to see examples of very useful things.</p>
<p>Which option is best for a 100,000+ url crawl project, with hardware not being an obstacle ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Arakelian</title>
		<link>http://www.nott.org/blog/php-crawler.html/comment-page-1#comment-20705</link>
		<dc:creator>David Arakelian</dc:creator>
		<pubDate>Sun, 17 Feb 2008 18:05:51 +0000</pubDate>
		<guid isPermaLink="false">http://www.nott.org/?p=46#comment-20705</guid>
		<description>Thanks for sharing your code on the use of the cURL multi functions. I run a lot of scripts that take a long time to execute (4 - 5 days in some cases) because I am using a single stream. Using your code and some proxies I can probably get this down to a few hours :)</description>
		<content:encoded><![CDATA[<p>Thanks for sharing your code on the use of the cURL multi functions. I run a lot of scripts that take a long time to execute (4 &#8211; 5 days in some cases) because I am using a single stream. Using your code and some proxies I can probably get this down to a few hours <img src='http://www.nott.org/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rays</title>
		<link>http://www.nott.org/blog/php-crawler.html/comment-page-1#comment-12024</link>
		<dc:creator>Rays</dc:creator>
		<pubDate>Sun, 11 Nov 2007 07:40:16 +0000</pubDate>
		<guid isPermaLink="false">http://www.nott.org/?p=46#comment-12024</guid>
		<description>I&#039;m confuse..
Somebody help me please?
Maybe, i must create auto crawler engine.
Hemm.. i need team work for this time!</description>
		<content:encoded><![CDATA[<p>I&#8217;m confuse..<br />
Somebody help me please?<br />
Maybe, i must create auto crawler engine.<br />
Hemm.. i need team work for this time!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike</title>
		<link>http://www.nott.org/blog/php-crawler.html/comment-page-1#comment-9900</link>
		<dc:creator>Mike</dc:creator>
		<pubDate>Mon, 15 Oct 2007 16:25:48 +0000</pubDate>
		<guid isPermaLink="false">http://www.nott.org/?p=46#comment-9900</guid>
		<description>Rishiraj - use the &#039;mycrawler_single&#039; function described in the post above. It should only need minimal tweaking to work in php 4.</description>
		<content:encoded><![CDATA[<p>Rishiraj &#8211; use the &#8216;mycrawler_single&#8217; function described in the post above. It should only need minimal tweaking to work in php 4.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rishiraj</title>
		<link>http://www.nott.org/blog/php-crawler.html/comment-page-1#comment-9899</link>
		<dc:creator>Rishiraj</dc:creator>
		<pubDate>Mon, 15 Oct 2007 16:19:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.nott.org/?p=46#comment-9899</guid>
		<description>I have freeernti hosting server with php 4.7.
Is there any alternative for me if i don&#039;t want to use curl?</description>
		<content:encoded><![CDATA[<p>I have freeernti hosting server with php 4.7.<br />
Is there any alternative for me if i don&#8217;t want to use curl?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike</title>
		<link>http://www.nott.org/blog/php-crawler.html/comment-page-1#comment-786</link>
		<dc:creator>Mike</dc:creator>
		<pubDate>Wed, 13 Sep 2006 09:40:11 +0000</pubDate>
		<guid isPermaLink="false">http://www.nott.org/?p=46#comment-786</guid>
		<description>If you don&#039;t know what to do with the code, then it isn&#039;t for you :-)</description>
		<content:encoded><![CDATA[<p>If you don&#8217;t know what to do with the code, then it isn&#8217;t for you <img src='http://www.nott.org/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: HM2K</title>
		<link>http://www.nott.org/blog/php-crawler.html/comment-page-1#comment-178</link>
		<dc:creator>HM2K</dc:creator>
		<pubDate>Thu, 01 Jun 2006 14:26:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.nott.org/?p=46#comment-178</guid>
		<description>You never once actually said what this function does, &quot;PHP Crawler&quot; is VERY generic.

Also you have not shown any examples of output.</description>
		<content:encoded><![CDATA[<p>You never once actually said what this function does, &#8220;PHP Crawler&#8221; is VERY generic.</p>
<p>Also you have not shown any examples of output.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jeff Beck</title>
		<link>http://www.nott.org/blog/php-crawler.html/comment-page-1#comment-26</link>
		<dc:creator>Jeff Beck</dc:creator>
		<pubDate>Wed, 18 Jan 2006 19:24:06 +0000</pubDate>
		<guid isPermaLink="false">http://www.nott.org/?p=46#comment-26</guid>
		<description>I&#039;ve been trying to do much of the same thing on a site I have been working on.  The problem I have been running up against has been fSockOpen; it waits for a connection to the server before returning.  As such, when you loop through an array of URLs, it can take maybe twice as long to connect and retrieve data as with cURL multi.  The fastest solution I have seen involved fSockOpen connecting to the local machine (aka PHP multithread HACK), and then letting each PHPlet file do the crawling independantly.  This cuts even the cURL multi in half, but is not very pratical on the server load (1 page = 30+ httpd files running).  Speed wise, have you compared your PHP function with the cURL function?</description>
		<content:encoded><![CDATA[<p>I&#8217;ve been trying to do much of the same thing on a site I have been working on.  The problem I have been running up against has been fSockOpen; it waits for a connection to the server before returning.  As such, when you loop through an array of URLs, it can take maybe twice as long to connect and retrieve data as with cURL multi.  The fastest solution I have seen involved fSockOpen connecting to the local machine (aka PHP multithread HACK), and then letting each PHPlet file do the crawling independantly.  This cuts even the cURL multi in half, but is not very pratical on the server load (1 page = 30+ httpd files running).  Speed wise, have you compared your PHP function with the cURL function?</p>
]]></content:encoded>
	</item>
</channel>
</rss>

