<?xml version="1.0" encoding="UTF-8"?><rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" > <channel><title>Comments for TECKpert</title> <atom:link href="http://www.teckpert.com/comments/feed/" rel="self" type="application/rss+xml" /><link>http://www.teckpert.com</link> <description>Your web and software development experts</description> <lastBuildDate>Wed, 16 Mar 2011 12:53:01 +0000</lastBuildDate> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <generator>http://wordpress.org/?v=3.1.3</generator> <item><title>Comment on DOM manipulation with PHP, the ultimate page scraper? by Leo</title><link>http://www.teckpert.com/php/dom-manipulation-with-php-the-ultimate-page-scraper/comment-page-1/#comment-338</link> <dc:creator>Leo</dc:creator> <pubDate>Wed, 16 Mar 2011 12:53:01 +0000</pubDate> <guid isPermaLink="false">http://www.teckpert.com/?p=395#comment-338</guid> <description>@brian the article is about page scraping. Having just helped out on a project that manipulated the scrape with JavaSrcipt as you suggest, I found it extremely problematic. If the browser has JavaScript disabled, your script fails. Secondly, you have to wait for the page to load with all of its assets (images, stylesheets, flash ect.) before you can start manipulating, which means there is a delay before your scraped and repackaged page is served. If it is all handled with on the server side with PHP, the end user never has an issue then with a delay making it faster, or the page manipulation failing because of no JavaScript, or some other competing script on the page causing yours to fail.</description> <content:encoded><![CDATA[<p>@brian the article is about page scraping. Having just helped out on a project that manipulated the scrape with JavaSrcipt as you suggest, I found it extremely problematic. If the browser has JavaScript disabled, your script fails. Secondly, you have to wait for the page to load with all of its assets (images, stylesheets, flash ect.) before you can start manipulating, which means there is a delay before your scraped and repackaged page is served. If it is all handled with on the server side with PHP, the end user never has an issue then with a delay making it faster, or the page manipulation failing because of no JavaScript, or some other competing script on the page causing yours to fail.</p> ]]></content:encoded> </item> <item><title>Comment on DOM manipulation with PHP, the ultimate page scraper? by brian</title><link>http://www.teckpert.com/php/dom-manipulation-with-php-the-ultimate-page-scraper/comment-page-1/#comment-285</link> <dc:creator>brian</dc:creator> <pubDate>Tue, 18 Jan 2011 16:29:52 +0000</pubDate> <guid isPermaLink="false">http://www.teckpert.com/?p=395#comment-285</guid> <description>another way to do this is with native Javascript. instead of having to write $results = $xPath-&gt;query(&#039;//div[@class=&quot;why_us&quot;]&#039;);you could simply write $results = querySelectorAll(&#039;.div&#039;);i launched my first php screen scraper in 2003. for even better scraping, get yourself on a Ruby on Rails environment and check out scRUBY. happy coding!</description> <content:encoded><![CDATA[<p>another way to do this is with native Javascript. instead of having to write<br /> $results = $xPath-&gt;query(&#8216;//div[@class="why_us"]&#8216;);</p><p>you could simply write<br /> $results = querySelectorAll(&#8216;.div&#8217;);</p><p>i launched my first php screen scraper in 2003. for even better scraping, get yourself on a Ruby on Rails environment and check out scRUBY. happy coding!</p> ]]></content:encoded> </item> </channel> </rss>
<!-- Served from: www.teckpert.com @ 2012-02-06 01:57:09 by W3 Total Cache -->
