<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Learning Python</title>
	<atom:link href="http://yourhtmlsource.com/phdblog/2005/12/06/learning-python/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.yourhtmlsource.com/phdblog/2005/12/06/learning-python/</link>
	<description>Researchin' the day away...</description>
	<pubDate>Wed, 20 Aug 2008 18:11:33 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5</generator>
		<item>
		<title>By: Aaron</title>
		<link>http://www.yourhtmlsource.com/phdblog/2005/12/06/learning-python/#comment-5</link>
		<dc:creator>Aaron</dc:creator>
		<pubDate>Fri, 09 Dec 2005 02:18:59 +0000</pubDate>
		<guid isPermaLink="false">http://yourhtmlsource.com/phdblog/?p=12#comment-5</guid>
		<description>I go with Joe... ;-) 

Try to thing about this assignment in terms of the software engineering involved. ie. an interface between components that can change where you would like to automatically detect this and perhaps loads up an alternate scrapper component as a result, perhaps a simpler one. ie. you have a tailored scrapper but you never know when or where the content publishers will mess with the interface so you ripple back to a more basic scrapper at each stage being able to test that what you scrap is content and not noise... 

Aaron.</description>
		<content:encoded><![CDATA[<p>I go with Joe&#8230; <img src='http://www.yourhtmlsource.com/phdblog/smilies/msn_wink.gif' alt='&#59;&#45;&#41;' class='wp-smiley' width='19' height='19' title='&#59;&#45;&#41;' /> </p>
<p>Try to thing about this assignment in terms of the software engineering involved. <acronym title="Internet Explorer">IE</acronym>. an interface between components that can change where you would like to automatically detect this and perhaps loads up an alternate scrapper component as a result, perhaps a simpler one. <acronym title="Internet Explorer">IE</acronym>. you have a tailored scrapper but you never know when or where the content publishers will mess with the interface so you ripple back to a more basic scrapper at each stage being able to test that what you scrap is content and not noise&#8230; </p>
<p>Aaron.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
