<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Uncle, Mercy, Whatever: Please, Just Kill the Dupes</title>
	<atom:link href="http://redmonk.com/sogrady/2005/04/27/uncle-mercy-whatever-please-just-kill-the-dupes/feed/" rel="self" type="application/rss+xml" />
	<link>http://redmonk.com/sogrady/2005/04/27/uncle-mercy-whatever-please-just-kill-the-dupes/</link>
	<description>because technology is just another ecosystem</description>
	<lastBuildDate>Sun, 13 May 2012 00:23:29 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>By: Bob Wyman</title>
		<link>http://redmonk.com/sogrady/2005/04/27/uncle-mercy-whatever-please-just-kill-the-dupes/comment-page-1/#comment-659</link>
		<dc:creator>Bob Wyman</dc:creator>
		<pubDate>Thu, 28 Apr 2005 13:27:07 +0000</pubDate>
		<guid isPermaLink="false">http://redmonk.com/sogrady/wp/?p=416#comment-659</guid>
		<description>The problem seems to be the way that ads are being inserted into the entry you are getting multiple copies of. We&#039;re working to see what we can do to work around this example of really bad ad insertion practice. 
 
It appears that the Industry Standard is serving ads from Doubleclick and the method they are using is simply wrong. What is happening is that the ad links are changing on a regular basis and thus our &quot;duplicate detection&quot; code thinks that the entry has changed. This will impact any post from an Industry Standard feed with ads in it. The fact that you&#039;re using PubSub is not relevant here. Just about any aggregator will get confused by posts that have changing content like this. 
 
Because of weaknesses in the RSS specification that are being addressed in the new Atom format, aggregators are currently forced to do textual analysis in order to detect changes to RSS posts. We can&#039;t simply rely on things like the optional RSS GUID when checking for duplicates. In the future, Atom will give us required unique IDs for entries and that will make it easier to detect duplicates... But, for now, we&#039;re stuck with a lot of old-style RSS feeds... Given this, it is vitally important that when someone inserts ads in RSS items, the text of the ad links *must not change* once the ad is inserted. The only alternative would be for us to build special code that recognizes the ads being inserted by each of the many advertisers and handles them specially. Personally, I don&#039;t think that is reasonble, however, if we only have a few &quot;bad apples&quot; (like DoubleClick), it might be workable for the short term until people start moving to Atom. 
 
I&#039;ll write more on this on my blog later today. See: &lt;a href=&quot;http://bobwyman.pubsub.com/&quot;&gt;http://bobwyman.pubsub.com/&lt;/a&gt; 
 
bob wyman </description>
		<content:encoded><![CDATA[<p>The problem seems to be the way that ads are being inserted into the entry you are getting multiple copies of. We&#039;re working to see what we can do to work around this example of really bad ad insertion practice. </p>
<p>It appears that the Industry Standard is serving ads from Doubleclick and the method they are using is simply wrong. What is happening is that the ad links are changing on a regular basis and thus our &quot;duplicate detection&quot; code thinks that the entry has changed. This will impact any post from an Industry Standard feed with ads in it. The fact that you&#039;re using PubSub is not relevant here. Just about any aggregator will get confused by posts that have changing content like this. </p>
<p>Because of weaknesses in the RSS specification that are being addressed in the new Atom format, aggregators are currently forced to do textual analysis in order to detect changes to RSS posts. We can&#039;t simply rely on things like the optional RSS GUID when checking for duplicates. In the future, Atom will give us required unique IDs for entries and that will make it easier to detect duplicates&#8230; But, for now, we&#039;re stuck with a lot of old-style RSS feeds&#8230; Given this, it is vitally important that when someone inserts ads in RSS items, the text of the ad links *must not change* once the ad is inserted. The only alternative would be for us to build special code that recognizes the ads being inserted by each of the many advertisers and handles them specially. Personally, I don&#039;t think that is reasonble, however, if we only have a few &quot;bad apples&quot; (like DoubleClick), it might be workable for the short term until people start moving to Atom. </p>
<p>I&#039;ll write more on this on my blog later today. See: <a href="http://bobwyman.pubsub.com/">http://bobwyman.pubsub.com/</a> </p>
<p>bob wyman </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James Governor</title>
		<link>http://redmonk.com/sogrady/2005/04/27/uncle-mercy-whatever-please-just-kill-the-dupes/comment-page-1/#comment-658</link>
		<dc:creator>James Governor</dc:creator>
		<pubDate>Thu, 28 Apr 2005 10:07:58 +0000</pubDate>
		<guid isPermaLink="false">http://redmonk.com/sogrady/wp/?p=416#comment-658</guid>
		<description>might this be something to do with industry standards feed? normally technorati does this, not pubsub so much </description>
		<content:encoded><![CDATA[<p>might this be something to do with industry standards feed? normally technorati does this, not pubsub so much </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: James Governor</title>
		<link>http://redmonk.com/sogrady/2005/04/27/uncle-mercy-whatever-please-just-kill-the-dupes/comment-page-1/#comment-657</link>
		<dc:creator>James Governor</dc:creator>
		<pubDate>Thu, 28 Apr 2005 06:08:57 +0000</pubDate>
		<guid isPermaLink="false">http://redmonk.com/sogrady/wp/?p=416#comment-657</guid>
		<description>the worst thing - i hate the quote in that article!!! i mean coldfusion is one thing i mentioned. now like ten times a day the story comes leaping out at me and i think why didn&#039;t i say something more insightful....  
 
and you begin to imagine that *everyone* keeps seeing the same story over and over, even though its just hitting me because of my vanity feed... </description>
		<content:encoded><![CDATA[<p>the worst thing &#8211; i hate the quote in that article!!! i mean coldfusion is one thing i mentioned. now like ten times a day the story comes leaping out at me and i think why didn&#039;t i say something more insightful&#8230;.  </p>
<p>and you begin to imagine that *everyone* keeps seeing the same story over and over, even though its just hitting me because of my vanity feed&#8230; </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Edward O&#039;Connor</title>
		<link>http://redmonk.com/sogrady/2005/04/27/uncle-mercy-whatever-please-just-kill-the-dupes/comment-page-1/#comment-656</link>
		<dc:creator>Edward O&#039;Connor</dc:creator>
		<pubDate>Wed, 27 Apr 2005 20:31:48 +0000</pubDate>
		<guid isPermaLink="false">http://redmonk.com/sogrady/wp/?p=416#comment-656</guid>
		<description>Of course, a few hours after I left that comment I get four bajillion dupes in several feeds which I&#039;ve  checked with `ignore&#039;, so it&#039;s not the be-all end-all solution by any means. </description>
		<content:encoded><![CDATA[<p>Of course, a few hours after I left that comment I get four bajillion dupes in several feeds which I&#039;ve  checked with `ignore&#039;, so it&#039;s not the be-all end-all solution by any means. </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: sogrady</title>
		<link>http://redmonk.com/sogrady/2005/04/27/uncle-mercy-whatever-please-just-kill-the-dupes/comment-page-1/#comment-655</link>
		<dc:creator>sogrady</dc:creator>
		<pubDate>Wed, 27 Apr 2005 16:12:06 +0000</pubDate>
		<guid isPermaLink="false">http://redmonk.com/sogrady/wp/?p=416#comment-655</guid>
		<description>i can&#039;t believe i never thought of that; i&#039;m an idiot. either way, bless you sir. </description>
		<content:encoded><![CDATA[<p>i can&#039;t believe i never thought of that; i&#039;m an idiot. either way, bless you sir. </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Edward O&#039;Connor</title>
		<link>http://redmonk.com/sogrady/2005/04/27/uncle-mercy-whatever-please-just-kill-the-dupes/comment-page-1/#comment-654</link>
		<dc:creator>Edward O&#039;Connor</dc:creator>
		<pubDate>Wed, 27 Apr 2005 13:16:16 +0000</pubDate>
		<guid isPermaLink="false">http://redmonk.com/sogrady/wp/?p=416#comment-654</guid>
		<description>When I subscribe to a feed in Bloglines, I start with the &quot;Display updated entries&quot; option. If the feed passes some dupe threshold of annoyance, I change that to &quot;ignore updated entries.&quot; This seems to take care of 90% of this problem for me. </description>
		<content:encoded><![CDATA[<p>When I subscribe to a feed in Bloglines, I start with the &quot;Display updated entries&quot; option. If the feed passes some dupe threshold of annoyance, I change that to &quot;ignore updated entries.&quot; This seems to take care of 90% of this problem for me. </p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Page Caching using memcached
Object Caching 315/317 objects using xcache

Served from: redmonk.com @ 2012-05-26 02:02:15 -->
