Home. 
.

transparent

transparent

transparent

Altova Mailing List Archives


Re: Removing duplicate entries/stories from a RSS feed?

From: Paul Lutus <nospam@------.--->
To: NULL
Date: 12/6/2006 10:26:00 AM

gaikokujinkyofusho@g... wrote:

> Hi, I have been enjoying being able to subscribe to RSS
> (http://kinja.com/user/thedigestibleaggie) for awhile and have come up
> with a fairly nice list of feeds but I have run into an annoying
> (though not critical) problem, duplicate stories. Apparently there is
> overlap with some of the sites I subscribe to so I get duplicate
> stories. Does anyone know of some sort of filter (software or online
> service) that can remove duplicate stories? Any help or suggestions
> would really be appreciated!

Write a script in a language that supports associative arrays (as do Java,
Perl, Ruby, Python, and even JavaScript). Key the associative array to a
unique key created out of elements in the various RSS feed items. Fill the
associative array using the generated key.

Unfortunately, it is rare for two RSS feed items to be truly identical.
Often, they tell the same story with small differences in wording (to avoid
accusations of plagiarism) and of course the URL is normally different.

Without some complex coding to detect items that are almost the same, the
above method will remove only genuinely identical items from different RSS
feeds.

-- 
Paul Lutus
http://www.arachnoid.com


transparent
Print
Mail
Like It
Disclaimer
.

These Archives are provided for informational purposes only and have been generated directly from the Altova mailing list archive system and are comprised of the lists set forth on www.altova.com/list/index.html. Therefore, Altova does not warrant or guarantee the accuracy, reliability, completeness, usefulness, non-infringement of intellectual property rights, or quality of any content on the Altova Mailing List Archive(s), regardless of who originates that content. You expressly understand and agree that you bear all risks associated with using or relying on that content. Altova will not be liable or responsible in any way for any content posted including, but not limited to, any errors or omissions in content, or for any losses or damage of any kind incurred as a result of the use of or reliance on any content. This disclaimer and limitation on liability is in addition to the disclaimers and limitations contained in the Website Terms of Use and elsewhere on the site.

.
.

transparent

transparent