<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>SEO WebMonkey &#187; ethics</title>
	<atom:link href="http://seowebmonkey.com/stuff/ethics/feed/" rel="self" type="application/rss+xml" />
	<link>http://seowebmonkey.com</link>
	<description>Web design &#38; development with an ample sprinkle of SEO</description>
	<lastBuildDate>Tue, 01 Jun 2010 09:03:32 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Jamming the scraper signals</title>
		<link>http://seowebmonkey.com/jamming-the-scraper-signals/</link>
		<comments>http://seowebmonkey.com/jamming-the-scraper-signals/#comments</comments>
		<pubDate>Fri, 12 Dec 2008 11:01:13 +0000</pubDate>
		<dc:creator>ndixon</dc:creator>
				<category><![CDATA[Everything else]]></category>
		<category><![CDATA[copyright]]></category>
		<category><![CDATA[ethics]]></category>
		<category><![CDATA[plugins]]></category>
		<category><![CDATA[Wordpress]]></category>

		<guid isPermaLink="false">http://seowebmonkey.com/jamming-the-scraper-signals/</guid>
		<description><![CDATA[<p>Many bloggers have experienced their content being legitimately syndicated onto other sites and permit it to happen as part of their promotion. But many have also experienced the scrapers: sites that illegally duplicate entire blog contents, replicating new and old posts in order to populate their sites with content. A Wordpres plugin helps us fight back.</p>
]]></description>
			<content:encoded><![CDATA[<p>I recently had this issue from my main, personal blog. A less than ethical site (I&#8217;m not giving it the benefit of a link) had scraped every word from my RSS feed, and replicated it in a vBulletin forum as individual posts. (Intriguingly, every link in those posts was replaced by a message to register for the forum in order to see the link.)</p>
<p><span style="font-size: 18px; font-weight: bold;">If you want content, I&#8217;ll give you content</span></p>
<p>This was the first instance of all out scraping I had experienced, so also was the first opportunity to try out a Wordpress plugin that has been collecting dust for some time. <a href="http://asymptomatic.net/2006/09/22/88/help-defeat-the-sploggers-with-antileech">Antileech</a> is a plugin by <a href="http://asymptomatic.net/">Owen Winkler</a> that replaces the content within your RSS feed &#8211; and posts &#8211; with a definable or generated message, but only for user-agents or IP addresses you specify.</p>
<p>The plugin adds a small, non-intrusive image to your RSS feed content that enables it to record the location and user-agent of anyone accessing the feed. These user-agents are then listed in the plugin&#8217;s settings page, each with a check-box for you to select which ones are to receive the alternative content. The default is to send normal content to all user-agents, so who sees what, is entirely under your control.</p>
<h2>The message that counts</h2>
<p>Some bloggers choose the default generated message &#8211; that encourages anyone reading the alternative content to visit the originating site &#8211; while others choose their own messages &#8211; some including profanity and obscene/illegal messages to increase the chances of the scraper site being shut down due to inappropriate content. I have chosen a simple message that clearly states the site is stealing content from elsewhere. Most scrapers are automated and once the site owner has set the RSS scraper to work, they rarely look at the incoming content again.</p>
<p>The plugin does not prevent your content from being scraped, of course, and some has to be scraped for you to discover the illegal site in the first place. But once discovered, it offers a very simple and immediate means of ensuring they get no further benefit from future posts from your site. Meanwhile, you can take measures with the site&#8217;s hosting company to formally make a complaint to have your stolen content removed.</p>
<h2>How to know when you&#8217;re being scraped</h2>
<p>Another plugin helps to detect scrapers. There are several that do similar jobs, but I choose to use <a href="http://www.maxpower.ca/wordpress-plugin-digital-fingerprint-detecting-content-theft/2006/09/25/">Digital Fingerprint</a> by <a href="http://www.maxpower.ca/">Kirk Montgomery</a>. This places a user defined string of text at the end of the first paragraph of each post in your RSS feed. Make this string unique and you can create a Google Alert for that particular string. The Alert will let you know whenever and wherever that string turns up. Most will be legitimate syndication, but now and then, you&#8217;ll probably discover someone is up to no good.</p>
<h2>A never ending battle</h2>
<p>Scraper sites (or &#8220;splogs&#8221;) are never going to be eradicated, and their numbers are growing. Tools like these offer the blogger a viable and effective means to retaliate without losing hours scouring the net or duplicates of their content and struggling to contact those responsible to have the content removed.</p>
]]></content:encoded>
			<wfw:commentRss>http://seowebmonkey.com/jamming-the-scraper-signals/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Comment spammers want to hear from you</title>
		<link>http://seowebmonkey.com/comment-spammers-want-to-hear-from-you/</link>
		<comments>http://seowebmonkey.com/comment-spammers-want-to-hear-from-you/#comments</comments>
		<pubDate>Mon, 17 Nov 2008 12:24:14 +0000</pubDate>
		<dc:creator>ndixon</dc:creator>
				<category><![CDATA[Everything else]]></category>
		<category><![CDATA[comments]]></category>
		<category><![CDATA[ethics]]></category>
		<category><![CDATA[spam]]></category>

		<guid isPermaLink="false">http://seowebmonkey.com/?p=37</guid>
		<description><![CDATA[I once wrote about my experience with "<a href="http://neildixon.com/spammers-concerned-about-their-karma/">ethical spammers</a>" back in 2005. <br /><br />In that instance, the wiki spammer added an almost apologetic message attached to the spam that they kept the original content intact (while adding a huge chunk of spam links, of course).]]></description>
			<content:encoded><![CDATA[<p>This morning, I discover this message pushed into the comments of one of my blogs&#8230;</p>
<blockquote><p>Author : SpoomiDibebot (IP: <a rel="nofollow" href="http://194.165.42.49/" target="_blank">194.165.42.49</a> , <a rel="nofollow" href="http://194.165.42.49/" target="_blank">194.165.42.49</a>)<br />
E-mail : <a rel="nofollow" href="mailto:babader@mymail-in.net">babader@mymail-in.net</a><br />
URL    :<br />
Whois  : <a rel="nofollow" href="http://ws.arin.net/cgi-bin/whois.pl?queryinput=194.165.42.49" target="_blank">http://ws.arin.net/cgi-bin/whois.pl?queryinput=194.165.42.49</a><br />
Comment:<br />
to: Admin &#8211; If You want to delete your site from my spam list, please sent url of your domain to my e-mail: <a rel="nofollow" href="mailto:stop.spam.today@gmail.com">stop.spam.today@gmail.com</a><br />
And I will remove your site from my base within 24 hours<br />
webmastegz</p></blockquote>
<p>The comment did not get through, of course, because I always have comment moderation applied to blogs. (I have no reason to believe this spam comment is related to the above mentioned ethical spammers.)</p>
<p>I give this spammer 10 bonus points for such an original means to harvest genuine email addresses from blog owners to add to his other (email) spam list. Nice work, Mr. SpoomiDibebot (I&#8217;m guessing that is not his real name).</p>
<p>I wonder if such emails will also open up the possibility of unmoderated posting on the relevant blogs, too. I guess most web users will have a primary contact email address, the same one they might have attached to a blog&#8217;s admin account. The spammer then has the admin&#8217;s email plus its associated blog address. Comment moderation on bogs tends to revolve around the email address of the commenter &#8211; bingo! the spammer has a pre-approved email address which might by-pass the moderation process.</p>
<p>Loathe them as we do, I have to admire this one for such a (rare) original idea.</p>
]]></content:encoded>
			<wfw:commentRss>http://seowebmonkey.com/comment-spammers-want-to-hear-from-you/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
