<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Structure in the flow &#187; information overload</title>
	<atom:link href="http://www.fsavard.com/flow/tag/information-overload/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.fsavard.com/flow</link>
	<description>Programming, personal knowledge management. Topics unstable.</description>
	<lastBuildDate>Thu, 29 Jul 2010 16:05:26 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>For the curious: the incentive to mass-produce information</title>
		<link>http://www.fsavard.com/flow/2008/10/for-the-curious-the-incentive-to-mass-produce-information/</link>
		<comments>http://www.fsavard.com/flow/2008/10/for-the-curious-the-incentive-to-mass-produce-information/#comments</comments>
		<pubDate>Mon, 06 Oct 2008 01:23:33 +0000</pubDate>
		<dc:creator>Francois</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[for the curious]]></category>
		<category><![CDATA[information overload]]></category>
		<category><![CDATA[pkm]]></category>

		<guid isPermaLink="false">http://www.fsavard.com/flow/?p=198</guid>
		<description><![CDATA[I always find it interesting to understand the phenomenons that affects me and discover their <strong>root causes</strong>, be that in politics or <strong>information overload (IO)</strong>. About IO, one path to explore is the <strong>motivation behind the production of information</strong>.

As you may know, on the Web there are clear incentives to get good search engine ranks to drive traffic to your site, in some cases generating revenue from ads. I just recently realized some of the specifics of this sometimes involve generating massive amounts of content, details of which I found rather startling.

In the following I explain why some people <strong>generate whole websites very quickly without adding any real value</strong> to existing information to <strong>profit from advertising</strong>.]]></description>
			<content:encoded><![CDATA[<p>I always find it interesting to understand the phenomenons that affects me and discover their root causes, be that in politics or information overload (IO). About IO, one path to explore is the <strong>motivation behind the production of information</strong>.</p>
<p>As you may know, on the Web there are clear incentives to get good search engine ranks to drive traffic to your site, in some cases generating revenue from ads. I just recently realized some of the specifics of this sometimes <strong>involve generating massive amounts of content</strong>, details of which I found rather startling.</p>
<h3>Some background on &#8220;Internet marketing&#8221;</h3>
<p><strong>Search Engine Optimization (SEO)</strong>, is the work done to make a site stand out in search engine results, for obvious marketing purposes. Some people specialize in doing this. No, bar that: there&#8217;s a <strong>whole industry</strong> centered around this.</p>
<p>Some <strong>SEO techniques</strong> are entirely <strong>ethical</strong> and simply make good content easier to find: they&#8217;re referred to as <strong>&#8220;white hat&#8221;</strong>. Other techniques,<strong> &#8220;black hat&#8221;</strong>, use devious ways to route people to less desirable content.</p>
<p>There are ways to benefit directly from search engine rankings <strong>without really creating any value</strong> in the process. One of those ways is the <strong>pathological MFA site, Made For AdSense</strong> site (AdSense/Adwords referring to Google ads).</p>
<p>These are those sites you end up on when you mistype a domain name and bam, get a page which is essentially a <strong>fresco of Google ads</strong> with some teeny-weeny content lost in there somewhere. If for some rather popular keyword someone may get a good spot in Google results for his MFA page, then he&#8217;s won the game and <strong>reaps some profit when wandering visitors click his ads</strong>. At least that&#8217;s one scenario.</p>
<h3>The techniques of content generation</h3>
<p>I knew about this, vaguely, but the other day I stumbled upon this <a href="http://thenextweb.org/2008/09/27/ep4-companies-who-make-money-datapresser/" target="_blank">article over at TheNextWeb concerning DataPresser</a>. In a nutshell, this is a tool that allows you to <strong>generate content automatically, by following rules</strong>.</p>
<p>With <a href="http://www.datapresser.com" target="_blank">DataPresser</a>, for example, you can <strong>generate all sorts of variations</strong> on &#8220;Find cool wallpapers of _______&#8221;, where ______ is a<strong> blank filled from a database</strong>. Not only can you replace the blank, you can even <strong>change the way the sentence is worded</strong>, using synonyms or even grammatical constructions. That&#8217;s to avoid being flagged as &#8220;duplicate content&#8221; by Google, who obviously tries to eliminate such sites from its results. As you can see, it&#8217;s a game of cat and mouse.</p>
<p>The general keyword for this activity is <strong>&#8220;Content Generation&#8221;</strong>. There are many techniques and tools. Some will simply generate new pages by <strong>republishing from RSS feeds</strong> found elsewhere, with ads slapped around. Others will accumulate tons of text and <strong>mix bits of sentences from here and there </strong>to create text that doesn&#8217;t make sense but appears correct to search engines. You can even buy <a href="http://www.seocracy.com/datasets/list" target="_blank">whole <strong>databases of content</strong>, say game cheatcodes</a>, rather cheaply.</p>
<h3>Why mass-produce?</h3>
<p>Obviously, it&#8217;s profitable to mass-produce text to cover many topics, therefore many keywords. But there&#8217;s another reason: search engines give more credit to sites with links pointing to them. That&#8217;s why some content generation <strong>involve the creation of many sites linking to each other</strong> (the whole thing is called a <a href="http://en.wikipedia.org/wiki/Link_farm" target="_blank"><strong>link farm</strong></a>). Another &#8220;link-building&#8221; technique concerns message boards and blog comments, with posts being made solely to create links that add to a site search-engine karma.</p>
<h3>Conclusion</h3>
<p>I think this phenomenon plays a big role in<strong> understanding the huge &#8220;size&#8221; of the Web</strong> (number of pages). A very simple technique to generate more revenue for a given site, for example, is to split an article in multiple pages so more ads can be displayed. But with these MFA sites, we&#8217;re talking about generating thousands of pages at the click of a button!</p>
<p>In the end, it all <strong>boils down to spam and background noise</strong>. Whatever the service, if it can generate buzz, chances are it&#8217;ll be exploited.</p>
<h3>References</h3>
<ul>
<li><a href="http://www.slightlyshadyseo.com/index.php/the-basics-of-content-generation-methods-coherrence-and-unique-content/" target="_blank">The Basics of Content Generation: Methods, Coherrence, and Unique Content</a></li>
<li><a href="http://www.seoshadow.com/2008/09/madlibbing-your-way-to-riches-with-datapresser/" target="_blank">Madlibbing your way to riches with Datapresser</a></li>
</ul>
 <img src="http://www.fsavard.com/flow/wp-content/plugins/feed-statistics.php?view=1&post_id=198" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.fsavard.com/flow/2008/10/for-the-curious-the-incentive-to-mass-produce-information/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>How a popular blogger reads 600+ RSS feeds every day</title>
		<link>http://www.fsavard.com/flow/2008/09/how-a-popular-blogger-reads-600-rss-feeds-every-day/</link>
		<comments>http://www.fsavard.com/flow/2008/09/how-a-popular-blogger-reads-600-rss-feeds-every-day/#comments</comments>
		<pubDate>Tue, 30 Sep 2008 20:29:32 +0000</pubDate>
		<dc:creator>Francois</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[information overload]]></category>
		<category><![CDATA[pkm]]></category>
		<category><![CDATA[RSS]]></category>

		<guid isPermaLink="false">http://www.fsavard.com/flow/?p=186</guid>
		<description><![CDATA[This is about a year old, but very relevant here. Timothy Ferriss, author of "The 4-hour workweek", <strong>interviewed Robert Scoble and filmed his RSS reading process</strong> (he's suscribed to more than 600 feeds!).

Read on for the <strong>video</strong>.]]></description>
			<content:encoded><![CDATA[<p>This is about a year old, but very relevant here. Timothy Ferriss, author of &#8220;The 4-hour workweek&#8221;, <a href="http://www.fourhourworkweek.com/blog/2007/05/16/how-scoble-reads-622-rss-feeds-each-morning/" target="_blank">interviewed Robert Scoble and filmed his RSS reading process</a> (he&#8217;s suscribed to more than 600 feeds!).</p>
<p>In the end, perhaps unsurprisingly, the magic relies on being really quick at judging an article from its title, its overall look and other cues. There are a couple of technical tips, though, about using the Google Reader interface efficiently, like relying on keyboard shortcuts.</p>
<p>Here&#8217;s the video:</p>
<p style="text-align: center;"><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="437" height="370" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="id" value="viddler" /><param name="allowScriptAccess" value="always" /><param name="allowFullScreen" value="true" /><param name="wmode" value="transparent" /><param name="src" value="http://www.viddler.com/player/6be21c4f/" /><embed id="viddler" type="application/x-shockwave-flash" width="437" height="370" src="http://www.viddler.com/player/6be21c4f/" wmode="transparent" allowfullscreen="true" allowscriptaccess="always"></embed></object></p>
 <img src="http://www.fsavard.com/flow/wp-content/plugins/feed-statistics.php?view=1&post_id=186" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.fsavard.com/flow/2008/09/how-a-popular-blogger-reads-600-rss-feeds-every-day/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Clay Shirky&#8217;s talk on information overload</title>
		<link>http://www.fsavard.com/flow/2008/09/clay-shirky-on-information-overload/</link>
		<comments>http://www.fsavard.com/flow/2008/09/clay-shirky-on-information-overload/#comments</comments>
		<pubDate>Mon, 22 Sep 2008 15:09:52 +0000</pubDate>
		<dc:creator>Francois</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[information overload]]></category>
		<category><![CDATA[pkm]]></category>
		<category><![CDATA[summary]]></category>

		<guid isPermaLink="false">http://www.fsavard.com/flow/?p=118</guid>
		<description><![CDATA[Via a LifeHacker story, I found this video of NYU New Media professor Clay Shirky&#8217;s opinion on the information overload problem. It&#8217;s very interesting, if a bit long, so I made a summary of some of his points: We always hear the same story: information being produced increasingly fast. That makes us feel good about [...]]]></description>
			<content:encoded><![CDATA[<p>Via a <a href="http://lifehacker.com/5052851/information-overload-is-filter-failure-says-shirky" target="_blank">LifeHacker story</a>, I found this <a href="http://web2expo.blip.tv/file/1277460/" target="_blank">video of NYU New Media professor Clay Shirky&#8217;s opinion on the information overload problem</a>. It&#8217;s very interesting, if a bit long, so I made a <strong>summary</strong> of some of his points:</p>
<ul>
<li>We <strong>always hear the same story: information being produced increasingly fast</strong>. That makes us feel good about ourselves: that&#8217;s why I can&#8217;t get anything done, see!</li>
</ul>
<p style="text-align: center;"><a href="http://www.flickr.com/photos/42182583@N00/2333232442/" target="_blank"><img class="alignnone size-full wp-image-122" title="IDC information overload chart" src="http://www.fsavard.com/flow/wp-content/uploads/2008/09/idc_chart_tn.jpg" alt="" width="250" height="210" /><br />
IDC information overload chart (mentioned in Shirky&#8217;s talk)</a></p>
<ul>
<li>In the past, the editor had to filter for quality what went out of the printing press, due to the risk involved if the book didn&#8217;t sell. But the Internet introduced &#8220;post-Gutenberg economics&#8221;, where the <strong>filter for quality is now &#8220;way downstream&#8221; from the source,</strong> since everyone may publish.</li>
<li>So we <strong>shouldn&#8217;t see the problem as an information overproduction <em>at the source</em> problem, so much as a personal filtering problem</strong>.</li>
<li>He takes email spam as an example: we set up filters, but after a few time we notice more spam gets in anyway: our filters need tweaking. It&#8217;s about <strong>old filters continuously breaking</strong> and needing to be fixed.</li>
<li>Social media and the Internet in general bring new systems that <strong>break old ways of exchanging information</strong>, and makes us formalize and <strong>need to take responsibility for information flow issues</strong>, who our information might reach, how public it gets, like privacy of Facebook events.</li>
<li><strong>Conclusion</strong>: information overload is <strong>not just a superficial problem</strong>, something that can be solved by programming once and for all. Algorithms can help, yes, but we <strong>need to rethink social norms</strong> and <strong>when we face overload, ask ourself personally: which of my filters just broke?</strong></li>
</ul>
<p>As some people underlined in comments at LifeHacker, &#8220;solving&#8221; information overload is nothing new for another fundamental reason: it&#8217;s about chosing what we&#8217;re personally interested in. One cannot master every field there is, obviously. In the end, it&#8217;s about <strong>personal choice, not just about what&#8217;s universally &#8220;good&#8221; or &#8220;bad&#8221;</strong>. That&#8217;s one problem with social bookmarking: one story might be very interesting to you and not to the mass, not even to people in what you consider your own field.</p>
 <img src="http://www.fsavard.com/flow/wp-content/plugins/feed-statistics.php?view=1&post_id=118" width="1" height="1" style="display: none;" />]]></content:encoded>
			<wfw:commentRss>http://www.fsavard.com/flow/2008/09/clay-shirky-on-information-overload/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
