<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Missives from the Technocave &#187; Fun With SEO</title>
	<atom:link href="http://www.widgettwalls.com/category/fun-with-seo/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.widgettwalls.com</link>
	<description>Fresh from my brain pan to the breakfast tables of the world.</description>
	<lastBuildDate>Tue, 09 Nov 2010 09:15:14 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1</generator>
		<item>
		<title>Redirecting .html to .htm via .htaccess</title>
		<link>http://www.widgettwalls.com/2007/12/14/redirecting-html-to-htm-via-htaccess/</link>
		<comments>http://www.widgettwalls.com/2007/12/14/redirecting-html-to-htm-via-htaccess/#comments</comments>
		<pubDate>Fri, 14 Dec 2007 06:12:17 +0000</pubDate>
		<dc:creator>Widge</dc:creator>
				<category><![CDATA[Fun With SEO]]></category>
		<category><![CDATA[archives]]></category>
		<category><![CDATA[axs]]></category>
		<category><![CDATA[cgi]]></category>
		<category><![CDATA[frames]]></category>
		<category><![CDATA[htaccess]]></category>
		<category><![CDATA[html]]></category>
		<category><![CDATA[images]]></category>
		<category><![CDATA[rewrites]]></category>
		<category><![CDATA[seo]]></category>

		<guid isPermaLink="false">http://www.widgettwalls.com/2007/12/14/redirecting-html-to-htm-via-htaccess/</guid>
		<description><![CDATA[Okay, so here&#039;s some fun. Make yourself comfortable. I went out to find an image for Needcoffee from the archives we&#039;ve already got on the site so I wouldn&#039;t have to grab it from elsewhere (remember, kids: recycle!) and realized that some of our archival pages weren&#039;t loading properly on the new server. Brief explanation: [...]]]></description>
			<content:encoded><![CDATA[<p>Okay, so here&#039;s some fun.  Make yourself comfortable.</p>
<p>I went out to find an image for Needcoffee from the archives we&#039;ve already got on the site so I wouldn&#039;t have to grab it from elsewhere (remember, kids: recycle!) and realized that some of our archival pages weren&#039;t loading properly on the new server.</p>
<p>Brief explanation: on Version 3 of the site (pre-Wordpress) we used frames.  We&#039;re not proud of it, but it made sense for 1998.  The content portion of the site was an .htm file.  The frame that went around it and contained the masthead and the menu/sidebar was an .html file.  So you could have, for example, a DVD review of <i>Drop Dead Fred</i> and the review itself was ddfred.htm while the frame that held it was ddfred.html.</p>
<p>Again, it was the best we could muster at the time.</p>
<p>Anyway, while checking out the new server for this old image I wanted, I discovered that the .html versions of pages were coughing up a lung.  &#034;An error occurred while processing this directive.&#034;  Lovely.  But the .htm versions were fine.  What to do?</p>
<p>Well, tear open a .html file and saw that AXS, the web tracking bit we used to use on the old site (and still use in the archives) wasn&#039;t working because it was pointing to an old CGI directory and the CGI directory wasn&#039;t kosher any longer.</p>
<p>So the first thing I tried to do was get the old CGI directory to work, but I couldn&#039;t seem to figure out how to do that.  And I wanted to get the damn thinking working.  So I decided, hey!  Screw it, I&#039;ll just to a rewrite via the .htaccess and when people go to the .html version, I&#039;ll just send them to the direct review instead.  That&#039;s actually better from an SEO perspective because back when we setup the whole framed system SEO wasn&#039;t even a twinkle in anybody&#039;s pants.</p>
<p>But somehow I botched it and the .htaccess blew up the site for a few minutes.  I had a Redirect that never redirected anywhere, and it didn&#039;t matter that it was in my /html/dvd/ directory, the server read down through .htaccess, couldn&#039;t go any further and barfed.</p>
<p>Why did this happen, children?  Because Uncle Widge fucked himself over.  He got in a hurry and forgot the cardinal rule of screwing around with your site:</p>
<h3>The Cardinal Rule of Screwing Around With Your Site:</h3>
<p><center><b>Don&#039;t be a dick.  Keep a backup and a fire extinguisher handy at all times.</b></center></p>
<p>That&#039;s right.  When you forget the Cardinal Rule, Yahweh himself will laugh at you and your site will explode.</p>
<p>So.  I finally got that corrected after a few minutes outage, and I was able to find some info on how to do this in .htaccess.</p>
<p>Not that I think anybody out there has the same setup as the old Needcoffee.com site, but still.  You can futz with this to match your own site.</p>
<p>This is a variation on what I found at SEOBook, which was the best and closest to what I was trying to do.</p>
<h3>Disclaimer:</h3>
<p><center><b>When it comes to .htaccess rewriting, I barely understand what I&#039;m doing.  I admit that up front, so if you want tips on this, I may or may not have any clue as to what you&#039;re asking.</b></center></p>
<p>So.</p>
<p>I basically have an .htaccess in the subdirectory where I want to make this change.</p>
<p>I added this to the top:</p>
<pre><code>Options +FollowSymlinks
RewriteEngine on
RewriteRule ^(.+)\.html http://www.needcoffee.com/html/$1.htm [r=301,nc]</code></pre>
<p>Here&#039;s what I can tell you.  The Options and RewriteEngine statement make the thing work to begin with.  The rule itself says for every .html hit, rewrite and resend them instead to .htm.  Also, it gives a 301 redirect, which is good because search engines need to be able to find the content that&#039;s been &#034;moved,&#034; or in this case, simply bypassed.  </p>
<p>The one thing I don&#039;t understand is, the way I read this code, it looks like it should only work for the /html/ directory, but it instead works in all subdirectories.  So that I&#039;m clueless about.  Just for the moment it works.</p>
<p>There?  Aren&#039;t you sorry you asked now?</p>
<p><i>Update:</i> Shade at That&#039;s My Stapler <a href="http://www.thatsmystapler.org/2007/12/14/rewritification/" target="_blank">tries to help my inept ass out</a>.  I think I almost understand what he&#039;s saying&#8230;I can do coding, me.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.widgettwalls.com/2007/12/14/redirecting-html-to-htm-via-htaccess/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Frame Breaker Breaks Google Images?</title>
		<link>http://www.widgettwalls.com/2007/11/08/frame-breaker-google-images/</link>
		<comments>http://www.widgettwalls.com/2007/11/08/frame-breaker-google-images/#comments</comments>
		<pubDate>Fri, 09 Nov 2007 03:10:45 +0000</pubDate>
		<dc:creator>Widge</dc:creator>
				<category><![CDATA[Fun With SEO]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[frames]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[google-images]]></category>
		<category><![CDATA[images]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[traffic]]></category>

		<guid isPermaLink="false">http://www.widgettwalls.com/2007/11/08/frame-breaker-google-images/</guid>
		<description><![CDATA[This is interesting. So Headspace2, the badass plugin that I&#039;m using over on Needcoffee currently, has a Frame Breaker built into it. WTF is a Frame Breaker? Well, you know how when you search in Google Images, you get presented the site below a Google frame up top? The Frame Breaker breaks you out of [...]]]></description>
			<content:encoded><![CDATA[<p>This is interesting.  So <a href="http://urbangiraffe.com/plugins/headspace2/" target="_blank">Headspace2, the badass plugin</a> that I&#039;m using over on <a href="http://www.needcoffee.com" target="_blank">Needcoffee</a> currently, has a Frame Breaker built into it.</p>
<p>WTF is a Frame Breaker?</p>
<p>Well, you know how when you search in Google Images, you get presented the site below a Google frame up top?  The Frame Breaker breaks you out of that frame, natch, so you get served the site without the Google flavoring up top.</p>
<p>I always did wonder about that, so I flicked it on.  That was around the 22nd or 23rd of October.</p>
<p>I decided to come back and check to see how it had affected my traffic on Google Images.</p>
<p>Now, let me state this up front.  My understanding is that Google Images doesn&#039;t update very often.  And I also understand that for the majority of my images, I haven&#039;t SEO&#039;d them up worth a good goddamn because at the time I was putting them up, I had no idea why I should.  (Of course, people still seem to find me and <a href="http://imagepolicy.needcoffee.com/" target="_blank">force me to do shit like this</a>.)</p>
<p>So I don&#039;t have a great deal of traffic coming in anyway.  But let&#039;s look.</p>
<p>Google Images <i>spiked</i> the day I turned it on, then went to a reasonable trickle.</p>
<p>As for Google Images.ca?  I went from a trickle&#8230;to <em>nothing</em>.</p>
<p>Google Images.uk?  From an erratic mess to&#8230;<em>nothing</em>.</p>
<p>And pretty much on down the line.</p>
<p>Did it do anything to my regular Google results?  No.  No discernable change.</p>
<p>In fact, if I just do &#034;images&#034; and pull that chart up, I spike, and then crater.</p>
<p>Fascinating.  Now.  One of two things is happening.</p>
<p>1.  Either Google Images doesn&#039;t like the frame break and has something built in which makes me show up lower in the results because of it. Or&#8230; </p>
<p>2.  Google Images needs the frame to show up as Google Images in my Analytics.  </p>
<p>I don&#039;t know which.  If I had a huge amount of Google Images traffic, I would be able to see if my Google hits went up an equivalent amount, so maybe Images traffic was being counted as regular traffic.  Just a theory.</p>
<p>Or, if I had Analytics installed on my Version 3 archival part of the site, which has no Frame Breaker, I could see if it suffered, yes or no.</p>
<p>I&#039;ve heard rumors that breaking the frame caused Google Images to not speak to you anymore, but never really saw that substantiated.  And trying to Google terms like &#034;frame breaker google images penalty&#034; and the like didn&#039;t really get me anywhere.</p>
<p>Either way, I&#039;m taking off the Frame Breaker.  I&#039;ll see if it changes anything and if so, how quickly.  And hopefully, I&#039;ll get Google Images out here to reindex my site with the SEO&#039;d bits I <i>do</i> have.</p>
<p>If anybody has their own experience, I&#039;d love to hear it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.widgettwalls.com/2007/11/08/frame-breaker-google-images/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Never Underestimate the Power of Stumbleupon</title>
		<link>http://www.widgettwalls.com/2007/10/17/never-underestimate-the-power-of-stumbleupon/</link>
		<comments>http://www.widgettwalls.com/2007/10/17/never-underestimate-the-power-of-stumbleupon/#comments</comments>
		<pubDate>Wed, 17 Oct 2007 16:30:34 +0000</pubDate>
		<dc:creator>Widge</dc:creator>
				<category><![CDATA[Fun With SEO]]></category>
		<category><![CDATA[stumbleupon]]></category>

		<guid isPermaLink="false">http://www.widgettwalls.com/2007/10/17/never-underestimate-the-power-of-stumbleupon/</guid>
		<description><![CDATA[I went and added Daily Kicksplode to my thumbs up bits, stating that it was one of mine.  Boom.  1200 visitors arrived.  Nice.]]></description>
			<content:encoded><![CDATA[<p>Have seen a few posts here and there about getting traffic via <a href="http://www.stumbleupon.com/" target="_blank">Stumbleupon</a> and thought, &#034;Oh, yeah, Stumbleupon.  I remember that.&#034;  I joined about two years ago and have given the thumbs up to over 400 bits.</p>
<p>So I went and added <a href="http://www.dailykicksplode.com/" target="_blank">Daily Kicksplode</a> to my thumbs up bits, stating up front that it was one of mine.</p>
<p>Boom.  1200 visitors arrived.  Nice.</p>
<p><span id="more-437"></span><br />
Now, before anybody gets all excited and starts bombarding Stumbleupon with their own sites, I would caution you as follows:</p>
<p>1.  The &#034;Don&#039;t Be a Dick&#034; rule is a no-brainer.  If you show up, register, and post your site and then leave, no one will ever pay attention to you, I wouldn&#039;t think.  We never allow that crap on <a href="http://gabfest.needcoffee.com" target="_blank">the Gabfest</a>, the hit-and-run promotions nonsense, and I&#039;m sure the Stumbleupon community won&#039;t put up with it either.</p>
<p>2.  If you&#039;re going to use the service, then use it. And use it to your benefit.  If you del.icio.us link something, Stumble it too.  The people who run the sites, if they&#039;re smart, will appreciate you doing so.</p>
<p>This has certainly got me active again on Stumbleupon because it gives results, obviously.  And I like what gives results.</p>
<p>I should have a report on <a href="http://www.projectwonderful.com/" target="_blank">Project Wonderful</a> here in a couple of days.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.widgettwalls.com/2007/10/17/never-underestimate-the-power-of-stumbleupon/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Robots.txt is Pretty Damn Important, Yes</title>
		<link>http://www.widgettwalls.com/2007/09/30/robotstxt-is-pretty-damn-important-yes/</link>
		<comments>http://www.widgettwalls.com/2007/09/30/robotstxt-is-pretty-damn-important-yes/#comments</comments>
		<pubDate>Sun, 30 Sep 2007 10:52:47 +0000</pubDate>
		<dc:creator>Widge</dc:creator>
				<category><![CDATA[Fun With SEO]]></category>
		<category><![CDATA[bots]]></category>
		<category><![CDATA[robots.txt]]></category>
		<category><![CDATA[search-engines]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[serp]]></category>
		<category><![CDATA[Wordpress]]></category>

		<guid isPermaLink="false">http://www.widgettwalls.com/2007/09/30/robotstxt-is-pretty-damn-important-yes/</guid>
		<description><![CDATA[Just found this. Technically, it&#039;s correct: you don&#039;t need a robots.txt for good SEO. However, it&#039;s not that simple. Part of the problems I was having on my sites was that the search engine bots weren&#039;t just crawling my site, they were freaking pounding it into a fine powder. Oh sure, if you&#039;ve got a [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.seroundtable.com/archives/014675.html" target="_blank">Just found this.</a></p>
<p>Technically, it&#039;s correct: you don&#039;t need a robots.txt for good SEO.  </p>
<p>However, it&#039;s not that simple.  Part of the problems I was having on my sites was that the search engine bots weren&#039;t just crawling my site, they were freaking pounding it into a fine powder.  Oh sure, if you&#039;ve got a big enough server, you can afford to let them run all over you&#8211;but I&#039;m doing this crap on a budget.</p>
<p>If you&#039;re on a budget hosting service, or to put it another way, if you&#039;re using the cheapest hosting you feel you can get away with&#8211;you have to make sure you&#039;re not throwing away bandwidth or CPU cycles.</p>
<p>Look at your access logs.  Are you getting hammered every couple of seconds by Googlebot?  Or the Yahoo bot?  Or any bot for that matter?</p>
<p>If you have a robots.txt, are the bots reading it and heeding it?</p>
<p>It&#039;s one thing if you&#039;ve got flat HTML pages for your site, but even with wp-cache running, WordPress can bog down if a bot is allowed to run rampant.  And if your site is slow or can&#039;t be crawled properly because the bots have bogged it down, then yeah, that can affect your SEO.</p>
<p>Now you know.  And knowing is half the battle.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.widgettwalls.com/2007/09/30/robotstxt-is-pretty-damn-important-yes/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>WordPress Duplicate Content: You Control the Horizontal, You Control the Vertical</title>
		<link>http://www.widgettwalls.com/2007/08/29/wordpress-duplicate-content/</link>
		<comments>http://www.widgettwalls.com/2007/08/29/wordpress-duplicate-content/#comments</comments>
		<pubDate>Wed, 29 Aug 2007 14:09:39 +0000</pubDate>
		<dc:creator>Widge</dc:creator>
				<category><![CDATA[Fun With SEO]]></category>

		<guid isPermaLink="false">http://www.widgettwalls.com/2007/08/29/wordpress-duplicate-content/</guid>
		<description><![CDATA[Okay, so, all you hear is Wordpress = easy to have duplicate content.  And it is.  It's quite true. ]]></description>
			<content:encoded><![CDATA[<p>Okay, so, all you hear is WordPress = easy to have duplicate content.  And it is.  It&#039;s quite true.  In fact, that may have bitten me on the arse on <a href="http://www.needcoffee.com/" target="_blank">Needcoffee</a> a few months back.  </p>
<p>An individual post can appear:</p>
<ul>
<li>As itself</li>
<li>In a category (as many as it has categories assigned to it)</li>
<li>In a tag page (as many as it has tags assigned to it)</li>
</ul>
<p>So if I&#039;ve got a review of a TV DVD set, it could appear in the categories: TV, DVD and Reviews.  Plus if I&#039;ve got five tags on it, well&#8230;you see the concern.  Without even meaning to, I&#039;ve got the same article showing up nine places.</p>
<p>What to do?  Well, initially I went into my robots.txt file and just told the search engines to stay the hell away from everything but my individual posts:</p>
<p><em>Disallow: /tag/<br />
Disallow: /category/</em></p>
<p>Then the pendulum swings back the other way&#8211;are all of my pages close enough to my front page that they get nabbed by Google and are considered important?  Basically, if nobody externally is linking to a post, and it&#039;s too many hops/links away from the front page, it can be abandoned and left for dead.  Or something equally dramatic.</p>
<p>So the happy medium is to have a post, split it up with the MORE tag, and then allow categories and individual posts in the robots.txt file, but then only have one category per post.</p>
<p>Ah, so: problem for me.  I can have a post, as stated above, that&#039;s for a TV DVD set.  Now I can see saying that DVD can be chucked because primarily it&#039;s a TV item, the media just happens to be DVD.  Fine.  But, well, it is a review.  And what if somebody wants to browse all our reviews?  Do I really want them to have to use the tag?  </p>
<p>Then it struck me like a slice of provolone from the blue: &#034;What a dumbass, just exclude the individual categories that you know are always going to be tied to something else.&#034;  For example, we will always have Reviews AND TV.  Or Reviews AND Movies.  Just exclude that category.  So I re-allowed /category/ and instead just did this:</p>
<p><em>Disallow: /category/reviews/<br />
Disallow: /category/press/</em></p>
<p>Done deal.  Now, granted, I have some posts with multiple categories that need to be cleaned up, but that can be done easily enough.</p>
<p>I wish there were a lot more SEO tools built in with WordPress, honestly.  And maybe this exists among the fifty gajillion plugins and I haven&#039;t seen it, but a way of looking at all my posts and being able to check/uncheck categories en masse would be nice.  Or even a plugin that went out, looked at how you&#039;ve got your posts, robots.txt and such setup, and graded you for duplicate content.  You know, you are at a 56% chance of being SOL because you&#039;ve got too little content too many places.  Something like that.</p>
<p><em>Anyway, What Have We Learned?</em></p>
<p>1.  <strong>There are no absolute, hard and fast rules to SEO.</strong>  And even that, being a hard and fast rule, is subject to scrutiny.  Sure, you need to do stuff like use decent titles, decent URLs, and have your server, you know, actually online.  There&#039;s some no brainers, but just because you find a post that says Your Robots.TXT Must Look Like This or You Are Doomed, well, have your grain of salt handy.  Make sure what you&#039;re doing works for your individual site, because as I find, anyway, most SEO posts are for sites that aren&#039;t, shall we say, dealing in the trade of pop culture.</p>
<p>2.  <strong>You can control a shitload of stuff about your site.</strong>  I&#039;m amazed at how many people actually <i>don&#039;t have</i> a robots.txt file.  Or an .htaccess file (that they know of).  I keep forgetting just how much power I have to shape what goes on on the site.  It&#039;s a good idea to take five minutes a week and step back from the grind of posting and just go, &#034;Right.  Do I have my hatches battened down?&#034;</p>
<p>3.  <strong>Google Webmaster Tools are your friend.</strong>  The robots.txt analyzer they provide has already saved me from fifteen really stupid things I could have done to cut my site off from the outside world.  I highly recommend you do not make any changes to your robots.txt without running it through their first.  And don&#039;t just check the Googlebot.  Check the image-bot and check the media-bot (if you&#039;re running AdSense).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.widgettwalls.com/2007/08/29/wordpress-duplicate-content/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Exabot-Thumbnails?</title>
		<link>http://www.widgettwalls.com/2007/08/15/exabot-thumbnails/</link>
		<comments>http://www.widgettwalls.com/2007/08/15/exabot-thumbnails/#comments</comments>
		<pubDate>Thu, 16 Aug 2007 04:41:25 +0000</pubDate>
		<dc:creator>Widge</dc:creator>
				<category><![CDATA[Fun With SEO]]></category>

		<guid isPermaLink="false">http://www.widgettwalls.com/2007/08/15/exabot-thumbnails/</guid>
		<description><![CDATA[Received a 500 error earlier this morning trying to post on Needcoffee.  Checking the access logs it appears I have a new friend: Exabot-Thumbnails.]]></description>
			<content:encoded><![CDATA[<p>Received a 500 error earlier this morning trying to post on Needcoffee.  Checking the access logs it appears I have a new friend: Exabot-Thumbnails.  Here&#039;s a sample line:</p>
<blockquote><p>193.47.80.77 &#8211; - [14/Aug/2007:01:25:22 -0700] &#034;GET /updates/tag/androids HTTP/1.0&#034; 301 242 &#034;-&#034; &#034;Mozilla/5.0 (compatible; Konqueror/3.5; Linux) KHTML/3.5.5 (like Gecko) (Exabot-Thumbnails)&#034;</p></blockquote>
<p>Apparently there&#039;s been some other Exabots in the past, but nothing that&#039;s specifically &#034;Exabot-Thumbnails.&#034;  There&#039;s no info on a site to check for who the hell owns this bot, nor, from what I can tell, is it nabbing my robots.txt file.  So&#8230;evil bot = .htaccess smackdown by IP address.</p>
<p>If anybody knows anything more about this, let me know.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.widgettwalls.com/2007/08/15/exabot-thumbnails/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>I&#039;m in Supplemental Hell, I Just Don&#039;t Know It Now: All Better</title>
		<link>http://www.widgettwalls.com/2007/08/01/im-in-supplemental-hell-i-just-dont-know-it-now-all-better/</link>
		<comments>http://www.widgettwalls.com/2007/08/01/im-in-supplemental-hell-i-just-dont-know-it-now-all-better/#comments</comments>
		<pubDate>Wed, 01 Aug 2007 07:12:17 +0000</pubDate>
		<dc:creator>Widge</dc:creator>
				<category><![CDATA[Fun With SEO]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[serp]]></category>
		<category><![CDATA[supplemental-results]]></category>
		<category><![CDATA[WTF]]></category>

		<guid isPermaLink="false">http://www.widgettwalls.com/2007/08/01/im-in-supplemental-hell-i-just-dont-know-it-now-all-better/</guid>
		<description><![CDATA[For those who don't know (and probably don't care, if you're not a webmaster), there is a secondary set of search results you can get from Google.  It's called Supplemental Results.  It might as well be called "The Results That Aren't As Good As The Real Results."  Nobody but nobody wants to be in them.]]></description>
			<content:encoded><![CDATA[<p>Well, <a href="http://googlewebmastercentral.blogspot.com/2007/07/supplemental-goes-mainstream.html" target="_blank">this is frustrating</a>.</p>
<p>For those who don&#039;t know (and probably don&#039;t care, if you&#039;re not a webmaster), there is a secondary set of search results you can get from Google.  It&#039;s called Supplemental Results.  It might as well be called &#034;The Results That Aren&#039;t As Good As The Real Results.&#034;  Nobody but nobody wants to be in them.</p>
<p>A couple of months ago, I noticed that a goodly number of <a href="http://www.needcoffee.com" target="_blank">Needcoffee</a>&#039;s entries had wound up in the Supplemental Results.  At first, it appeared that this was because we had a lot of duplicate content: tag pages, category pages, date pages&#8211;all with the same posts.  All right, fair enough&#8211;I setup a robots.txt that kept the Googlebot from indexing pages that I didn&#039;t want, and kept single entries as indexable.</p>
<p>However, stuff continues to slide into Supplemental Results.  Right now I was toying with internal links to try and get things under control, but basically Google has effectively blinded me to how well I&#039;m doing.  The name of the article should have been more properly called &#034;Supplemental Goes Stealth.&#034;</p>
<p>This doesn&#039;t fix anything.  In fact, it makes my job as a webmaster even more difficult.  </p>
<p>It would be one thing if there was a webmaster tool that said, &#034;Hey, Widge, here&#039;s what&#039;s wrong with your page and why it slid into Supplemental Hell.&#034;  Then I would go and fix it.  However, now I not only don&#039;t know why this is happening I can&#039;t even see it happening any longer.  So the problem has just gotten a lot worse.  Google&#039;s solution to the problem is simply to make it impossible to see the problem.  But the problem hasn&#039;t gone away.</p>
<p>This, frankly, sucks.  And this is me, Google enthusiast and defender, talking here.  Why is Google doing this?  I run AdSense on Needcoffee.  Why would they make it harder for people to find pages on my site and thus harder to get at the ad revenue that I could potentially bring in?  And this is not just my site&#8211;AdSense is all over the place, and this affects everybody&#039;s sites.  It would be in Google&#039;s best interests, I would think, to provide us with the tools so we can make our sites work better with their search engine, so everybody wins.  Again, I&#039;m not one of this whiny assholes who thinks Google owes me this&#8211;they owe me jack crap.  It&#039;s just hard to understand why they would respond to a problem by, instead of using their vaunted resources to throw at it, to make it <i>look like</i> it&#039;s gone away and hope nobody bitches.</p>
<p>Somebody help me understand how this is a good idea.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.widgettwalls.com/2007/08/01/im-in-supplemental-hell-i-just-dont-know-it-now-all-better/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nofollow: Now They Freaking Tell Me</title>
		<link>http://www.widgettwalls.com/2007/05/08/nofollow-now-they-freaking-tell-me/</link>
		<comments>http://www.widgettwalls.com/2007/05/08/nofollow-now-they-freaking-tell-me/#comments</comments>
		<pubDate>Tue, 08 May 2007 19:56:41 +0000</pubDate>
		<dc:creator>Widge</dc:creator>
				<category><![CDATA[Fun With SEO]]></category>
		<category><![CDATA[nofollow]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[WTF]]></category>

		<guid isPermaLink="false">http://www.widgettwalls.com/2007/05/08/nofollow-now-they-freaking-tell-me/</guid>
		<description><![CDATA[But anyway, so this post at Black Hat really took the wind out of me, honestly.  I've always considered myself a "black" hat SEO guy just because I only wear black, so I'm kind of one of them by default.  But still--"bad form"?]]></description>
			<content:encoded><![CDATA[<p>So there I am, trolling through my blogs, behind as always, when I come across this line <a href="http://seoblackhat.com/2007/05/08/nofollow-and-how-google-really-verifys-links/" target="_blank">from SEO Black Hat</a>:</p>
<p><i>If you have verified the link and you are putting it on your site, do not put a nofollow on it. It&#039;s bad form.</i></p>
<p>Well, this threw me for a loop.  And that&#039;s not a coding joke.  </p>
<p>Back before I even started seriously screwing around with SEO, I ran across a few blog posts (and I won&#039;t even begin to fathom where they were) that seemed to  know what they were talking about that explained the problem with bleeding PageRank.  Basically, if you link out to a page, you give them a skoche of your PageRank.  At the time, PageRank seemed like something you wanted to hang onto.  And in my case, since I was trying to build a site properly, hang onto for dear life.</p>
<p>The way to get out of this was to stick a rel=&#034;nofollow&#034; on your outbound links.  This would tell the search engines that mojo leakage was not in order, thank you very much.</p>
<p>Okay, fine.  Makes sense.</p>
<p>But here&#039;s where we learn that SEO is a lot like medical science.  I shall explain.</p>
<p>One of my problems with doctors is that you get a lot of facts with your diagnosis but also a ton of opinion and interpretation.  And also, they can&#039;t ever seem to agree on a lot of things.  That&#039;s why red meat and caffeine are like Schrodinger&#039;s Diet&#8211;they exist in a &#034;good for you/bad for you&#034; state simultaneously, because nobody can make up their mind whether or not one or the other or both will kill you.</p>
<p>Same thing with SEO.  Because it&#039;s a bunch of folks peering in through frosted glass at the inner workings of the search engines, they&#039;re guessing.  And at the time, it seems like a lot of people were making honest guesses about nofollow and what it meant and what you should do about it.</p>
<p>But anyway, so this post at Black Hat really took the wind out of me, honestly.  I&#039;ve always considered myself a &#034;black&#034; hat SEO guy just because I only wear black, so I&#039;m kind of one of them by default.  But still&#8211;&#034;bad form&#034;?  Fuck, the last thing I want to be accused of is bad form.  I mean, if I&#039;m going to be accused of being an asshole, I&#039;d prefer it to be for something I meant to do.</p>
<p>So I started combing around.  Here&#039;s <a href="http://scobleizer.com/2007/02/14/i-was-wrong-about-nofollow/" target="_blank">this</a> from Scobleizer.  Which led me to <a href="http://www.searchenginejournal.com/13-reasons-why-nofollow-tags-suck/4410/" target="_blank">this from Search Engine Journal</a>:</p>
<p><i>Linking to someone with a NoFollow attribute is a sign of not trusting them. It&#039;s like reaching to shake someone&#039;s hand, but stopping to put on a pair of latex gloves.</i></p>
<p>Now, excluding for a moment that I might wear latex gloves when shaking somebody&#039;s hand because in my old age I&#039;m finding I&#039;m about three steps away from becoming Monk, still&#8230;<i>FUCK</i>.  Now they tell me.</p>
<p>And upon reflection, since I am, after all, a Machiavellian bastard who wants to come out on top of everything (but at least I tell you this up front), I decided this makes sense.  First of all, I don&#039;t want to look like an asshole unless I am an asshole.  And I have plenty of other opportunities to be assholish that actually make sense.  So.  Second, I can&#039;t honestly tell you what PageRank does for me when it comes to Search Engine standings.  I don&#039;t have the best PageRank in the world, and yet I seem to make out just fine.  So.  Fuck it.</p>
<p>I just went and did an uber-find-and-replace and all the nofollow shit should be gone on Needcoffee.  Effective immediately.  If you find anything that is, let me know.  I&#039;ll fix it on here soon enough&#8230;I&#039;ve got other stuff broken on here since the move.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.widgettwalls.com/2007/05/08/nofollow-now-they-freaking-tell-me/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SEO Tip: Quick and Dirty Access Log Fun: What Do I Need to Optimize First?</title>
		<link>http://www.widgettwalls.com/2007/04/27/seo-tip-quick-and-dirty-access-log-fun-what-do-i-need-to-optimize-first/</link>
		<comments>http://www.widgettwalls.com/2007/04/27/seo-tip-quick-and-dirty-access-log-fun-what-do-i-need-to-optimize-first/#comments</comments>
		<pubDate>Fri, 27 Apr 2007 05:02:54 +0000</pubDate>
		<dc:creator>Widge</dc:creator>
				<category><![CDATA[Fun With SEO]]></category>
		<category><![CDATA[access-logs]]></category>
		<category><![CDATA[image-optimization]]></category>
		<category><![CDATA[images]]></category>
		<category><![CDATA[optimization]]></category>
		<category><![CDATA[seo]]></category>
		<category><![CDATA[tips]]></category>
		<category><![CDATA[web-admin]]></category>
		<category><![CDATA[Wordpress]]></category>

		<guid isPermaLink="false">http://www.widgettwalls.com/2007/04/27/seo-tip-quick-and-dirty-access-log-fun-what-do-i-need-to-optimize-first/</guid>
		<description><![CDATA[Well, you haven't seen me a lot around here because I've been up to my elbows in code over on Needcoffee.  I've been trying to optimize the site while fighting with WP-Cache, which I can't live without due to my traffic, but which also kills me if I try to update the site while it's turned on.  I'm still trying to figure out that silly shit.]]></description>
			<content:encoded><![CDATA[<p>Well, you haven&#039;t seen me a lot around here because I&#039;ve been up to my elbows in code over on <a href="http://www.needcoffee.com">Needcoffee</a>.  I&#039;ve been trying to optimize the site while fighting with WP-Cache, which I can&#039;t live without due to my traffic, but which also kills me if I try to update the site while it&#039;s turned on.  I&#039;m still trying to figure out that silly shit.</p>
<p>Anyway, now that I&#039;ve finally gotten a bunch of superfluous ne&#039;er-do-wells gone from my access.log file, I can finally look at it and see what&#039;s happening moment by moment to try and address the problem.  One thing I wanted to see, though, is what is really taxing the memory of my server space.</p>
<p>Trouble is, the access.log I get from Dreamhost (which I can only assume is the same sort you get from where you are) looks like this:</p>
<p>x.x.x.x &#8211; - [26/Apr/2007:00:36:50 -0700] &#034;GET /wp-content/plugins/podpress/podpress_js.php HTTP/1.1&#034; 200 2311 &#034;http://www.needcoffee.com/2006/03/08/power-rangers-dino-thunder-vol-3-dvd-review/&#034; &#034;Mozilla/5.0 (Windows; U; Win98; en-US; rv:1.8.1.3) Gecko/20070309 Firefox/2.0.0.3&#034;</p>
<p><span id="more-387"></span><br />
Now, barring for a moment that someone is actually viewing a Power Rangers review and we must find them and stop them from breeding, imagine 10MB of that.  That&#039;s how much I&#039;ve got for a full day&#039;s access log, and that&#039;s after I&#039;ve been working for a few days to optimize my robots.txt file.  </p>
<p>Now if we want quick results to cut down on the really bad big files that we&#039;ve got out there, what can we do?  Especially when, like Needcoffee, we&#039;re looking at a site that&#039;s been around for ten years, and has scads and scads of pre-Wordpress material that hasn&#039;t been converted yet.  That log is a mess.</p>
<p>Well, the obvious thing would be to sort the log file by size of the file being requested, and I&#039;ve seen some sites promising perl scripts or whatever, but I thought there had to be an easier way.</p>
<p>And here it is.</p>
<p>1. Take your access.log and open it in a text editor.  Now, granted, if you&#039;re looking to do a 10MB access log, Wordpad will cough up a lung so grab something like Editpad or the like, or just use a subset of the log.  </p>
<p>2. Do a find and replace.  You want to find a space, i.e. &#034; &#034; and replace it with a comma &#034;,&#034;  Since we don&#039;t care about any data that would get screwed up by doing this, go for it.</p>
<p>3. Save the file with the suffix of .csv</p>
<p>4. Open the file in Excel (or equivalent) as a text .csv file</p>
<p>5.  This should put the info into a spreadsheet where you should have a column for size.  On my version, it&#039;s column H.  Sort by H and take a look.</p>
<p>In my case, once I get past the podcasts and such that are supposed to be large I find&#8230;wow, holy crap: there&#039;s a JPG on here that&#039;s 73KB that flat out doesn&#039;t need to be.</p>
<p>Also, prototype.js, which WordPress uses for the admin panels, is about that size as well.  I wish somebody would create a stripped down, no FX, just want to get the shit done WordPress admin theme, for those of us who&#8230;well, just want to get the shit done.</p>
<p>Anyway, there you go.  Enjoy.</p>
<p>If this is helpful, I may post more stuff like this as I find it.  </p>
]]></content:encoded>
			<wfw:commentRss>http://www.widgettwalls.com/2007/04/27/seo-tip-quick-and-dirty-access-log-fun-what-do-i-need-to-optimize-first/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

