<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Robots.txt Secrets From Matt Cutts</title>
	<atom:link href="http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html</link>
	<description>Advanced Web Development</description>
	<lastBuildDate>Wed, 18 Nov 2009 23:28:48 -0500</lastBuildDate>
	
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: MetaT</title>
		<link>http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-93677</link>
		<dc:creator>MetaT</dc:creator>
		<pubDate>Sat, 08 Aug 2009 14:45:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-93677</guid>
		<description>Is there a way to block the flow of page rank by just using the robots.txt file without the meta tags? or do you have to use both?</description>
		<content:encoded><![CDATA[<p>Is there a way to block the flow of page rank by just using the robots.txt file without the meta tags? or do you have to use both?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pragmites</title>
		<link>http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-43425</link>
		<dc:creator>Pragmites</dc:creator>
		<pubDate>Wed, 23 Jul 2008 07:42:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-43425</guid>
		<description>This guy&#039;s a genius. He managed to explain all the terms with such simplicity. I had a doubt in my mind about the NoIndex pages and if their outgoing links could have page ranks.</description>
		<content:encoded><![CDATA[<p>This guy&#8217;s a genius. He managed to explain all the terms with such simplicity. I had a doubt in my mind about the NoIndex pages and if their outgoing links could have page ranks.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John H. Gohde</title>
		<link>http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-42957</link>
		<dc:creator>John H. Gohde</dc:creator>
		<pubDate>Wed, 09 Jul 2008 01:25:11 +0000</pubDate>
		<guid isPermaLink="false">http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-42957</guid>
		<description>Upon a second reading, I am now able to comprehend Matt Cutts&#039;s quote from my last comment.  Matt was referring to the concept of &quot;Google Bombing&quot; where totally erroneous webpages can show up in the SERPs because of the off-page factor of having other websites pointing to them with anchor text.  Still, I find that a violation of trust committed by Google.  When I specify that I don&#039;t want a webpage indexed or crawled, I most certainly do NOT want that page to show up in Google&#039;s SERPs under any circumstances.  Seems simple enough to me.</description>
		<content:encoded><![CDATA[<p>Upon a second reading, I am now able to comprehend Matt Cutts&#8217;s quote from my last comment.  Matt was referring to the concept of &#8220;Google Bombing&#8221; where totally erroneous webpages can show up in the SERPs because of the off-page factor of having other websites pointing to them with anchor text.  Still, I find that a violation of trust committed by Google.  When I specify that I don&#8217;t want a webpage indexed or crawled, I most certainly do NOT want that page to show up in Google&#8217;s SERPs under any circumstances.  Seems simple enough to me.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John H Gohde</title>
		<link>http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-40624</link>
		<dc:creator>John H Gohde</dc:creator>
		<pubDate>Mon, 12 May 2008 10:01:41 +0000</pubDate>
		<guid isPermaLink="false">http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-40624</guid>
		<description>Quoting Matt Cutts:
&lt;blockquote cite=&quot;http://www.&quot;&gt;&quot;Now, robots.txt says you are not allowed to crawl a page, and Google therefore does not crawl pages that are forbidden in robots.txt. However, they can accrue PageRank, and they can be returned in our search results.&quot;&lt;/blockquote&gt;
If that is not proof that Google does anything that it feels like doing, then what is?
Yeah, right, sure ... Google respects your robots.txt directives not to crawl a given webpages.  Google is going to return its content in the search results!!!
With double-talk like this, I am going to depend on using my own brain and not read more dribble from Matt Cutts.</description>
		<content:encoded><![CDATA[<p>Quoting Matt Cutts:</p>
<blockquote cite="http://www."><p>&#8220;Now, robots.txt says you are not allowed to crawl a page, and Google therefore does not crawl pages that are forbidden in robots.txt. However, they can accrue PageRank, and they can be returned in our search results.&#8221;</p></blockquote>
<p>If that is not proof that Google does anything that it feels like doing, then what is?<br />
Yeah, right, sure &#8230; Google respects your robots.txt directives not to crawl a given webpages.  Google is going to return its content in the search results!!!<br />
With double-talk like this, I am going to depend on using my own brain and not read more dribble from Matt Cutts.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: R. Richard Hobbs</title>
		<link>http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-37572</link>
		<dc:creator>R. Richard Hobbs</dc:creator>
		<pubDate>Tue, 25 Mar 2008 17:08:41 +0000</pubDate>
		<guid isPermaLink="false">http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-37572</guid>
		<description>This made for interesting reading and gave me some food for thought - not being a full time  webmaster but rather more like being a business owner and &quot;chief bottle washer and cook&quot; (read: publishing my own website(s) and at the bottom of it all just wanting tmy content to get seen by the right people, I have to be careful not to get in over my head... I agree with the @John&#039;s comment about not wanting to spend too much time as a &quot;Google Detective&quot;...

In an effort to reduce potential dupe content, applied a robots.txt recently based on one I copied from an article elsewhere in which the author was using disallow for his index.php, not sure what I was thinking, that was one of the lines that ended up in my robots.txt and unless there is some other unknown aspect involved, (no messages from Google...), for the time being I seem to be literally wiped off the face of Google. Hopefully this is just temporary the Google bot revisits and starts indexing from my home page again... My Home Page (index.php) in fact was my highest ranked page and was ranked very attractively in my desired search results - argggh

So- after reading around, including the interview you posted, I started thinking (and have little to lose for the moment) and this is probably the eternal conflict for anyone wishing to SEO their site on anything but the most basic of levels, &quot;how do I get ALL my content seen but avoid dupe content penalties? have decided to try the following - using noindex, follow tags on the content most likely to be flagged for due content (I use a excerpting plugin as it is for anything resembling an archive...) and only using disallow for for administrative or site code areas (i.e. the wp-* folders) and server areas i.e. cgi-bin as they aare surely duplicates of many others sites, its boring stuff that has no business showing up in content search results and just generally feeling like its best to keep as much of the administraive and code stuff private for security reasons.

So my thinking is, dont ask the googlebot to index all the flotsam and jetsam (noindex - archives, other likely &quot;dupy&quot; content...) but tell it to follow so any linky goodness is maximised - i.e outbound links contained in any of the content - this is all seems pretty wholesome?

Or am I just somehow missing the boat in my naivety? 

I just wondered to myself if the follow tag needs to be included, its probably default action for a searchbot?

Any way thanks askapache for making some interesting and thought provoking content available on your site.</description>
		<content:encoded><![CDATA[<p>This made for interesting reading and gave me some food for thought &#8211; not being a full time  webmaster but rather more like being a business owner and &#8220;chief bottle washer and cook&#8221; (read: publishing my own website(s) and at the bottom of it all just wanting tmy content to get seen by the right people, I have to be careful not to get in over my head&#8230; I agree with the @John&#8217;s comment about not wanting to spend too much time as a &#8220;Google Detective&#8221;&#8230;</p>
<p>In an effort to reduce potential dupe content, applied a robots.txt recently based on one I copied from an article elsewhere in which the author was using disallow for his index.php, not sure what I was thinking, that was one of the lines that ended up in my robots.txt and unless there is some other unknown aspect involved, (no messages from Google&#8230;), for the time being I seem to be literally wiped off the face of Google. Hopefully this is just temporary the Google bot revisits and starts indexing from my home page again&#8230; My Home Page (index.php) in fact was my highest ranked page and was ranked very attractively in my desired search results &#8211; argggh</p>
<p>So- after reading around, including the interview you posted, I started thinking (and have little to lose for the moment) and this is probably the eternal conflict for anyone wishing to SEO their site on anything but the most basic of levels, &#8220;how do I get ALL my content seen but avoid dupe content penalties? have decided to try the following &#8211; using noindex, follow tags on the content most likely to be flagged for due content (I use a excerpting plugin as it is for anything resembling an archive&#8230;) and only using disallow for for administrative or site code areas (i.e. the wp-* folders) and server areas i.e. cgi-bin as they aare surely duplicates of many others sites, its boring stuff that has no business showing up in content search results and just generally feeling like its best to keep as much of the administraive and code stuff private for security reasons.</p>
<p>So my thinking is, dont ask the googlebot to index all the flotsam and jetsam (noindex &#8211; archives, other likely &#8220;dupy&#8221; content&#8230;) but tell it to follow so any linky goodness is maximised &#8211; i.e outbound links contained in any of the content &#8211; this is all seems pretty wholesome?</p>
<p>Or am I just somehow missing the boat in my naivety? </p>
<p>I just wondered to myself if the follow tag needs to be included, its probably default action for a searchbot?</p>
<p>Any way thanks askapache for making some interesting and thought provoking content available on your site.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: AskApache</title>
		<link>http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-26944</link>
		<dc:creator>AskApache</dc:creator>
		<pubDate>Fri, 02 Nov 2007 03:28:30 +0000</pubDate>
		<guid isPermaLink="false">http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-26944</guid>
		<description>&lt;p&gt;&lt;strong&gt;@ JOHN&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Nice advice, thanks.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Leak juice? I prefer not to humanize the SEO process. Your post once again brings up the importance of internal link structure. Use NoFollow on your &quot;about us page&quot;? An absolute waste of time, IMHO. Webmasters should use NoFollow to artificially fix an inherit weakness of the Google Page Rank system? No, I have better things to do with my time. Like making my &quot;about us page&quot; into something important and worth reading. And, personally I think webmasters have better things to do than waste their time on time consuming meaningless busy work.&lt;/p&gt;
&lt;/blockquote&gt;</description>
		<content:encoded><![CDATA[<p><strong>@ JOHN</strong></p>
<p>Nice advice, thanks.</p>
<blockquote>
<p>Leak juice? I prefer not to humanize the SEO process. Your post once again brings up the importance of internal link structure. Use NoFollow on your &#8220;about us page&#8221;? An absolute waste of time, IMHO. Webmasters should use NoFollow to artificially fix an inherit weakness of the Google Page Rank system? No, I have better things to do with my time. Like making my &#8220;about us page&#8221; into something important and worth reading. And, personally I think webmasters have better things to do than waste their time on time consuming meaningless busy work.</p>
</blockquote>
]]></content:encoded>
	</item>
	<item>
		<title>By: Steve Walker</title>
		<link>http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-25496</link>
		<dc:creator>Steve Walker</dc:creator>
		<pubDate>Tue, 23 Oct 2007 14:18:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-25496</guid>
		<description>I have a page on my site that is simply designed to assist clients in funding their accounts but it seems to be drawing a lot of traffic from very odd and irrelevant terms.  Will the use of noindex tags alone work or do I need to block the URL in my robots text file as well? If so, should I then block every link pointing to the page too?  It is an important page so I link to it site-wide, I just don&#039;t want it indexed.

Also, I&#039;m ranked 12th or so for my key phrase and hoping to land on the home page soon.  Will this adversely affect my ranking in any way or could it help?  Its taken a while to get here so I hate to rock the boat unless it will likely help me.
Thanks</description>
		<content:encoded><![CDATA[<p>I have a page on my site that is simply designed to assist clients in funding their accounts but it seems to be drawing a lot of traffic from very odd and irrelevant terms.  Will the use of noindex tags alone work or do I need to block the URL in my robots text file as well? If so, should I then block every link pointing to the page too?  It is an important page so I link to it site-wide, I just don&#8217;t want it indexed.</p>
<p>Also, I&#8217;m ranked 12th or so for my key phrase and hoping to land on the home page soon.  Will this adversely affect my ranking in any way or could it help?  Its taken a while to get here so I hate to rock the boat unless it will likely help me.<br />
Thanks</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John H. Gohde</title>
		<link>http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-25284</link>
		<dc:creator>John H. Gohde</dc:creator>
		<pubDate>Mon, 22 Oct 2007 11:26:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.askapache.com/seo/robotstxt-mattcutts-noindex.html#comment-25284</guid>
		<description>Leak juice?  I prefer not to humanize the SEO process.  Your post once again brings up the importance of internal link structure. Use NoFollow on your &quot;about us page&quot;?  An absolute waste of time, IMHO.  Webmasters should use NoFollow to artificially fix an inherit weakness of the Google Page Rank system?  No, I have better things to do with my time.  Like making my &quot;about us page&quot; into something important and worth reading.  And, personally I think webmasters have better things to do than waste their time on time consuming meaningless busy work.</description>
		<content:encoded><![CDATA[<p>Leak juice?  I prefer not to humanize the SEO process.  Your post once again brings up the importance of internal link structure. Use NoFollow on your &#8220;about us page&#8221;?  An absolute waste of time, IMHO.  Webmasters should use NoFollow to artificially fix an inherit weakness of the Google Page Rank system?  No, I have better things to do with my time.  Like making my &#8220;about us page&#8221; into something important and worth reading.  And, personally I think webmasters have better things to do than waste their time on time consuming meaningless busy work.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
