Why you should stop worrying about avoiding the duplicate content penalty為什麼你應該不用擔心,避免重複處罰內容
Posted on September 21, 2007 at 8:47 am發布於2007年9月21日在上午8點47分
Ok, so it seems like everyone and anyone starting a blog or "optimizing" their blog is concerned about duplicate content penalties from Google and so have devised a an entire slew of remedies from adding all kinds of disallow statements to their robots.txt files to installing SEO-optimized duplicate-content-curing plugins for WordPress, etc.好吧,我們就這樣好像每個人都開始對博客或者"優化"他們的博客是關心重複處罰的內容從Google等,設計了一個整一系列的補救措施,從加入各種不允許報表,以自己的robots.txt文件安裝西區優化重複內容固化插件,為的WordPress等。
And I’m no special person, I’ve got over 30 lines in my robots.txt file to block Google from my WP- folders, my archive pages, my tag pages, and lots more!和我沒有什麼特別的人,我還有超過30線在我的robots.txt文件,以阻止Google從我的可濕性粉劑文件夾,我的檔案頁,我的標籤頁,及其它更多! I also have the SEO WordPress plugin installed that helps prevent "supplemental results" by adding the NOINDEX meta tag to my category and archive pages.我也有徐Wordpress插入式安裝,從而可以有效防止"補充結果" ,加入了noindex meta標籤,以我的分類和歸檔頁。 Basically, the only pages that I allow Google to access are the actual permalinks URLs for my posts and my static pages.基本上,只有頁面我允許Google准入實際permalinks的URL為我的職位和我的靜態頁面。
That’s it!就是這麼簡單! Nothing else!什麼都沒有! If you perform a site:www.online-tech-tips.com search in Google, you’ll see it’s just my articles and nothing else.如果您執行網址: www.online科技tips.com搜索在Google ,你會看到它的只是我的文章,什麼都沒有了。
Now when I first implemented this, I thought that I was doing something that would help my rankings in Google considering it would be avoiding getting thrown into the supplemental results.現在,當我第一次執行這個,我還以為我做的東西,這將有助於我的排名在Google考慮,它會避免越來越扔進補充的效果。 However, over the last few months, I’ve been asking other bloggers like不過,在過去幾個月裡,我一直在問其他博客一樣 Lorelle lorelle and及 Amit amit經由 about what kinds of steps they have taken to prevent duplicate content and was shocked by the responses.關於什麼樣的步驟,有措施,以防止重複的內容和震驚的反應。
Here was Lorelle’s response to my question: 這裡是lorelle的回應我的問題:
Do I? 我? Or does WordPress.com? 還是wordpress.com ? This is a WordPress.com blog. 這是一個wordpress.com博客。 You’ll have to talk to them about their robots.txt. 你必須跟他們談他們的robots.txt 。
The duplicate content issue is one that bloggers have taken WAY out of control. 對重複內容的問題,是一個博客已採取的方式失去控制。 Duplicate content is natural on blogs. 重複的內容,是很自然的博客。 Don’t stress over it. 不應力超過它。 The issue is related specifically to evil doers who use duplicate content for their splogs, and stealing content from other blogs or copying content from their splogs across to their other splogs. 問題是具體有關邪惡者的人使用複製的內容,為他們的splogs ,並竊取內容,從其它博客或複製的內容,從他們的splogs跨越它們其他splogs 。 It’s to tackle the evil, not the normal blogger. 它的打擊邪惡 ,而不是正常的博客。
For some reason I was thinking that such big bloggers would have been all over these "issues".出於某些原因,我的想法是:這麼大的博客會被所有這些"問題"的報告。 So I decided to perform a site: search on a couple of big name blogs like ProBlogger.net, CopyBlogger.com, Lifehacker.com, and SEOMoz.com .因此,我決定將演出地點:搜索對一對夫婦的大名稱博客像problogger.net , copyblogger.com , lifehacker.com , seomoz.com 。 Well it was pretty interesting what I came across.好,它相當有趣,我所遇見的。 All of these sites get thousands of visitors a day from the search engines and yet just about everything is indexed by Google including archive pages, category pages, tag pages, and comments!所有這些土地得到了數以千計的參觀者,每天從搜尋引擎,但只是一切都收錄其中包括Google的檔案頁,分類頁,標籤頁,並評論!
So after doing this, I became even more curious as to whether my 30 line robots.txt is really necessary!所以這樣做之後,我變得更加好奇,至於是否我的30線的robots.txt ,是十分必要的! What kind of robots.txt file are these guys using?什麼樣的robots.txt文件,是這些傢伙用? So here’s what mine looks like as of right now:所以這裡的什麼礦看上去就像右現在:
User-agent: Googlebot 用戶代理: googlebot
Disallow: */feed* 批駁: * / *飼料
Disallow: */rss* 批駁: * / *的RSS
Disallow: */trackback* 批駁: * /跟踪*
Disallow: */wp-admin 批駁: * /可濕性粉劑-管理員
Disallow: */wp-content 批駁: * /可濕性粉劑含量
Disallow: */wp-includes 批駁: * /可濕性粉劑-包括
Disallow: *wp-login.php 批駁: *粉劑-l ogin.php
Disallow: */20* 批駁: * / 20 *
Disallow: */comments* 批駁: * /評論*
Allow: */category/*/page/* 允許: * /類別/ * /頁/ *
Disallow: /page* 批駁: /頁*
Disallow: */search* 批駁: * /搜索*
Disallow: */?s* 批駁: * /有何*
Disallow: */?p* 批駁: * / ? p *的
Disallow: */index.php?p* 批駁: * / index.php ? p *的
Disallow: /*.php$ 批駁: / *.的PHP元
Disallow: /*.js$ 批駁: / *. js元
Disallow: /*.inc$ 批駁: / *.公司元
Disallow: /*.css$ 批駁: / *.的CSS元
Disallow: /*.gz$ 批駁: / *.的GZ元
Disallow: /*.cgi$ 批駁: / *.的CGI元
Disallow: /*.wmv$ 批駁: / *.對WMV元
Disallow: /*.cgi$ 批駁: / *.的CGI元
Disallow: /*.xhtml$ 批駁: / *.的XHTML元
Disallow: /*.php* 批駁: / *. PHP中*
Disallow: */trackback* 批駁: * /跟踪*
Disallow: /*?* 批駁: / * ? *
Disallow: /z/ 批駁: / z為/
Disallow: /wp-* 批駁: /可濕性粉劑- *
Disallow: */tag/ 批駁: * /標籤/
Disallow: */stats* 批駁: * /統計*
Disallow: */cgi-bin* 批駁: * /的CGI斌*
Allow: /wp-content/uploads/ 允許: / wp-content/uploads /
User-agent: Googlebot-Image用戶代理: googlebot形象
Allow: /*允許: / *
Sitemap:網站: http://www.online-tech-tips.com/sitemap.xml
Now let’s take a look at a few from the big bloggers!現在,讓我們來看看幾個從大博客! So here’s what the robots.txt file looks like for the following sites:所以這裡的什麼robots.txt文件看起來像用於下列地點:
Problogger.net problogger.net
User-agent: * 用戶代理: *
Disallow: 批駁:
LifeHacker.com lifehacker.com
User-Agent: Googlebot 用戶代理: googlebot
Disallow: /index.xml$ 批駁: / index.xml元
Disallow: /excerpts.xml$ 批駁: / excerpts.xml元
Allow: /sitemap.xml$ 允許: / sitemap.xml元
Disallow: /*view=rss$ 批駁: / *查看=美元的RSS
Disallow: /*?view=rss$ 批駁: / * ?看法=美元的RSS
Disallow: /*format=rss$ 批駁: / * =格式的RSS元
Disallow: /*?format=rss$ 批駁: / * ? =格式的RSS元
Sitemap: 網站: http://lifehacker.com/sitemap.xml http://lifehacker.com/sitemap.xml
CopyBlogger.com copyblogger.com
User-agent: * 用戶代理: *
Disallow: /*/feed/ 批駁: / * /飼料/
Disallow: /*/trackback/ 批駁: / * /跟踪/
TechCrunch.com techcrunch.com
User-agent: * 用戶代理: *
Disallow: /*/feed/ 批駁: / * /飼料/
Disallow: /*/trackback/ 批駁: / * /跟踪/
Mashable.com mashable.com
User-agent: * 用戶代理: *
Disallow: /feed 批駁: /飼料
Disallow: /*.xml$ 批駁: / *.的XML元
Disallow: /*/feed/ 批駁: / * /飼料/
Disallow: /*/trackback/ 批駁: / * /跟踪/
Ok, so as you can see from the above list, EVERYONE’s list is a hell of a lot shorter than mine and my list was created by reading through all kinds of posts talking about how everything must be blocked or disallowed.好吧,我們就這樣,因為你可以看到,從上述名單中, 每個人的名單,是一個地獄的很多少於礦井和我的名單是由讀通過各種崗位談到如何都必須被阻塞或不獲批准。 Well, obviously if the top bloggers are not worrying about duplicate content than why should I be!那麼,顯然,如果高層博客並不擔心重複的內容比,我為什麼要! Actually, it seems like maybe it’s even helping them in some kind of way.實際上,它似乎想也許它的,甚至幫助他們在某些種方式。
So before you go installing lots of plugins that prevent Google from indexing your site completely, remember two things:所以之前你安裝了很多插件,即阻止Google索引你的站點,記住兩件事:
1. 1 。 Doesn’t seem like any of the really popular blogs are doing anything about it and似乎並不像任何一個真正受歡迎的博客做任何事,它與
2. 2 。 The supplemental results database no longer exists in Google anyway!補充成果數據庫不再存在,在Google無論如何!
My next step is to remove all of my the disallow statements from my robots.txt file and see what happens!我的下一個步驟是清除所有我的該機構禁止發言,從我的robots.txt文件,看看會發生什麼! Any one else try this yet?任何人都嘗試這個沒?
Also, another observation that may be obvious, but warrants a mention is the fact that all of these people write GREAT content and a LOT of it.此外,另一種看法認為,可能是顯而易見的,但值得一提的是一個事實,即所有這些人寫出偉大的內容,並提供不少 。 So you can do all the optimizing you want, but unless you have really good content that people will link to, bookmark, and visit again, it’s not really going to matter!所以你可以做的所有優化你想要做的,但除非你有真正好的內容,人們將連接,書籤,並參觀再次,它不是真的要的事情!
Tell me what you think in the comments!告訴我你的想法在評論! ![]()
If you enjoyed this post, make sure you 如果你喜歡這個職位,請務必 subscribe to my RSS feed 訂閱我的RSS饋送 ! !
» Filed Under »提起下 Blogging博客
Related Posts相關職位
- A complete list of search engine friendly (SEO) WordPress plugins for your Blog一個完整的清單,搜索引擎友好(西區) wordpress插件,為您的博客
- How to get your Blog to rank higher in Google’s search results如何讓您的博客,以職級高,在Google的搜索結果
- 8 Security Tips and Guidelines for your WordPress Blog 8條安全提示和指引,你的WordPress博客
- Windows Live Search Webmaster Center open to public的Windows Live搜索網管中心向公眾開放
- SEO’s please help me!?徐在應的,請救救我! ? Should I try this crazy shit with my blog!?我應該試試這個瘋狂shit的與我的博客!

























One question regarding duplicate content please ?一個問題,對於重複的內容好嗎?
I write for some more sites我寫的部分更多土地
especially techtoday one of my really good friend尤其是techtoday我的一個真正的好朋友
I need to ask that I directly copy and paste from my site to his我要問,我直接複製並粘貼,從我的網站,以他的
SO will it panelize me or him??????所以將它panelize我還是他??????
thx THX的
Well it depends.以及它有賴於此。 If you write the content on your site and immediately post it on his site, the site that will be penalized will be the one that Google indexes LAST.如果你寫的內容對你的網站,並立即郵寄回他的網站,該網站會受到懲罰將是一個Google的指標上。 So if the Google bot indexes your Page1.html, let’s say, first and then goes to his site and see the same content, his site will be penalized.所以,如果Google公司的BOT索引您page1.html ,讓我們說,首先,然後去他的網站上看到同樣的內容,他的網站將受到懲罰。 But if it’s the other way around, you will be penalized.但如果它的另一條路周圍,你將受到懲罰。
Basically, the content should only be on one person’s site because no matter how you do it, only one will be in the main index.基本上,內容應只對一個人的地盤,因為不管你如何做,只有一個,將在主要指數。
hmm哼
I immediately post in his site我立刻後,在他的網站上
So wht if I do a bit of change in that article and then post it?????? WHT的,所以如果我做一點改變,在這篇文章中,然後郵寄回??????
Your changes should be significant, minor changes won’t really help.您的修改具有重要意義,小的變化不會真正有幫助。 Actually, it would be much smarter to write the article and have it posted on ONE site and then have the other site link back to that article with good keywords in the link.實際上,它將會大大聰明,寫文章,並已張貼在其中一個地點,然後再有其他網站的鏈接回條具有良好關鍵詞,在聯繫匯率制度。 That way both sites will be getting high quality back links, which is one of the most important factors in Google’s ranking algorithm.這樣,這兩個網站將得到的高品質回的鏈接,這是一個最重要的因素, Google的排名算法。 Don’t worry about having the content on both sites.不要擔心過的內容就這兩個網站。