iSnare.com - Free Content Articles Directory
Authors Contents [Advanced Search][Add OpenSearch][Job Search]
Distribute your articles to thousands of article sites for only $2 and below! Read more...

Index  Internet
 

Why Search Engines Are Adverse To Identical Content

 
[ Contact the Author] [ Send to a Friend] [ Article Publisher] [Make PDF] [ Print] [ Bookmark & Share]
 
Read our Terms of Service before reprinting this article. The submitter specified above has claimed the rights to this article.
Danny Wirken

Reasons for Replicating Data

According to a study done by Krishna Bharat and Andrei Brodner there are several reasons why data are replicated or why mirror sites are created – Load Balancing, High Availability, Multi-lingual replication, Franchises or Local versions, Database Sharing, Virtual Hosting, and Maintaining Pseudo Identities.

In load balancing, replication of data is done to decrease the servers’ loads. Instead of just having one server to handle all the traffic from web surfers interested in the data or content, the site is mirrored or the data replicated so that the traffic is split between two or more servers.

Data are also replicated to make them more highly available. An example of this is when data are mirrored within the same organization for geographical purposes to make them easily available.

Multi-lingual replication of data is also very common. Data translated into different languages are very useful for reaching a wider audience who all need access to the same data. Good examples of multi-lingual replication are many Canadian sites that are the same in everything except for the language of the content wherein English or French is used.

Data is also replicated for franchises or local versions of data. This happens when data or content is franchised to another company, which then offer the very same data or product but under different branding.

Sometimes data is replicated unintentionally. This happens when two independent websites share a common database or file system. The sharing of database sometimes results to mirroring even without the websites’ intention.

Virtual hosting also sometimes result in mirroring. This happens to services with different websites and host names but use the same IP address and server. What happens is the path to one site is the valid one while the path to the other site simply gives an identical webpage as a result.

The last reason, unlike the first six reasons, is often not a valid reason for site mirroring. This is because mirroring to maintain pseudo identities is often done to spam search engines with different websites of the same content as a means getting a higher page ranking. This reason is considered unacceptable and is one of the very reasons why search engines tend to be adverse towards identical content or replicated data.

Google’s Webmaster Guideline about Duplicate Content

Search engines are blatantly against replicated data so much so that Google even has a warning against them in their Webmaster Guidelines. Google’s Webmaster Guidelines were a list of Do’s and Don’ts that ought to be followed by websites to help the search engine in finding, indexing, and ranking websites. Following the Do’s will of course increase the chance that Google will list a specific website and ran it favorably as well. However, doing any of the Don'ts will of course detract from a website’s rank.

In the specific guidelines for quality of the website part, it was stated clearly that websites should not create multiple pages, subdomains, or domains with substantially duplicate content. The term duplicate content is however a dubious term since it isn’t clear how many duplicate words it takes for search engines like Google to penalize a page. It can take ten words or maybe an entire sentence, or paragraph, or even need an entire document or page for content to be considered duplicate content. The key thing to remember is that the guideline says to not create pages with substantially duplicate content. So to be on the safe side it would be better to always have a fresh original content. This is however not possible at times especially when quoting articles so that it is your call to determine whether the duplicate content might penalize your website. If your conscience is clear that the duplicate content is there for the user’s benefit and not to up your page ranking then the crawlers will hopefully interpret it as the same and not penalize your site.

Annoyed Surfers and Speedy Crawlers

Search engines exist to point surfers to websites containing the information relevant to their search string. However, they do not exist to point surfers to different websites containing the exact same or nearly the same information. When surfers click on different links they expect to be getting different web pages with maybe the same or different take on the same topic but with definitely different content. However there are many sites out there with partial duplicate content and even the exact content simply replicated. Clicking on mirror sites irritate surfers since it is only a waste of time waiting for the same thing to load twice or maybe even more times. This is especially irritating if the site happens to be a spam site whose content is not of a good quality. Due to this problem web crawlers now do not crawl exact duplicate and near-duplicate web pages or websites that they have determined from a previous crawl. This means that the mirror sites not crawled will not even make it to the search engine’s results listing since only one of the duplicates is indexed by the web crawler. Because of this search engines will not have more than one of the mirror sites among its results listing thus avoiding irritating the web surfers.

Satisfied surfers are not the only result of the new technique crawlers use. Search engines benefit as well since not having to crawl mirrored pages lessens the load of the crawlers and thus speeds up crawling. The bandwidth is also saved because of this resulting to a faster more efficient crawling operation wherein the web crawler can cover and index more significant websites.

Valid Mirrored Sites

However, for valid mirror sites like those mentioned above (multi-lingual, franchise, etc.) there should be no worry since search engines have provisions for such things and take into account the motive behind them. You can help your mirror site by making sure that you follow all the other guidelines to get noticed and ranked by Google. Following the guidelines will surely help not only your ranking with Google but with other search engines as well.

Important NoticeDISCLAIMER: All information, content, and data in this article are sole opinions and/or findings of the individual user or organization that registered and submitted this article at Isnare.com without any fee. The article is strictly for educational or entertainment purposes only and should not be used in any way, implemented or applied without consultation from a professional. We at Isnare.com do not, in anyway, contribute or include our own findings, facts and opinions in any articles presented in this site. Publishing this article does not constitute Isnare.com's support or sponsorship for this article. Isnare.com is an article publishing service. Please read our Terms of Service for more information.

Article Tags: content [See Dictionary], data [See Dictionary], websites [See Dictionary]
Got a question about this article? Ask the community!
Article published on July 29, 2006 at Isnare.com
 
Rate this article:

Riya: A Big Leap In Visual Search Engines
Submitted by: Danny Wirken

Watch out for new software that will give a new face to search engines Rather, a program that includes faces in the search function...

Taguchi Method: The Key In Ad Optimization?
Submitted by: Danny Wirken

For people who are looking for the secrets on how to master ad optimization, your prayers have been answered...

What A .htaccess File Is And How To Make One
Submitted by: Danny Wirken

A htaccess file is a simple ASCII file similar to that created through text editor such as Notepad or Simple Text...

What You Should Know About Trackback Spam
Submitted by: Danny Wirken

Trackback facilitates communication between blogs When a blogger writes a new entry whether to comment on or refer to an entry found at another blog, the commenting blogger can notify the other blog with a Trackback ping...

What You Newbies Need To Know About Pay Per Click Ads
Submitted by: Danny Wirken

Just about anyone who has been using the Internet in the last few years has no doubt come across the term "pay per click" once or twice...

The Exciting World Of Video Blogging
Submitted by: Danny Wirken

When the idea of weblogs was first introduce online, it was an instant phenomenon Suddenly just about everyone feels the need to create their own space online by writing their thoughts...

The Latest On WordPress Themes
Submitted by: Danny Wirken

As WordPress and blogging become more and more popular, the list of customization options continues to grow...

Tips On How To Deal With Anonymous Comment Spam
Submitted by: Danny Wirken

Have you ever experience being flooded with anonymous comments If yes, then chances are you have been a victim of comment spam...

To Blog Or Not To Blog: The Ups And Downs Of Blogging
Submitted by: Danny Wirken

Whenever the subject of the phenomenon called blogging is raised, most people immediately think associated it with an online diary or weblog...

Trackback Spam Explained
Submitted by: Danny Wirken

In most blog applications, there is a feature called Trackback, which allows the user to send a trackback or notification to a different site or another blog that the user referred to in his own blog...

Web 2.0, A Guide For Newbies
Submitted by: Danny Wirken

A couple of years back Bill Gates introduce the idea of Convergence to the public It was a fresh idea that later became a catchphrase for the Internet Industry...

How To Use Linknotes Plugins
Submitted by: Danny Wirken

When users complained about inline links that are becoming way too obtrusive, someone was bound to find the answer...

Moving Your WordPress Blog
Submitted by: Danny Wirken

Moving a blog can make it unreachable for 24 – 72 hours, unless the new domain name has fully propagated around the Internet...

Google Update: A Test For Keyword Dominance
Submitted by: Danny Wirken

Google is one of the most popular search engines on the Internet today According to statistics about 50 to 80 percent of searches made by users worldwide are being done on Google...

How To Prevent Comment Spam With Google’s No Follow Attribute
Submitted by: Danny Wirken

Putting up and maintaining a weblog of your own could be done for free or built into your paid domain site...

The Gimp Tutorial And Free Gimp Download
Submitted by: Peter Nisbet

If you are looking for a Gimp tutorial, or 'the Gimp' as many refer to it as, then probably the best sites are those offering Gimp video tutorials...

How To Generate Traffic Using Social Bookmarking Websites
Submitted by: John Don

Social bookmarking will allow you to generate traffic to your website You will need a working strategy in order to generate the real traffic...

Creating Real Money Through Google Adsense
Submitted by: Jack Wylde

Finding a solution from getting frustrated with Google Adsense is all about bringing some of the most interesting things about bringing some of the best of links to follow up with answers for unique contents in your page...

Effective Tips For Increasing Google Ad Sense
Submitted by: Jack Wylde

There are many ways of bringing some of the most effective Google Ad sense with more clicks to bring to a website...

How Affiliate Programs Work – Beginners Affiliate Marketing Guide
Submitted by: Jack Wylde

Many of you might be wondering how affiliate programs work In this article, we shall give you all the necessary information...

Earning Money Through an Adult Website
Submitted by: Jack Wylde

Earning from adult websites can be quite promising as there are plenty of opportunities through which the internet can bring you facilities to understand and bring endless money making options through adult website hits...

Paid Survey Strategies That Do Not Benefit Users
Submitted by: Scott Lindsay

Paid surveys are offered as a premier way to make money by sharing an opinion A counter product is known as paid emails...

Understanding and Implementing Sound SEO Principles
Submitted by: Scott Lindsay

Search Engine Optimization (SEO) is often talked about as if it is understood completely The trouble is there are some who are just being introduced to online marketing that have very little idea what SEO is and why it is important...

Make Money On The Web In These Basic Business Principles
Submitted by: Alicia Pierce

The Internet as an information superhighway has also become a venue for successful enterprises and ventures...

Quick Ways To Make Money Online – Very Easy!
Submitted by: Alicia Pierce

There are many quick ways to make money these days Most of these ways can come from doing some kind of activity on the internet...

How To Make Quick Money – Do It The Millionaires’ Way
Submitted by: Alicia Pierce

There are no shortcuts in how to make quick money the right way Those that seem to have a knack for it have been carefully honing their craft and talent for years and have paid their keep to the school of hard knocks...

Website Design and Development - Tips on How to Get Started With E-Commerce and be Successful
Submitted by: Daljeet Sidhu

If you want your business to thrive, do not wait for customers to come to you Take your products to your customers through internet and e-commerce...

Web Design Service - Important Steps to Better Security and Safety For Online Business
Submitted by: Daljeet Sidhu

Internet users are highly concerned about receiving spam mail and becoming vulnerable to identity theft...

Let Your Voice be Heard-Create a Blog
Submitted by: Cathy Lindsay

Since the beginning of recorded time, mankind has longed to be heard, to be remembered, to have their voice live on after their body is gone, in essence, to record their time on earth...

Some Tips For Building a Website
Submitted by: Cathy Lindsay

A lot has changed in the virtual world since its creation, and even in the last ten years the bounds made in technology have allowed for richer, more engaging content on the World Wide Web...

Isnare.com Footer Divider

© 2004-2009. Isnare Free Articles - An Isnare Online Technologies Free Articles Project. All Rights Reserved.   Privacy Policy