Revenue Source

Welcome to the Revenue Source affiliate marketing forums.

You are viewing our internet marketing and SEO forums as a guest which gives you limited access to most of our discussions.  By joining our free community, you will have access to post affiliate marketing topics, communicate privately with other members (PM), exchange SEO strategies, and access many other special features.  Registration is fast, simple and absolutely free so please, join our community today!

If you have any problems, please don't hesitate to contact us.

Go Back   Revenue Source > Affiliate Marketing Hangout > Internet Marketing Articles
Reload this Page Busted!!! Microsoft/MSN Search Stealing Google Results?
Tags: , , , , ,

Reply
 
LinkBack Thread Tools Search this Thread
Old
  (#1 (permalink))
RS Tom is Offline
Revenue Source Admin
RS Tom is almost famous!RS Tom is almost famous!RS Tom is almost famous!RS Tom is almost famous!
 
RS Tom's Avatar
 
Join Date: Sep 2004
Posts: 2,406
Tom A.
Jack of All Trades
Revenue Source, Inc.
Ft. Lauderdale, FL United States
   
Busted!!! Microsoft/MSN Search Stealing Google Results? - 11-11-2004

I was questioned today by a developer who was watching a particular IP address scan his site. The IP was 65.54.188.86 and is registered to Microsoft Corp. located at One Microsoft Way, Redmond, Washington 98052. This visitor was not sending the normal header information associated with a crawler to the web server such as an http robot name or identifying info or even a browser name.

The behavior it demonstrated made it look like a crawler, especially since it was spidering urls that were no longer in existence (search engine spiders crawl site segments at regular intervals and often come back when an initial crawl left urls uncrawled) and doing so at the rate of 1 page every 3 - 5 seconds. The visitor started their visit at 7:37 am and was still on the site at 12:00 pm.

Correction, the data was there after all, here's the crawler info... msnbot/0.3 (+http://search.msn.com/msnbot.htm)

Here's the kicker

So now you're saying, so what, big deal. But this really is a big deal. It's a big deal not only because the urls this visitor was making requests to don't exist any longer but because the only place these urls can be found is in Google's search results using site:www.sitename.com. A similar query on MSN Search doesn't show the urls at all, even on the beta version of their new Microsoft search engine. But then within just hours of the visitors exit from the site the new same search at Microsoft's new search engine shows all of the urls in question being fully indexed within its results.

My Theory On This Mysterious Microsoft Crawler

The old msn required a fee to be crawled by its spider. But a few months back MSN dropped the fee and said they were going to begin crawling the entire web and doing it without charge. However, that's no easy task. So I believe MSN is using the results from Google and possibly even Yahoo to get all of the pages they've indexed on sites that have a relatively low page count in the current msn search engine.

First off, that's the fastest way to get the relevant pages from a web site. Sure they could just go to the site directly and start crawling but in doing so they're going to get tons of duplicate urls and urls that seem different but point to the same content. Crawling Google's results will eliminate the bandwidth to some extent but will not completely take care of the duplicate content issue their spider will encounter.

Secondly, crawling Google's results can act as a qualitative measure for their new search engine. By creating a baseline number of pages per site when the new Microsoft Search is launched and running a comparison on a regular interval for the next 6 months, they'll be able to determine internally if their engine is finding and indexing the same links and as many links as Google. Call it competitive analysis or whatever you want.

So Microsoft's Screen Scraping?

Obviously my conclusion should be taken as a grain of salt but it's a definite possibility. Microsoft very well could be screen scraping Google (or maybe even using their API, LOL) and crawling the urls it finds. It makes sense from a business case but I wonder if there are any legal issues there. I doubt it. It's like putting garbage out to the curb. Once it's out there it's fair game but I bet Google's lawyers would have more to say than that on the case.

Has anyone out there seen similar behavior on their own sites? Please comment with your qualitative/objective data if so.

Jason's article first appeared on his blog MarketingShift.com.

View All Articles by Jason Dowdell
  
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Old
  (#2 (permalink))
chrissie is Offline
Revenue Source Member
chrissie has a brilliant future here!
 
Join Date: Nov 2004
Posts: 11
Chrissie L.
Affiliate/Webmaster
Self Employed
Fort Lauderdale, FL United States
   
11-11-2004

Nearly every time I check my message boards via the admin panel the IP Address 65.54.188.54 is there.

I don't have any theories as I am not good what this stuff, I asked a few people and they didn't know what it was. I thought it was a Googlebot but now reading your take probably not. Interesting and yet confusing for a simpleton such as myself ;)
  
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Revenue Sharing Ads
Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is On
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads for: Busted!!! Microsoft/MSN Search Stealing Google Results?
Thread Thread Starter Forum Replies Last Post
Good Google - Writing For The Most Powerful Robot In The World ValiantMarketer Search Engine Optimization / Marketing 0 12-07-2004 08:56 AM
Search Engine Spider, Index, And Ranking ValiantMarketer Search Engine Optimization / Marketing 0 11-30-2004 04:11 AM
Feedster’s New Blog Search Enhances RSS Search Service ValiantMarketer Search Engine Optimization / Marketing 0 11-16-2004 09:47 PM
Google Results vs. Yahoo Results RS Tom Search Engine Optimization / Marketing 7 11-12-2004 11:42 PM
Targeting Usage Demographics to Increase Paid Search Conversions. ValiantMarketer Search Engine Optimization / Marketing 0 11-11-2004 06:42 AM



© 2004-6 RevenueSource.com.  All rights reserved.  Do not duplicate or redistribute in any form.
This website and its logos/design are property of RevenueSource.com.  All rights reserved. vBSEO 3.2.0 RC7


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34