Revenue Source

Welcome to the Revenue Source affiliate marketing forums.

You are viewing our internet marketing and SEO forums as a guest which gives you limited access to most of our discussions.  By joining our free community, you will have access to post affiliate marketing topics, communicate privately with other members (PM), exchange SEO strategies, and access many other special features.  Registration is fast, simple and absolutely free so please, join our community today!

If you have any problems, please don't hesitate to contact us.

Go Back   Revenue Source > Affiliate Marketing Hangout > Internet Marketing Articles > SEO / SEM
Reload this Page Sitewide Duplicate Content Checker Tool Useless?
Tags: , , , , ,

Reply
 
LinkBack Thread Tools Search this Thread
Old
  (#1 (permalink))
SEO Blogs is Offline
Revenue Source Veteran
SEO Blogs is worth a listen.SEO Blogs is worth a listen.SEO Blogs is worth a listen.
 
SEO Blogs's Avatar
 
Join Date: Jul 2005
Posts: 825
   
Sitewide Duplicate Content Checker Tool Useless? - 11-15-2006

SEO Junkie just released an clientside application you can use to check for duplicate content on your site. I found a link to his app through Search Engine Watch yesterday while I was playing around with Google Co-op. The application is meant for small sites and he warns that there’s alot of functionality missing. When he said “small sites” I hoped he meant less than 10,000 pages, but in the end I walked away scratching my head.
First, even when checking just two urls, its incredibly slow, compared to something like the Similar Pages Checker. Second, it indiscriminantly crawls every URL it finds: affiliate redirect links, links to images, videos, etc. It took me a few minutes to figure out why I was suddenly getting hit with a barrage of pop-ups. Third, it crapped out at the middle of crawling my site with an error message: Runtime Error: Method ‘~’ of object ‘~’ failed. I went over to his blog and saw similar error messages being reported. It could be due to hitting a memory cap or something, but why not just elegantly stop crawling links when you get above a certain threshhold?
My biggest complaint though, is the fact that you can’t use this to check large sites. Those are exactly the kind of sites I’d want to check for page similarity. I mean, why would I want to run my 30 page blog through something like this?
Suggestions:
  • Reference robots.txt and/or check robots tag and ignore disallowed/noindex pages.
  • Do not follow redirect urls that lead outside of a given domain.
  • Automatically limit number of pages crawled to prevent the program from crashing.
  • Ignore image, audio, video files (or does it already ignore them?)
  • Give users an option to compare urls in a subdirectory instead of comparing each page to every other page in a domain.

Sitewide Duplicate Content Checker Tool Useless? - Read More...
  
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply


Thread Tools Search this Thread
Search this Thread:

Advanced Search

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is On
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads for: Sitewide Duplicate Content Checker Tool Useless?
Thread Thread Starter Forum Replies Last Post
The Best Affiliate Stats Checker! RS Marifer Affiliate Marketing Q & A 5 04-10-2007 11:26 PM
Position and Clickthrough Tool SEO Blogs SEO / SEM 0 11-15-2006 08:44 PM
I agree to disagree on duplicate content! SEO Blogs SEO / SEM 0 11-15-2006 05:21 PM
Merchant 7980 - The Village Hat Shop - Halloween Sale - 10% off Sitewide Affiliate Ma Affiliate Marketing News Shareasale Affiliate Deals 0 10-18-2006 07:20 PM
Merchant 7833 - GoodBulbs.com - GoodBulbs Sitewide 10%-Off Sale Affiliate Marketing D Affiliate Marketing News Shareasale Affiliate Deals 0 10-17-2006 07:27 PM



© 2004-6 RevenueSource.com.  All rights reserved.  Do not duplicate or redistribute in any form.
This website and its logos/design are property of RevenueSource.com.  All rights reserved. vBSEO 3.2.0 RC7


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34