Click to Play

Dealing with Load Balancing Issues
As most everyone knows, large and complex websites require a lot of hosting systems. Unfortunately, a problem occurs since many hosting systems create...

Recent Articles

SEOing Your Site During A Re-Launch
This article comes on the heels of a few website redesigns and relaunches that the SEO team and I have had to work through over here at People To...

How To Apply SEO Changes To Your Site
Implementing SEO changes can be one of the toughest parts of any search engine optimization project. Admittedly, it takes a lot of time and energy to...

You Could Be Sacrificing Visibility By Using Ranking...
In the past, some search engines allowed an API access key to be used for ranking report software, and it can still be utilized today. Without utilizing the API...

Finding Value In The Avalanche Of SEO Advice
There is obviously no shortage of information on SEO. But thanks for turning up here :) The sheer avalanche of SEO information can be overwhelming, for beginners...

How To Detect And Correct Over-Optimization
What is overoptimization? In the world of SEO, over-optimization refers to the idea that your site has been heavily manipulated for search engine rankings.


05.06.10

How To Build An Archive System For Large Websites

By Michael Gray

Today's post is an answer to a question I took a few weeks ago: How to organize archives/categories on Wordpress for news site/blog that publishes a lot of articles, around 30-50 daily.

The first thing I want to bring up is that generally, if you're publishing that volume of posts per day, your posts probably are date sensitive, so you don't mind Google attaching a date to your posts. You can include the date in your URL structure, but it's not necessary. However, you do want to have date archives available and close to the top. As an example, in the top of my masthead, you'll see a link to the archive pagewhich has a link to every month I have published a post. So if you wanted to reach a post I wrote, the path is 4 levels deep
Home> Archives> October 2005> Bacon Polenta

This is what people mean when they mention a flat site architecture or crawling path: you aren't more than 4 clicks from any other post. You also want to make sure you have your robots.txt and robots meta tags configured properly to allow the spiders to crawl that path. If you are publishing a very high volume of posts/pages, you're going to want to get as many links on the archive page as possible, without becoming excessive. Google recommends no more than 100 links per page. In reality that number is really affected by the trust and authority of your inbound links or your link equity.

If you aren't familiar with the term link equity you should read this article by Eric Ward on link equity. Basically, your site's link equity will determine your crawling depth and crawling frequency. The more links you have and the stronger those links are, the deeper the search engines will crawl and the more frequently they will re-crawl it. This is a difficult problem for new websites: they need to add content, but if they add too much too soon, it won't get crawled. So new sites need to balance content creation with link building.


Some other tools you can use to help flatten out a website and increase crawling depth are breadcrumbs. Joost De Valk makes an excellent breadcrumb pluginfor Wordpress websites. A related posts plugin that changes the related posts at the end of each post will also help. I like yet another related posts plugin. Also make sure you are generating an HTML sitemapI like the dagon sitemap generator.

So, to wrap things up, here's what you want to do:

• Provide one or more short crawling paths for spiders. Try to keep the path as short as possible because anything beyond 5 levels is a problem.

• Make sure your robots.txt and robots meta tags don't accidentally block the crawling path.

• Provide alternate crawling paths or access to the website content with breadcrumbs, related posts, and sitemaps.

• Try to minimize maintenanceand make sure all or as many of these solutions as possible update automatically.

Comments

About the Author:
Michael Gray is SEO specialist and publishes a Search Engine Industry blog at www.Wolf-Howl.com. He has over 10 years experience in website development and internet marketing, helping both small and large companies increase their search engine visibility, traffic, and sales. Michael is a current member of Internet Marketing of New York ( IM-NY.org) and a guest speaker on Webmaster Radio. He is also an editor for the popular search engine new website Threadwatch.org.
About DevWebProCanada
DevWebProCanada is for professional developers ... those who build and manage applications and sophisticated websites. DevWebProCanada delivers via news and expert advice New Strategies In Development.
iEntry





DevWebProCanada is brought to you by:

SecurityConfig.com NetworkingFiles.com
NetworkNewz.com WebProASP.com
DatabaseProNews.com SQLProNews.com
ITcertificationNews.com SysAdminNews.com
LinuxProNews.com WirelessProNews.com
CProgrammingTrends.com ITmanagementNews.com






-- DevWebProCA is an iEntry, Inc. publication --
iEntry, Inc. 2549 Richmond Rd. Lexington KY, 40509
2010 iEntry, Inc.  All Rights Reserved  Privacy Policy  Legal 

archives | advertising info | news headlines | free newsletters | comments/feedback | submit article