Recently I’ve been laying the groundwork for some future SEO projects doing some site migrations onto content management systems, showing search spiders friendlier content, and cleaning up some URL structures. I’ve been using Google Sitemaps to help keep track of how the projects are progressing.
Some of the problems I’ve been working to overcome are uncrawlable product/shopping cart URL’s like this
another is a hodgepodge of architecture techniques like
While it is completely possible to rank with a mix like that is does make it really hard to maintain and keep track of without loosing stuff on the way.
First a bit of a warning, IF YOUR SITE IS RANKING AND YOU ARE GETTING ORGANIC LISTINGS AND TRAFFIC DON’T MUCK WITH IT! If you do decide to muck with it do it very, very, slowly. I’d recommend logging in and creating a sitemaps account first before you actually make any changes. It gives you an idea of what’s going on and any problems that exist right off the bat. Note to the sitemaps team do you have to show the ‘submit a reinclusion request’ on the login screen if someone isn’t banned, it scares non-seo types who think it’s there for a reason.
Another thing I learned is many site owners really have no idea what’s going on on their sites. They know about a handful of pages, but by and large they really don’t have a full understanding of what pages are on their sites. I didn’t find any hidden spam but on the 6 sites I worked on all of them had old pages that were linked to from somewhere and had either duplicate, outdated or missing information. For example if the information was originally on example.com/page-1.htm and a significant update was made the information it was then put on example.com/page-2.htm . However what invariably happened was a few dangling links existed to page-1.htm and it stayed in the index. If it’s an update keep it on the same page if at all possible. If not delete the old page (keep a backup offline) and 301 the old page to the new page. If it’s for an annual event build a date into the URL structure like:
For heavily/frequently crawled sites you may get feedback sooner, but I’ve found it takes 2-4 weeks from when you start moving pages around for the new sitemaps data to start to show up. So go out and do of all of the changes at once, ping sitemaps, wait and then check back to see what’s changed, and make adjustments.
Now above I recommended not changing things dramatically if you are ranking and getting traffic. I went completely against that advice here on this blog recently. I did this for two reasons, first I want to see what can really go wrong when you do a large scale site migration (1000+ pages). Second I think it’s important to experiment and try new stuff from time to time, you can really learn a lot from the mistakes you make. Here’s a screen shot of my missing pages you can see I’ve got over 180 pages that are missing that I need to 301 to the new locations.
While the experiment is clearly still in progress I haven’t seen a dramatic drop
- Google Sitemaps – Have a conversation with Google about your site
- GOODROI Internet Marketing Â» Those Google Engineerâ€™s are getting better looking
- Vanessa Fox of Google Sitemaps Interview (mp3 format)
- GOODROI Internet Marketing Â» Feb. 23 – Google Sitemaps with Vanessa Fox, Google Engineer
- GOODROI Internet Marketing Â» Follow-up on Google Sitemaps with Vanessa Fox
- Interview with Vanessa Fox of Google Sitemaps – Sugarrae