Create robots.txt to Control Search Engine Spiders
How to setup a robots.txt file to implement the Standard for Robot Exclusion. The file is placed in the main directory of a website that advises spiders and other robots which directories or files they should not access.
Textism's Word HTML Cleaner
Excellent online tool for cleaning the garbage out of Microsoft Word-generated HTML files.