app/soc/content/robots.txt
author Todd Larsen <tlarsen@google.com>
Wed, 01 Oct 2008 07:11:27 +0000
changeset 242 17984abf0c74
parent 73 211a3eeacf27
permissions -rw-r--r--
Some TODOs on access control that I didn't want to forget.

# Directions for web crawlers.
# See http://www.robotstxt.org/wc/norobots.html.

User-agent: HTTrack
User-agent: puf
User-agent: MSIECrawler
User-agent: Nutch
Disallow: /