app/soc/content/robots.txt
author Sverre Rabbelier <srabbelier@gmail.com>
Sun, 19 Apr 2009 00:06:12 +0000
changeset 2229 b36ecf371aef
parent 73 211a3eeacf27
permissions -rw-r--r--
Store how many times a job has timed out and abort if needed Patch by: Sverre Rabblier

# Directions for web crawlers.
# See http://www.robotstxt.org/wc/norobots.html.

User-agent: HTTrack
User-agent: puf
User-agent: MSIECrawler
User-agent: Nutch
Disallow: /