app/soc/content/robots.txt
author Lennard de Rijk <ljvderijk@gmail.com>
Sun, 04 Oct 2009 15:07:29 +0200
changeset 3004 8485ae85de54
parent 73 211a3eeacf27
permissions -rw-r--r--
Also pass along the args when running an update.

# Directions for web crawlers.
# See http://www.robotstxt.org/wc/norobots.html.

User-agent: HTTrack
User-agent: puf
User-agent: MSIECrawler
User-agent: Nutch
Disallow: /