app/soc/content/robots.txt
author Pawel Solyga <Pawel.Solyga@gmail.com>
Wed, 20 May 2009 12:32:36 +0200
changeset 2329 4e487ffd4102
parent 73 211a3eeacf27
permissions -rw-r--r--
Add comment to clean_html_content function and update __authors__.

# Directions for web crawlers.
# See http://www.robotstxt.org/wc/norobots.html.

User-agent: HTTrack
User-agent: puf
User-agent: MSIECrawler
User-agent: Nutch
Disallow: /