app/soc/content/robots.txt
author Sverre Rabbelier <srabbelier@gmail.com>
Sat, 06 Dec 2008 14:24:26 +0000
changeset 680 7f047b2a2d3a
parent 73 211a3eeacf27
permissions -rw-r--r--
Added a new create regexp in urls for just scope_path Now that scope_path is properly defined we can add a url matching just the scope path. This allows some other custom create regexps to be removed/rewritten. Note: It needs to be -after- the full key_name regexp, since for arbitrarily nested scopes it would always match as just scope_path, even if there are other fields needed after the scope. Patch by: Sverre Rabbelier
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
73
211a3eeacf27 Created robots.txtx and simple Melange favicon.
Pawel Solyga <Pawel.Solyga@gmail.com>
parents:
diff changeset
     1
# Directions for web crawlers.
211a3eeacf27 Created robots.txtx and simple Melange favicon.
Pawel Solyga <Pawel.Solyga@gmail.com>
parents:
diff changeset
     2
# See http://www.robotstxt.org/wc/norobots.html.
211a3eeacf27 Created robots.txtx and simple Melange favicon.
Pawel Solyga <Pawel.Solyga@gmail.com>
parents:
diff changeset
     3
211a3eeacf27 Created robots.txtx and simple Melange favicon.
Pawel Solyga <Pawel.Solyga@gmail.com>
parents:
diff changeset
     4
User-agent: HTTrack
211a3eeacf27 Created robots.txtx and simple Melange favicon.
Pawel Solyga <Pawel.Solyga@gmail.com>
parents:
diff changeset
     5
User-agent: puf
211a3eeacf27 Created robots.txtx and simple Melange favicon.
Pawel Solyga <Pawel.Solyga@gmail.com>
parents:
diff changeset
     6
User-agent: MSIECrawler
211a3eeacf27 Created robots.txtx and simple Melange favicon.
Pawel Solyga <Pawel.Solyga@gmail.com>
parents:
diff changeset
     7
User-agent: Nutch
211a3eeacf27 Created robots.txtx and simple Melange favicon.
Pawel Solyga <Pawel.Solyga@gmail.com>
parents:
diff changeset
     8
Disallow: /