Fossil SCM

Improve the recommented value for robot-exception.

drh 2025-10-03 17:27 trunk
Commit 3bbd18a284c8bd6a8c1d46a6faa7ee33142e5fec53bb64792aab53d3c84e57fd
1 file changed +3 -2
+3 -2
--- src/setup.c
+++ src/setup.c
@@ -501,17 +501,18 @@
501501
textarea_attribute("", 2, 80,
502502
"robot-restrict", "rbrestrict", robot_restrict_default(), 0);
503503
504504
@ <hr>
505505
@ <p><b>Exceptions to anti-robot restrictions</b><br>
506
- @ The entry below is a list of regular expressions, one per line.
506
+ @ The entry below is a list of
507
+ @ <a href="%R/re_rules">regular expressions</a>, one per line.
507508
@ If any of these regular expressions match the input URL, then the
508509
@ request is exempt from anti-robot defenses. Use this, for example,
509510
@ to allow scripts to download release tarballs using a pattern
510511
@ like:</p>
511512
@ <p>
512
- @ &emsp;&emsp;<tt>^/tarball\b.*\b(version-|release)\b</tt>
513
+ @ &emsp;&emsp;<tt>^/tarball/(version-[0-9.]+|release)/</tt>
513514
@ <p>The pattern should match against the REQUEST_URI with the
514515
@ SCRIPT_NAME prefix removed, and with QUERY_STRING appended following
515516
@ a "?" if QUERY_STRING exists. (Property: robot-exception)<br>
516517
textarea_attribute("", 3, 80,
517518
"robot-exception", "rbexcept", "", 0);
518519
--- src/setup.c
+++ src/setup.c
@@ -501,17 +501,18 @@
501 textarea_attribute("", 2, 80,
502 "robot-restrict", "rbrestrict", robot_restrict_default(), 0);
503
504 @ <hr>
505 @ <p><b>Exceptions to anti-robot restrictions</b><br>
506 @ The entry below is a list of regular expressions, one per line.
 
507 @ If any of these regular expressions match the input URL, then the
508 @ request is exempt from anti-robot defenses. Use this, for example,
509 @ to allow scripts to download release tarballs using a pattern
510 @ like:</p>
511 @ <p>
512 @ &emsp;&emsp;<tt>^/tarball\b.*\b(version-|release)\b</tt>
513 @ <p>The pattern should match against the REQUEST_URI with the
514 @ SCRIPT_NAME prefix removed, and with QUERY_STRING appended following
515 @ a "?" if QUERY_STRING exists. (Property: robot-exception)<br>
516 textarea_attribute("", 3, 80,
517 "robot-exception", "rbexcept", "", 0);
518
--- src/setup.c
+++ src/setup.c
@@ -501,17 +501,18 @@
501 textarea_attribute("", 2, 80,
502 "robot-restrict", "rbrestrict", robot_restrict_default(), 0);
503
504 @ <hr>
505 @ <p><b>Exceptions to anti-robot restrictions</b><br>
506 @ The entry below is a list of
507 @ <a href="%R/re_rules">regular expressions</a>, one per line.
508 @ If any of these regular expressions match the input URL, then the
509 @ request is exempt from anti-robot defenses. Use this, for example,
510 @ to allow scripts to download release tarballs using a pattern
511 @ like:</p>
512 @ <p>
513 @ &emsp;&emsp;<tt>^/tarball/(version-[0-9.]+|release)/</tt>
514 @ <p>The pattern should match against the REQUEST_URI with the
515 @ SCRIPT_NAME prefix removed, and with QUERY_STRING appended following
516 @ a "?" if QUERY_STRING exists. (Property: robot-exception)<br>
517 textarea_attribute("", 3, 80,
518 "robot-exception", "rbexcept", "", 0);
519

Keyboard Shortcuts

Open search /
Next entry (timeline) j
Previous entry (timeline) k
Open focused entry Enter
Show this help ?
Toggle theme Top nav button