|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||
| Packages that use MatchesRegExpDecideRule | |
|---|---|
| org.archive.crawler.deciderules | Provides classes for a simple decision rules framework. |
| Uses of MatchesRegExpDecideRule in org.archive.crawler.deciderules |
|---|
| Subclasses of MatchesRegExpDecideRule in org.archive.crawler.deciderules | |
|---|---|
class |
ClassKeyMatchesRegExpDecideRule
Rule applies configured decision to any CrawlURI class key -- i.e. |
class |
ContentTypeMatchesRegExpDecideRule
DecideRule whose decision is applied if the URI's content-type is present and matches the supplied regular expression. |
class |
ContentTypeNotMatchesRegExpDecideRule
DecideRule whose decision is applied if the URI's content-type is present and does not match the supplied regular expression. |
class |
MatchesFilePatternDecideRule
Compares suffix of a passed CrawlURI, UURI, or String against a regular expression pattern, applying its configured decision to all matches. |
class |
NotMatchesFilePatternDecideRule
Rule applies configured decision to any URIs which do *not* match the supplied (file-pattern) regexp. |
class |
NotMatchesRegExpDecideRule
Rule applies configured decision to any URIs which do *not* match the supplied regexp. |
class |
PathologicalPathDecideRule
Rule REJECTs any URI which contains an excessive number of identical, consecutive path-segments (eg http://example.com/a/a/a/boo.html == 3 '/a' segments) |
|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||