Closed niels closed 8 years ago
Previously. a reference such as http://example.com/some.dir/file would have matched a dir/file extension. Furthermore, extensions were looked for in the entire URL. For example,
http://example.com/some.dir/file
dir/file
<referenceFilters> <filter class="${filterExtension}" onMatch="exclude" caseSensitive="false" >com</filter> </referenceFilters> […] <startUrls> <url>http://example.com</url> </startUrl>
Would have meant that the crawl immediately finishes as http://example.com would have matched the .com exclusion pattern.
http://example.com
.com
This patch makes three changes to fix the extension filtering behaviour:
example.subtype.xml
This fixes #2.
Previously. a reference such as
http://example.com/some.dir/file
would have matched adir/file
extension. Furthermore, extensions were looked for in the entire URL. For example,Would have meant that the crawl immediately finishes as
http://example.com
would have matched the.com
exclusion pattern.This patch makes three changes to fix the extension filtering behaviour:
example.subtype.xml
-style filenames.)This fixes #2.