aigents / aigents-java

Aigents Java Core Platform
MIT License
29 stars 12 forks source link

Fix support for robots.txt patterns #2

Open akolonin opened 4 years ago

akolonin commented 4 years ago

Example: https://www.joom.com/robots.txt ... Disallow: */q.* ... not matched for https://www.joom.com/ru/search/q.xiaomi in https://github.com/aigents/aigents-java/blob/master/src/main/java/net/webstructor/cat/HttpFileReader.java#L211 and file is tried to get read with error 400