Closed cooperaj closed 1 week ago
It'd probably be better to cache the results of a robots.txt check for any given run so that only a single call would be made.
Additionally, the user agent used to pull the robots.txt is not the same as the fedifetcher one.
"GET /robots.txt HTTP/1.1" 200 190 "-" "Python-urllib/3.11"
should both be addressed now.
It'd probably be better to cache the results of a robots.txt check for any given run so that only a single call would be made.
Additionally, the user agent used to pull the robots.txt is not the same as the fedifetcher one.
"GET /robots.txt HTTP/1.1" 200 190 "-" "Python-urllib/3.11"