Closed Rhynorater closed 6 years ago
The current CommonCrawl fetch url is this:
http://index.commoncrawl.org/CC-MAIN-2018-22-index?url=*.%s&output=json
I would suggest that it should be this:
http://index.commoncrawl.org/CC-MAIN-2018-22-index?url=*.%s/*&output=json
See the difference in results in the following: http://index.commoncrawl.org/CC-MAIN-2018-22-index?url=blog.innerht.ml/*&output=json as opposed to how you currently have it: http://index.commoncrawl.org/CC-MAIN-2018-22-index?url=blog.innerht.ml&output=json
Thanks, Justin
Ah! Good spot! Should be sorted as of 3279764 :)
The current CommonCrawl fetch url is this:
I would suggest that it should be this:
See the difference in results in the following: http://index.commoncrawl.org/CC-MAIN-2018-22-index?url=blog.innerht.ml/*&output=json as opposed to how you currently have it: http://index.commoncrawl.org/CC-MAIN-2018-22-index?url=blog.innerht.ml&output=json
Thanks, Justin