In this project we will be scanning unstructured online resources such as the common crawl data set
GNU General Public License v3.0
3
stars
1
forks
source link
Added correct content length and removed unnecessary block digest #210
Closed
felixschorer closed 7 years ago
Added missing new line after content
Content length is no accurate
Removed unnescessary
WARC-Block-Digest
header