InternetHealthReport / internet-yellow-pages

A knowledge graph for Internet resources
GNU General Public License v3.0
39 stars 16 forks source link

decompressed roas.csv.xz and changed the urls #118

Closed MAVRICK-1 closed 7 months ago

MAVRICK-1 commented 7 months ago

solved the issue of #117

Description

In this modification, requests.get() is used to fetch the .xz file content. Then, the content is decompressed using lzma.open() and read as a CSV file. The rest of the code remains the same as before for processing the CSV data.

Motivation and Context

Types of changes

Checklist:

MAVRICK-1 commented 7 months ago

@m-appel and @romain-fontugne can you review my PR :-)

m-appel commented 7 months ago

Hey, your PR looks good to me, please just fix the formatting (easiest is to setup pre-commit hooks as described here).

MAVRICK-1 commented 7 months ago

@m-appel I have modified the code struct, It passed all the tests.