Open holzsec opened 3 months ago
The static files that were used in create_datasets.ipynb are not included in the repository, their source is not listed.
##################################################################################################
################################################################################################## STATIC_PATH: Path = DEFAULT_DIR / "datasets/static_datasets/"
COUNTRIES_JSON_FILE: Path = STATIC_PATH / "countries.json" COUNTRIES_TXT_FILE: Path = STATIC_PATH / "countries.csv" COUNTRIES_CSV_FILE: Path = STATIC_PATH / "iso_code_2.csv" POPULATION_CITY_FILE: Path = STATIC_PATH / "population.json" CITIES_500_FILE: Path = STATIC_PATH / "cities500.txt" POPULATION_DENSITY_FILE: Path = ( STATIC_PATH / "gpw_v4_population_density_rev11_2020_30_sec.tif" )
ADDRESS_FILE: Path = ( STATIC_PATH / "internet_address_verfploeter_hitlist_it102w-20230125.fsdb" ) GEOLITE_FILE: Path = STATIC_PATH / "GeoLite2-City-Blocks-IPv4_20230516.tree" IP_INFO_GEO_FILE: Path = STATIC_PATH / "ip_info_geo_anchors.json" MAXMIND_GEO_FILE: Path = STATIC_PATH / "maxmind_free_geo_anchors.json"
GEOPAPIFY_1_FILE: Path = STATIC_PATH / "geocoded_by_geoapify-10_05_2023_0_500.csv" GEOPAPIFY_2_FILE: Path = STATIC_PATH / "geocoded_by_geoapify-10_05_2023_500_last.csv"
IP_TO_ASN_FILE: Path = STATIC_PATH / "2022-03-28.dat" ANCHORS_SECOND_PAPER_FILE: Path = STATIC_PATH / "anchors_ip_list.json" CACHED_WEBSITES_FILE: Path = STATIC_PATH / "websites.json" BGP_PRIFIXES_FILE: Path = STATIC_PATH / "bgp_prefixes.json"
hello, sorry for the late answer... I indeed checked on the ftp server, these static files are missing, I will upload them as soon as possible.
The static files that were used in create_datasets.ipynb are not included in the repository, their source is not listed.
##################################################################################################
STATIC FILES
################################################################################################## STATIC_PATH: Path = DEFAULT_DIR / "datasets/static_datasets/"
COUNTRIES_JSON_FILE: Path = STATIC_PATH / "countries.json" COUNTRIES_TXT_FILE: Path = STATIC_PATH / "countries.csv" COUNTRIES_CSV_FILE: Path = STATIC_PATH / "iso_code_2.csv" POPULATION_CITY_FILE: Path = STATIC_PATH / "population.json" CITIES_500_FILE: Path = STATIC_PATH / "cities500.txt" POPULATION_DENSITY_FILE: Path = ( STATIC_PATH / "gpw_v4_population_density_rev11_2020_30_sec.tif" )
ADDRESS_FILE: Path = ( STATIC_PATH / "internet_address_verfploeter_hitlist_it102w-20230125.fsdb" ) GEOLITE_FILE: Path = STATIC_PATH / "GeoLite2-City-Blocks-IPv4_20230516.tree" IP_INFO_GEO_FILE: Path = STATIC_PATH / "ip_info_geo_anchors.json" MAXMIND_GEO_FILE: Path = STATIC_PATH / "maxmind_free_geo_anchors.json"
GEOPAPIFY_1_FILE: Path = STATIC_PATH / "geocoded_by_geoapify-10_05_2023_0_500.csv" GEOPAPIFY_2_FILE: Path = STATIC_PATH / "geocoded_by_geoapify-10_05_2023_500_last.csv"
IP_TO_ASN_FILE: Path = STATIC_PATH / "2022-03-28.dat" ANCHORS_SECOND_PAPER_FILE: Path = STATIC_PATH / "anchors_ip_list.json" CACHED_WEBSITES_FILE: Path = STATIC_PATH / "websites.json" BGP_PRIFIXES_FILE: Path = STATIC_PATH / "bgp_prefixes.json"