BlankerL / DXY-COVID-19-Crawler

2019新型冠状病毒疫情实时爬虫及API | COVID-19/2019-nCoV Realtime Infection Crawler and API
https://lab.isaaclin.cn/nCoV/
MIT License
1.99k stars 400 forks source link

The Diamond Cruise data redundacy #65

Closed jnasoy closed 4 years ago

jnasoy commented 4 years ago

It seems like there are multiple data entries for The Diamond Cruise

Screenshot_2020-02-18 https lab isaaclin cn

By the way, great job on the API, been using it in my own visualization. What do you think about adding coordinates? I can help with the map data for the coordinates based on your api.

BlankerL commented 4 years ago

Thank you for reporting this. I suppose Ding Xiang Yuan does not fix the name and punctuation marks in Chinese, so there are redundant data.

I suppose you could just use APIs offered by Google Maps or some open-source maps to meet your needs. This project only crawls and saves data for visualization and scientific research, I can offer English names of the cities or the zip codes because foreign people need them to distinguish city names. But this project is not supposed to wrap up everything and might not be a one-stop service for all the development requirements.

By the way, you can just use ECharts to visualize all the data simply with the city name, rather than using the coordinates.

BlankerL commented 4 years ago

I went through the database and found Ding Xiang Yuan has already fixed the name for Diamond Princess, so I fixed the problem in historical data.

I also add the English name "Diamond Princess Cruise Ship", which is provided by its official website, in the latest commit 3219f3e.