dr5hn / countries-states-cities-database

🌍 Discover our global repository of countries, states, and cities! 🏙️ Get comprehensive data in JSON, SQL, PSQL, XML, YAML, and CSV formats. Access ISO2, ISO3 codes, country code, capital, native language, timezones (for countries), and more. #countries #states #cities
https://dr5hn.github.io/countries-states-cities-database/
Open Data Commons Open Database License v1.0
7.13k stars 2.51k forks source link

Sub localities within cities are added as cities #520

Open HashirLabs opened 1 year ago

HashirLabs commented 1 year ago

First of all, highly appreciate the efforts to build this data set.

There are too many occurrences of sub localities (which are areas within City) of cities/districts/metropolitan regions presented as City in Cities database, which is like cities within city

Here is an excerpt (some entries ) that are sub localities of Mumbai city, however represented as city within Maharashtra State. Also noticed duplications in data. According to me, either we add these entries in a separate file named sub-localities having foreign key relationship with city as these are city sub localities, OR just remove them from Cities.

Mumbai City

(133024,'Mumbai',4008,'MH',101,'IN',19.07283000,72.88261000,'2019-10-06 01:08:47','2019-10-06 01:08:47',1,'Q1156'),

Mumbai city sub localities marked as cities

(57978,'Bhayandar',4008,'MH',101,'IN',19.30157000,72.85107000,'2019-10-06 00:23:45','2019-10-06 00:23:45',1,'Q4901615'), (58082,'Borivli',4008,'MH',101,'IN',19.23496000,72.85976000,'2019-10-06 00:33:48','2019-10-06 00:33:48',1,'Q4945504'),

(133025,'Mumbai Suburban',4008,'MH',101,'IN',19.12636000,72.84897000,'2019-10-06 01:08:47','2019-10-06 01:08:47',1,'Q2341660'), (133484,'Powai',4008,'MH',101,'IN',19.11640000,72.90471000,'2019-10-06 01:08:53','2019-10-06 01:08:53',1,'Q13118508'), (147680,'Andheri',4008,'MH',101,'IN',19.11916667,72.84694444,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q12413015'), (147694,'Ballard Estate',4008,'MH',101,'IN',18.95000000,72.84000000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q4851507'), (147697,'Bandra',4008,'MH',101,'IN',19.05444444,72.84055556,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q257622'), (147715,'Borivali',4008,'MH',101,'IN',19.23000000,72.86000000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q4945504'), (147717,'Breach Candy',4008,'MH',101,'IN',18.96700000,72.80500000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q4959100'), (147719,'Byculla',4008,'MH',101,'IN',18.98000000,72.83500000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q5003979'), (147723,'Chembur',4008,'MH',101,'IN',19.05871111,72.89969444,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q251170'), (147727,'Chinchpokli',4008,'MH',101,'IN',18.98333333,72.83333333,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q5100316'), (147728,'Colaba',4008,'MH',101,'IN',18.91000000,72.81000000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q3632559'), (147737,'Dharavi',4008,'MH',101,'IN',19.05000000,72.86667000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q649632'), (147742,'Dongri',4008,'MH',101,'IN',19.28333333,72.78333333,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q5296011'), (147744,'Fort',4008,'MH',101,'IN',18.93500000,72.83590000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q5470698'), (147747,'Ghatkopar',4008,'MH',101,'IN',19.08000000,72.91000000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q3309774'), (147748,'Girgaon',4008,'MH',101,'IN',18.95300000,72.81300000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q1931371'), (147750,'Gorai',4008,'MH',101,'IN',19.25005700,72.78202100,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q4932603'), (147767,'Jogeshwari',4008,'MH',101,'IN',19.12000000,72.85000000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q5456464'), (147768,'Juhu',4008,'MH',101,'IN',19.10000000,72.83000000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q674362'), (147806,'Mahim',4008,'MH',101,'IN',19.03500000,72.84000000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q674501'), (147807,'Malabar Hill',4008,'MH',101,'IN',18.95000000,72.79500000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q6373912'), (147808,'Malad',4008,'MH',101,'IN',19.18611111,72.84861111,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q6740929'), (147815,'Mankhurd',4008,'MH',101,'IN',19.05000000,72.93000000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q4856120'), (147820,'Matunga',4008,'MH',101,'IN',19.01798056,72.84476389,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q5226775'), (147821,'Mazagaon',4008,'MH',101,'IN',18.97000000,72.85000000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q5121900'), (147830,'Mulund',4008,'MH',101,'IN',19.17168056,72.95600000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q6935217'), (147835,'Nala Sopara',4008,'MH',101,'IN',19.41540000,72.86130000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q12435629'), (147842,'Nariman Point',4008,'MH',101,'IN',18.92000000,72.83000000,'2021-06-06 14:47:48','2021-06-06 14:47:48',1,'Q3632268'), (147855,'Parel',4008,'MH',101,'IN',18.99000000,72.84000000,'2021-06-06 14:47:49','2021-06-06 14:47:49',1,'Q4856263'), (147868,'Prabhadevi',4008,'MH',101,'IN',19.01690500,72.82864900,'2021-06-06 14:47:49','2021-06-06 14:47:49',1,'Q5456385'), (147895,'Sewri',4008,'MH',101,'IN',19.00000000,72.86000000,'2021-06-06 14:47:49','2021-06-06 14:47:49',1,'Q4933909'), (147904,'Sion Mumbai',4008,'MH',101,'IN',19.04000000,72.86000000,'2021-06-06 14:47:49','2022-08-29 12:30:58',1,'Q7525343'), (147914,'Tardeo',4008,'MH',101,'IN',18.97000000,72.81000000,'2021-06-06 14:47:49','2021-06-06 14:47:49',1,'Q7685715'), (147916,'Trombay',4008,'MH',101,'IN',19.00000000,72.90000000,'2021-06-06 14:47:49','2021-06-06 14:47:49',1,'Q3595610'), (147928,'Vikhroli',4008,'MH',101,'IN',19.11000000,72.94000000,'2021-06-06 14:47:49','2021-06-06 14:47:49',1,'Q5383080'), (147929,'Vile Parle',4008,'MH',101,'IN',19.10000000,72.83000000,'2021-06-06 14:47:49','2021-06-06 14:47:49',1,'Q3558400'), (147932,'Wadala',4008,'MH',101,'IN',19.02163056,72.86458889,'2021-06-06 14:47:49','2021-06-06 14:47:49',1,'Q4856518'), (147939,'Worli',4008,'MH',101,'IN',19.00000000,72.81500000,'2021-06-06 14:47:49','2021-06-06 14:47:49',1,'Q1934607'),

Please revert if we can fix this data and how.

Thanks

dr5hn commented 1 year ago

@HashirLabs Thanks 🙏

Yes, I am aware of this issue.

We can introduce one more table for sub-localities. It would be great if you can contribute to segregating the sub-localities. I know it would be difficult to find other sub-localities apart from Mumbai. Please let me know if you can do so.

Otherwise, you can simply refer to https://github.com/dr5hn/countries-states-cities-database/blob/master/.github/CONTRIBUTING.md and if you can try to remove the duplicate cities/localities that will be appreciated.