thammegowda / mtdata

A tool that locates, downloads, and extracts machine translation corpora
https://pypi.org/project/mtdata/
Apache License 2.0
147 stars 22 forks source link

Add visualizations in search results #104

Open thammegowda opened 2 years ago

thammegowda commented 2 years ago

Task 1: add map view

  1. https://github.com/glottolog/glottolog this repo has language information. The one we are interested most is Geo coordinates (latitude,langitude) -- for example, https://github.com/glottolog/glottolog/blob/master/languoids/tree/drav1251/sout3133/sout3138/tami1291/bada1263/kann1259/kann1255/nucl1305/md.ini

  2. We need to create geo-coordinates for all languages in our list https://github.com/thammegowda/mtdata/blob/master/iso-langs/bcp/iso639_3.tsv

  3. Add map view into the search interface https://github.com/thammegowda/mtdata/blob/master/docs/search.html Related: D3Js and Plotly.js https://plotly.com/javascript/scattermapbox/

Task 2: histograms

Add histograms based on (a) languages, (2) groups (3) name


Edit: here is current version https://gowda.ai/mtdata/search.html