Closed dchud closed 12 years ago
I wasn't aware we were displaying the language at all(?). If you are, I assume you have a source for translating the codes. Otherwise, I have one that might also be used, languages.txt attached.
On Fri, Jun 8, 2012 at 1:21 PM, Dan Chudnov < reply@reply.github.com
wrote:
Reply to this email directly or view it on GitHub: https://github.com/gwu-libraries/launchpad/issues/27
aar=Afar abk=Abkhaz ach=Acoli ada=Adangme ady=Adygei afa=Afroasiatic (Other) afh=Afrihili (Artificial language) afr=Afrikaans ajm=_Aljaia aka=Akan akk=Akkadian alb=Albanian ale=Aleut alg=Algonquian (Other) amh=Amharic ang=English, Old (ca. 450-1100) apa=Apache languages ara=Arabic arc=Aramaic arg=Aragonese Spanish arm=Armenian arn=Mapuche arp=Arapaho art=Artificial (Other) arw=Arawak asm=Assamese ast=Bable ath=Athapascan (Other) aus=Australian languages ava=Avaric ave=Avestan awa=Awadhi aze=Azerbaijani bad=Banda bai=Bamileke languages bak=Bashkir ban=Balinese baq=Basque bas=Basa bat=Baltic (Other) bej=Beja bem=Bemba ben=Bengali ber=Berber (Other) bho=Bhojpuri bih=Bihari bik=Bikol bin=Edo bis=Bislama bla=Siksika bnt=Bantu (Other) bos=Bosnian bra=Braj bre=Breton btk=Batak bua=Buriat bug=Bugis bul=Bulgarian bur=Burmese cad=Caddo cai=Central American Indian (Other) cam=_Khmer car=Carib cat=Catalan cau=Caucasian (Other) ceb=Cebuano cel=Celtic (Other) cha=Chamorro chb=Chibcha che=Chechen chg=Chagatai chi=Chinese chk=Truk chm=Mari chn=Chinook jargon cho=Choctaw chp=Chipewyan chr=Cherokee chu=Church Slavic chv=Chuvash chy=Cheyenne cmc=Chamic languages cop=Coptic cor=Cornish cos=Corsican cpe=Creoles and Pidgins, English-based (Other) cpf=Creoles and Pidgins, French-based (Other) cpp=Creoles and Pidgins, Portuguese-based (Other) cre=Cree crh=Crimean Tatar crp=Creoles and Pidgins (Other) cus=Cushitic (Other) cze=Czech dak=Dakota dan=Danish dar=Dargwa day=Dayak del=Delaware den=Slave dgr=Dogrib din=Dinka div=Divehi doi=Dogri dra=Dravidian (Other) dua=Duala dum=Dutch, Middle (ca. 1050-1350) dut=Dutch dyu=Dyula dzo=Dzongkha efi=Efik egy=Egyptian eka=Ekajuk elx=Elamite eng=English enm=English, Middle (1100-1500) esk=_Eskimo languages est=Estonian eth=_Ethiopic ewe=Ewe ewo=Ewondo fan=Fang fao=Faroese far=_Faroese fat=Fanti fij=Fijian fin=Finnish fiu=Finno-Ugrian (Other) fon=Fon fre=French fri=_Frisian frm=French, Middle (ca. 1400-1600) fro=French, Old (ca. 842-1400) fry=Frisian ful=Fula fur=Friulian gaa=Gã gae=_Scottish Gaelic gag=_Galician gal=_Oromo gay=Gayo gba=Gbaya gem=Germanic (Other) geo=Georgian ger=German gez=Ethiopic gil=Gilbertese gla=Scottish Gaelic gle=Irish glg=Galician glv=Manx gmh=German, Middle High (ca. 1050-1500) goh=German, Old High (ca. 750-1050) gon=Gondi gor=Gorontalo got=Gothic grb=Grebo grc=Greek, Ancient (to 1453) gre=Greek, Modern (1453- ) gwi=Gwich'in hai=Haida hat=Haitian French Creole hau=Hausa haw=Hawaiian heb=Hebrew her=Herero hil=Hiligaynon him=Himachali hin=Hindi hit=Hittite hmn=Hmong hmo=Hiri Motu hun=Hungarian hup=Hupa iba=Iban ibo=Igbo ice=Icelandic ido=Ido iii=Sichuan Yi ijo=Ijo iku=Inuktitut ile=Interlingue ilo=Iloko ina=Interlingua (International Auxiliary Language Association) inc=Indic (Other) ind=Indonesian ine=Indo-European (Other) inh=Ingush int=_Interlingua (International Auxiliary Language Association) ipk=Inupiaq ira=Iranian (Other) iri=_Irish iro=Iroquoian (Other) ita=Italian jav=Javanese jpn=Japanese jpr=Judeo-Persian jrb=Judeo-Arabic kab=Kabyle kal=Kalatdlisut kam=Kamba kan=Kannada kar=Karen kas=Kashmiri kau=Kanuri kaw=Kawi kaz=Kazakh kbd=Kabardian kha=Khasi khi=Khoisan (Other) khm=Khmer kho=Khotanese kik=Kikuyu kin=Kinyarwanda kir=Kyrgyz kmb=Kimbundu kok=Konkani kom=Komi kon=Kongo kor=Korean kos=Kusaie kpe=Kpelle kro=Kru kru=Kurukh kua=Kuanyama kum=Kumyk kur=Kurdish kus=_Kusaie kut=Kutenai lad=Ladino lah=Lahnda lam=Lamba lan=_Occitan (post-1500) lao=Lao lap=_Sami lat=Latin lav=Latvian lez=Lezgian lim=Limburgish lin=Lingala lit=Lithuanian lol=Mongo-Nkundu loz=Lozi ltz=Letzeburgesch lua=Luba-Lulua lub=Luba-Katanga lug=Ganda lui=Luiseño lun=Lunda luo=Luo (Kenya and Tanzania) lus=Lushai mac=Macedonian mad=Madurese mag=Magahi mah=Marshallese mai=Maithili mak=Makasar mal=Malayalam man=Mandingo mao=Maori map=Austronesian (Other) mas=Masai max=_Manx may=Malay mdr=Mandar men=Mende mga=Irish, Middle (ca. 1100-1550) mic=Micmac min=Minangkabau mis=Miscellaneous languages mkh=Mon-Khmer (Other) mla=_Malagasy mlg=Malagasy mlt=Maltese mnc=Manchu mni=Manipuri mno=Manobo languages moh=Mohawk mol=Moldavian mon=Mongolian mos=Mooré mul=Multiple languages mun=Munda (Other) mus=Creek mwr=Marwari myn=Mayan languages nah=Nahuatl nai=North American Indian (Other) nap=Neapolitan Italian nau=Nauru nav=Navajo nbl=Ndebele (South Africa) nde=Ndebele (Zimbabwe)
ndo=Ndonga nds=Low German nep=Nepali new=Newari nia=Nias nic=Niger-Kordofanian (Other) niu=Niuean nog=Nogai non=Old Norse nor=Norwegian nso=Northern Sotho nub=Nubian languages nya=Nyanja nym=Nyamwezi nyn=Nyankole nyo=Nyoro nzi=Nzima oci=Occitan (post-1500) oji=Ojibwa ori=Oriya orm=Oromo osa=Osage oss=Ossetic ota=Turkish, Ottoman oto=Otomian languages paa=Papuan (Other) pag=Pangasinan pal=Pahlavi pam=Pampanga pan=Panjabi pap=Papiamento pau=Palauan per=Persian phi=Philippine (Other) phn=Phoenician pli=Pali pol=Polish pon=Ponape por=Portuguese pra=Prakrit languages pus=Pushto que=Quechua raj=Rajasthani rap=Rapanui rar=Rarotongan roa=Romance (Other) roh=Raeto-Romance rom=Romani rum=Romanian run=Rundi rus=Russian sad=Sandawe sag=Sango (Ubangi Creole) sah=Yakut sai=South American Indian (Other) sal=Salishan languages sam=Samaritan Aramaic san=Sanskrit sao=_Samoan sas=Sasak sat=Santali scc=Serbian sco=Scots scr=Croatian sel=Selkup sem=Semitic (Other) sga=Irish, Old (to 1100) sgn=Sign languages shn=Shan sho=_Shona sid=Sidamo sin=Sinhalese sio=Siouan (Other) sit=Sino-Tibetan (Other) sla=Slavic (Other) slo=Slovak slv=Slovenian sma=Southern Sami sme=Northern Sami smi=Sami smj=Lule Sami smn=Inari Sami smo=Samoan sms=Skolt Sami sna=Shona snd=Sindhi snh=_Sinhalese snk=Soninke sog=Sogdian som=Somali son=Songhai sot=Sotho spa=Spanish srd=Sardinian srr=Serer sso=_Sotho ssw=Swazi suk=Sukuma sun=Sundanese sus=Susu sux=Sumerian swa=Swahili swe=Swedish swz=_Swazi syr=Syriac tag=_Tagalog tah=Tahitian tai=Tai (Other) taj=_Tajik tam=Tamil tar=_Tatar tat=Tatar tel=Telugu tem=Temne ter=Terena tet=Tetum tgk=Tajik tgl=Tagalog tha=Thai tib=Tibetan tig=Tigré tir=Tigrinya tiv=Tiv tkl=Tokelauan tli=Tlingit tmh=Tamashek tog=Tonga (Nyasa) ton=Tongan tpi=Tok Pisin tru=_Truk tsi=Tsimshian tsn=Tswana tso=Tsonga tsw=_Tswana tuk=Turkmen tum=Tumbuka tup=Tupi languages tur=Turkish tut=Altaic (Other) tvl=Tuvaluan twi=Twi tyv=Tuvinian udm=Udmurt uga=Ugaritic uig=Uighur ukr=Ukrainian umb=Umbundu und=Undetermined urd=Urdu uzb=Uzbek vai=Vai ven=Venda vie=Vietnamese vol=Volapük vot=Votic wak=Wakashan languages wal=Walamo was=Washo wel=Welsh wen=Sorbian languages wln=Walloon wol=Wolof xal=Kalmyk xho=Xhosa yao=Yao (Africa) yap=Yapese yid=Yiddish yor=Yoruba ypk=Yupik languages zap=Zapotec zen=Zenaga zha=Zhuang znd=Zande zul=Zulu zun=Zuni zxx=No linguistic content ___=No linguistic content provided
The pycountry library has good support for ISO 639 codes, e.g.:
language = pycountry.languages.get(bibliographic=bib['LANGUAGE'])
bib['LANGUAGE_DISPLAY'] = language.name
...does the trick. The code was already in the bib data, so I thought it couldn't hurt to show it for now.
pycountry has support for this.