iLanguage / ilanguagelab

Automatically exported from code.google.com/p/ilanguagelab
0 stars 0 forks source link

Inuktitut Corpus #10

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Purpose of addition of this task:
acquire Inuktitut corpus which is large enough to test multilingual open source 
text processing tools for Inuktitut apps

When reviewing task, please focus on:
size of corpus, consistency of spelling 

After the review, please add a xxx to yyy wiki page.
add reference of corpus to our reference page; 
add reference data page containing source, size, date of collection, any 
automated procedure run on the corpus 

After the review, the expected next step is to:
create vocabulary, word frequency list from the corpus 

Original issue reported on code.google.com by hisako...@gmail.com on 21 Oct 2011 at 10:31

GoogleCodeExporter commented 9 years ago

Original comment by a...@ilanguage.ca on 9 Nov 2011 at 10:17

GoogleCodeExporter commented 9 years ago

Original comment by a...@ilanguage.ca on 9 Nov 2011 at 10:25

GoogleCodeExporter commented 9 years ago
We found 24 pdfs (Inuktitut Magazine) 

Original comment by hisako...@gmail.com on 9 Nov 2011 at 10:34

GoogleCodeExporter commented 9 years ago
The pdfs are in the CorpusForFieldLinguisticsNonPublic repository

Original comment by gina.c.c...@gmail.com on 9 Nov 2011 at 11:08

GoogleCodeExporter commented 9 years ago

Original comment by a...@ilanguage.ca on 25 Nov 2011 at 10:23