iLanguage / ilanguagelab

Automatically exported from code.google.com/p/ilanguagelab
0 stars 0 forks source link

Test Recall and Precision #16

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Purpose of addition of this task:
To investigate precision and recall in Inuktitut search using 3 tf-idf 
matrices: 

(1) no encoding, no grammar (baseline) 
(2) yes encoding, no grammar 
(3) yes encoding, yes grammar  

note: 
with/without encoding = with/without transcorder; due to non-standardized 
encoding, words with "&" would be cut in the middle of the word >low recall  

When reviewing task, please focus on:
We predict that recall increases in (2) compared to (1); precision will 
increase in (3)  
would these predictions be borne out? 

After the review, please 
add results (percentages of recall and precision for each matrices) 
to InuktituteSearch wiki page.

After the review, the expected next step is to:
combine scripts to tag words (based on roots) and find suffixes 
write script to find words and truncate suffixes 

Original issue reported on code.google.com by hisako...@gmail.com on 12 Nov 2011 at 1:21

GoogleCodeExporter commented 9 years ago

Original comment by a...@ilanguage.ca on 25 Nov 2011 at 4:26

GoogleCodeExporter commented 9 years ago

Original comment by a...@ilanguage.ca on 25 Nov 2011 at 4:35

GoogleCodeExporter commented 9 years ago

Original comment by a...@ilanguage.ca on 25 Nov 2011 at 10:26

GoogleCodeExporter commented 9 years ago
get perspective morphology-in-search (what morphology means to programmers) 

http://gina.openlanguage.ca/research/inuktitut/090416Morphological_parsing_inukt
itut-v3.pdf

Original comment by hisako...@gmail.com on 29 Nov 2011 at 12:24