qe-team / marmot

MARMOT - the open source framework for feature extraction and machine learning, designed to estimate the quality of Machine Translation output
ISC License
21 stars 7 forks source link

representation generators should return data, not filenames #12

Closed chrishokamp closed 9 years ago

chrishokamp commented 9 years ago

representation generators should not return filenames, because it's not clear to the user what happened. Right now, representation generators write files which are implicitly whitespace tokenized, which are then read by create_contexts. If the file is not whitespace tokenized, the user would get junk output without knowing why.

The way to fix:

varvara-l commented 9 years ago

Done