ghpaetzold / questplusplus

Pipelined quality estimation.
49 stars 14 forks source link

Missing resources on WMT shared tasks #43

Open deepakagr-007 opened 6 years ago

deepakagr-007 commented 6 years ago

Hi,

I am trying to get the Quest++ tool working for sentence level English to German locale. I couldn't find the giza table for english-german on WMT15 . The link provided on the webpage is incorrect and points to German-English table (which cannot be used for English-German).

Also I couldn't find the English and German corpora, LM of POS tags and truecase model for WMT16 and WMT17 tasks and is needed to be specified in config file for sentence level feature extraction. Can these be leveraged from WMT15 task?

Could you please help to provide the missing resources. Thanks!

deepakagr-007 commented 6 years ago

Can anyone help me on this one. @carolscarton @ghpaetzold ??

lspecia commented 6 years ago

Hi,

For the en-de 2015: https://www.quest.dcs.shef.ac.uk/quest_files/lex.en-de

For the 2017 datasets, we cannot provide the source and target corpora as these are proprietary resources. To be honest, you will not gain much from using these resources and the LM POS in addition to what we provided as baseline if what you are after is the complete set of QuEst features. But if you really need them, I could see if I can make them available for the task participation only.

Best, Lucia

On 17 March 2018 at 03:29, deepakagr-007 notifications@github.com wrote:

Can anyone help me on this one. @carolscarton https://github.com/carolscarton @ghpaetzold https://github.com/ghpaetzold ??

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/ghpaetzold/questplusplus/issues/43#issuecomment-373891015, or mute the thread https://github.com/notifications/unsubscribe-auth/ABRaBHX1SKRXyg5ZjI7cx0ppnJIcpAIKks5tfIMzgaJpZM4Sr3XV .

-- Lucia www.dcs.shef.ac.uk/~lucia/

deepakagr-007 commented 6 years ago

@lspecia @carolscarton @ghpaetzold : I was trying to run the Quest++ model for English-French but couldn't find the resources on WMT shared tasks. Could you please point me in the correct direction.

lspecia commented 6 years ago

Hi, you'd have to build them from WMT parallel corpora. I think there was English-French at WMT14 and if not, the years before.

Best, Lucia

On 15 September 2018 at 16:58, deepakagr-007 notifications@github.com wrote:

@lspecia https://github.com/lspecia @carolscarton https://github.com/carolscarton @ghpaetzold https://github.com/ghpaetzold : I was trying to run the Quest++ model for English-French but couldn't find the resources on WMT shared tasks. Could you please point me in the correct direction.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/ghpaetzold/questplusplus/issues/43#issuecomment-421591036, or mute the thread https://github.com/notifications/unsubscribe-auth/ABRaBOeL6OFllMWOdDj782Awn7MJS_Dsks5ubSOIgaJpZM4Sr3XV .

-- Lucia www.dcs.shef.ac.uk/~lucia/