neulab / contextual-mt

A repository with the code related to experiments around context-aware machine translation
48 stars 9 forks source link

Why scat context data do not have <BRK> tags? #14

Open zihanlalala opened 3 years ago

zihanlalala commented 3 years ago

Dear author,

The attn_reg_transformer takes sentences separated by "< BRK >" as input, but I find that in scat's context data, there is no "< BRK >" to separate sentences. Does it affect the performance?

Meanwhile, for models which needs a separator between sentences, it might be difficult to use the scat data. Could you please provide a scat version with "< BRK >" separating sentences?