pmhalvor / fgsa

Main repo for my Masters thesis on Fine Grained Sentiment Analysis
4 stars 0 forks source link

expand labels #1

Closed pmhalvor closed 2 years ago

pmhalvor commented 2 years ago

Need to match label length to tokenized ids in Norec(Dataset).

The first iteration of this will given each sub-word the original token of the parent, meaning some BIO-sequences may have multiple B'. This could affect training capability, and should be explored further.