PyThaiNLP / attacut

A Fast and Accurate Neural Thai Word Segmenter
https://pythainlp.github.io/attacut/
MIT License
79 stars 16 forks source link

What is the format of the input data? #29

Closed so-coolboy closed 3 years ago

so-coolboy commented 3 years ago

I want to continue training attacut on my own data set, but I am not sure what the format of the data set should be? 截图 The data set link here is no longer valid, so I cannot view the format of the data set, can you help me?

p16i commented 3 years ago

@so-coolboy perhaps, this issue might be of your interest https://github.com/PyThaiNLP/attacut/issues/20.

so-coolboy commented 3 years ago

thank you, 😊