KnightZhang625 / Enhanced-RCNN_TensorFlow

TensorFlow implementation for the Enhanced-RCNN
Apache License 2.0
4 stars 5 forks source link

关于vocab_idx的形式 #3

Closed victory-hsu closed 4 years ago

victory-hsu commented 4 years ago

您好,请教一下,您的数据集的形式结构,以及vocab_idx的形式,是否有按照词频降序之类的操作等

victory-hsu commented 4 years ago

您好,请教一下,您的vocab_idx是否是跟bert一样的结构,有cls,和sep标记符?

KnightZhang625 commented 4 years ago

Data: a list contains (que, ans)s of tuple type. You could find how I create the vocab_idx from the beginning of the load_data.py, no term-frequency is used.