texttron / tevatron

Tevatron - A flexible toolkit for neural retrieval research and development.
http://tevatron.ai
Apache License 2.0
531 stars 100 forks source link

Update dataset.py #153

Closed ChuanMeng closed 1 month ago

ChuanMeng commented 2 months ago

"--query_prefix" and "--passage prefix" already contain a space. So we should remove the space between prefix and query/doc during preprocessing.

MXueguang commented 1 month ago

thanks for the edits!