koursaros-ai / nboost

NBoost is a scalable, search-api-boosting platform for deploying transformer models to improve the relevance of search results on different platforms (i.e. Elasticsearch)
Apache License 2.0
674 stars 69 forks source link

cannot index csv file into nboost #83

Open ghost opened 3 years ago

ghost commented 3 years ago

Hi @pertschuk and @colethienes,

Hope you are all well !

I cannot index this dump https://paper2code.com/public/nboost_dump.txt.tar.gz into nboost.

Here is my command line:

nboost-index --file ./nboost_dump.txt --index_name paper2code --delim , --id_col

What is wrong the format of dump ? is there a quotechar parameter ?

btw, Is there a way to support tab delimited format ?

Thanks in advance for any insights or inputs on that issue.

Cheers, X

ghost commented 3 years ago

@pertschuk and @colethienes

Is it possible to add extra columns to the elasticsearch index ? I mean it would be really useful.

Please I need an answer ^^