Open riemann85 opened 4 years ago
Hi, I analyzed the issue and the problem consists in the dataframe format in input to fit_retriever() method. fit_retirever() QAPipeline works fine for df of a format like bnp one. May I ask which is the format for df dataframe (a dataframe with title , paragraphs columns)
Describe the bug Replication of a QAPipeline as in your example in fit_retriever() related to numpy.core.fromnumeric
To Reproduce Steps to reproduce the behavior: tutorial-use-pdf-converter.ipynb
Fit Retriever to documents
cdqa_pipeline.fit_retriever(df=df) cdqa_pipeline = QAPipeline(reader='./models/bert_qa.joblib', max_df=1.0)
Fit Retriever to documents
cdqa_pipeline.fit_retriever(df=df)
Screenshots ValueError Traceback (most recent call last)