Open Zhou-Xun opened 1 year ago
Thanks Xun for the report.
A workaround would be
def score_and_save_text(df):
inner = colbert.text.scorer(df)
return df[['qid', 'docno', 'text']].merge(inner, on=['qid', 'docno'])
and replace colbert.text_scorer()
in the pipeline with pt.apply.generic(score_and_save_text)
I'll have a think about how to change the implementation to fix the underlying issue.
Hello,
I'm trying to use colbert.text_scorer() to do the rerank in the pipeline, but it seems that there is no option for me to include the metadata, and the output of
colbert.text_scorer()
only returns me thedocno
.Therefore, even if my pipeline below have already included my text when doing bm25, I still need to go back to my data to match the text for each docno.