-
First of all, thanks for this project! On some of my pdf Documents, the exact hierarchical document structure was extracted.
Unfortunately, the result was not that good for some documents, especial…
-
## 📸 코드
```python
import faiss
from sklearn.feature_extraction.text import TfidfVectorizer
from tqdm.auto import tqdm
import pandas as pd
import pickle
import json
import os
import numpy as np
fr…
-
First, thanks for all the work on Anserini.
I might have missed it but is there no documentation/tutorial for using Anserini via the Java API? There are lots of examples in Python and there are lot…
-
**Question**
I have previously used In Memory Document Stores with tf-idf Retrieval, which was pretty quick to setup.
Now I have gone to the Faiss Document Store combined with Dense Passage Retrieva…
-
hi,大家好,非常高兴的告诉大家,百度飞桨论文复现赛第四期已经开始了,本次共将有100篇的经典&前沿论文供大家复现,详细信息可以参考[AI Studio](https://aistudio.baidu.com/aistudio/competition/detail/106),大家是否已经迫不及待了呢~
**注意:** 本次部分赛题与[人工智能创新应用大赛](https://aistudio.…
-
Hi, thank you so much for open-sourcing DPR! I have some questions about the [Retriever input data format](https://github.com/facebookresearch/DPR#retriever-input-data-format).
Given the paper, th…
-
**Bug description**
When I use following command you provided in projects/hallucination/README.md,
I cannot train retrieval-based models.
When I use the option --m instead of -m, it works.
**R…
-
# Motivation
With full confidence in the future development of Filecoin, Chinese storage providers are actively providing storage power, which accounts for a large proportion of the entire Filecoin n…
-
Hi,
I'm wondering whether there're codes for DPR + BM25 as described in your paper:
"In addition to DPR, we also present the results of BM25, the traditional retrieval method9 and BM25+DPR, usin…
-
We can fold in all the DPR collections into Pyserini, so we can do the retriever part of a QA system directly in Pyserini.