pdf2text Search Results

192 results
for pdf2text

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

greenelab/preprint-similarity-search #38

Different Topic Preprints Close Together

here's a pair of odd ones: https://greenelab.github.io/annorxiver-journal-recommender/?doi=10.1101/2020.06.28.176180 https://greenelab.github.io/annorxiver-journal-recommender/?doi=10.1101/447862 t…

cgreene updated 4 years ago
1
pdfminer/pdfminer.six #437

How to know 2 textline is the same sentence

I working in domain: CV layout analysis. I have a question. I using xml format to get position of char,word,sentence to layout analysis, but pdf2text always return 1 line in tag . I want textlines th…

giangttbkhn updated 4 years ago
3
thp/urlwatch #183

diff(erent) data types

Thanks a lot for your work! I find it really useful. A wish: I would like to monitor a website which contains a list of links to pdf files and would like to know if the pdfs change. I could imagine…

julianuu updated 4 years ago
2
ccodwg/Covid19Canada #41

Detailed Ontario data available in CSV format

Hello; Detailed Province of Ontario data is available in CSV format at https://data.ontario.ca/dataset/f4f86e54-872d-43f8-8a86-3892fd3cb5e6/resource/ed270bb8-340b-41f9-a7c6-e8ef587e6d11/downlo…

walterdnes updated 4 years ago
3
openZH/covid_19 #590

GE: URL used in parser isn't accessible any more

current URL https://www.ge.ch/document/point-coronavirus-maladie-covid-19/telecharger now returns 403 error. Information is now available at https://www.ge.ch/document/covid-19-situation-epidemiolo…

dominikgehl updated 4 years ago
6
pdfminer/pdfminer.six #426

"fi" not recognized

"fi" The two letters are not recognized when they are joined together, to "??". pdf eg. https://arxiv.org/pdf/1503.03832.pdf

mengcius updated 4 years ago
1
code4sabae/covid19 #21

厚生労働省のPDFにアクセスできない

node/pdf2text.jsでurlとして宣言されている厚生労働省のホームページのpdfにアクセスすることができません。また、このURLは逐次更新されるものですか。

TBNV999 updated 4 years ago
1
LanguageMachines/foliautils #45

FoLiA-correct fails with text consistency error

Running the TICCL pipeline on [mwsel.pdf](https://github.com/LanguageMachines/ticcltools/files/5200248/mwsel.pdf) (via Piroska Lendvai), using text extracted from the PDF, FoLiA-correct fails with a t…

proycon updated 4 years ago
26
Belval/pdf2image #125

pdf2image seems not to be thread safe

Using multiprocessing.dummy.Pool I get sometimes the following error. This happens not very often and after hundreds of convertions, but it can happen. I think it's due to the fact that generators …

stavrakidis updated 4 years ago
6
pdfminer/pdfminer.six #93

converting to text outside of command line

Hi there, I'm currently trying to use pdfminer within a jupyter notebook to convert pdf files to text but fail miserably :/ I know that you provide the command line tool pdf2text.py, but isn't this…

cschwem2er updated 4 years ago
8

上一页 1...11 12 13 14 15 16 17...20 下一页

192 results for pdf2text

192 results
for pdf2text