-
here's a pair of odd ones:
https://greenelab.github.io/annorxiver-journal-recommender/?doi=10.1101/2020.06.28.176180
https://greenelab.github.io/annorxiver-journal-recommender/?doi=10.1101/447862
t…
-
I working in domain: CV layout analysis.
I have a question. I using xml format to get position of char,word,sentence to layout analysis, but pdf2text always return 1 line in tag . I want textlines th…
-
Thanks a lot for your work! I find it really useful.
A wish: I would like to monitor a website which contains a list of links to pdf files and would like to know if the pdfs change. I could imagine…
-
Hello;
Detailed Province of Ontario data is available in CSV format at
https://data.ontario.ca/dataset/f4f86e54-872d-43f8-8a86-3892fd3cb5e6/resource/ed270bb8-340b-41f9-a7c6-e8ef587e6d11/downlo…
-
current URL https://www.ge.ch/document/point-coronavirus-maladie-covid-19/telecharger now returns 403 error.
Information is now available at https://www.ge.ch/document/covid-19-situation-epidemiolo…
-
"fi" The two letters are not recognized when they are joined together, to "??".
pdf eg. https://arxiv.org/pdf/1503.03832.pdf
-
node/pdf2text.jsでurlとして宣言されている厚生労働省のホームページのpdfにアクセスすることができません。
また、このURLは逐次更新されるものですか。
-
Running the TICCL pipeline on [mwsel.pdf](https://github.com/LanguageMachines/ticcltools/files/5200248/mwsel.pdf) (via Piroska Lendvai), using text extracted from the PDF, FoLiA-correct fails with a t…
-
Using multiprocessing.dummy.Pool I get sometimes the following error. This happens not very often and after hundreds of convertions, but it can happen.
I think it's due to the fact that generators …
-
Hi there,
I'm currently trying to use pdfminer within a jupyter notebook to convert pdf files to text but fail miserably :/ I know that you provide the command line tool pdf2text.py, but isn't this…