AllanCameron / PDFR

An R package to extract text from pdf.
Other
36 stars 3 forks source link

Error: Couldn't open file. #9

Open funarog opened 2 months ago

funarog commented 2 months ago

Unable to open a file.

pdfpage("Geometrical Representation of Categorial Grammar Abrusci_SYSMICS2016.pdf",1) Error: Couldn't open file.

I have been using pdftools with no issues. Just installed PDFR

funarog commented 2 months ago

More info:

Initially installed using pak, then uninstalled. Tried installing using devtools :: install_github.

Still no luck.

>  library(devtools)
Loading required package: usethis
> install_github("AllanCameron/PDFR")
Using GitHub PAT from the git credential store.
Downloading GitHub repo AllanCameron/PDFR@HEAD
Using GitHub PAT from the git credential store.
Downloading GitHub repo AllanCameron/PDFR@HEAD
── R CMD build ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
✔  checking for file ‘/private/var/folders/y6/ymyjp7qs571_61dp_7kvb6n40000gn/T/RtmpI7dpDM/remotese62557e27bfb/AllanCameron-PDFR-955c122/DESCRIPTION’ (585ms)
─  preparing ‘PDFR’:
✔  checking DESCRIPTION meta-information
─  cleaning src
─  checking for LF line-endings in source and make files and shell scripts
─  checking for empty or unneeded directories
   Omitted ‘LazyData’ from DESCRIPTION
─  building ‘PDFR_0.1.0.tar.gz’

Installing package into ‘/usr/local/lib/R/4.4/site-library’
(as ‘lib’ is unspecified)
* installing *source* package ‘PDFR’ ...
** using non-staged installation via StagedInstall field
** libs
using C++ compiler: ‘Apple clang version 14.0.0 (clang-1400.0.29.202)’
using C++11
using SDK: ‘’
clang++ -std=gnu++11 -I"/usr/local/Cellar/r/4.4.1/lib/R/include" -DNDEBUG  -I'/usr/local/lib/R/4.4/site-library/Rcpp/include' -I'/usr/local/lib/R/4.4/site-library/testthat/include' -I/usr/local/opt/gettext/include -I/usr/local/opt/readline/include -I/usr/local/opt/xz/include -I/usr/local/include    -fPIC  -g -O2   -c RcppExports.cpp -o RcppExports.o
... # more clang++
clang++ -std=gnu++11 -dynamiclib -Wl,-headerpad_max_install_names -undefined dynamic_lookup -L/usr/local/Cellar/r/4.4.1/lib/R/lib -L/usr/local/opt/gettext/lib -L/usr/local/opt/readline/lib -L/usr/local/opt/xz/lib -L/usr/local/lib -o PDFR.so RcppExports.o adobetounicode.o box.o charstring.o chartounicode.o corefonts.o crypto.o deflate.o dictionary.o document.o encoding.o font.o glyphwidths.o letter_grouper.o line_grouper.o object_class.o page.o parser.o pdfr.o streams.o test-PDFR.o test-runner.o text_element.o textbox.o tokenizer.o truetype.o unicodetoadobe.o utilities.o whitespace.o word_grouper.o xref.o -L/usr/local/Cellar/r/4.4.1/lib/R/lib -lR -lintl -Wl,-framework -Wl,CoreFoundation
ld: warning: -undefined dynamic_lookup may not work with chained fixups
installing to /usr/local/lib/R/4.4/site-library/PDFR/libs
** R
** inst
** byte-compile and prepare package for lazy loading
** help
*** installing help indices
** building package indices
** testing if installed package can be loaded
* DONE (PDFR)
Error in fetch(key) : 
  lazy-load database '/usr/local/lib/R/4.4/site-library/PDFR/help/PDFR.rdb' is corrupt
Error in fetch(key) : 
  lazy-load database '/usr/local/lib/R/4.4/site-library/PDFR/help/PDFR.rdb' is corrupt
> PDFR::pdfpage("Geometrical Representation of Categorial Grammar Abrusci_SYSMICS2016.pdf",1)
Error: Couldn''t open file.
> barcodes <- system.file("extdata", "barcodes.pdf", package = "PDFR")
> PDFR::pdfpage(barcodes,1)
                              text  left right bottom   top    font size
1                             None  53.5  74.4  774.2 782.2 Courier    8
2                   Acute medicine 187.4 255.9  774.2 782.2 Courier    8
3                                / 258.8 264.8  774.2 782.2 Courier    8
4                             ward 267.8 288.6  774.2 782.2 Courier    8
5                               11 291.6 303.5  774.2 782.2 Courier    8
6 jean.cairney@ggc.scot.nhs.uk0141 318.3 470.1  774.2 782.2 Courier    8
7                              211 473.0 490.9  774.2 782.2 Courier    8
8                             5719 493.9 514.7  774.2 782.2 Courier    8