sambitdash / PDFIO.jl

PDF Reader Library for Native Julia.
Other
127 stars 13 forks source link

tocPDF #66

Open aminya opened 5 years ago

aminya commented 5 years ago

I have created a repository which the plan is to auto-generate bookmarks from the table of contents already available at the beginning of pdf files. https://github.com/aminya/tocPDF

For now, I plan to start using available software (e.g k2pdfoptdoes), and then later make the functionality Julia native (when you add pdf write capability).

Current algorithm plan: https://github.com/aminya/tocPDF#automated

I looked at the PDFIO doc, however, it is a long one, and it has many functions. Could you help me start using PDFIO?

if anyone is interested in participation, that will be awesome. (@kskyten @sambitdash )

sambitdash commented 5 years ago

@aminya thanks a lot for your interest in PDFIO project. While I will love to see the project become a PDF writer, I personally cannot devote any time on it other than reviewing and merging PRs or contributing to the library design. Moreover, PDFIO is a low level API needing detailed understanding of PDF specification to be able to do advanced programming with it. A similar discussion is also in the issue #56. Moreover, the APIs are enhanced for encrypted PDF and digital signatures. Those capabilities make writer consistency even harder. However, if you need any help in using or extending PDFIO, I will be happy to support.

aminya commented 5 years ago

@sambitdash You're welcome. I have found a Java program for now that I call using JavaCall.

To draw more attention regarding developing PDFIO consider adding your repository to https://github.com/JuliaIO?page=1 JuliaIO organization, which is the group for a unified Julia IO infrastructure.

sambitdash commented 5 years ago

@aminya making PDFIO part of FileIO interface which is most JuliaIO projects are based on have its own challenges. Most FileIO projects load a file and read it from beginning to end. While PDFIO is an object model of the underlying PDF document. PDFs are used for varying different needs. Extracting internal data is just one of the aspects. One can write an interface for FileIO but again something not highest on my priorities right now. But anyone interested to work on it is most welcome to write a shim layer for the same.