CeON / CERMINE

Content ExtRactor and MINEr
GNU Affero General Public License v3.0
484 stars 99 forks source link

no bibtex output in v1.13 #68

Open eds-slim opened 6 years ago

eds-slim commented 6 years ago

Hi, when I download the lates version 1.13 from the repository, there seems to be no bibtex option?

Unknown output types: [bibtex]

Usage: ContentExtractor -path <path> [optional parameters]

Tool for extracting metadata and content from PDF files.

Arguments:
  -path <path>           path to a directory containing PDF files
  -outputs <list>        (optional) comma-separated list of extraction
                         output(s); possible values: "jats" (document
                         metadata and content in NLM JATS format), "text"
                         (raw document text), "zones" (text zones with
                         their labels), "trueviz" (geometric structure in
                         TrueViz format), "images" (images from the
                         document); default: "jats,images"
  -exts <list>           (optional) comma-separated list of extensions of the
                         resulting files; the list has to have the same
                         length as output list; default: "cermxml,images"
  -override              override already existing files
  -timeout <seconds>     (optional) approximate maximum allowed processing
                         time for a PDF file in seconds; by default, no
                         timeout is used; the value is approximate because in
                         some cases, the program might be allowed to slightly
                         exceeded this time, say by a second or two
  -configuration <path> (optional) path to configuration properties file
                         see https://github.com/CeON/CERMINE
                         for description of available configuration properties

Is that intentional?

dtkaczyk commented 6 years ago

@eds-slim At the moment we do not export bibliographic references from PDFs in BibTeX. We might add this in the future, though.

dtkaczyk commented 6 years ago

@eds-slim The current official release, 1.13, does not contain this feature, but you will find it in the current snapshot: http://maven.ceon.pl/artifactory/webapp/#/artifacts/browse/simple/General/kdd-snapshots/pl/edu/icm/cermine/cermine-impl/1.14-SNAPSHOT I cannot say at the moment when the next release including the bibtex output feature will happen.

resourcesunite commented 3 years ago

The bibtex option also works if you build the code yourself and run with the following parameters: -path your_path_to_pdfs -outputs bibtex