ONLYOFFICE-QA / ooxml_parser

Ruby OOXML Parser
GNU Affero General Public License v3.0
86 stars 22 forks source link
docx ooxml ooxml-parser parser pptx ruby xlsx

ooxml_parser

_ooxmlparser is a Ooxml files (docx, xlsx, pptx) parser written in Ruby.

Installation

  1. Install magic lib, required by ruby-filemagic
    Mac OS:

    brew install libmagic

    Debian-Based Linux:

    sudo apt-get install libmagic-dev

    Centos-Based Linux:

    sudo yum install file-devel
  2. Install gem by command:

    gem install ooxml_parser

Usage

Parse a docx file

require 'ooxml_parser'
docx = OoxmlParser::Parser.parse('spec/document/document_properties/page_count.docx')
p docx.document_properties.pages # 2

Also check the example directory for more examples.

Configuration

Accuracy

Accuracy of digits in fraction part Default is 2 digits in fraction part

OoxmlParser.configure do |config|
  config.accuracy = 3
end

Changes

Renamed SeriesText#string to SeriesText#reference