computerline1z / okapi

Automatically exported from code.google.com/p/okapi
0 stars 0 forks source link

PPTX slides not extracted in order #325

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
The slides of PPTX files are not extracted in the order they are used.

Original issue reported on code.google.com by yves.sav...@gmail.com on 9 Apr 2013 at 2:58

GoogleCodeExporter commented 9 years ago
Very simple testcase attached.  It looks like the filter is depending on the 
ordering of the slides in the ZIP archive.  The correct way to do this is to 
parse ppt/_rels/presentation.xml.rels and look for all relationships of type 
http://schemas.openxmlformats.org/officeDocument/2006/relationships/slide, and 
use that data to map IDs to the slide XML files in the archive.  Then parse 
ppt/presentation.xml and use the IDs to figure out the correct ordering of the 
slide data.

Original comment by tingley on 10 Apr 2013 at 4:54

Attachments:

GoogleCodeExporter commented 9 years ago
Attaching fresh copy of that PPTX since it looks like I corrupted it by mistake 
when I was messing with it.

Original comment by tingley on 20 Apr 2013 at 4:26

Attachments:

GoogleCodeExporter commented 9 years ago
Fixed on dev, commit 1283327f7a31d9def3c7427ab7f6e4acc52fff70

Original comment by tingley on 24 Apr 2013 at 10:22