SheetJS / js-ppt

Pure JS PowerPoint 97-2003 (PPT) Parser
Other
105 stars 19 forks source link

Does not parse .pptx at all #3

Open AlexanderTserkovniy opened 9 years ago

AlexanderTserkovniy commented 9 years ago
c:\Users\...\AppData\Roaming\npm\node_modules\ppt\node_modules\cfb\cfb.js:76
        if(m !== hexstr) throw fld + 'Expected ' + hexstr + ' saw ' + m;
                                                                    ^
Header Signature: Expected d0cf11e0a1b11ae1 saw 504b030414000600

After converting to old .ppt type 2003 year old, throw ne type of error:

$ ppt test2.ppt

c:\Users\...\AppData\Roaming\npm\node_modules\ppt\ppt.js:10
var parsenoop = function(blob, length) { throw new Error("n"); };
                                               ^
Error: n
    at Object.parsenoop [as f] (c:\Users\...\AppData\Roaming\npm\node_mo
dules\ppt\ppt.js:10:48)
    at recordhopper (c:\Users\...\AppData\Roaming\npm\node_modules\ppt\p
pt.js:22:15)
    at Object.parse_OfficeArtClientTextbox [as f] (c:\Users\...\AppData\
Roaming\npm\node_modules\ppt\ppt.js:497:2)
    at recordhopper (c:\Users\...\AppData\Roaming\npm\node_modules\ppt\p
pt.js:22:15)
    at Object.parse_OfficeArtSpContainer [as f] (c:\Users\...\AppData\Ro
aming\npm\node_modules\ppt\ppt.js:65:2)
    at recordhopper (c:\Users\...\AppData\Roaming\npm\node_modules\ppt\p
pt.js:22:15)
    at Object.parse_OfficeArtSpgrContainer [as f] (c:\Users\...\AppData\
Roaming\npm\node_modules\ppt\ppt.js:85:2)
    at recordhopper (c:\Users\...\AppData\Roaming\npm\node_modules\ppt\p
pt.js:22:15)
    at Object.parse_OfficeArtDgContainer [as f] (c:\Users\...\AppData\Ro
aming\npm\node_modules\ppt\ppt.js:51:2)
    at recordhopper (c:\Users\...\AppData\Roaming\npm\node_modules\ppt\p
pt.js:22:15)

Could you help me please with this issue! I really need this module, it is very important for me!

dbashford commented 9 years ago

I don't believe its the job of this module to parse .pptx. That is an entirely different thing. There are other tools that'll do that for you.

It definitely doesn't quite do all it needs to in order to parse .ppt though. See #1.

AlexanderTserkovniy commented 9 years ago

Omg ... it is bad :( mb you know such plugin? I thought that your plugin has this functionality.

dbashford commented 9 years ago

You could try https://github.com/dbashford/textract, it does PPTX among other things. I'm hopeful I'll be able to add ppt support to it in the future, but it does not have it. Waiting on little issues with this lib to be ironed out or for another one to pop up.