architshukla / Plagiarism-Checker

A utility to check if a document's contents are plagiarised
GNU General Public License v3.0
256 stars 102 forks source link

DOCX #11

Closed shashankrao closed 11 years ago

shashankrao commented 11 years ago

Docx can be implemented, by using the python-docx module.

And then dumping the docx text contents into a txt file.

I shall do this in a seperate branch.

Importance: Medium

architshukla commented 11 years ago

Sounds awesome. We can also look into pdf files later cause most of the papers are in that format. Python probably has a module for that too.

shashankrao commented 11 years ago

Fixed along with utf-8 issue.