Closed peter0302 closed 9 years ago
Sorry my copy was way outdated, I see that this has already been officially added. Cheers!
// add to PdfDocument.cs
// returns text of whole pdf document public string GetText() { string strReturn = ""; for (int i = 0; i < PageSizes.Count; i++) { strReturn += GetText(i) + "\r\n"; } return strReturn; }
Hey guys - I needed simple text extraction and went ahead and added it in my copy. Very easy to do. The changes are:
NativeMethods.cs, NativeMethods, add:
PdfFile.cs, PdfFile, add:
PdfDocument.cs, PdfDocument, add:
Just want to know if you'd prefer I fork this or if you want to incorporate it yourself. If I do fork it I'll probably add some more sophisticated extraction such as obtaining the coordinates of the text.
Also, are we still under LGPL3 or did you switch to Apache? I saw the issue was closed but the license still says LGPL3. Fine either way.
Peter