ropensci / unrtf

Wrapper for 'unrtf' utility to extract text from RTF documents
https://docs.ropensci.org/unrtf
15 stars 0 forks source link
extract-text r r-package rstats rtf unrtf

unrtf

Project Status: Active – The project has reached a stable, usable state and is being actively developed. CRAN_Status_Badge CRAN RStudio mirror downloads

Extract Text from Rich Text Format (rtf) Documents

Wraps the unrtf utility to extract text from rtf files.

Installation

install.packages("unrtf")

Hello World

The function has only a single function unrtf(). It takes either a local file path or a URL to a word document:

library(unrtf)
text <- unrtf("https://jeroen.github.io/files/sample.rtf", format = "text")
html <- unrtf("https://jeroen.github.io/files/sample.rtf", format = "html")
cat(text)
###  Translation from RTF performed by UnRTF, version 0.21.9 
### font table contains 11 fonts total

TITLE: It is an example test rtf-file to RTF2XML bean for testing

AUTHOR: kissj
### creation date: 17 April 2000 15:34 
### revision date: 19 April 2000 09:34 
### total pages: 2
### total words: 217
### total chars: 1240

-----------------
It is an example test rtf-file to RTF2XML bean for testing

Font size 10, plain text;
Font size 12, bold text. Underline,bold text.
 Underline,italic,bold text. 
Font size 22, plain text.
 Bold text.