texTract

Based on an old university project from 2008 and initially designed in Perl, TexTract is a lightweight and minimalist python module that takes a .txt file, and provides some context for any given token within the input corpus.

It was originally meant to be used from the command line to obtain basic insights from various novels in .txt format.

textract

TexTract provides two main functionalities:

Get context:

Outputs the 5 previous and and 5 following contextual tokens for every iteration of the input token.

Get Summary

Outputs some very basic statistics for the input token, as well as an array of other noteworthy tokens to explore.

How to use texTract

Open your terminal
Place the textract.py inside a folder
Place any .txt file inside the same folder
Run the textract.py file

Example

TBC

julien-blanchard / texTract

readme

texTract

How to use texTract

Example