adobe / NLP-Cube

Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing
http://opensource.adobe.com/NLP-Cube/index.html
Apache License 2.0
552 stars 93 forks source link

Major refactor + update package versions #138

Closed tiberiu44 closed 1 year ago

tiberiu44 commented 1 year ago

Overview

The PR updates the package versions used by NLP-Cube and adds some major refactoring to the code, also removing deprecated files from previous versions

Demo

In [1]: from cube.api import Cube

In [2]: cube=Cube()

In [3]: cube.load('en')

In [4]: cube('This is a simple test.')

Out[4]:
1   This    this    PRON    DT  Number=Sing|PronType=Dem    5   nsubj   _   _
2   is  be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   5   cop _   _
3   a   a   DET DT  Definite=Ind|PronType=Art   5   det _   _
4   simple  simple  ADJ JJ  Degree=Pos  5   amod    _   _
5   test    test    NOUN    NN  Number=Sing 0   root    _   _
6   .   .   PUNCT   .   _   5   punct   _   _

Notes

N/A

Testing Instructions