cedricrupb / code_tokenize

Fast tokenization and structural analysis of any programming language
MIT License
43 stars 8 forks source link

code_tokenize v0.2.0: Major API redesign #3

Closed cedricrupb closed 2 years ago

cedricrupb commented 2 years ago

CHANGE

Previously, code_tokenize employed custom configurable tokenizers to parse code. However, the implementation is limited by design and required work around for language dependent changes.

The new API introduces the visitor pattern for tokenization. Dependent on the language that should parsed, we can employ custom visitors to parse the AST.

In addition, AST parsing is now outsourced to a custom library (specialized for the AST handling process)