AnFreTh / STREAM

A versatile Python package engineered for seamless topic modeling, topic evaluation, and topic visualization. Ideal for text analysis, natural language processing (NLP), and research in the social sciences, STREAM simplifies the extraction, interpretation, and visualization of topics from large, complex datasets.
https://stream-topic.readthedocs.io
MIT License
33 stars 7 forks source link

Hongyi: modify the codes to adapt to Chinese datasets #100

Open williamlhy opened 5 days ago

williamlhy commented 5 days ago
  1. Add codes to load embedding model from local path
  2. Modify the code of the data preprocessing process to adapt to Chinese datasets
  3. Modify some of the codes in metrics and models to adapt to the Chinese dataset and Chinese stop word.