AnFreTh / STREAM

A versatile Python package engineered for seamless topic modeling, topic evaluation, and topic visualization. Ideal for text analysis, natural language processing (NLP), and research in the social sciences, STREAM simplifies the extraction, interpretation, and visualization of topics from large, complex datasets.
https://stream-topic.readthedocs.io
MIT License
33 stars 7 forks source link

Add nmf #33

Closed xillig closed 3 months ago

xillig commented 4 months ago

added NMFTM, including theta and beta as a model attribute as requested

AnFreTh commented 4 months ago

from stream.models import NMFTM

from tests.model_validation import validate_model validate_model(NMFTM, tfidf_args={"min_df": 1, "max_df": 0.5})

Returns:

AssertionError: The sum of 'theta' values along axis 1 must be all ones.

Should this not be the case for NMF?

Could you also add the logger infos for NMF similar to e.g. LDA or KmeansTM?

AnFreTh commented 3 months ago

added NMF with #87