from Korpora import NSMC
nsmc = NSMC(root_dir='./Korpora/')
print(str(nsmc))
NSMC
Reference: https://github.com/e9t/nsmc
Naver sentiment movie corpus v1.0
This is a movie review dataset in the Korean language.
Reviews were scraped from Naver Movies.
The dataset construction is based on the method noted in
[Large movie review dataset][^1] from Maas et al., 2011.
[^1]: http://ai.stanford.edu/~amaas/data/sentiment/
Attributes
NSMC.train : size=150000
NSMC.test : size=50000
print(str(nsmc.train))
NSMCData
Naver sentiment movie corpus v1.0. size of data=150000
Attributes:
NSMCData.texts (list[str]) : size=150000
NSMCData.labels (list[int]) : size=150000
usage 입니다.