kaegi / MorphMan

Anki plugin that reorders language cards based on the words you know
Other
257 stars 66 forks source link

Trying to use wareya freq list in morphman readibility analyser #259

Open elwendyr opened 2 years ago

elwendyr commented 2 years ago

freqlist.txt

Hello, i'm trying to use morphman readibility analyser with this freq list as a master frequency list, but it doesnt work (other seemlingly specially formated lists do). Anyone know what i should do to make it work?

ghost commented 2 years ago

Hey, seems to be because a working frequency list needs each item to be in a column of its own. Looks like you'll need to find a way to format each part of your frequency list in such a way that it resembles my screenshot. If you happen to need good frequency lists I've got some I found which are excellent.They're each tailored to a specific interest in learning and I'd suggest choosing just one if you're more inclined towards something specific. For example the 5109 novels one or the SoL (Slice of life) one. :) image

ghost commented 2 years ago

Top 3K taken from all japanese netflix subtitles: Morphman netflix_unidic_3011_no_names_word_freq_report.txt

Shonen: shounen_instance_freq_report.txt

Slice of life: Morphman SoL instance_freq_report.txt

Novels (also Light novels): Morphman JapFreqList_5109_Novels.txt

elwendyr commented 2 years ago

Thanks a lot! i'll try to do that. Thanks for the freq lists (also do you have a visual novel frequency list?)

ghost commented 2 years ago

Hey! No problem! Hope I've helped. If visual novel is what you're interested in I'd suggest using the Novels one. 5k worth of novels will cover every light novel vocabulary you could possibly imagine :)

elwendyr commented 2 years ago

Sadly enough the novel freq list doesn't seem to work (the others do though).

ghost commented 2 years ago

Any error? Working here

elwendyr commented 2 years ago

It's just that when setting up a minimum word, it (that's what it does with lists that do not work). At first i thought there was a gap between the first 10 words and the rest so i deleted them but it still did not work. The difference between that one is the other is that it only has two columns, one for the word and one for the word total number of occurence wheread the others have many more columns (for the word type for exemple). It's not really that much of a problem if it doesn't work though.

ghost commented 2 years ago

Really strange...for some reason I don't get this problem. Morphman has unfortunately become kind of unstable and unusable in many cases. I for example can't even make it work properly in 2.1.46 any more. Hoping that someone revives this addon some day :)

ianki commented 2 years ago

The 'frequency.txt' in the first post is in the format of a "word_freq_report", and can't be directly used as a 'frequency.txt' file.

There's a way to generate a 'frequency.txt' from this in the ReadabilityAnalyzer.

Set the list as the "Master Frequency List", check the "Set Frequency List" checkbox only, then run "Analyze!" image

That should set your frequency.txt to the right thing. If you want to limit the frequency of words included, you can also increase the "Minimum Frequency" setting.

lolzorzbbq commented 2 years ago

Hey i made a visual novel frequency list using wareya data ive used it. here