mortii / anki-morphs

A MorphMan fork rebuilt from the ground up with a focus on simplicity, performance, and a codebase with minimal technical debt.
https://mortii.github.io/anki-morphs/
Mozilla Public License 2.0
52 stars 7 forks source link

Redability report generator feautures #113

Closed Vilhelm-Ian closed 8 months ago

Vilhelm-Ian commented 8 months ago

This is a list of things I want to add to the redability report generator. I would like to eventually to add them, and not bother you mortii to do the hard work.

  1. Calculate the percentage based on the number of occurences of a word. Like in the following sentence with ten words. "I am too shy to speak to that many people." If I only knew the word "to" the percentage stat would be 20% since to appears twice and there are 10 words.

  2. Compare only the base form of words. In the following sentence "She borrowed my favorite book but returned two different books. " If I only knew the word book I would get 20% since book and books appeear and there are 10 words.

  3. In the absolute numbers ability to click on unknown morphs and to get a list of unknown morphs.

  4. (Optional) Ability to add a percentage cut-off. Like if I wanted to get a 95% comprehension it would generate a different amount of absolute unknown morphs. So when you click you wil get which words you need to know to reach that level.

mortii commented 8 months ago

Cool!

  1. Calculate the percentage based on the number of occurences of a word. Like in the following sentence with ten words. "I am too shy to speak to that many people." If I only knew the word "to" the percentage stat would be 20% since to appears twice and there are 10 words.

I thought it already did that, but I never checked. That would be a good improvement.

  1. Compare only the base form of words. In the following sentence "She borrowed my favorite book but returned two different books. " If I only knew the word book I would get 20% since book and books appeear and there are 10 words.

I'm not sure if that would be a good idea. If you have a sentence like:

She was gone last week, he went shortly after, and I go tomorrow.

If I only know 'go', but not 'went', nor 'gone', then the stats would be misleading....

  1. In the absolute numbers ability to click on unknown morphs and to get a list of unknown morphs.

Great idea. How would the list be displayed? Like the 'view morphemes' option in the browser?

  1. (Optional) Ability to add a percentage cut-off. Like if I wanted to get a 95% comprehension it would generate a different amount of absolute unknown morphs. So when you click you wil get which words you need to know to reach that level.

I don't know how I feel about that. It was the least understandable option in the morphman readability analyzer. It might cross over into the frequency file generator territory.

mortii commented 8 months ago
  1. (Optional) Ability to add a percentage cut-off. Like if I wanted to get a 95% comprehension it would generate a different amount of absolute unknown morphs. So when you click you wil get which words you need to know to reach that level.

I don't know how I feel about that. It was the least understandable option in the morphman readability analyzer. It might cross over into the frequency file generator territory.

I think I have a semi-genius idea: If you click on the list of unknown morphs, the list is ordered by frequency, and in the margin it shows the percentile of how much you much you would know if you reach that morph? Does that make sense?