StephanAkkerman / FluentAI

Automating language learning with the power of Artificial Intelligence. This repository presents FluentAI, a tool that combines Fluent Forever techniques with AI-driven automation. It streamlines the process of creating Anki flashcards, making language acquisition faster and more efficient.
https://akkerman.ai/FluentAI/
MIT License
9 stars 1 forks source link

Filter non-sense words from frequency word list #96

Open StephanAkkerman opened 4 days ago

StephanAkkerman commented 4 days ago
  1. Description:

    • Problem: There are some words that are spelling errors or are incorrect in the https://huggingface.co/datasets/StephanAkkerman/frequency-words-2018 dataset

    • Solution: Find a dictionary and remove all words that are not in there or give a warning to the user that the word is not commonly found in a dictionary and that they should check it.

    • Prerequisites: [List any requirements or dependencies needed before starting.]

  2. Tasks:

    • Look for dictionaries for languages listed in the freq words 2018 dataset
    • Check the word against the dict
  3. Additional context Add any other context or screenshots about the feature request here.