A console app, and future hosted service, for the generation of French language quizzing problems based on language features. Also, my first time writing in Python beyond simple tooling scripts. It will back a mobile app I've already made if it ends up working out.
MIT License
0
stars
0
forks
source link
🇫🇷 Large scale verb with frequencies ingestion feature #7
Right now only a subset of interesting verbs (auxiliaries, common irregulars, and a selection of pronominal verbs with non-pronominal usages) are hard coded for ingestion. Given a dump of all French words and their frequencies (we have been provided frequencies in books and movies), it should be possible to create a weighted list of infinitives in order to make the question generation more useful for the user.
[x] Extract verb infinitive list and frequencies
[ ] Update verb schema to include their usage frequencies
[ ] Tune a combination of those frequencies to something ideal for the user (tbd how)
[ ] Incorporate these into the verb ingestion process
It should be noted that this is not a blocker for sentence generation. We can continue to use the hardcoded lists for that. Every verb on that list is vitally important so frequency of selection is not at all a loss.
Right now only a subset of interesting verbs (auxiliaries, common irregulars, and a selection of pronominal verbs with non-pronominal usages) are hard coded for ingestion. Given a dump of all French words and their frequencies (we have been provided frequencies in books and movies), it should be possible to create a weighted list of infinitives in order to make the question generation more useful for the user.
It should be noted that this is not a blocker for sentence generation. We can continue to use the hardcoded lists for that. Every verb on that list is vitally important so frequency of selection is not at all a loss.