marcoagpinto / aoo-mozilla-en-dict

English Dictionaries Project (AOO+Mozilla+others)
159 stars 24 forks source link

AFF errors #33

Closed Ding-adong closed 5 years ago

Ding-adong commented 5 years ago

Suffix O is causing duplicate ial and al in some words. Only one or the other is needed. Also O adds non- because O is a prefix too. This is bad. Made corrections below, all is working.

SFX O Y 13 SFX O 0 l a SFX O 0 al [dh] SFX O e al [^bcgv]e SFX O 0 ial eon SFX O 0 ial r SFX O um al um SFX O 0 al [^u]m SFX O y al ty SFX O y ial [^t]y SFX O 0 ual [px]t SFX O 0 tal [iu]t SFX O 0 al [^ipux]t SFX O 0 al io[^aebcrtxmy]

marcoagpinto commented 5 years ago

Yes, I know that flag "O" is both a suffix and a prefix, it is in the .AFF original instructions.

It is okay with it.

Does your post mean that you made a fix to the flag "O" for suffixes?

:-)

Ding-adong commented 5 years ago

A couple of fixes. Compare with yours and you can see. Surely prefix O can have his own char like: PFX ^ Y 1 PFX ^ 0 non- .

Ding-adong commented 5 years ago

Another fix, new line and minor correction.

SFX Z Y 23 SFX Z ic y dic - new line enables w+dy SFX Z e y [^aeiouy]e SFX Z 0 y [aeiouy]e SFX Z 0 ey [aiouy] SFX Z 0 by [^aeio][aeiou]b SFX Z 0 ky [^adeio][aeiou]c add d SFX Z 0 dy [^aeio][aeiou]d SFX Z 0 fy [^aeio][aeiou]f SFX Z 0 gy [^aeio][aeiou]g SFX Z 0 ky [^aeio][aeiou]k SFX Z 0 ly [^aeio][aeiou]l SFX Z 0 my [^aeio][aeiou]m SFX Z 0 ny [^aeio][aiou]n SFX Z 0 py [^aeio][aeiou]p SFX Z 0 ry [^aeio][aiou]r SFX Z 0 sy [^aeio][aeiou]s SFX Z 0 ty [^aeio][aiou]t SFX Z 0 vy [^aeio][aeiou]v SFX Z 0 zy [^aeio][aeiou]z SFX Z 0 y [^aeio]e[nrt] SFX Z 0 y [aeio][aeiou][bcdfgklmnprstvz] SFX Z 0 y [^aeiou][bcdfgklmnprstvz] SFX Z 0 y [^aebcdfgiklmnoprstuvyz]

marcoagpinto commented 5 years ago

A couple of fixes. Compare with yours and you can see. Surely prefix O can have his own char like: PFX ^ Y 1 PFX ^ 0 non- .

I have just implemented it.

I did a DIFF to add ^ to all the "non-" words that had an "O" flag.

Soon I will implement the remaining.

Thank you!

Ding-adong commented 5 years ago

Don't have to use ^ . A small change like this: PFX f Y 2 PFX f 0 under [^r] PFX f 0 under- r to PFX ^ Y 2 PFX ^ 0 under [^r] PFX ^ 0 under- r

^ looks like under and maybe easier to remember. Put f for non-.

PFX f Y 1 PFX f 0 non- .

Your choice.

Now, what is REP for?

marcoagpinto commented 5 years ago

Don't have to use ^ . A small change like this: PFX f Y 2 PFX f 0 under [^r] PFX f 0 under- r to PFX ^ Y 2 PFX ^ 0 under [^r] PFX ^ 0 under- r

^ looks like under and maybe easier to remember. Put f for non-.

PFX f Y 1 PFX f 0 non- .

Your choice.

Now, what is REP for?

I have changed "non-" to "^".

I can't remember what the REP is for :-(

I am brain toasted.

marcoagpinto commented 5 years ago

@Ding-adong Minutes ago I released PTG build 140.

Force a page refresh if the site still shows build 139.

Basically, I have implemented partially the option that detects duplicates on the .dic, but it still doesn't have options to merge the flags.

There are 64 duplicates in the .dic but I will only fix them when I have the feature 100% ready to "test it on the field".

Ding-adong commented 5 years ago

Only 4 words at the moment with previous O for non- needs ^ for non- biocide/SO^ cuboid/SO^ fiction/MSOo^ fiction/MSOo^

The rest do not.

What is your plan. Continue typing out non- or use ^ for non- A word of warning, there are 8406 words with non-

marcoagpinto commented 5 years ago

Another fix, new line and minor correction.

SFX Z Y 23 SFX Z ic y dic - new line enables w+dy SFX Z e y [^aeiouy]e SFX Z 0 y [aeiouy]e SFX Z 0 ey [aiouy] SFX Z 0 by [^aeio][aeiou]b SFX Z 0 ky [^adeio][aeiou]c add d SFX Z 0 dy [^aeio][aeiou]d SFX Z 0 fy [^aeio][aeiou]f SFX Z 0 gy [^aeio][aeiou]g SFX Z 0 ky [^aeio][aeiou]k SFX Z 0 ly [^aeio][aeiou]l SFX Z 0 my [^aeio][aeiou]m SFX Z 0 ny [^aeio][aiou]n SFX Z 0 py [^aeio][aeiou]p SFX Z 0 ry [^aeio][aiou]r SFX Z 0 sy [^aeio][aeiou]s SFX Z 0 ty [^aeio][aiou]t SFX Z 0 vy [^aeio][aeiou]v SFX Z 0 zy [^aeio][aeiou]z SFX Z 0 y [^aeio]e[nrt] SFX Z 0 y [aeio][aeiou][bcdfgklmnprstvz] SFX Z 0 y [^aeiou][bcdfgklmnprstvz] SFX Z 0 y [^aebcdfgiklmnoprstuvyz]

Done!

Thank you!

marcoagpinto commented 5 years ago

Only 4 words at the moment with previous O for non- needs ^ for non- biocide/SO^ cuboid/SO^ fiction/MSOo^ fiction/MSOo^

The rest do not.

What is your plan. Continue typing out non- or use ^ for non- A word of warning, there are 8406 words with non-

The idea is to type "non-" because I hate prefixes, as it makes harder to find the words on the .dic .

For example, when I want to add derivates, if partly are prefixes, sometimes I can't find them or it takes a lot of time and effort.

Ding-adong commented 5 years ago

Are you kidding me. It isn't hard at all. You want to look for non-fiction, go to fiction - problem solved. Prefixes like rebut using but then prefix re is hard. non- is easy. typing out 000s of lines isn't fun.

Ding-adong commented 5 years ago

Add below to the AFF file making it 24. SFX Z us sity ous then SFX Z 0 y [aeio][aeiou][bcdfgklmnprtvz] remove s.

Ding-adong commented 5 years ago

Add below to the AFF file making it 14.

SFX O Y 14
SFX O 0 l a 
SFX O 0 al [dhn] 
SFX O e inal itude 
SFX O e al [^i][acnt][eiu][^bcgv]e 
SFX O 0 ial eon 
SFX O 0 ial r 
SFX O um al um 
SFX O 0 al [^u]m 
SFX O y al ty 
SFX O y ial [^t]y 
SFX O 0 ual [px]t 
SFX O 0 tal [iu]t 
SFX O 0 al [^ipux]t 
SFX O 0 al io[^aebcrtxmy]
marcoagpinto commented 5 years ago

Add below to the AFF file making it 14.

SFX O Y 14
SFX O 0 l a 
SFX O 0 al [dhn] 
SFX O e inal itude 
SFX O e al [^i][acnt][eiu][^bcgv]e 
SFX O 0 ial eon 
SFX O 0 ial r 
SFX O um al um 
SFX O 0 al [^u]m 
SFX O y al ty 
SFX O y ial [^t]y 
SFX O 0 ual [px]t 
SFX O 0 tal [iu]t 
SFX O 0 al [^ipux]t 
SFX O 0 al io[^aebcrtxmy]

Hello!

Your change causes dozens of duplicates.

Check the word "version" and others where it duplicates derivates.

Ding-adong commented 5 years ago

There are some duplicates due to past errors. Confusing as hell. Look at the word class etc and it's a mess. Will look at it again.

Ding-adong commented 5 years ago

Sorted.

SFX O Y 11 SFX O 0 l a SFX O 0 al [ix]on SFX O 0 al ct SFX O 0 ial eon SFX O 0 al [cdhm] SFX O 0 ial r SFX O 0 tal ut SFX O e inal ude SFX O e al [mrs]e SFX O e al ide SFX O y ial y

Ding-adong commented 5 years ago

Your Proofing tool GUI, can you change the font so that I is I and not l; i is i and not l. It would help to prevent mistakes.

marcoagpinto commented 5 years ago

Your Proofing tool GUI, can you change the font so that I is I and not l; i is i and not l. It would help to prevent mistakes.

What do you mean?: ptg30_i_not_i_20190222

Ding-adong commented 5 years ago

version/ODGMIFAS The I looks like l in PT.

marcoagpinto commented 5 years ago

version/ODGMIFAS The I looks like l in PT.

How?! Why?! When?!

This is how it looks like in my Windows 10: i_not_i_ptg_v2_20190222

Ding-adong commented 5 years ago

Yes it is l not I. It depends on the font you use. Simply change it.

marcoagpinto commented 5 years ago

Yes it is l not I. It depends on the font you use. Simply change it.

Yes, if you can suggest a font compatible with Windows, Linux and Mac (that comes built-in in those three OSes).

Ding-adong commented 5 years ago

Verdana, don't know about linux or mac. https://www.luckymanpress.com/teach/pages/Links/Guide2Fonts.html

marcoagpinto commented 5 years ago

Verdana, don't know about linux or mac. https://www.luckymanpress.com/teach/pages/Links/Guide2Fonts.html

Verdana is a Windows (Microsoft?) font.

Next week I will try to find out the better option.

Ding-adong commented 5 years ago

2 new lines adding plural

SFX W Y 23
SFX W ce tific ce 
SFX W e atic me 
SFX W se tic se 
SFX W le ic ble 
SFX W e ic [^b]le 
SFX W e ic [^clms]e 
SFX W 0 lic [ay]l 
SFX W 0 ic [^ay]l 
SFX W us ic us 
SFX W 0 tic [^u]s 
SFX W er ric er 
SFX W 0 ic [^e]r 
SFX W 0 atic [aeiou]m 
SFX W 0 ic [^aeiou]m 
SFX W 0 tic ma 
SFX W a ic [^m]a 
SFX W y etic thy 
SFX W y ic [^t]hy 
SFX W y tic sy 
SFX W y ic [^hs]y 
SFX W 0 ic [^aelmrsy] 
SFX W y ics [^hs]y 
SFX W 0 ics [^aelmrsy] 
marcoagpinto commented 5 years ago

On Monday I will take a look.

Thanks!

Ding-adong commented 5 years ago

Error.

SFX z 0 ily [^aeiou][bcdfgklmnprstvyz] remove y. No such suffix as yily

marcoagpinto commented 5 years ago

SFX O Y 11 SFX O 0 l a SFX O 0 al [ix]on SFX O 0 al ct SFX O 0 ial eon SFX O 0 al [cdhm] SFX O 0 ial r SFX O 0 tal ut SFX O e inal ude SFX O e al [mrs]e SFX O e al ide SFX O y ial y

done!

marcoagpinto commented 5 years ago

SFX W Y 23 SFX W ce tific ce SFX W e atic me SFX W se tic se SFX W le ic ble SFX W e ic [^b]le SFX W e ic [^clms]e SFX W 0 lic [ay]l SFX W 0 ic [^ay]l SFX W us ic us SFX W 0 tic [^u]s SFX W er ric er SFX W 0 ic [^e]r SFX W 0 atic [aeiou]m SFX W 0 ic [^aeiou]m SFX W 0 tic ma SFX W a ic [^m]a SFX W y etic thy SFX W y ic [^t]hy SFX W y tic sy SFX W y ic [^hs]y SFX W 0 ic [^aelmrsy] SFX W y ics [^hs]y SFX W 0 ics [^aelmrsy]

Hello!

It adds 300+ words to the list, plurals, most of which I can't find on dictionaries: acidics acidimetrics acromegalics aerometrics aetiologics alchemics allegorics etc. Can you confirm if they are valid?

Thank you!

Error.

SFX z 0 ily [^aeiou][bcdfgklmnprstvyz] remove y. No such suffix as yily

Done!

marcoagpinto commented 5 years ago

Here is the complete list of new words: --- C:/Users/marcoagpinto/Desktop/0000_BACKPEN128GB_000/0_spellers_wordlist_bugs/1-before.txt seg fev 25 10:45:50 2019 +++ C:/Users/marcoagpinto/Desktop/0000_BACKPEN128GB_000/0_spellers_wordlist_bugs/2-after.txt seg fev 25 11:33:03 2019 @@ -1559,0 +1560 @@ acidic +acidics @@ -1573,0 +1575 @@ acidimetric +acidimetrics @@ -1754,0 +1757 @@ acrobatic +acrobatics @@ -1769,0 +1773 @@ acromegalic +acromegalics @@ -3029,0 +3034 @@ aerometric +aerometrics @@ -3102,0 +3108 @@ aetiologic +aetiologics @@ -3879,0 +3886 @@ agronomic +agronomics @@ -4492,0 +4500 @@ alchemic +alchemics @@ -5099,0 +5108 @@ allegoric +allegorics @@ -5139,0 +5149 @@ allergenic +allergenics @@ -5144,0 +5155 @@ allergic +allergics @@ -5287,0 +5299 @@ allometric +allometrics @@ -5304,0 +5317 @@ alloplastic +alloplastics @@ -5530,0 +5544 @@ alphabetic +alphabetics @@ -5703,0 +5718 @@ altimetric +altimetrics @@ -5739,0 +5755 @@ altruistic +altruistics @@ -6408,0 +6425 @@ amphibrachic +amphibrachics @@ -6612,0 +6630 @@ anaglyphic +anaglyphics @@ -6707,0 +6726 @@ anapaestic +anapaestics @@ -6726,0 +6746 @@ anarchistic +anarchistics @@ -6780,0 +6801 @@ anatomic +anatomics @@ -6821,0 +6843 @@ anchoretic +anchoretics @@ -6931,0 +6954 @@ androgenic +androgenics @@ -7303,0 +7327 @@ animistic +animistics @@ -7314,0 +7339 @@ anionic +anionics @@ -7333,0 +7359 @@ anisotropic +anisotropics @@ -7704,0 +7731 @@ antagonistic +antagonistics @@ -7863,0 +7891 @@ anthropometric +anthropometrics @@ -8068,0 +8097 @@ antigenic +antigenics @@ -8254,0 +8284 @@ antisymmetric +antisymmetrics @@ -9512,0 +9543 @@ archaeometric +archaeometrics @@ -9932,0 +9964 @@ aristocratic +aristocratics @@ -10583,0 +10616 @@ artistic +artistics @@ -10977,0 +11011 @@ aspic +aspics @@ -11583,0 +11618 @@ astrometric +astrometrics @@ -11585,0 +11621 @@ astronautic +astronautics @@ -11599,0 +11636 @@ astronomic +astronomics @@ -11651,0 +11689 @@ asymmetric +asymmetrics @@ -11701,0 +11740 @@ atavistic +atavistics @@ -11741,0 +11781 @@ atheistic +atheistics @@ -11899,0 +11940 @@ atopic +atopics @@ -12587,0 +12629 @@ autarkic +autarkics @@ -12701,0 +12744 @@ autoallergic +autoallergics @@ -12778,0 +12822 @@ autocratic +autocratics @@ -12801,0 +12846 @@ autodidactic +autodidactics @@ -12834,0 +12880 @@ autographic +autographics @@ -12929,0 +12976 @@ autonomic +autonomics @@ -12963,0 +13011 @@ autoradiographic +autoradiographics @@ -13085,0 +13134 @@ auxotrophic +auxotrophics @@ -14179,0 +14229 @@ bacteriologic +bacteriologics @@ -14774,0 +14825 @@ balletic +balletics @@ -15534,0 +15586 @@ bardic +bardics @@ -16122,0 +16175 @@ basaltic +basaltics @@ -17037,0 +17091 @@ beatific +beatifics @@ -17121,0 +17176 @@ beautific +beautifics @@ -17728,0 +17784 @@ behaviouristic +behaviouristics @@ -17988,0 +18045 @@ belletristic +belletristics @@ -19878,0 +19936 @@ bioclastic +bioclastics @@ -19997,0 +20056 @@ biographic +biographics @@ -20029,0 +20089 @@ biologic +biologics @@ -20075,0 +20136 @@ biometric +biometrics @@ -22715,0 +22777 @@ Bolshevistic +Bolshevistics @@ -22785,0 +22848 @@ bombastic +bombastics @@ -23513,0 +23577 @@ botanic +botanics @@ -27475,0 +27540 @@ bureaucratic +bureaucratics @@ -28352,0 +28418 @@ Byronic +Byronics @@ -28947,0 +29014 @@ calcific +calcifics @@ -29348,0 +29416 @@ Calvinistic +Calvinistics @@ -29963,0 +30032 @@ canonic +canonics @@ -30300,0 +30370 @@ capitalistic +capitalistics @@ -30654,0 +30725 @@ carbonic +carbonics @@ -30727,0 +30799 @@ carcinogenic +carcinogenics @@ -31864,0 +31937 @@ casuistic +casuistics @@ -32099,0 +32173 @@ categoric +categorics @@ -32231,0 +32306 @@ cationic +cationics @@ -32807,0 +32883 @@ Celtic +Celtics @@ -33099,0 +33176 @@ cephalometric +cephalometrics @@ -33486,0 +33564 @@ chalcedonic +chalcedonics @@ -34374,0 +34453 @@ chauvinistic +chauvinistics @@ -34860,0 +34940 @@ cherubic +cherubics @@ -35610,0 +35691 @@ chivalric +chivalrics @@ -35900,0 +35982 @@ choreographic +choreographics @@ -36092,0 +36175 @@ chromatographic +chromatographics @@ -36142,0 +36226 @@ chronographic +chronographics @@ -38863,0 +38948 @@ cobaltic +cobaltics @@ -40068,0 +40154 @@ colonic +colonics @@ -40513,0 +40600 @@ comedic +comedics @@ -40954,0 +41042 @@ communalistic +communalistics @@ -41027,0 +41116 @@ communistic +communistics @@ -41653,0 +41743 @@ conic +conics @@ -46674,0 +46765 @@ craniometric +craniometrics @@ -46810,0 +46902 @@ cratonic +cratonics @@ -48427,0 +48520 @@ cryptic +cryptics @@ -48469,0 +48563 @@ cryptologic +cryptologics @@ -48580,0 +48675 @@ cubic +cubics @@ -49591,0 +49687 @@ cyanic +cyanics @@ -49944,0 +50041 @@ cystic +cystics @@ -49988,0 +50086 @@ cytophotometric +cytophotometrics @@ -50132,0 +50231 @@ Dadaistic +Dadaistics @@ -50148,0 +50248 @@ daemonic +daemonics @@ -52632,0 +52733 @@ deistic +deistics @@ -52964,0 +53066 @@ demagogic +demagogics @@ -53112,0 +53215 @@ democratic +democratics @@ -53146,0 +53250 @@ demonic +demonics @@ -54345,0 +54450 @@ despotic +despotics @@ -54701,0 +54807 @@ Deuteronomic +Deuteronomics @@ -55128,0 +55235 @@ dialectic +dialectics @@ -57252,0 +57360 @@ dissymmetric +dissymmetrics @@ -61089,0 +61198 @@ dyadic +dyadics @@ -61166,0 +61276 @@ dynastic +dynastics @@ -61194,0 +61305 @@ dysenteric +dysenterics @@ -61928,0 +62040 @@ echoic +echoics @@ -61938,0 +62051 @@ re-echoic +re-echoics @@ -61951,0 +62065 @@ echocardiographic +echocardiographics @@ -62073,0 +62188 @@ economic +economics @@ -62205,0 +62321 @@ Edenic +Edenics @@ -62859,0 +62976 @@ egoistic +egoistics @@ -62880,0 +62998 @@ egotistic +egotistics @@ -63061,0 +63180 @@ eisteddfodic +eisteddfodics @@ -63363,0 +63483 @@ electroencephalographic +electroencephalographics @@ -63390,0 +63511 @@ electromagnetic +electromagnetics @@ -63414,0 +63536 @@ electronic +electronics @@ -64354,0 +64477 @@ embryologic +embryologics @@ -65185,0 +65309 @@ endoblastic +endoblastics @@ -65971,0 +66096 @@ enthusiastic +enthusiastics @@ -66109,0 +66235 @@ entropic +entropics @@ -66267,0 +66394 @@ eolithic +eolithics @@ -66363,0 +66491 @@ epidemiologic +epidemiologics @@ -66517,0 +66646 @@ epithetic +epithetics @@ -67672,0 +67802 @@ ethnologic +ethnologics @@ -67785,0 +67916 @@ Eucharistic +Eucharistics @@ -67839,0 +67971 @@ eulogistic +eulogistics @@ -67872,0 +68005 @@ euphemistic +euphemistics @@ -68145,0 +68279 @@ evangelistic +evangelistics @@ -69194,0 +69329 @@ existentialistic +existentialistics @@ -69812,0 +69948 @@ expressionistic +expressionistics @@ -71507,0 +71644 @@ fascistic +fascistics @@ -71613,0 +71751 @@ fatalistic +fatalistics @@ -72611,0 +72750 @@ ferromagnetic +ferromagnetics @@ -72631,0 +72771 @@ ferric +ferrics @@ -72766,0 +72907 @@ fetishistic +fetishistics @@ -76923,0 +77065 @@ formalistic +formalistics @@ -79677,0 +79820 @@ futuristic +futuristics @@ -80352,0 +80496 @@ ganglionic +ganglionics @@ -80846,0 +80991 @@ gastronomic +gastronomics @@ -81774,0 +81920 @@ geologic +geologics @@ -81815,0 +81962 @@ geonomic +geonomics @@ -81937,0 +82085 @@ Germanic +Germanics @@ -83640,0 +83789 @@ gnomonic +gnomonics @@ -85187,0 +85337 @@ graphic +graphics @@ -87423,0 +87574 @@ gymnastic +gymnastics @@ -87786,0 +87938 @@ haematologic +haematologics @@ -88344,0 +88497 @@ hallucinogenic +hallucinogenics @@ -88992,0 +89146 @@ haplobiontic +haplobiontics @@ -91004,0 +91159 @@ hedonistic +hedonistics @@ -91081,0 +91237 @@ hegemonic +hegemonics @@ -91231,0 +91388 @@ heliographic +heliographics @@ -91299,0 +91457 @@ Hellenistic +Hellenistics @@ -91734,0 +91893 @@ heraldic +heraldics @@ -91937,0 +92097 @@ heroic +heroics @@ -92156,0 +92317 @@ heteromorphic +heteromorphics @@ -92465,0 +92627 @@ hieroglyphic +hieroglyphics @@ -93086,0 +93249 @@ histologic +histologics @@ -93157,0 +93321 @@ historic +historics @@ -93162,0 +93327 @@ prehistoric +prehistorics @@ -93749,0 +93915 @@ holographic +holographics @@ -94032,0 +94199 @@ homoeopathic +homoeopathics @@ -94107,0 +94275 @@ homomorphic +homomorphics @@ -94697,0 +94866 @@ horologic +horologics @@ -94731,0 +94901 @@ horrific +horrifics @@ -95696,0 +95867 @@ humanistic +humanistics @@ -95872,0 +96044 @@ humoristic +humoristics @@ -96643,0 +96816 @@ hymnic +hymnics @@ -97156,0 +97330 @@ iambic +iambics @@ -97261,0 +97436 @@ Icelandic +Icelandics @@ -97331,0 +97507 @@ iconic +iconics @@ -97346,0 +97523 @@ iconoclastic +iconoclastics @@ -97429,0 +97607 @@ idealistic +idealistics @@ -97491,0 +97670 @@ ideographic +ideographics @@ -97535,0 +97715 @@ idiotic +idiotics @@ -98179,0 +98360 @@ immunologic +immunologics @@ -98366,0 +98548 @@ imperialistic +imperialistics @@ -98678,0 +98861 @@ impressionistic +impressionistics @@ -99544,0 +99728 @@ individualistic +individualistics @@ -102949,0 +103134 @@ iodometric +iodometrics @@ -102965,0 +103151 @@ ionic +ionics @@ -103194,0 +103381 @@ ironic +ironics @@ -103598,0 +103786 @@ isolationistic +isolationistics @@ -103621,0 +103810 @@ isomorphic +isomorphics @@ -103658,0 +103848 @@ isotropic +isotropics @@ -104889,0 +105080 @@ jingoistic +jingoistics @@ -105345,0 +105537 @@ journalistic +journalistics @@ -105836,0 +106029 @@ juristic +juristics @@ -106203,0 +106397 @@ kaolinic +kaolinics @@ -106305,0 +106500 @@ karstic +karstics @@ -109310,0 +109506 @@ kymographic +kymographics @@ -112628,0 +112825 @@ lethargic +lethargics @@ -113643,0 +113841 @@ limbic +limbics @@ -113980,0 +114179 @@ linguistic +linguistics @@ -114393,0 +114593 @@ literalistic +literalistics @@ -114445,0 +114646 @@ lithographic +lithographics @@ -116003,0 +116205 @@ logic +logics @@ -117904,0 +118107 @@ lymphoblastic +lymphoblastics @@ -118445,0 +118649 @@ macroplanktonic +macroplanktonics @@ -118758,0 +118963 @@ magnetic +magnetics @@ -119162,0 +119368 @@ majestic +majestics @@ -119689,0 +119896 @@ manic +manics @@ -121357,0 +121565 @@ masochistic +masochistics @@ -121367,0 +121576 @@ masonic +masonics @@ -121713,0 +121923 @@ materialistic +materialistics @@ -122867,0 +123078 @@ mechanistic +mechanistics @@ -123229,0 +123441 @@ megalithic +megalithics @@ -123356,0 +123569 @@ melancholic +melancholics @@ -123516,0 +123730 @@ melodic +melodics @@ -123994,0 +124209 @@ mercuric +mercurics @@ -124229,0 +124445 @@ mesenteric +mesenterics @@ -124277,0 +124494 @@ mesomorphic +mesomorphics @@ -124479,0 +124697 @@ metallurgic +metallurgics @@ -125097,0 +125316 @@ microcephalic +microcephalics @@ -125291,0 +125511 @@ microlithic +microlithics @@ -125401,0 +125622 @@ microphotographic +microphotographics @@ -126084,0 +126306 @@ militaristic +militaristics @@ -126371,0 +126594 @@ Miltonic +Miltonics @@ -126708,0 +126932 @@ minimalistic +minimalistics @@ -127353,0 +127578 @@ misogynistic +misogynistics @@ -127587,0 +127813 @@ mitogenic +mitogenics @@ -127963,0 +128190 @@ modernistic +modernistics @@ -128373,0 +128601 @@ monadic +monadics @@ -128387,0 +128616 @@ monarchic +monarchics @@ -128394,0 +128624 @@ monarchistic +monarchistics @@ -128701,0 +128932 @@ monocratic +monocratics @@ -128717,0 +128949 @@ monodic +monodics @@ -128784,0 +129017 @@ monolithic +monolithics @@ -128874,0 +129108 @@ monopolistic +monopolistics @@ -128898,0 +129133 @@ monosomic +monosomics @@ -128912,0 +129148 @@ monotheistic +monotheistics @@ -129341,0 +129578 @@ moralistic +moralistics @@ -129541,0 +129779 @@ moronic +moronics @@ -129583,0 +129822 @@ morphologic +morphologics @@ -129591,0 +129831 @@ morphometric +morphometrics @@ -131719,0 +131960 @@ mutualistic +mutualistics @@ -131892,0 +132134 @@ myomorphic +myomorphics @@ -131997,0 +132240 @@ mythic +mythics @@ -132519,0 +132763 @@ Napoleonic +Napoleonics @@ -132559,0 +132804 @@ narcissistic +narcissistics @@ -132812,0 +133058 @@ nationalistic +nationalistics @@ -132898,0 +133145 @@ naturalistic +naturalistics @@ -134052,0 +134300 @@ neurologic +neurologics @@ -134082,0 +134331 @@ neuropathic +neuropathics @@ -134955,0 +135205 @@ nihilistic +nihilistics @@ -135434,0 +135685 @@ nomadic +nomadics @@ -135733,0 +135985 @@ non-Germanic +non-Germanics @@ -136827,0 +137080 @@ novelistic +novelistics @@ -137850,0 +138104 @@ obscurantic +obscurantics @@ -138164,0 +138419 @@ oceanic +oceanics @@ -139046,0 +139302 @@ oligarchic +oligarchics @@ -139706,0 +139963 @@ ophthalmologic +ophthalmologics @@ -139731,0 +139989 @@ opisthographic +opisthographics @@ -139766,0 +140025 @@ opportunistic +opportunistics @@ -139830,0 +140090 @@ optic +optics @@ -139862,0 +140123 @@ optimistic +optimistics @@ -139914,0 +140176 @@ optometric +optometrics @@ -140202,0 +140465 @@ organic +organics @@ -140660,0 +140924 @@ orthoepic +orthoepics @@ -140910,0 +141175 @@ osteoblastic +osteoblastics @@ -140913,0 +141179 @@ osteoclastic +osteoclastics @@ -140925,0 +141192 @@ osteopathic +osteopathics @@ -141072,0 +141340 @@ otolithic +otolithics @@ -142432,0 +142701 @@ pacifistic +pacifistics @@ -142443,0 +142713 @@ pacific +pacifics @@ -142852,0 +143123 @@ palaeoeconomic +palaeoeconomics @@ -143227,0 +143499 @@ panic +panics @@ -143421,0 +143694 @@ pantheistic +pantheistics @@ -143712,0 +143986 @@ paradoxic +paradoxics @@ -143834,0 +144109 @@ paramagnetic +paramagnetics @@ -143884,0 +144160 @@ paramorphic +paramorphics @@ -144443,0 +144720 @@ parthenocarpic +parthenocarpics @@ -144944,0 +145222 @@ paternalistic +paternalistics @@ -144976,0 +145255 @@ pathogenic +pathogenics @@ -145088,0 +145368 @@ patriotic +patriotics @@ -145708,0 +145989 @@ pedagogic +pedagogics @@ -145719,0 +146001 @@ pedantic +pedantics @@ -146988,0 +147271 @@ periodic +periodics @@ -147641,0 +147925 @@ pessimistic +pessimistics @@ -148215,0 +148500 @@ philanthropic +philanthropics @@ -148221,0 +148507 @@ philatelic +philatelics @@ -148410,0 +148697 @@ phonographic +phonographics @@ -148508,0 +148796 @@ photoallergic +photoallergics @@ -148582,0 +148871 @@ photoelectronics +photoelectronics @@ -148621,0 +148911 @@ photogrammetric +photogrammetrics @@ -148630,0 +148921 @@ photographic +photographics @@ -148690,0 +148982 @@ photoplethysmographic +photoplethysmographics @@ -148763,0 +149056 @@ phototrophic +phototrophics @@ -148907,0 +149201 @@ physiologic +physiologics @@ -148966,0 +149261 @@ pianistic +pianistics @@ -149126,0 +149422 @@ pictographic +pictographics @@ -151358,0 +151655 @@ pluralistic +pluralistics @@ -151393,0 +151691 @@ plutocratic +plutocratics @@ -151587,0 +151886 @@ poetic +poetics @@ -152264,0 +152564 @@ polyphonic +polyphonics @@ -152300,0 +152601 @@ polysomic +polysomics @@ -152322,0 +152624 @@ polytheistic +polytheistics @@ -152835,0 +153138 @@ porphyroblastic +porphyroblastics @@ -152837,0 +153141 @@ porphyroclastic +porphyroclastics @@ -153357,0 +153662 @@ post-impressionistic +post-impressionistics @@ -153680,0 +153986 @@ potentiometric +potentiometrics @@ -155845,0 +156152 @@ probabilistic +probabilistics @@ -156707,0 +157015 @@ propagandistic +propagandistics @@ -156772,0 +157081 @@ prophetic +prophetics @@ -157002,0 +157312 @@ prosodic +prosodics @@ -157656,0 +157967 @@ pseudomorphic +pseudomorphics @@ -157752,0 +158064 @@ psychic +psychics @@ -157764,0 +158077 @@ psychiatric +psychiatrics @@ -157880,0 +158194 @@ psychometric +psychometrics @@ -157893,0 +158208 @@ psychopathic +psychopathics @@ -158009,0 +158325 @@ pubic +pubics @@ -158210,0 +158527 @@ pugilistic +pugilistics @@ -158690,0 +159008 @@ puristic +puristics @@ -159166,0 +159485 @@ pyrotechnic +pyrotechnics @@ -159199,0 +159519 @@ pythonic +pythonics @@ -159530,0 +159851 @@ quartic +quartics @@ -159928,0 +160250 @@ quietistic +quietistics @@ -160013,0 +160336 @@ quintic +quintics @@ -160548,0 +160872 @@ radiologic +radiologics @@ -160590,0 +160915 @@ radioscopic +radioscopics @@ -161629,0 +161955 @@ rationalistic +rationalistics @@ -162186,0 +162513 @@ realistic +realistics @@ -163258,0 +163586 @@ reductionistic +reductionistics @@ -164177,0 +164506 @@ relativistic +relativistics @@ -164333,0 +164663 @@ relic +relics @@ -166503,0 +166834 @@ rhapsodic +rhapsodics @@ -166589,0 +166921 @@ rhinoplastic +rhinoplastics @@ -167793,0 +168126 @@ robotic +robotics @@ -169812,0 +170146 @@ rustic +rustics @@ -170219,0 +170554 @@ sadistic +sadistics @@ -170228,0 +170564 @@ sadomasochistic +sadomasochistics @@ -171293,0 +171630 @@ Sanskritic +Sanskritics @@ -174446,0 +174784 @@ seismographic +seismographics @@ -175074,0 +175413 @@ sensationalistic +sensationalistics @@ -175318,0 +175658 @@ septic +septics @@ -175439,0 +175780 @@ seraphic +seraphics @@ -181599,0 +181941 @@ socialistic +socialistics @@ -181644,0 +181987 @@ socioecologic +socioecologics @@ -181676,0 +182020 @@ sociometric +sociometrics @@ -181679,0 +182024 @@ sociopathic +sociopathics @@ -181920,0 +182266 @@ solecistic +solecistics @@ -182049,0 +182396 @@ Solomonic +Solomonics @@ -182250,0 +182598 @@ sonic +sonics @@ -182319,0 +182668 @@ sonographic +sonographics @@ -182392,0 +182742 @@ sophistic +sophistics @@ -183291,0 +183642 @@ specialistic +specialistics @@ -183332,0 +183684 @@ specific +specifics @@ -183909,0 +184262 @@ spiritualistic +spiritualistics @@ -187024,0 +187378 @@ stoichiometric +stoichiometrics @@ -187667,0 +188022 @@ strategic +strategics @@ -188554,0 +188910 @@ stylistic +stylistics @@ -188570,0 +188927 @@ stylographic +stylographics @@ -188578,0 +188936 @@ stylometric +stylometrics @@ -190713,0 +191072 @@ supersymmetric +supersymmetrics @@ -191109,0 +191469 @@ surrealistic +surrealistics @@ -191994,0 +192355 @@ sycophantic +sycophantics @@ -192127,0 +192489 @@ symmetric +symmetrics @@ -192166,0 +192529 @@ symphonic +symphonics @@ -192173,0 +192537 @@ symplastic +symplastics @@ -192378,0 +192743 @@ synodic +synodics @@ -192738,0 +193104 @@ tactic +tactics @@ -193058,0 +193425 @@ talismanic +talismanics @@ -193132,0 +193500 @@ Talmudic +Talmudics @@ -193485,0 +193854 @@ Taoistic +Taoistics @@ -194058,0 +194428 @@ taxonomic +taxonomics @@ -194282,0 +194653 @@ technocratic +technocratics @@ -194539,0 +194911 @@ telegraphic +telegraphics @@ -195192,0 +195565 @@ teratogenic +teratogenics @@ -195372,0 +195746 @@ terrific +terrifics @@ -195420,0 +195795 @@ terroristic +terroristics @@ -195658,0 +196034 @@ Teutonic +Teutonics @@ -195925,0 +196302 @@ theistic +theistics @@ -196194,0 +196572 @@ thermostatic +thermostatics @@ -197922,0 +198301 @@ titanic +titanics @@ -198502,0 +198882 @@ tonic +tonics @@ -198589,0 +198970 @@ tonometric +tonometrics @@ -198712,0 +199094 @@ topic +topics @@ -199612,0 +199995 @@ traditionalistic +traditionalistics @@ -200728,0 +201112 @@ triadic +triadics @@ -200971,0 +201356 @@ trigonometric +trigonometrics @@ -201362,0 +201748 @@ trophoblastic +trophoblastics @@ -201995,0 +202382 @@ tunic +tunics @@ -202190,0 +202578 @@ Turkic +Turkics @@ -215810,0 +216199 @@ xenocrystic +xenocrystics @@ -215818,0 +216208 @@ xenolithic +xenolithics @@ -217286,0 +217677 @@ zwitterionic +zwitterionics

Ding-adong commented 5 years ago

If you are not bothered about adjectives with plural s but want plural noun, keep it. Otherwise simply remove the two line then you will have to add plural noun as another entry.

Ding-adong commented 5 years ago

https://www.systutorials.com/docs/linux/man/4-hunspell/#index

Explains the AFF codes including REP.

Ding-adong commented 5 years ago

Are you aware of suffix plus another suffix? SFX W 0 ic/S [^aelmrsy] using /S adds plural again. alphabet/SW alphabets alphabetic alphabetics

Also you can set the flag to use 2 chars for affixes. This would allow more options.

marcoagpinto commented 5 years ago

@Ding-adong

I have released V2.70 of the speller for Mozilla, AOO and LO.

I still haven't had the chance of adding the whole wordlists you provided, but I have added around 1000 words from one of them.

Notice that I am not adding all the words as I check word by word in the official British spellers, the reason why it takes longer.

Also, I haven't added most of the US ones but added some archaic ones provided they can't be confused with recent words.

I have given you some "credit" in the words, which means that if I can't find them on Oxford and Collins, I try on Wiktionary and add them if found in it. As a last resource I try to find them on Wikipedia, just because I "assume" that you are certain about them.

Thanks!

Ding-adong commented 5 years ago

I use Oxford then Cambridge and sometimes Merriam dictionaries. Not too keen on wiki as they tend to add affixes not recognised by the first 3. 's is not just a possessive noun but also contractions for is has and you won't find them in the dictionary.

Ding-adong commented 5 years ago

A problem. Why does B flag have able and ability?

Ding-adong commented 5 years ago

After 3/4 weeks the suffix ous is now completed. I need the latest dictionary and I will update it. You can then compare it. In the meantime do not add or make any changes to the dictionary otherwise comparison will be complicated. Is that ok with you?

marcoagpinto commented 5 years ago

After 3/4 weeks the suffix ous is now completed. I need the latest dictionary and I will update it. You can then compare it. In the meantime do not add or make any changes to the dictionary otherwise comparison will be complicated. Is that ok with you?

We can test with the latest release in GitHub.

Ding-adong commented 5 years ago

Yup. It will probably take 3/4 days. We need to discuss how to improve the AFF file. Ran out of chars to use as flags. ous used 8. Did you read https://www.systutorials.com/docs/linux/man/4-hunspell/#index ?

Ding-adong commented 5 years ago

I am halfway through. I used your great PT software and created a wordlist of the latest dic and a wordlist of my work so far. Also created a list of duplicate file. I went through the capitalised words only and removed all the duplicates - easy fix and few compare to the rest.

The dic should have common British names and places and some very common foreign names and places, such as Paris, New York and not the rest such as Labourd, Laboureur etc. All it does is expand the dic with irrelevant words. If users want to save odd foreign words, then they can save it into their personal dic provided by various software, such as SubtitleEdit, Open Office and Firefox, etc.

Ding-adong commented 5 years ago

Since noun can take possessive, would it be easier if: SFX n Y 14 SFX n 0 tion/S a becomes SFX n Y 14 SFX n 0 tion/MS a instead of repeatedly typing words again and again.

marcoagpinto commented 5 years ago

Since noun can take possessive, would it be easier if: SFX n Y 14 SFX n 0 tion/S a becomes SFX n Y 14 SFX n 0 tion/MS a instead of repeatedly typing words again and again.

Well, I am not sure if mass nouns can all have plural/possessive, so, in order not to risk, I prefer the more laborious task.

Ding-adong commented 5 years ago

OK, it was just a though. Now this is correct. Plural or names etc ending with s is s'. SFX M Y 2 SFX M 0 's [^s] SFX M 0 ' s

marcoagpinto commented 5 years ago

OK, it was just a though. Now this is correct. Plural or names etc ending with s is s'. SFX M Y 2 SFX M 0 's [^s] SFX M 0 ' s

@Ding-adong

The idea is good, but Hunspell already accepts the s' for English.

If you add an ' to any word ending with an "s", no typo will appear.

Your idea would create 1000s of new words in the wordlist, but they wouldn't do anything except increasing the wordlist.

Ding-adong commented 5 years ago

Which hunspell version and does it work only for s' ? In my experience spellchecker kept flagging up 's errors.

Then in that case any word ending with s should not have /M in it. That's 2678 extra false words.

marcoagpinto commented 5 years ago

Which hunspell version and does it work only for s' ? In my experience spellchecker kept flagging up 's errors.

Well, type: "Marcos'" in Thunderbird and LibreOffice and it won't be flagged as an error.

Nor it happened here in Firefox, but I am using the LanguageTool add-on for it that overrides the spellers.

marcoagpinto commented 5 years ago

Then in that case any word ending with s should not have /M in it. That's 2678 extra false words.

Well, I noticed it is common use of "s's": "Jesus's student was a great teacher".

Although it doesn't look nice, lots of people write this way, even if official documents.

Ding-adong commented 5 years ago

Officially in UK English s' is the way to go. s's is putted-up because people kept on adding 's to everything, even as contraction. But you have thousands of incorrect 's such as gas's etc. After all the spellchecker is there to correct human's mistakes eh? Any 's for contraction outside normal rules goes into my personal dic to save repetition, mostly by subtitle files to conserve space by avoiding has is etc.

Ding-adong commented 5 years ago

3GPP's - is that a contraction or should it be 3GPP/M ?

marcoagpinto commented 5 years ago

3GPP's - is that a contraction or should it be 3GPP/M ?

Nice finding, I will fix it tomorrow :-)