Rebuild hyphenated words, using Wikpedia abstract as a lexicon.
In french, césure means hyphen, décésure is a neologism meaning unhyphen.
Go get some abstract.xml.gz file somewhere in http://ftp.acc.umu.se/mirror/wikimedia.org/dumps/enwiki/
python -m decesure.index ~/Downloads/enwiki-20230320-abstract.xml.gz
Reading the abstract is really slow, about 5 minutes.
You can compress the lexicon
gzip -9 en.txt
python -m decesure en.txt.gz deploy- ment
from decesure import Decesure
cesure = Decesure("path/to/en.txt.gz")
cesure.unhyphen("deploy-", "ment")