imsky / wordlists

Words categorized by topic.
MIT License
296 stars 128 forks source link

Consolidate all words in a structured file and generate individual wordlists from the file #2

Open imsky opened 5 years ago

imsky commented 5 years ago

a root-level JSON should work

bcopeland commented 4 years ago

In case it is of any use, you can get pretty far towards a consolidated file with just a bunch of unix tools:

 grep -r '' . | grep .txt | awk -F '/' '{print $NF}' |
    sed -e "s/.txt//g" | awk -F: '{print $2 " " $1}' | sort

excerpt:

young age
young chicago
young english
you're* reddit
you scary reddit
you wouldn't download reddit
yr hipster
yukon microsoft
yverdon-les-bains switzerland
zaanstad netherlands
imsky commented 4 years ago

ok might give that a try @bcopeland as a first pass, thanks

origintopleft commented 1 year ago

I have a Python script within one of my own projects that takes all the files of wordlists and renders them as a module file for the Rant procgen language. I'd like to fix it up, add support for rendering JSON files using Python's standard library functions, and submit it directly to this repository. Do you have a preferred location for utility scripts like this, or should I just copy it to the repository root?

imsky commented 1 year ago

@origintopleft add to root for now