Update dictionary to remove "comma"

dax-westerman commented 1 month ago

Need to remove comma entry from res/dicts/dict.txt

This involves the following steps:

[x] Delete line from dict.txt
```
2|,|PUNCT|boundry
```
[ ] Renumber index
- The manual method would involve changing the 'semicolon' entry's index from 3 to 2 The following will be moved to a feature branch to explore, left struck here for posterity
- ~~A more automated mechanism might leverage a Pandas DataFrame, in order to avoid manual manipulation:~~
- ~~A common load method (pandas.read_csv)~~
- ~~A common means of updating using DataFrame/Series methods~~
- ~~A common means of validating using DataFrame masks and a framework to eval~~
- ~~A common means to persist the output~~

suzytamang commented 1 month ago

The terms are not enumerated from 0, but we can do that. Right now they are grouped in chunks of 1000s, but it's more than fine to enumberate if we have a system.

dax-westerman commented 1 month ago

The terms are not enumerated from 0, but we can do that. Right now they are grouped in chunks of 1000s, but it's more than fine to enumberate if we have a system.

As it pertains to the second point, I should have left that as a separate "idea" rather than include as part of the effort, so sorry for any confusion. This was an area of exploration I'd wanted to include as a potential means of managing the dictionary which would provide a method for validation as well. I'm going line-strike it to keep the issue clean :)

Thanks!

suzytamang / clever-rockies

Update dictionary to remove "comma" #15