issues
search
direct-phonology
/
jdsw
Parsing the "Jingdian Shiwen" with spaCy
MIT License
2
stars
0
forks
source link
Prep for automated alignment
#20
Closed
thatbudakguy
closed
1 year ago
thatbudakguy
commented
1 year ago
Convert manually aligned Lunyu to test fixture
Ignore prodigy binaries
Create top-level project directory
Delete old run.py script
Revert directory structure changes
Generate and use .csv instead of .tsv files
Rename src/ to txt/
Regenerate split output as .csv
Make alignment script work with .csv input
Add fixture and tests for alignment script
Fix some indexing bugs in alignjdsw
Update tests for alignjdsw
Formatting cleanup
Implement variant-aware fuzzy_find function
Stub alignjdsw test until it is re-implemented as pipe