issues
search
nipunsadvilkar
/
pySBD
🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
MIT License
813
stars
84
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add SBD tests from Pragmatic Segmenter
#24
nipunsadvilkar
closed
5 years ago
1
[Tests] 12 Tests passing
#23
nipunsadvilkar
closed
5 years ago
0
Initial ListItemReplacer class and method implementations
#22
nipunsadvilkar
closed
5 years ago
0
Fixed regex escapes and more
#21
nipunsadvilkar
closed
5 years ago
1
Stable punctuation_replacer and between_punctuation
#20
nipunsadvilkar
closed
5 years ago
0
Update process_text function
#19
nipunsadvilkar
closed
5 years ago
1
Add initial project structure and rules
#18
nipunsadvilkar
closed
5 years ago
0
Use Timex format (tag) to substitute word/spaces/delimiters
#17
nipunsadvilkar
closed
5 years ago
0
Make changes in regex use `\\` in special characters
#16
nipunsadvilkar
closed
5 years ago
1
Divide processor task into small steps
#15
nipunsadvilkar
closed
5 years ago
0
Integrating rule for consecutive characters occurence
#14
nipunsadvilkar
closed
5 years ago
0
Check and write rule for handling no space in between sentences
#13
nipunsadvilkar
closed
5 years ago
0
Replacing Table of contents kind of text having multiple periods
#12
nipunsadvilkar
closed
5 years ago
0
Handling multiple types of quotations
#11
nipunsadvilkar
closed
5 years ago
0
Integrating Inline formatting rule
#10
nipunsadvilkar
closed
5 years ago
0
Handling punctuation within brackets
#9
nipunsadvilkar
closed
5 years ago
0
Removing HTML tags rule
#8
nipunsadvilkar
closed
5 years ago
0
Integrating newlines handling rule
#7
nipunsadvilkar
closed
5 years ago
0
Explore Abbreviation handling and write rules for it
#6
nipunsadvilkar
closed
5 years ago
0
Deciding preprocessing steps needed for handling clean text
#5
nipunsadvilkar
closed
5 years ago
0
Listing rules for cleaning unwanted formatting
#4
nipunsadvilkar
closed
5 years ago
0
Making it compatible with multiple document type format
#3
nipunsadvilkar
closed
5 years ago
1
Adding Multiple Language Support
#2
nipunsadvilkar
closed
4 years ago
6
Deciding API stucture for module like scikit-learn
#1
nipunsadvilkar
closed
5 years ago
1
Previous