issues
search
alpheios-project
/
tokenizer
Alpheios Tokenizer Service
1
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update onesegment -> no segment
#48
irina060981
closed
2 years ago
0
Update template to support all kinds of XML as a source of a text
#46
irina060981
closed
2 years ago
6
process all elements in xml other than cts and dts wrappers
#45
balmas
closed
2 years ago
1
Update xslt template to support parsing not only by namespace
#44
irina060981
closed
2 years ago
0
Add onesegment property
#43
irina060981
closed
2 years ago
0
Add segmentation type - one segment
#42
irina060981
closed
2 years ago
1
Spacy update
#41
irina060981
closed
3 years ago
0
Force tokenizer to recognize every character as a separate token for Chinese
#40
irina060981
closed
2 years ago
1
fix #38
#39
balmas
closed
3 years ago
0
DTSAPI: Duplicate reference in alignment group
#38
monzug
closed
3 years ago
5
fix #36
#37
balmas
closed
3 years ago
1
TEI XML from Betamasaheft API returns empty result
#36
irina060981
closed
3 years ago
3
Additional language dependencies
#34
irina060981
closed
3 years ago
3
additional language dependencies
#33
balmas
opened
3 years ago
3
fix #31
#32
balmas
closed
3 years ago
7
Problems with chinese text
#31
irina060981
closed
3 years ago
0
Question on tokenization errors
#30
monzug
opened
3 years ago
5
empty alignment group with TEI.2 tag
#29
monzug
closed
2 years ago
12
I22 i26
#28
balmas
closed
3 years ago
0
I23 i24 i25
#27
balmas
closed
3 years ago
0
report more meaningful errors
#26
balmas
closed
3 years ago
2
problems with changing starting segment index
#25
balmas
closed
3 years ago
4
text in elements outside the segment element gets included in the output
#24
balmas
closed
3 years ago
2
don't fail on XML declaration
#23
balmas
closed
3 years ago
2
normalize > 2 new lines in a row in plain text input
#22
balmas
closed
3 years ago
3
I20
#21
balmas
closed
3 years ago
0
input parameter fixes and improvements
#20
balmas
closed
3 years ago
0
make handling of new lines more robust
#19
balmas
closed
3 years ago
1
"id" should not be tokenized in Latin
#18
balmas
closed
3 years ago
1
Doubline segments mode doesn't work inside a request from the Alignment Editor
#17
irina060981
closed
3 years ago
1
id handling
#16
balmas
closed
3 years ago
1
Support more granular specification of element filters for TEI XML
#15
balmas
opened
3 years ago
0
add support for bibliographic metadata
#14
balmas
opened
3 years ago
0
add support for DTS urls
#13
balmas
opened
3 years ago
0
production deployment
#12
balmas
opened
3 years ago
0
finish OpenAPI schema
#11
balmas
opened
3 years ago
1
Externalize language map
#10
balmas
opened
3 years ago
0
urns and language models
#9
balmas
closed
3 years ago
0
do we need to handle mid-word hyphenation?
#8
balmas
opened
3 years ago
0
handle latin -ne and -ve enclytics
#7
balmas
opened
3 years ago
0
citation is not applied correctly without the first empty line.
#6
balmas
closed
3 years ago
0
Make sure linebreaks are preserved in plain text input
#5
balmas
closed
3 years ago
1
rtl/ltr input parameter needed?
#4
balmas
closed
3 years ago
1
review initial design
#3
balmas
closed
3 years ago
27
add token exception for cts urns and uris
#2
balmas
closed
3 years ago
0
sentencizer handle new lines
#1
balmas
opened
4 years ago
1