cboulanger / excite-docker

Docker image with tools for the annotation of ML training docs for reference extraction based on the EXparser tools
https://cboulanger.github.io/excite-docker
GNU General Public License v3.0
0 stars 0 forks source link

Handling of ders. in German citations #2

Open iurshina opened 2 years ago

iurshina commented 2 years ago

Add a rule for "use the last author name previously recognized" when "ders." is encountered

cboulanger commented 2 years ago

maybe it could be implemented as a regular expression which is passed as a command line option

docker run -v $(pwd):/app excite_toolchain segmentation --repeated-author-symbol="ders[.]?|[-]{2,}"

This would make it possible to configure the list in the Web UI frontend, depending on language.