Closed nsch0e closed 4 months ago
Sry, was a user error: the default suggester for spans uses ngrams of sizes 1,2,3. The bigger spans contain 5 tokens, so the suggester never suggested the complete span.
Fixed by using this config:
config = {
"suggester": {"@misc": "spacy.ngram_suggester.v1", "sizes": [1, 2, 3, 4, 5]}
}
pipe = model.add_pipe("spancat", config=config)
No problem, thanks for reporting back! 🙏
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
A span containing more than one INFIX "tokens" will not be recognized.
Expected behaviour
all 4 date strings should be recognized.
How to reproduce the behaviour
Was run in a jupyter notebook for displaying purpose
Your Environment