code-kern-ai / bricks

Open-source natural language enrichments at your fingertips.
Apache License 2.0
447 stars 23 forks source link

[BUG] - Time Extraction extracts numbers #204

Open FelixKirsch opened 1 year ago

FelixKirsch commented 1 year ago

What module did you encounter a bug with?

https://bricks.kern.ai/extractors/428

Please describe the bug you observed with this module.

Description

The time extraction module extracts numbers that are not part of any time indication.

Input:

{
    "text": "18 Problems Only People Who Are Always Cold Understand",
    "spacyTokenizer": "en_core_web_sm"
}

Expected output:

{"times":[]}

Actual output:

{"times":[["time",0,1]]}
LeonardPuettmann commented 1 year ago

Will look into this! Thanks for mentioning it.

vvkrddy commented 1 year ago

I have created a pull request (#282) addressing this issue. Please take a look.