tetherless-world / mowgli-etl

DARPA Machine Common Sense (MCS) Multi-modal Open World Grounded Learning and Inference (MOWGLI) Extract-Transform-Load sub-project
MIT License
6 stars 1 forks source link

Source: WebDataCommons products #18

Open gordom6 opened 4 years ago

gordom6 commented 4 years ago

A student (Jody Sunray) did initial work on this. Her tasks were to (1) use regular expressions to parse dimensions from the WebDataCommons product corpus and (2) infer generic product types (e.g., "table", "bowling ball") from descriptions using natural language processing in order to (3) create qualitative spatial relations such as (bowlingBall, smallerThan, table).

I've moved the start of her work to a "wdc" pipeline in the usual structure.