keras-team / keras-hub

Pretrained model hub for Keras 3
Apache License 2.0
805 stars 245 forks source link

Scope `BertSpanLabeler` and supporting infra #625

Open jbischof opened 1 year ago

jbischof commented 1 year ago

BertClassifier supports one output for the entire input sequence, but span labeling is an important research benchmark. This will require some scoping and design:

  1. Do we need a separate BertSpanLabelingPreprocessor or just a more complicated label to pass through?
  2. Do we want to offer a script that can turn a standard dataset like squadv2 into something our preprocessor can use? The "answer text" must be turned into indicies in the tokenized input.
  3. Create a BertSpanLabeler task model with the correct task-specific layers and preprocessing.
jbischof commented 1 year ago

References: