caretech-owl / gerd

Generating and evaluating relevant documentation (GERD)
https://towardsdatascience.com/running-llama-2-on-cpu-inference-for-document-q-a-3d636037a3d8
MIT License
4 stars 0 forks source link

TODO: Label GRASCCO #26

Closed aleneum closed 7 months ago

aleneum commented 10 months ago

Software

Labelstud.io

Label:

Labels 'Section' (Prio A)

Ergänzungen 'Context' (Prio A)

Labels 'Medication Information' (Prio B)

Ressourcen

Traget Format: NER (TODO)

{
    'train': {
        'tokens': [
            ['@paulwalk', 'It', "'s", 'the', 'view', 'from', 'where', 'I', "'m", 'living', 'for', 'two', 'weeks', '.', 'Empire', 'State', 'Building', '=', 'ESB', '.', 'Pretty', 'bad', 'storm', 'here', 'last', 'evening', '.'],
            ['From', 'Green', 'Newsfeed', ':', 'AHFA', 'extends', 'deadline', 'for', 'Sage', 'Award', 'to', 'Nov', '.', '5', 'http://tinyurl.com/24agj38'], ...
        ],
        'tags': [
            [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 2, 2, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0],
            [0, 0, 0, 0, 3, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0], ...
        ]
    },
    'validation': ...,
    'test': ...,
}

source: t-ner

Depo14 commented 10 months ago

Have added our defined labels

aleneum commented 7 months ago

labeled everything we wanted (for now)