Guided Decoding - Githubissues

sillsdev / silnlp

A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.

Other

30 stars 4 forks source link

Guided Decoding #178

Open johnml1135 opened 11 months ago

johnml1135 commented 11 months ago

From the papers out there, determine the best path forward, research and implement guided decoding. Assess the improved Bleu score and user assessment of quality. Address concerns with different types of languages with preixes and suffixes on proper names and key terms.

johnml1135 commented 11 months ago

https://huggingface.co/blog/constrained-beam-search https://github.com/huggingface/transformers/pull/15761

johnml1135 commented 11 months ago

We should likely just use the huggingface implementation from 2022 (see links above) - but it may need to be modified. A few reasons:

It does not use alignment to guide when to add tokens
It cannot deal with wildcards - which is useful in many languages we deal with - but we may be able to build it on disjunctive constraints

ddaspit commented 11 months ago

I added preliminary support for using HF constrained beam search to silnlp. From the experiments I have run, it doesn't work very well.

johnml1135 commented 11 months ago

@ddaspit - do you know why the tests didn't go that well? Do you have the results documented somewhere? Is it "keyterms with asterisks don't work well" or "their algorithm is poor" or "certain languages don't do well with this"? Is it worth more research now, or do we want someone else to lead the charge? Do we need to add alignment data to enhance it? LILT appears to have been able to get this working well enough to integrate into their main offering - so I am inclined to believe it is possible to have it be advantageous.

ddaspit commented 11 months ago

There definitely seems to be something wrong with the implementation in HF. Here is an issue that describes the problems I was seeing.

johnml1135 commented 2 months ago

While not implemented, this may do better than the current hugging face implementation: https://arxiv.org/pdf/2112.08726.pdf - with this code: https://github.com/GXimingLu/a_star_neurologic.