hfthair / emerald_crawler

10 stars 2 forks source link

Question about the implementation of BART-Facet #4

Closed JaniceXiong closed 2 years ago

JaniceXiong commented 3 years ago

I am confused about "indicator" amd "section tags" described in the implementation of BART-Facet. image Could you give an example like if I want to predict the Purpose of the article, how to change the src (full or I+C) and tgt? image

We train the model for purpose, method, findings, value separately, or just train a model? If only a model, how can I tell the model which facet to generate?

memray commented 3 years ago

Hey @JaniceXiong ,

We append a special token (e.g. PURPOSE) at the beginning of the source text to indicate the facet of summary to generate. And we train a single model to generate all facets.

Best, Rui

JaniceXiong commented 3 years ago

Thanks a lot @memray But how about the "section tags" at the begining of each training input? I read the paper "Tldr: Extreme summarization of scientific documents" which describes CATTS. CATTS means a scaffold task to predict titles indicated by control codes. How to use CATTS in your model? I want to build the model described in your papar, and these details seem important :)

memray commented 3 years ago

Oh I only mean that we use the control code to indicate the facet to output, as illustrated in the figure below. We didn't use the title generation as an auxiliary training task. facet_sum

We are organizing our code for data processing and it will be available very soon. Also, our slides might be useful for clarification.

Best, Rui

JaniceXiong commented 3 years ago

Ok I got it! Thanks again for your work and reply !!! @memray

JaniceXiong commented 3 years ago

I have another question about "Full" in result tables. @memray image

BART-Faceted summarize a specific facet at a time, but BART-CAT generate a long summary by concatenating all facets. When comparing these two model, did you concatenate all the facets summarized by BART-Faceted?

memray commented 3 years ago

@JaniceXiong Yes, that's right.

JaniceXiong commented 3 years ago

@memray Thanks! By the way, do you have a plan to release the model code and pretrained model?