Extractive and abstractive summarization designs have historically been fragmented, limiting the benefits that often arise from compatible model architectures. In this paper, we explore the potential synergies of modeling extractive summarization with an abstractive summarization system and propose three novel inference algorithms using the sequence-to-sequence architecture. We evaluate them on the CNN \& Dailymail dataset and show that recent advancements in abstractive system designs enable abstractive systems to not only compete, but even surpass the performance of extractive systems with custom architectures. To our surprise, abstractive systems achieve this without being exposed to extractive oracle summaries and, therefore, for the first time allow a single model to produce both abstractive and extractive summaries. This evidence questions our fundamental understanding of extractive system design, and the necessity for extractive labels while pathing the way for promising research directions in hybrid models.

Translation (by gpt-3.5-turbo)

これまで、抽出型と要約型の要約設計は分断されており、互換性のあるモデルアーキテクチャから生じる利点が制限されていました。本論文では、抽出型要約と要約型要約システムのモデリングの潜在的な相乗効果を探求し、シーケンス・トゥ・シーケンス・アーキテクチャを使用した3つの新しい推論アルゴリズムを提案します。CNN＆Dailymailデータセットで評価し、最近の要約システム設計の進歩により、要約型システムがカスタムアーキテクチャを持つ抽出型システムのパフォーマンスを超えることができることを示します。驚くべきことに、要約型システムは抽出型のオラクル要約にさらされることなく、要約型と抽出型の要約の両方を単一のモデルで生成することができます。この証拠は、抽出型システム設計の基本的な理解と、ハイブリッドモデルの有望な研究方向を示す上で、抽出型ラベルの必要性に疑問を投げかけます。
Summary (by gpt-3.5-turbo)
本研究では、抽出型要約と要約型要約の相乗効果を探求し、シーケンス・トゥ・シーケンス・アーキテクチャを使用した3つの新しい推論アルゴリズムを提案しています。これにより、要約型システムが抽出型システムを超えることができることを示しました。また、要約型システムは抽出型のオラクル要約にさらされることなく、両方の要約を単一のモデルで生成できることも示しました。これは、抽出型ラベルの必要性に疑問を投げかけるものであり、ハイブリッドモデルの有望な研究方向を示しています。

AkihikoWatanabe / paper_notes

Abstractive Summarizers are Excellent Extractive Summarizers, ACL'23 #859

Translation (by gpt-3.5-turbo)

Summary (by gpt-3.5-turbo)