Revision in Continuous Space: Fine-Grained Control of Text Style Transfer

codertimo commented 4 years ago

어떤 내용의 논문인가요? 👋

기존 연구들에서는 unsupervised text style transfer 문제를 풀기 위해 주로 두가지 방법을 사용합니다. 1) contents와 attribute 간의 얽힘 문제를 푸는 방법 2) adversarial learning 을 활용하는 방법. 하지만 두가지 방법 모두 필수적으로 필요하지 않다는 것을 보여 줍니다.
본 논문에서는 text style transfer 를 만들기 위해, gradient 를 조정하여 inference 시 문장을 continuous 한 space 에 위치할 수 있도록 하는 새로운 프레임워크를 제안합니다.
이 프레임워크는 다음 3가지의 구성요소로 이루어져 있습니다. 1) VAE(variational autoencoder), 2) attribute predictors (one for each attribute) 3) content predictor
VAE와 두개의 predictor 를 통해서 target sentence의 representation 을 찾을 수 있는 gradient-based optimization이 가능해 집니다. (원래 target은 string이기 때문에 discrete 해서 학습하기 어려움, 예: seq2seq)
또한 제안한 방법의 자연스럽운 특성에 의해 여러개의 attribute(문장 길이, 특정 단어 필수 등장 등) 를 동시에 사용하는 style-transfer 기법을 사용할 수 있습니다.
기존의 adversarial 을 활용한 학습 기법보다 제안하는 방법이 보다 직관적이고, 제어하기 편하며, 학습하기도 쉽다는 이점을 갖습니다.
3개의 유명한 style-transfer task에서 모두 유의미한 격차로 SOTA를 달성했음.

Abstract (요약) 🕵🏻‍♂️

Typical methods for unsupervised text style transfer often rely on two key ingredients: 1) seeking the explicit disentanglement of the content and the attributes, and 2) troublesome adversarial learning. In this paper, we show that neither of these components is indispensable. We propose a new framework that utilizes the gradients to revise the sentence in a continuous space during inference to achieve text style transfer. Our method consists of three key components: a variational auto-encoder (VAE), some attribute predictors (one for each attribute), and a content predictor. The VAE and the two types of predictors enable us to perform gradient-based optimization in the continuous space, which is mapped from sentences in a discrete space, to find the representation of a target sentence with the desired attributes and preserved content. Moreover, the proposed method naturally has the ability to simultaneously manipulate multiple fine-grained attributes, such as sentence length and the presence of specific words, when performing text style transfer tasks. Compared with previous adversarial learning based methods, the proposed method is more interpretable, controllable and easier to train. Extensive experimental studies on three popular text style transfer tasks show that the proposed method significantly outperforms five state-of-the-art methods.

이 논문을 읽어서 무엇을 배울 수 있는지 알려주세요! 🤔

SEQ2SEQ과 같은 discrete 한 value를 target으로 학습하는 방식이 아닌, target 문장의 representation에 생성된 문장의 representation이 가까워지도록 하는 학습 방법을 이해할 수 있다.
제안하는 방식을 통해서 어떻게 multi-attribute 를 자연스럽게 학습할 수 있는지 이해할 수 있다.
VAE를 style transfer에 사용하는 방식을 알고, 이에 대한 인사이트를 얻을 수 있습니다.
두 attribute, sentiment predictor 의 각 역할에 대해 분석해보고, 인과관계를 파악할 수 있습니다.

레퍼런스의 URL을 알려주세요! 🔗

https://arxiv.org/abs/1905.12304

codertimo commented 4 years ago

Motivatation

Latent space 를 content 와 style 로 Distangle 하는 방식은 여러가지 문제가 있음을 주장합니다.
unsupervised style transfer 에서 Adversarial 을 기반으로 하는 학습은 학습하기 어렵다는 문제가 있습니다.

Method

단순하게 VAE, BoW, Attribute Classifier 를 동시에 학습 합니다.
그럼 style transfer 를 어떻게 하냐? (이 부분이 가장 강한 novelty 입니다)
- decoder 에 줄 latent variable 을 만들기 위해서 latent variable optimization 을 수행합니다.

위 수식을 보면, decoding 에 넣을 z' 을 iteration 을 돌면서 update 를 하는 것을 볼 수 있습니다. 이를 해석하면 z' 을 구하기 위해서 각 attribute classifier 의 loss 와 BoW loss 를 z' 에 대하여 gradient 를 구해 우리가 원하는 style과 content 를 보유한 z' 을 만들도록 하고 있습니다. 그리고 이 loss 가 일정 threshold 이하가 되도록 학습을 하고 있습니다.

이를 통해서 우리가 원하는 style 을 갖으며, 기존의 content 를 preservation 한 결과를 얻을 수 있습니다.

Experiment

스크린샷 2020-01-13 오전 12 44 17

전체적인 실험을 통해서 style-classification rate 는 기존과 비슷하게 가져가면서, fluency 에서는 높은 성능을 보여주는 성과를 거두었습니다. Amazone Review 데이터셋에서는 사람이 레이블링 한 데이터보다 fluency가 높은 성과를 기록하였습니다.

codertimo commented 4 years ago

본 논문이 #18 과 접근 방식이 매우 동일하다는 점에 있어서 novelty 에 대한 의심이 있습니다. 다만 두 논문중 어떤 연구가 선행 되었는지는 확실하지 않습니다.

codertimo / paper-log