Back to Interests
NIPS 2014

Sequence to Sequence Learning with Neural Networks .

NLP Machine Translation LSTM

Authors

Sutskever et al.

Conference

NIPS 2014

Abstract

Introduced the encoder-decoder architecture for sequence-to-sequence tasks like machine translation.

Architecture

  • Encoder LSTM: Reads input sequence, produces fixed-size context vector
  • Decoder LSTM: Generates output sequence conditioned on context vector

Impact

Before Transformers, this was the dominant architecture for NLP tasks. Laid groundwork for attention mechanisms and modern NLP.