Ask a Question

Prefer a chat interface with context about you and your work?

Learning Deep Transformer Models for Machine Translation

Learning Deep Transformer Models for Machine Translation

Transformer is the state-of-the-art model in recent machine translation evaluations. Two strands of research are promising to improve models of this kind: the first uses wide networks (a.k.a. Transformer-Big) and has been the de facto standard for development of the Transformer system, and the other uses deeper language representation but …