Introduction
Introduction to the Transformer Model
An overview of the Transformer model, a sequence transduction model based solely on attention mechanisms, eliminating recurrence and convolutions.