1. Attention is all you need
  2. Sequence to Sequence Encoder - Decoder Models (Text 2 Text)
  3. Encoder Only Models Auto Encoding Models (Masked Language Modelling)
  4. Decoder Only Models Auto Regressive Models (Causal Language Modelling)
  5. Language Models