Attention is all you need
Sequence to Sequence Encoder - Decoder Models (Text 2 Text)
Encoder Only Models Auto Encoding Models (Masked Language Modelling)
Decoder Only Models Auto Regressive Models (Causal Language Modelling)
Language Models