Transformer Decoder Architecture
A detailed view of a Transformer decoder architecture with 100 billion parameters.

Overall Architecture
Inputs
Decoders
Linear Layer
Outputs
Attention Mechanism
Number of Parameters
Last updated