Latest episodes of the podcast AI: post transformers
Mostrando página 17 de 18
GQA: Grouped Query Attention
07/08/2025
Longformer: A Transformer for Long Documents
07/08/2025
RoPE
07/08/2025
Batch Normalization
07/08/2025
Chinchilla: Optimal Language Model Scaling
07/08/2025
Transformer Scaling
07/08/2025
Learning from repeated data
07/08/2025
Scaling Laws
07/08/2025
LSTM: the forget gate
07/08/2025
GPT4 Technical Report
07/08/2025
GPT3
07/08/2025
GPT2
07/08/2025
GELU
07/08/2025
Dropout
07/08/2025
ResNets - residual block
07/08/2025
BERT
07/08/2025
ZARZA We are Zarza, the prestigious firm behind major projects in information technology.