Home
Categories
LoRA
Category
Cancel
LoRA
1
Rethink LoRA initialisations for faster convergence
Jun 7, 2024
Trending Tags
FFNN
Transformer
Attention
Math
activations
Data Parallelism
Differential Transformer
Fine tuning
GPU
GQA