🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Fix RoPE inner product equation & add note on the difference in implementation with the original paper
T
thanhtcptit committed
8c84d6ee46985cc1bd98831443c7c4af8d1a2321
Parent: 89a3ae8