🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
transformer mha chinese translation
V
Varuna Jayasiri committed
f6e913eb09cabca03d7c015867ec4929de8c3d1b
Parent: d3f0bd3