CUDA Transformer Implementation
We’ve all heard of the famous 2017 AI paper “Attention is all you Need”. I’ve re-implemented the paper in CUDA C++ to make the most efficient use of the GPU!
We’ve all heard of the famous 2017 AI paper “Attention is all you Need”. I’ve re-implemented the paper in CUDA C++ to make the most efficient use of the GPU!