CUDA Transformer Implementation

We’ve all heard of the famous 2017 AI paper “Attention is all you Need”. I’ve re-implemented the paper in CUDA C++ to make the most efficient use of the GPU!

Demo Video

Previous
Previous

Fashion Search Agent

Next
Next

Chat Memex