This repo aims at providing a collection of efficient Triton-based implementations for state-of-the-art linear attention models. Any pull requests are welcome! Click me for feature request ...
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.