Skip to content

simondanielsson/custom-triton-kernels

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Custom GPU kernel implementations in Triton for DL workflows

Project is structured as follows:

.
├── src
│   └── kernels
│       ├── __init__.py
│       ├── dropout.py
│       └── softmax.py
│       └── ...
├── tests
│   ├── __init__.py
│   └── dropout_test.py
│   └── softmax_test.py
│   └── ...

Currently supported kernels:

  • Softmax
  • Dropout
  • Vector addition

More to be implemented. Coming up:

  • [] Matrix addition
  • [] Block-based fused softmax
  • [] Layer norm
  • [] Matrix multiplication
  • [] Fused attention

Testing kernel correctness

To run tests, simply run

make test

Prepare your environment

python3 -m venv venv
source venv/bin/activate
make install_dev

About

Custom-built triton kernels

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published