Kernels

Commit History

Add built binary [skip-build]
f2471cd

github-actions[bot] commited on

fix(rms_norm.py): add assertion for input gradients to handle unsupported cases in backward pass
f19f8f4

wyldecat commited on

feat: support sequence parallel with fused_add_rms_norm
151bb5a

wyldecat commited on

refactor(activation): change fused_add_rms_norm and fused_add_rms_norm_backward to out-place operations
7e4334d

wyldecat commited on

refactor(rms_norm): move RMS normalization logic to a new module for better organization and maintainability
66b3c5e

wyldecat commited on

feat: support sequence parallel with rms_norm
06d6367

wyldecat commited on

feat: add assert is_contiguous
a2a2501

wyldecat commited on

feat: make rms_norm as out-place
9d0a235

wyldecat commited on

feat(workflow): add Slack notifications for build start, success, and failure [skip-build]
ab05e35

wyldecat commited on

Revert "fix typo in readme (#7)" (#8)
ddd119c
unverified

TaehyunKim commited on

fix typo in readme (#7)
2d926c3
unverified

TaehyunKim github-actions[bot] commited on

chore: add build action [skip-build]
5dde6fa

wyldecat commited on

Fix fused add rms norm (#4)
a1e5ca8
unverified

TaehyunKim TaehyunKimMotif commited on

chore: add license
e677f62

wyldecat commited on

chore: add push-to-hf workflow
269223c

wyldecat commited on

Merge pull request #1 from MotifTechnologies/add_action
9c49e08
unverified

TaehyunKim commited on

torch 2.8 support (#1)
43629b7
verified

iamwyldecat commited on

feat: add cuda build
cf68df1

iamwyldecat commited on

feat: support reset_parameters()
605f22e

iamwyldecat commited on

feat(rms-norm): Impl fused RMSNorm
f3b99fb

iamwyldecat commited on

feat(poly-norm): add perf test
d14fd4d

iamwyldecat commited on

fix(poly-norm): calc param grad explicitly
704692b

iamwyldecat commited on

fix(poly-norm): fix bug in reduce sum
883cc1c

iamwyldecat commited on

refactor(poly-norm): fix indentation
32c2bde

iamwyldecat commited on

refactor(poly-norm): use const for immutable args
e85ecc9

iamwyldecat commited on

chore(poly-norm): remove unnecessary file
552d415

iamwyldecat commited on

chore: use latest build image and misc
f5a7d38

iamwyldecat commited on

chore(poly-norm): update README and build artifacts
f72121c

iamwyldecat commited on

feat(poly-norm): add default value for eps argument
afd2a56

iamwyldecat commited on

chore(poly-norm): add ROCm build artifacts
4b70498

iamwyldecat commited on

feat(poly-norm): Add PolyNorm
44e9845

iamwyldecat commited on

initial commit
7a7d761
verified

iamwyldecat commited on