This website requires JavaScript.
Explore
Help
Sign in
simcop2387
/
AutoGPTQ
Watch
1
Star
0
Fork
You've already forked AutoGPTQ
0
Code
Issues
Pull requests
Projects
Releases
Packages
1
Wiki
Activity
3ff6ab18cb
AutoGPTQ
/
auto_gptq
/
nn_modules
History
qwopqwop200
3ff6ab18cb
Merge branch 'main' into faster-llama
2023-05-06 00:20:29 +09:00
..
triton_utils
add triton support
2023-04-25 20:05:22 +08:00
__init__.py
refactor file structure
2023-04-25 18:58:20 +08:00
fused_attn.py
Fix 'groupsize' -> 'group_size' in all other .py files. I haven't touched any CUDA kernels in case there's any complexity there I don't understand
2023-05-05 14:44:16 +01:00
fused_mlp_triton.py
add fused mlp ,fused attn
2023-05-02 18:55:44 +09:00
qlinear.py
Merge branch 'main' into faster-llama
2023-05-06 00:20:29 +09:00
qlinear_old.py
Merge branch 'main' into faster-llama
2023-05-06 00:20:29 +09:00
qlinear_triton.py
Fix 'groupsize' -> 'group_size' in all other .py files. I haven't touched any CUDA kernels in case there's any complexity there I don't understand
2023-05-05 14:44:16 +01:00