This website requires JavaScript.
Explore
Help
Sign in
simcop2387
/
AutoGPTQ
Watch
1
Star
0
Fork
You've already forked AutoGPTQ
0
Code
Issues
Pull requests
Projects
Releases
Packages
1
Wiki
Activity
fe5f5d12ed
AutoGPTQ
/
auto_gptq
/
nn_modules
History
PanQiWei
fe5f5d12ed
Merge branch 'main' into peft_integration
2023-05-26 09:48:06 +08:00
..
qlinear
support faster vecquant4matmul cuda kernel
2023-05-26 08:55:05 +08:00
triton_utils
Update kernels.py
2023-05-25 23:15:33 +09:00
__init__.py
refactor file structure
2023-04-25 18:58:20 +08:00
_fused_base.py
fix ImportError when triton is not installed
2023-05-20 16:15:20 +08:00
fused_gptj_attn.py
add GPTJ fused attention module
2023-05-14 16:17:21 +08:00
fused_llama_attn.py
compatible with older pytorch version
2023-05-14 16:17:03 +08:00
fused_llama_mlp.py
fix not return directly when triton is not installed
2023-05-20 16:21:52 +08:00