This website requires JavaScript.
Explore
Help
Sign in
simcop2387
/
AutoGPTQ
Watch
1
Star
0
Fork
You've already forked AutoGPTQ
0
Code
Issues
Pull requests
Projects
Releases
Packages
1
Wiki
Activity
8c7c806d36
AutoGPTQ
/
auto_gptq
/
nn_modules
/
qlinear
History
fxmarty
71f23268eb
Merge pull request
#1
from qwopqwop200/exllama-q4-kernel
...
Exllama q4 kernel
2023-08-05 00:15:22 +09:00
..
__init__.py
regist buffer of general quant linear
2023-08-03 05:15:09 +00:00
qlinear_cuda.py
fix fused attn
2023-07-31 13:46:32 +00:00
qlinear_cuda_old.py
fix fused attn
2023-07-31 13:46:32 +00:00
qlinear_exllama.py
change pcak func support only 4 bit
2023-08-01 20:01:45 +09:00
qlinear_triton.py
fix fused attn
2023-07-31 13:46:32 +00:00