This website requires JavaScript.
Explore
Help
Sign in
simcop2387
/
AutoGPTQ
Watch
1
Star
0
Fork
You've already forked AutoGPTQ
0
Code
Issues
Pull requests
Projects
Releases
Packages
1
Wiki
Activity
8c7c806d36
AutoGPTQ
/
auto_gptq
/
nn_modules
History
qwopqwop200
8c7c806d36
if exllama auto diable fused attention
2023-08-07 19:24:16 +09:00
..
qlinear
Merge pull request
#1
from qwopqwop200/exllama-q4-kernel
2023-08-05 00:15:22 +09:00
triton_utils
support 32dim triton kernel
2023-06-02 19:04:12 +09:00
__init__.py
refactor file structure
2023-04-25 18:58:20 +08:00
_fused_base.py
add trainable mode
2023-05-26 13:11:30 +08:00
fused_gptj_attn.py
if exllama auto diable fused attention
2023-08-07 19:24:16 +09:00
fused_llama_attn.py
if exllama auto diable fused attention
2023-08-07 19:24:16 +09:00
fused_llama_mlp.py
update FusedLlamaMLPForQuantizedModel for general usage purpose
2023-05-27 07:47:20 +08:00