AutoGPTQ

History

PanQiWei fe5f5d12ed Merge branch 'main' into peft_integration		2023-05-26 09:48:06 +08:00
..
qlinear	support faster vecquant4matmul cuda kernel	2023-05-26 08:55:05 +08:00
triton_utils	Update kernels.py	2023-05-25 23:15:33 +09:00
__init__.py	refactor file structure	2023-04-25 18:58:20 +08:00
_fused_base.py	fix ImportError when triton is not installed	2023-05-20 16:15:20 +08:00
fused_gptj_attn.py	add GPTJ fused attention module	2023-05-14 16:17:21 +08:00
fused_llama_attn.py	compatible with older pytorch version	2023-05-14 16:17:03 +08:00
fused_llama_mlp.py	fix not return directly when triton is not installed	2023-05-20 16:21:52 +08:00