AutoGPTQ/auto_gptq/nn_modules
2023-05-25 23:15:33 +09:00
..
triton_utils Update kernels.py 2023-05-25 23:15:33 +09:00
__init__.py refactor file structure 2023-04-25 18:58:20 +08:00
_fused_base.py fix ImportError when triton is not installed 2023-05-20 16:15:20 +08:00
fused_gptj_attn.py add GPTJ fused attention module 2023-05-14 16:17:21 +08:00
fused_llama_attn.py compatible with older pytorch version 2023-05-14 16:17:03 +08:00
fused_llama_mlp.py fix not return directly when triton is not installed 2023-05-20 16:21:52 +08:00
qlinear.py half out 2023-05-23 16:08:28 +08:00
qlinear_old.py remove duplicate code 2023-05-23 23:48:15 +08:00
qlinear_triton.py half out 2023-05-23 16:08:28 +08:00