AutoGPTQ/auto_gptq/nn_modules/qlinear
2023-08-11 14:52:26 +08:00
..
__init__.py extend to support qlinear_exllama's fusion 2023-08-11 14:52:26 +08:00
qlinear_cuda.py Merge branch 'main' into xformers_integration 2023-08-10 15:27:11 +08:00
qlinear_cuda_old.py Merge branch 'main' into xformers_integration 2023-08-10 15:27:11 +08:00
qlinear_exllama.py patch for transformers compatiblity 2023-08-09 14:23:59 +00:00
qlinear_triton.py fix fused attn 2023-07-31 13:46:32 +00:00