AutoGPTQ/auto_gptq/nn_modules
qwopqwop200 a88cd16d65
fix bug
2023-05-03 22:36:14 +09:00
..
triton_utils add triton support 2023-04-25 20:05:22 +08:00
__init__.py refactor file structure 2023-04-25 18:58:20 +08:00
fused_attn.py fix bug 2023-05-02 19:13:40 +09:00
fused_mlp_triton.py add fused mlp ,fused attn 2023-05-02 18:55:44 +09:00
qlinear.py fix bug 2023-05-03 22:36:14 +09:00
qlinear_old.py fix bug 2023-05-01 18:09:39 +09:00
qlinear_triton.py Multi-GPU, allocate output tensor 2023-05-02 17:51:41 +09:00