AutoGPTQ/auto_gptq/nn_modules
2023-05-02 21:53:13 +09:00
..
triton_utils add triton support 2023-04-25 20:05:22 +08:00
__init__.py refactor file structure 2023-04-25 18:58:20 +08:00
fused_attn.py fix bug 2023-05-02 19:13:40 +09:00
fused_mlp_triton.py add fused mlp ,fused attn 2023-05-02 18:55:44 +09:00
qlinear.py support fused_attn 2023-05-02 21:53:13 +09:00
qlinear_old.py fix bug 2023-05-01 18:09:39 +09:00
qlinear_triton.py Multi-GPU, allocate output tensor 2023-05-02 17:51:41 +09:00