AutoGPTQ/auto_gptq/nn_modules
2023-05-05 14:44:16 +01:00
..
triton_utils add triton support 2023-04-25 20:05:22 +08:00
__init__.py refactor file structure 2023-04-25 18:58:20 +08:00
fused_attn.py Fix 'groupsize' -> 'group_size' in all other .py files. I haven't touched any CUDA kernels in case there's any complexity there I don't understand 2023-05-05 14:44:16 +01:00
fused_mlp_triton.py add fused mlp ,fused attn 2023-05-02 18:55:44 +09:00
qlinear.py Fix 'groupsize' -> 'group_size' in all other .py files. I haven't touched any CUDA kernels in case there's any complexity there I don't understand 2023-05-05 14:44:16 +01:00
qlinear_old.py Fix 'groupsize' -> 'group_size' in all other .py files. I haven't touched any CUDA kernels in case there's any complexity there I don't understand 2023-05-05 14:44:16 +01:00
qlinear_triton.py Fix 'groupsize' -> 'group_size' in all other .py files. I haven't touched any CUDA kernels in case there's any complexity there I don't understand 2023-05-05 14:44:16 +01:00