AutoGPTQ/auto_gptq/quantization
2023-05-05 14:44:16 +01:00
..
__init__.py refactor file structure 2023-04-25 18:58:20 +08:00
ACKNOWLEDGEMENT.md first init 2023-04-14 01:09:40 +08:00
gptq.py Fix 'groupsize' -> 'group_size' in all other .py files. I haven't touched any CUDA kernels in case there's any complexity there I don't understand 2023-05-05 14:44:16 +01:00
quantizer.py add triton support 2023-04-25 20:05:22 +08:00