AutoGPTQ/auto_gptq
2023-05-15 00:00:05 -03:00
..
eval_tasks make data_utils.py as global utils 2023-04-28 18:08:58 +08:00
modeling Look for .pt files 2023-05-15 00:00:05 -03:00
nn_modules add GPTJ fused attention module 2023-05-14 16:17:21 +08:00
quantization Fix 'groupsize' -> 'group_size' in all other .py files. I haven't touched any CUDA kernels in case there's any complexity there I don't understand 2023-05-05 14:44:16 +01:00
utils add library version comparison help functions 2023-05-14 16:16:06 +08:00
__init__.py remove LlamaGPTQForCausalLM 2023-05-02 22:18:17 +09:00