AutoGPTQ/auto_gptq
2023-05-24 11:32:45 +08:00
..
eval_tasks make data_utils.py as global utils 2023-04-28 18:08:58 +08:00
modeling make_sure_not_tensor_in_meta_device before load checkpoint 2023-05-24 11:32:45 +08:00
nn_modules remove duplicate code 2023-05-23 23:48:15 +08:00
quantization Fix 'groupsize' -> 'group_size' in all other .py files. I haven't touched any CUDA kernels in case there's any complexity there I don't understand 2023-05-05 14:44:16 +01:00
utils fix ImportError when triton is not installed 2023-05-20 16:15:20 +08:00
__init__.py remove LlamaGPTQForCausalLM 2023-05-02 22:18:17 +09:00