AutoGPTQ

161 commits 7 branches 13 tags 8.1 MiB

Author	SHA1	Message	Date
ZXED	b3f19a7ba7	support custom model name when loading the model	2023-04-29 09:52:11 +03:00
ZXED	ea8ab73343	support custom quantize_config when loading the model	2023-04-29 09:51:50 +03:00
qwopqwop200	ac41f68532	add gpt2	2023-04-28 09:14:05 +09:00
PanQiWei	51d2e53130	add support to cpu offloading and multi gpus inference on quantized model	2023-04-28 00:53:57 +08:00
PanQiWei	498de923f2	support multi gpus quantization	2023-04-27 18:48:43 +08:00
PanQiWei	c9bb427546	align 'from_pretrained' api	2023-04-27 02:29:32 +08:00
PanQiWei	f2359f56cb	add support to use push_to_hub to upload and share quantized model	2023-04-26 16:55:01 +08:00
PanQiWei	832dc4a7a1	refactor file structure	2023-04-25 18:58:20 +08:00
PanQiWei	a259fb06bb	add support to MOSS model	2023-04-25 11:54:29 +08:00
PanQiWei	7ba0edffe0	refactor file structure of modeling module	2023-04-23 17:33:09 +08:00

Renamed from auto_gptq/modeling_auto.py (Browse further)