AutoGPTQ

210 commits 7 branches 13 tags 8.1 MiB

Author	SHA1	Message	Date
LaaZa	b8187ff05a	Add support for CodeGen/2	2023-05-08 17:34:00 +03:00
ZXED	24a371d14a	use the same Optional style as in other params	2023-04-29 09:52:11 +03:00
ZXED	c22770188d	allow user to set trust_remote_code flag manually	2023-04-29 09:52:11 +03:00
ZXED	b3f19a7ba7	support custom model name when loading the model	2023-04-29 09:52:11 +03:00
ZXED	ea8ab73343	support custom quantize_config when loading the model	2023-04-29 09:51:50 +03:00
qwopqwop200	ac41f68532	add gpt2	2023-04-28 09:14:05 +09:00
PanQiWei	51d2e53130	add support to cpu offloading and multi gpus inference on quantized model	2023-04-28 00:53:57 +08:00
PanQiWei	498de923f2	support multi gpus quantization	2023-04-27 18:48:43 +08:00
PanQiWei	c9bb427546	align 'from_pretrained' api	2023-04-27 02:29:32 +08:00
PanQiWei	f2359f56cb	add support to use push_to_hub to upload and share quantized model	2023-04-26 16:55:01 +08:00
PanQiWei	832dc4a7a1	refactor file structure	2023-04-25 18:58:20 +08:00
PanQiWei	a259fb06bb	add support to MOSS model	2023-04-25 11:54:29 +08:00
PanQiWei	7ba0edffe0	refactor file structure of modeling module	2023-04-23 17:33:09 +08:00

Renamed from auto_gptq/modeling_auto.py (Browse further)