Commit graph

56 commits

Author SHA1 Message Date
PanQiWei
498de923f2 support multi gpus quantization 2023-04-27 18:48:43 +08:00
PanQiWei
c9bb427546 align 'from_pretrained' api 2023-04-27 02:29:32 +08:00
PanQiWei
f2359f56cb add support to use push_to_hub to upload and share quantized model 2023-04-26 16:55:01 +08:00
PanQiWei
832dc4a7a1 refactor file structure 2023-04-25 18:58:20 +08:00
PanQiWei
a259fb06bb add support to MOSS model 2023-04-25 11:54:29 +08:00
PanQiWei
7ba0edffe0 refactor file structure of modeling module 2023-04-23 17:33:09 +08:00
Renamed from auto_gptq/modeling_auto.py (Browse further)