潘其威(William)
|
2245fad095
|
Update auto.py
fix None type error
|
2023-05-30 07:35:15 +08:00 |
|
潘其威(William)
|
61a4ea035f
|
Update auto.py
add back save_dir for backward compatible
|
2023-05-30 06:43:00 +08:00 |
|
Tom Jobbins
|
492255b400
|
Merge branch 'main' into TheBloke_support-HF-download
|
2023-05-25 07:02:13 +01:00 |
|
PanQiWei
|
10347fdd7b
|
remove full_cpu_offload argument and unify model dispatch strategy
|
2023-05-24 17:41:04 +08:00 |
|
PanQiWei
|
6476ee4235
|
add options: 'low_cpu_mem_usage' and 'full_cpu_offload'
|
2023-05-23 22:51:00 +08:00 |
|
TheBloke
|
908b338436
|
Initial support for model loading from HF hub
|
2023-05-19 15:57:05 +01:00 |
|
PanQiWei
|
759d6953d4
|
support user customize device_map
|
2023-05-15 13:26:38 +08:00 |
|
PanQiWei
|
f159aeabb6
|
refactor .from_quantized api and improve model loading strategy
|
2023-05-12 18:09:50 +08:00 |
|
qwopqwop200
|
d49281bc5d
|
support faster and model load strict
|
2023-05-04 09:07:34 +09:00 |
|
qwopqwop200
|
24251d1397
|
check kwargs
|
2023-05-02 22:32:54 +09:00 |
|
qwopqwop200
|
ccd87e5800
|
add Auto model parameter
|
2023-05-02 22:15:56 +09:00 |
|
ZXED
|
24a371d14a
|
use the same Optional style as in other params
|
2023-04-29 09:52:11 +03:00 |
|
ZXED
|
c22770188d
|
allow user to set trust_remote_code flag manually
|
2023-04-29 09:52:11 +03:00 |
|
ZXED
|
b3f19a7ba7
|
support custom model name when loading the model
|
2023-04-29 09:52:11 +03:00 |
|
ZXED
|
ea8ab73343
|
support custom quantize_config when loading the model
|
2023-04-29 09:51:50 +03:00 |
|
qwopqwop200
|
ac41f68532
|
add gpt2
|
2023-04-28 09:14:05 +09:00 |
|
PanQiWei
|
51d2e53130
|
add support to cpu offloading and multi gpus inference on quantized model
|
2023-04-28 00:53:57 +08:00 |
|
PanQiWei
|
498de923f2
|
support multi gpus quantization
|
2023-04-27 18:48:43 +08:00 |
|
PanQiWei
|
c9bb427546
|
align 'from_pretrained' api
|
2023-04-27 02:29:32 +08:00 |
|
PanQiWei
|
f2359f56cb
|
add support to use push_to_hub to upload and share quantized model
|
2023-04-26 16:55:01 +08:00 |
|
PanQiWei
|
832dc4a7a1
|
refactor file structure
|
2023-04-25 18:58:20 +08:00 |
|
PanQiWei
|
a259fb06bb
|
add support to MOSS model
|
2023-04-25 11:54:29 +08:00 |
|
PanQiWei
|
7ba0edffe0
|
refactor file structure of modeling module
|
2023-04-23 17:33:09 +08:00 |
|