Commit graph

56 commits

Author SHA1 Message Date
Automation Pipeline
9fb99f61e7 Merge remote-tracking branches 'laaza/Mistral' and 'laaza/MPT' 2023-10-22 07:53:59 -04:00
LaaZa
4b7389ddb7 Merge branch 'main' into MPT
# Conflicts:
#	auto_gptq/modeling/__init__.py
#	auto_gptq/modeling/_const.py
#	auto_gptq/modeling/auto.py
2023-10-04 20:21:49 +03:00
LaaZa
99acbead42 Add support for Mistral models. 2023-10-04 01:07:55 +03:00
Marc Sun
c912bf361a exllamav2 integration 2023-09-25 16:51:18 +00:00
TheBloke
02a87dce76 Add support for Falcon as part of Transformers 4.33.0, including new Falcon 180B 2023-09-06 18:03:33 +01:00
PanQiWei
172deae049 expose disable_exllama argument 2023-08-09 12:03:31 +08:00
qwopqwop200
fe244503e0
add "," 2023-08-08 19:57:23 +09:00
qwopqwop200
d22f89c524
support qwen 2023-08-08 19:27:43 +09:00
LaaZa
6ff6bc8dfc Merge branch 'main' into MPT
# Conflicts:
#	auto_gptq/modeling/__init__.py
#	auto_gptq/modeling/_const.py
#	auto_gptq/modeling/auto.py
2023-07-26 20:41:19 +03:00
PanQiWei
ff1f100ded remove argument 'save_dir' in method from_quantized 2023-07-26 17:58:04 +08:00
TheBloke
c9124e3fc7 Fix revision and other huggingface_hub args for .from_quantized(), which were not being passed through 2023-07-25 12:48:33 +00:00
tc
e28e8ee809 Add support for InternLM 2023-07-07 09:25:40 -07:00
LaaZa
03577a7698 Rename the class to match reference capitalisation 2023-06-18 21:01:07 +03:00
LaaZa
9fd558f2ba Add support for Baichuan 2023-06-18 20:13:29 +03:00
LaaZa
bf47892b81 Merge branch 'main' into MPT
# Conflicts:
#	auto_gptq/modeling/__init__.py
#	auto_gptq/modeling/_const.py
#	auto_gptq/modeling/auto.py
2023-06-02 15:01:10 +03:00
潘其威(William)
b4fdd8d264
Merge branch 'main' into peft_integration 2023-06-02 19:11:59 +08:00
潘其威(William)
defc96ff04
Merge pull request #91 from TheBloke/TheBloke_support-HF-download
Add support for HF Hub download, and `push_to_hub`
2023-05-30 07:37:15 +08:00
潘其威(William)
2245fad095
Update auto.py
fix None type error
2023-05-30 07:35:15 +08:00
潘其威(William)
61a4ea035f
Update auto.py
add back save_dir for backward compatible
2023-05-30 06:43:00 +08:00
PanQiWei
86f060c74b Merge branch 'main' into peft_integration 2023-05-28 16:23:38 +08:00
PanQiWei
491da62402 fix signature at import time 2023-05-27 17:49:58 +08:00
潘其威(William)
23998345f5
Merge branch 'main' into falcon 2023-05-27 16:23:16 +08:00
Bill Cai
0729760234
Update auto.py 2023-05-27 11:16:43 +08:00
潘其威(William)
269ef7335c
Merge branch 'main' into falcon 2023-05-27 08:15:52 +08:00
潘其威(William)
3c3b0e1e79
Merge branch 'main' into GPTBigCode 2023-05-27 08:03:03 +08:00
潘其威(William)
eab728b263
Merge branch 'main' into Codegen 2023-05-27 08:00:19 +08:00
潘其威(William)
f6fd314d5a
Merge branch 'main' into GPTBigCode 2023-05-27 07:57:25 +08:00
qwopqwop200
277809381b
fix bug 2023-05-27 08:53:47 +09:00
qwopqwop200
bcb345fb35
support falcon 2023-05-27 07:53:39 +09:00
PanQiWei
2b532f9453 add trainable mode 2023-05-26 13:11:30 +08:00
Tom Jobbins
492255b400
Merge branch 'main' into TheBloke_support-HF-download 2023-05-25 07:02:13 +01:00
PanQiWei
10347fdd7b remove full_cpu_offload argument and unify model dispatch strategy 2023-05-24 17:41:04 +08:00
PanQiWei
6476ee4235 add options: 'low_cpu_mem_usage' and 'full_cpu_offload' 2023-05-23 22:51:00 +08:00
TheBloke
908b338436 Initial support for model loading from HF hub 2023-05-19 15:57:05 +01:00
PanQiWei
759d6953d4 support user customize device_map 2023-05-15 13:26:38 +08:00
潘其威(William)
bdb08c16fc
Merge branch 'main' into Codegen 2023-05-14 13:10:52 +08:00
潘其威(William)
e24c5122db
Merge branch 'main' into GPTBigCode 2023-05-14 13:10:10 +08:00
LaaZa
fb380fb9c2 Add initial support for MPT 2023-05-12 14:46:52 +03:00
PanQiWei
f159aeabb6 refactor .from_quantized api and improve model loading strategy 2023-05-12 18:09:50 +08:00
LaaZa
b8187ff05a Add support for CodeGen/2 2023-05-08 17:34:00 +03:00
LaaZa
63247a0669 Add support for GPTBigCode 2023-05-08 12:28:29 +03:00
qwopqwop200
d49281bc5d
support faster and model load strict 2023-05-04 09:07:34 +09:00
qwopqwop200
24251d1397
check kwargs 2023-05-02 22:32:54 +09:00
qwopqwop200
ccd87e5800
add Auto model parameter 2023-05-02 22:15:56 +09:00
ZXED
24a371d14a
use the same Optional style as in other params 2023-04-29 09:52:11 +03:00
ZXED
c22770188d
allow user to set trust_remote_code flag manually 2023-04-29 09:52:11 +03:00
ZXED
b3f19a7ba7
support custom model name when loading the model 2023-04-29 09:52:11 +03:00
ZXED
ea8ab73343
support custom quantize_config when loading the model 2023-04-29 09:51:50 +03:00
qwopqwop200
ac41f68532
add gpt2 2023-04-28 09:14:05 +09:00
PanQiWei
51d2e53130 add support to cpu offloading and multi gpus inference on quantized model 2023-04-28 00:53:57 +08:00