Commit graph

558 commits

Author SHA1 Message Date
PanQiWei
ec6603d0ab support older version python 2023-05-31 22:11:16 +08:00
潘其威(William)
93698e027d
Merge pull request #116 from PanQiWei/pytorch-qlinear
switch to use pytorch backend when triton is not available at train mode
2023-05-31 00:10:55 +08:00
qwopqwop200
b1a8cc28e8
remove raise 2023-05-31 00:03:51 +09:00
qwopqwop200
c381958a5f
add warning 2023-05-30 23:53:33 +09:00
qwopqwop200
0f2841cb13
remove log 2023-05-30 23:51:55 +09:00
qwopqwop200
33809a8e59
remove log 2023-05-30 23:51:39 +09:00
qwopqwop200
dfd9dc0e6b
change if trainable backend pytorch 2023-05-30 23:43:55 +09:00
qwopqwop200
5274313067
change if trainable backend pytorch 2023-05-30 23:40:58 +09:00
PanQiWei
d0769c1a39 update README 2023-05-30 08:01:16 +08:00
PanQiWei
e826d89dbc update basic_usage.py 2023-05-30 07:47:10 +08:00
PanQiWei
df8672ce75 update README 2023-05-30 07:44:25 +08:00
PanQiWei
448a53e6a7 delete push to hub example script 2023-05-30 07:39:58 +08:00
潘其威(William)
defc96ff04
Merge pull request #91 from TheBloke/TheBloke_support-HF-download
Add support for HF Hub download, and `push_to_hub`
2023-05-30 07:37:15 +08:00
潘其威(William)
2245fad095
Update auto.py
fix None type error
2023-05-30 07:35:15 +08:00
潘其威(William)
15db2cdc44
Update _base.py
fix problem that recursively adding file extension to model_base_name
2023-05-30 07:26:42 +08:00
潘其威(William)
cfa7271617
Update _base.py
fix variable not found error
2023-05-30 07:22:10 +08:00
潘其威(William)
e5771fb206
Update _base.py
fix key mismatch
2023-05-30 06:44:45 +08:00
潘其威(William)
61a4ea035f
Update auto.py
add back save_dir for backward compatible
2023-05-30 06:43:00 +08:00
潘其威(William)
ea74e15199
Update _base.py
add model_name_or_path and model_file_base_name to BaseQuantizeConfig for better model file management; add back save_dir to .from_quantized() for backward compatible
2023-05-30 06:40:31 +08:00
潘其威(William)
0021417050
Update README.md 2023-05-30 05:56:46 +08:00
潘其威(William)
243bb2d56e
Update README_zh.md
fix typo
2023-05-30 05:56:04 +08:00
潘其威(William)
9a10b8496a
Update README_zh.md
merge the example code of downloading from and uploading to HF Hub into simplest usage code above to keep README compact.
2023-05-30 05:53:27 +08:00
潘其威(William)
17db71491f
Update README.md
merge the example code of downloading from and uploading to HF Hub into simplest usage code above to keep README compact.
2023-05-30 05:49:29 +08:00
PanQiWei
9dd7784e6a update lr 2023-05-29 21:27:35 +08:00
PanQiWei
539682e951 update lr 2023-05-28 22:54:53 +08:00
PanQiWei
788128f0a6 rename example scripts 2023-05-28 22:49:01 +08:00
PanQiWei
4296ced96d add example script for AdaptionPrompt peft type 2023-05-28 22:44:32 +08:00
PanQiWei
6c64b0b361 raise NotImplementedError when model with fused attention injected try to use ADAPTION_PROMPT peft type 2023-05-28 22:35:34 +08:00
PanQiWei
7af93624d7 remove useless code 2023-05-28 22:30:38 +08:00
PanQiWei
a1f5204bfd fix ppl calculation error 2023-05-28 22:20:49 +08:00
PanQiWei
def084bf0e reset value of AdaptionPromptConfig.adapter_layers to number of model's hidden layers when exceeds 2023-05-28 22:11:02 +08:00
PanQiWei
7bb01ae1cd remove useless prints 2023-05-28 22:01:35 +08:00
PanQiWei
956029fcb2 update README.md 2023-05-28 21:42:59 +08:00
PanQiWei
271ae2926b add training example scripts for Lora and AdaLora 2023-05-28 21:35:53 +08:00
PanQiWei
801b1c13ca update example script 2023-05-28 21:30:53 +08:00
PanQiWei
ad10c13d40 support AdaLora 2023-05-28 21:30:45 +08:00
PanQiWei
3ee2daa73c make GPTQLoraModel to inherit from LoraModel to simplify code 2023-05-28 17:36:18 +08:00
PanQiWei
22d1d8dcaa add 'auto_find_all_linears' argument to get_gptq_peft_model function 2023-05-28 17:04:38 +08:00
PanQiWei
83132a663a add warning to guide users interact with lora properly 2023-05-28 16:57:31 +08:00
PanQiWei
86f060c74b Merge branch 'main' into peft_integration 2023-05-28 16:23:38 +08:00
PanQiWei
4b8524ffb1 update README 2023-05-27 23:04:00 +08:00
PanQiWei
f703c9ac98 update generation_speed.py 2023-05-27 20:15:43 +08:00
PanQiWei
1ff000658e update README.md 2023-05-27 19:26:56 +08:00
PanQiWei
e8bd3c33c4 add generation speed benchmark example script 2023-05-27 19:16:42 +08:00
PanQiWei
491da62402 fix signature at import time 2023-05-27 17:49:58 +08:00
PanQiWei
0327ac8f42 update README 2023-05-27 17:42:16 +08:00
PanQiWei
3cb1bf5a6d add trust_remote_code command line flag 2023-05-27 17:09:10 +08:00
PanQiWei
c040617a94 update README 2023-05-27 17:03:50 +08:00
PanQiWei
ceacd59e4b update NEWS_OR_UPDATE 2023-05-27 16:37:51 +08:00
潘其威(William)
0a40581270
Merge pull request #111 from PanQiWei/falcon
Falcon support
2023-05-27 16:23:30 +08:00