PanQiWei
bc61e51394
update README
2023-06-01 10:35:17 +08:00
PanQiWei
7ae89f282a
update build_wheels.yml
2023-06-01 01:48:29 +08:00
PanQiWei
31b8c1313e
update build_wheels.yml
2023-06-01 01:34:46 +08:00
PanQiWei
d53a30d351
update build_wheels.yml
2023-06-01 01:16:10 +08:00
PanQiWei
ac7dd9bc1f
update build_wheels.yml
2023-06-01 01:03:35 +08:00
潘其威(William)
a63f8fd523
Merge pull request #120 from PanQiWei/add_build_wheels_workflow
...
Add build wheels workflow
2023-06-01 00:43:38 +08:00
PanQiWei
d780ef5eef
update build_wheels.yml
2023-06-01 00:42:31 +08:00
PanQiWei
407e5d8133
add workflow to build wheels
2023-06-01 00:39:09 +08:00
PanQiWei
0ece40ca25
update setup.py
2023-06-01 00:38:35 +08:00
PanQiWei
402973259f
update setup.py
2023-06-01 00:18:43 +08:00
PanQiWei
ec6603d0ab
support older version python
2023-05-31 22:11:16 +08:00
潘其威(William)
93698e027d
Merge pull request #116 from PanQiWei/pytorch-qlinear
...
switch to use pytorch backend when triton is not available at train mode
2023-05-31 00:10:55 +08:00
qwopqwop200
b1a8cc28e8
remove raise
2023-05-31 00:03:51 +09:00
qwopqwop200
c381958a5f
add warning
2023-05-30 23:53:33 +09:00
qwopqwop200
0f2841cb13
remove log
2023-05-30 23:51:55 +09:00
qwopqwop200
33809a8e59
remove log
2023-05-30 23:51:39 +09:00
qwopqwop200
dfd9dc0e6b
change if trainable backend pytorch
2023-05-30 23:43:55 +09:00
qwopqwop200
5274313067
change if trainable backend pytorch
2023-05-30 23:40:58 +09:00
PanQiWei
d0769c1a39
update README
2023-05-30 08:01:16 +08:00
PanQiWei
e826d89dbc
update basic_usage.py
2023-05-30 07:47:10 +08:00
PanQiWei
df8672ce75
update README
2023-05-30 07:44:25 +08:00
PanQiWei
448a53e6a7
delete push to hub example script
2023-05-30 07:39:58 +08:00
潘其威(William)
defc96ff04
Merge pull request #91 from TheBloke/TheBloke_support-HF-download
...
Add support for HF Hub download, and `push_to_hub`
2023-05-30 07:37:15 +08:00
潘其威(William)
2245fad095
Update auto.py
...
fix None type error
2023-05-30 07:35:15 +08:00
潘其威(William)
15db2cdc44
Update _base.py
...
fix problem that recursively adding file extension to model_base_name
2023-05-30 07:26:42 +08:00
潘其威(William)
cfa7271617
Update _base.py
...
fix variable not found error
2023-05-30 07:22:10 +08:00
潘其威(William)
e5771fb206
Update _base.py
...
fix key mismatch
2023-05-30 06:44:45 +08:00
潘其威(William)
61a4ea035f
Update auto.py
...
add back save_dir for backward compatible
2023-05-30 06:43:00 +08:00
潘其威(William)
ea74e15199
Update _base.py
...
add model_name_or_path and model_file_base_name to BaseQuantizeConfig for better model file management; add back save_dir to .from_quantized() for backward compatible
2023-05-30 06:40:31 +08:00
潘其威(William)
0021417050
Update README.md
2023-05-30 05:56:46 +08:00
潘其威(William)
243bb2d56e
Update README_zh.md
...
fix typo
2023-05-30 05:56:04 +08:00
潘其威(William)
9a10b8496a
Update README_zh.md
...
merge the example code of downloading from and uploading to HF Hub into simplest usage code above to keep README compact.
2023-05-30 05:53:27 +08:00
潘其威(William)
17db71491f
Update README.md
...
merge the example code of downloading from and uploading to HF Hub into simplest usage code above to keep README compact.
2023-05-30 05:49:29 +08:00
PanQiWei
9dd7784e6a
update lr
2023-05-29 21:27:35 +08:00
PanQiWei
539682e951
update lr
2023-05-28 22:54:53 +08:00
PanQiWei
788128f0a6
rename example scripts
2023-05-28 22:49:01 +08:00
PanQiWei
4296ced96d
add example script for AdaptionPrompt peft type
2023-05-28 22:44:32 +08:00
PanQiWei
6c64b0b361
raise NotImplementedError when model with fused attention injected try to use ADAPTION_PROMPT peft type
2023-05-28 22:35:34 +08:00
PanQiWei
7af93624d7
remove useless code
2023-05-28 22:30:38 +08:00
PanQiWei
a1f5204bfd
fix ppl calculation error
2023-05-28 22:20:49 +08:00
PanQiWei
def084bf0e
reset value of AdaptionPromptConfig.adapter_layers to number of model's hidden layers when exceeds
2023-05-28 22:11:02 +08:00
PanQiWei
7bb01ae1cd
remove useless prints
2023-05-28 22:01:35 +08:00
PanQiWei
956029fcb2
update README.md
2023-05-28 21:42:59 +08:00
PanQiWei
271ae2926b
add training example scripts for Lora and AdaLora
2023-05-28 21:35:53 +08:00
PanQiWei
801b1c13ca
update example script
2023-05-28 21:30:53 +08:00
PanQiWei
ad10c13d40
support AdaLora
2023-05-28 21:30:45 +08:00
PanQiWei
3ee2daa73c
make GPTQLoraModel to inherit from LoraModel to simplify code
2023-05-28 17:36:18 +08:00
PanQiWei
22d1d8dcaa
add 'auto_find_all_linears' argument to get_gptq_peft_model function
2023-05-28 17:04:38 +08:00
PanQiWei
83132a663a
add warning to guide users interact with lora properly
2023-05-28 16:57:31 +08:00
PanQiWei
86f060c74b
Merge branch 'main' into peft_integration
2023-05-28 16:23:38 +08:00