Commit graph

34 commits

Author SHA1 Message Date
潘其威(William)
04db761eed
Merge pull request #347 from alex4321/peft-model-use-adapter-name
Use `adapter_name` for `get_gptq_peft_model` with `train_mode=True`
2023-09-26 13:55:06 +08:00
Marc Sun
c912bf361a exllamav2 integration 2023-09-25 16:51:18 +00:00
Alexander Pozharskii
0185095402 Use adapter_name for get_gptq_peft_model with train_mode=True 2023-09-24 17:11:19 +04:00
qwopqwop200
45a1ee4d84
install check qigen 2023-08-31 14:37:39 +09:00
qwopqwop200
6a9d80eddc Merge remote-tracking branch 'qwopqwop200/main' into main 2023-08-25 18:06:03 +09:00
Felix Marty
04730ac66c expose api to set exllama max length 2023-08-24 11:22:15 +00:00
qwopqwop200
084c9d8860
name change 2023-08-17 15:17:09 +09:00
qwopqwop200
7ba78af3ae support cpu 2023-08-10 22:48:04 +09:00
PanQiWei
44c7a1a184 make exllama_kernels compilation as optional 2023-08-09 17:42:22 +08:00
Félix Marty
4fb3e20c5e Merge branch 'main' into exllama-q4-kernel 2023-08-04 15:13:34 +02:00
Felix Marty
38447262c0 fix fused attn 2023-07-31 13:46:32 +00:00
Felix Marty
179776bd1d exllama kernel 2023-07-31 11:50:45 +00:00
Felix Marty
23eb519e68 typo 2023-07-28 17:45:34 +00:00
Felix Marty
caf6625b68 warning about triton 2023-07-28 17:42:37 +00:00
PanQiWei
722a621aaa simplified code 2023-07-26 17:53:47 +08:00
Casper
992a0ab102 Reference Perplexity class 2023-06-19 20:03:32 +02:00
Casper
b351c8c547 Add perplexity calculation class 2023-06-19 20:03:22 +02:00
qwopqwop200
b1a8cc28e8
remove raise 2023-05-31 00:03:51 +09:00
PanQiWei
6c64b0b361 raise NotImplementedError when model with fused attention injected try to use ADAPTION_PROMPT peft type 2023-05-28 22:35:34 +08:00
PanQiWei
def084bf0e reset value of AdaptionPromptConfig.adapter_layers to number of model's hidden layers when exceeds 2023-05-28 22:11:02 +08:00
PanQiWei
ad10c13d40 support AdaLora 2023-05-28 21:30:45 +08:00
PanQiWei
3ee2daa73c make GPTQLoraModel to inherit from LoraModel to simplify code 2023-05-28 17:36:18 +08:00
PanQiWei
22d1d8dcaa add 'auto_find_all_linears' argument to get_gptq_peft_model function 2023-05-28 17:04:38 +08:00
PanQiWei
83132a663a add warning to guide users interact with lora properly 2023-05-28 16:57:31 +08:00
PanQiWei
5bc5325920 add find_all_linear_names help function, make customized lora module more general 2023-05-27 07:49:17 +08:00
PanQiWei
8bf21a7e4c set xavier_uniform_ as lora_A's init function 2023-05-26 14:06:53 +08:00
PanQiWei
2b532f9453 add trainable mode 2023-05-26 13:11:30 +08:00
PanQiWei
cfd27e8caa refactor file structure of qlinears 2023-05-26 07:18:16 +08:00
PanQiWei
f6a34137e9 lora compatibility 2023-05-25 19:44:53 +08:00
PanQiWei
d293bf3a04 first upload peft_utils.py 2023-05-25 15:11:11 +08:00
PanQiWei
86b3b52c63 fix ImportError when triton is not installed 2023-05-20 16:15:20 +08:00
PanQiWei
5445c67190 add library version comparison help functions 2023-05-14 16:16:06 +08:00
PanQiWei
d718d63e9c add import_utils.py for commonly used module importation 2023-05-12 19:58:48 +08:00
PanQiWei
41564a48db make data_utils.py as global utils 2023-04-28 18:08:58 +08:00