潘其威(William)
|
04db761eed
|
Merge pull request #347 from alex4321/peft-model-use-adapter-name
Use `adapter_name` for `get_gptq_peft_model` with `train_mode=True`
|
2023-09-26 13:55:06 +08:00 |
|
Marc Sun
|
c912bf361a
|
exllamav2 integration
|
2023-09-25 16:51:18 +00:00 |
|
Alexander Pozharskii
|
0185095402
|
Use adapter_name for get_gptq_peft_model with train_mode=True
|
2023-09-24 17:11:19 +04:00 |
|
qwopqwop200
|
45a1ee4d84
|
install check qigen
|
2023-08-31 14:37:39 +09:00 |
|
qwopqwop200
|
6a9d80eddc
|
Merge remote-tracking branch 'qwopqwop200/main' into main
|
2023-08-25 18:06:03 +09:00 |
|
Felix Marty
|
04730ac66c
|
expose api to set exllama max length
|
2023-08-24 11:22:15 +00:00 |
|
qwopqwop200
|
084c9d8860
|
name change
|
2023-08-17 15:17:09 +09:00 |
|
qwopqwop200
|
7ba78af3ae
|
support cpu
|
2023-08-10 22:48:04 +09:00 |
|
PanQiWei
|
44c7a1a184
|
make exllama_kernels compilation as optional
|
2023-08-09 17:42:22 +08:00 |
|
Félix Marty
|
4fb3e20c5e
|
Merge branch 'main' into exllama-q4-kernel
|
2023-08-04 15:13:34 +02:00 |
|
Felix Marty
|
38447262c0
|
fix fused attn
|
2023-07-31 13:46:32 +00:00 |
|
Felix Marty
|
179776bd1d
|
exllama kernel
|
2023-07-31 11:50:45 +00:00 |
|
Felix Marty
|
23eb519e68
|
typo
|
2023-07-28 17:45:34 +00:00 |
|
Felix Marty
|
caf6625b68
|
warning about triton
|
2023-07-28 17:42:37 +00:00 |
|
PanQiWei
|
722a621aaa
|
simplified code
|
2023-07-26 17:53:47 +08:00 |
|
Casper
|
992a0ab102
|
Reference Perplexity class
|
2023-06-19 20:03:32 +02:00 |
|
Casper
|
b351c8c547
|
Add perplexity calculation class
|
2023-06-19 20:03:22 +02:00 |
|
qwopqwop200
|
b1a8cc28e8
|
remove raise
|
2023-05-31 00:03:51 +09:00 |
|
PanQiWei
|
6c64b0b361
|
raise NotImplementedError when model with fused attention injected try to use ADAPTION_PROMPT peft type
|
2023-05-28 22:35:34 +08:00 |
|
PanQiWei
|
def084bf0e
|
reset value of AdaptionPromptConfig.adapter_layers to number of model's hidden layers when exceeds
|
2023-05-28 22:11:02 +08:00 |
|
PanQiWei
|
ad10c13d40
|
support AdaLora
|
2023-05-28 21:30:45 +08:00 |
|
PanQiWei
|
3ee2daa73c
|
make GPTQLoraModel to inherit from LoraModel to simplify code
|
2023-05-28 17:36:18 +08:00 |
|
PanQiWei
|
22d1d8dcaa
|
add 'auto_find_all_linears' argument to get_gptq_peft_model function
|
2023-05-28 17:04:38 +08:00 |
|
PanQiWei
|
83132a663a
|
add warning to guide users interact with lora properly
|
2023-05-28 16:57:31 +08:00 |
|
PanQiWei
|
5bc5325920
|
add find_all_linear_names help function, make customized lora module more general
|
2023-05-27 07:49:17 +08:00 |
|
PanQiWei
|
8bf21a7e4c
|
set xavier_uniform_ as lora_A's init function
|
2023-05-26 14:06:53 +08:00 |
|
PanQiWei
|
2b532f9453
|
add trainable mode
|
2023-05-26 13:11:30 +08:00 |
|
PanQiWei
|
cfd27e8caa
|
refactor file structure of qlinears
|
2023-05-26 07:18:16 +08:00 |
|
PanQiWei
|
f6a34137e9
|
lora compatibility
|
2023-05-25 19:44:53 +08:00 |
|
PanQiWei
|
d293bf3a04
|
first upload peft_utils.py
|
2023-05-25 15:11:11 +08:00 |
|
PanQiWei
|
86b3b52c63
|
fix ImportError when triton is not installed
|
2023-05-20 16:15:20 +08:00 |
|
PanQiWei
|
5445c67190
|
add library version comparison help functions
|
2023-05-14 16:16:06 +08:00 |
|
PanQiWei
|
d718d63e9c
|
add import_utils.py for commonly used module importation
|
2023-05-12 19:58:48 +08:00 |
|
PanQiWei
|
41564a48db
|
make data_utils.py as global utils
|
2023-04-28 18:08:58 +08:00 |
|