PanQiWei
|
86f060c74b
|
Merge branch 'main' into peft_integration
|
2023-05-28 16:23:38 +08:00 |
|
PanQiWei
|
491da62402
|
fix signature at import time
|
2023-05-27 17:49:58 +08:00 |
|
潘其威(William)
|
23998345f5
|
Merge branch 'main' into falcon
|
2023-05-27 16:23:16 +08:00 |
|
Bill Cai
|
0729760234
|
Update auto.py
|
2023-05-27 11:16:43 +08:00 |
|
潘其威(William)
|
269ef7335c
|
Merge branch 'main' into falcon
|
2023-05-27 08:15:52 +08:00 |
|
潘其威(William)
|
3c3b0e1e79
|
Merge branch 'main' into GPTBigCode
|
2023-05-27 08:03:03 +08:00 |
|
潘其威(William)
|
eab728b263
|
Merge branch 'main' into Codegen
|
2023-05-27 08:00:19 +08:00 |
|
潘其威(William)
|
f6fd314d5a
|
Merge branch 'main' into GPTBigCode
|
2023-05-27 07:57:25 +08:00 |
|
qwopqwop200
|
277809381b
|
fix bug
|
2023-05-27 08:53:47 +09:00 |
|
qwopqwop200
|
bcb345fb35
|
support falcon
|
2023-05-27 07:53:39 +09:00 |
|
qwopqwop200
|
4d5b4fa5c6
|
add dtype
|
2023-05-27 07:49:28 +09:00 |
|
qwopqwop200
|
c14b4c1567
|
change find layer algorithm
|
2023-05-27 07:48:50 +09:00 |
|
PanQiWei
|
f7e705848a
|
move peft compatible model injection to the last step
|
2023-05-26 14:29:33 +08:00 |
|
PanQiWei
|
2b532f9453
|
add trainable mode
|
2023-05-26 13:11:30 +08:00 |
|
PanQiWei
|
cfd27e8caa
|
refactor file structure of qlinears
|
2023-05-26 07:18:16 +08:00 |
|
PanQiWei
|
4d157a3b64
|
add hack of __getattr__
|
2023-05-25 15:10:33 +08:00 |
|
TheBloke
|
b7bb50b4d5
|
Fix bug added after merge
|
2023-05-25 07:05:51 +01:00 |
|
Tom Jobbins
|
492255b400
|
Merge branch 'main' into TheBloke_support-HF-download
|
2023-05-25 07:02:13 +01:00 |
|
PanQiWei
|
096749fe9d
|
generalize QuantLinear
|
2023-05-25 13:33:09 +08:00 |
|
PanQiWei
|
94ef4d5ada
|
update basic usage example code
|
2023-05-24 17:56:46 +08:00 |
|
PanQiWei
|
c89bb6450c
|
correct typo of function name
|
2023-05-24 17:43:38 +08:00 |
|
PanQiWei
|
10347fdd7b
|
remove full_cpu_offload argument and unify model dispatch strategy
|
2023-05-24 17:41:04 +08:00 |
|
PanQiWei
|
379f24c2a5
|
remove add_align_logits_hook_to_model
|
2023-05-24 17:01:57 +08:00 |
|
PanQiWei
|
749dba1a7e
|
disable add_align_logits_hook_to_model for now
|
2023-05-24 13:42:06 +08:00 |
|
PanQiWei
|
58c1b509f0
|
support add_align_logits_hook_to_model
|
2023-05-24 12:50:30 +08:00 |
|
PanQiWei
|
21ab7c435a
|
make comments more readable
|
2023-05-24 11:38:29 +08:00 |
|
PanQiWei
|
c31b370228
|
make_sure_not_tensor_in_meta_device before load checkpoint
|
2023-05-24 11:32:45 +08:00 |
|
PanQiWei
|
63f1b4e073
|
remove comment
|
2023-05-24 11:23:07 +08:00 |
|
PanQiWei
|
057c39e3f2
|
fix meta device bug when use low_cpu_mem_usage
|
2023-05-24 11:19:59 +08:00 |
|
PanQiWei
|
e2e7809a1f
|
always to enable QuantLinear bias to make compatible with model quantized from other frameworks
|
2023-05-24 10:56:31 +08:00 |
|
PanQiWei
|
191da8141e
|
fix device mismatch
|
2023-05-23 23:22:52 +08:00 |
|
PanQiWei
|
e4e90e8b0a
|
add warmup_triton method
|
2023-05-23 23:18:46 +08:00 |
|
PanQiWei
|
ed14d3a786
|
fix save quantized model failed when load pretrained model using CPU offload
|
2023-05-23 23:17:11 +08:00 |
|
PanQiWei
|
6476ee4235
|
add options: 'low_cpu_mem_usage' and 'full_cpu_offload'
|
2023-05-23 22:51:00 +08:00 |
|
PanQiWei
|
1b2159bd4c
|
add more help functions
|
2023-05-23 19:30:28 +08:00 |
|
TheBloke
|
bf633c298e
|
Clean up some unused params
|
2023-05-20 10:32:27 +01:00 |
|
PanQiWei
|
86b3b52c63
|
fix ImportError when triton is not installed
|
2023-05-20 16:15:20 +08:00 |
|
潘其威(William)
|
13defe253a
|
Merge pull request #84 from TheBloke/TheBloke_forward-positional-args
Forward position args to allow `model(tokens)` syntax
|
2023-05-20 15:04:27 +08:00 |
|
潘其威(William)
|
1ef0af824a
|
Merge pull request #80 from PanQiWei/user_customized_device_map
Support users customize `device_map`
|
2023-05-20 15:00:05 +08:00 |
|
TheBloke
|
e5c8479100
|
Remove debugging print line
|
2023-05-19 17:50:48 +01:00 |
|
TheBloke
|
735f7df4cc
|
Add push_to_hub for HF hub uploading
|
2023-05-19 17:10:57 +01:00 |
|
TheBloke
|
908b338436
|
Initial support for model loading from HF hub
|
2023-05-19 15:57:05 +01:00 |
|
TheBloke
|
a397f00cc3
|
Implement HF cached download for quantize_config
|
2023-05-19 15:15:43 +01:00 |
|
TheBloke
|
7f165337ed
|
Forward position args to allow syntax
|
2023-05-16 12:19:52 +01:00 |
|
PanQiWei
|
759d6953d4
|
support user customize device_map
|
2023-05-15 13:26:38 +08:00 |
|
PanQiWei
|
07e06fa08c
|
make compatible with older transformers version
|
2023-05-15 13:26:18 +08:00 |
|
oobabooga
|
86c7021285
|
Look for .pt files
|
2023-05-15 00:00:05 -03:00 |
|
PanQiWei
|
d5429441ef
|
add GPTJ fused attention module
|
2023-05-14 16:17:21 +08:00 |
|
PanQiWei
|
5445c67190
|
add library version comparison help functions
|
2023-05-14 16:16:06 +08:00 |
|
潘其威(William)
|
bdb08c16fc
|
Merge branch 'main' into Codegen
|
2023-05-14 13:10:52 +08:00 |
|