PanQiWei
|
ff1f100ded
|
remove argument 'save_dir' in method from_quantized
|
2023-07-26 17:58:04 +08:00 |
|
潘其威(William)
|
bf521cbe7b
|
Merge pull request #134 from TheBloke/TB_benchmark
add command flags inject_fused_attention and inject_fused_mlp
|
2023-06-05 23:02:36 +08:00 |
|
TheBloke
|
edb13d493e
|
Default inject_fused_attention and mlp to True, matching defaults
|
2023-06-03 17:58:40 +01:00 |
|
TheBloke
|
4617629f0c
|
Support setting inject_fused_attention and inject_fused_mlp to False
|
2023-06-03 17:48:36 +01:00 |
|
PanQiWei
|
801b1c13ca
|
update example script
|
2023-05-28 21:30:53 +08:00 |
|
PanQiWei
|
f703c9ac98
|
update generation_speed.py
|
2023-05-27 20:15:43 +08:00 |
|
PanQiWei
|
e8bd3c33c4
|
add generation speed benchmark example script
|
2023-05-27 19:16:42 +08:00 |
|