Commit graph

8 commits

Author SHA1 Message Date
PanQiWei
db9eabfc4b add disable_exllama argument 2023-08-09 12:05:15 +08:00
PanQiWei
ff1f100ded remove argument 'save_dir' in method from_quantized 2023-07-26 17:58:04 +08:00
潘其威(William)
bf521cbe7b
Merge pull request #134 from TheBloke/TB_benchmark
add command flags inject_fused_attention and inject_fused_mlp
2023-06-05 23:02:36 +08:00
TheBloke
edb13d493e Default inject_fused_attention and mlp to True, matching defaults 2023-06-03 17:58:40 +01:00
TheBloke
4617629f0c Support setting inject_fused_attention and inject_fused_mlp to False 2023-06-03 17:48:36 +01:00
PanQiWei
801b1c13ca update example script 2023-05-28 21:30:53 +08:00
PanQiWei
f703c9ac98 update generation_speed.py 2023-05-27 20:15:43 +08:00
PanQiWei
e8bd3c33c4 add generation speed benchmark example script 2023-05-27 19:16:42 +08:00