AutoGPTQ/examples/benchmark
潘其威(William) bf521cbe7b
Merge pull request #134 from TheBloke/TB_benchmark
add command flags inject_fused_attention and inject_fused_mlp
2023-06-05 23:02:36 +08:00
..
generation_speed.py Merge pull request #134 from TheBloke/TB_benchmark 2023-06-05 23:02:36 +08:00