AutoGPTQ

History

PanQiWei fdb8c4500a extend to support qlinear_exllama's fusion		2023-08-11 14:52:26 +08:00
..
__init__.py	add FusedGeneralQuantLinear	2023-08-04 19:10:32 +08:00
attention.py	support inherit one of the three fused attention class and customize attn_bias building logic	2023-08-07 18:59:04 +08:00
linear.py	extend to support qlinear_exllama's fusion	2023-08-11 14:52:26 +08:00
mlp.py	doing 'memory_efficient_fusion' in __init__	2023-08-06 17:23:57 +08:00