AutoGPTQ/auto_gptq/nn_modules
2023-04-28 16:41:46 +08:00
..
triton_utils add triton support 2023-04-25 20:05:22 +08:00
__init__.py refactor file structure 2023-04-25 18:58:20 +08:00
qlinear.py fix device mismatch when directly using model to inference after quantization 2023-04-28 16:41:46 +08:00
qlinear_triton.py support conv1d,conv2d 2023-04-28 09:15:42 +09:00