This website requires JavaScript.
Explore
Help
Sign in
simcop2387
/
AutoGPTQ
Watch
1
Star
0
Fork
You've already forked AutoGPTQ
0
Code
Issues
Pull requests
Projects
Releases
Packages
1
Wiki
Activity
a69a73a22c
AutoGPTQ
/
auto_gptq
/
nn_modules
History
PanQiWei
a69a73a22c
fix device mismatch when directly using model to inference after quantization
2023-04-28 16:41:46 +08:00
..
triton_utils
add triton support
2023-04-25 20:05:22 +08:00
__init__.py
refactor file structure
2023-04-25 18:58:20 +08:00
qlinear.py
fix device mismatch when directly using model to inference after quantization
2023-04-28 16:41:46 +08:00
qlinear_triton.py
support conv1d,conv2d
2023-04-28 09:15:42 +09:00