Felix Marty
|
c203a85dee
|
fix
|
2023-08-04 15:00:12 +00:00 |
|
Felix Marty
|
d0608b09db
|
rocm support
|
2023-08-04 13:38:02 +00:00 |
|
Félix Marty
|
4fb3e20c5e
|
Merge branch 'main' into exllama-q4-kernel
|
2023-08-04 15:13:34 +02:00 |
|
Felix Marty
|
179776bd1d
|
exllama kernel
|
2023-07-31 11:50:45 +00:00 |
|
Felix Marty
|
677d23be2d
|
style
|
2023-07-28 15:14:46 +00:00 |
|
Felix Marty
|
2cb191e114
|
fix bugs
|
2023-07-28 14:10:44 +00:00 |
|
Felix Marty
|
547fb198d1
|
fix
|
2023-07-27 12:36:25 +00:00 |
|
qwopqwop200
|
ed2aa9368e
|
fix cuda buf
|
2023-07-25 16:46:32 +09:00 |
|
qwopqwop200
|
e04c3b86cc
|
add cuda
|
2023-06-03 07:28:35 +09:00 |
|
qwopqwop200
|
5fc2064e1a
|
Rename autogptq_cuda_kernel.cu to autogptq_cuda_kernel_64.cu
|
2023-06-03 07:27:45 +09:00 |
|
qwopqwop200
|
446e12d3de
|
Rename autogptq_cuda.cpp to autogptq_cuda_64.cpp
|
2023-06-03 07:27:31 +09:00 |
|
qwopqwop200
|
90106d7c34
|
support cuda 64dim
|
2023-06-02 19:49:38 +09:00 |
|
PanQiWei
|
69609c4bc7
|
support faster vecquant4matmul cuda kernel
|
2023-05-26 08:55:05 +08:00 |
|
TheBloke
|
898f1ef62d
|
Rename 'quant_cuda' to 'autogptq_cuda' to avoid conflicts with existing GPTQ-for-LLaMa installations.
|
2023-05-20 09:33:51 +01:00 |
|