Commit graph

14 commits

Author SHA1 Message Date
Felix Marty
c203a85dee fix 2023-08-04 15:00:12 +00:00
Felix Marty
d0608b09db rocm support 2023-08-04 13:38:02 +00:00
Félix Marty
4fb3e20c5e Merge branch 'main' into exllama-q4-kernel 2023-08-04 15:13:34 +02:00
Felix Marty
179776bd1d exllama kernel 2023-07-31 11:50:45 +00:00
Felix Marty
677d23be2d style 2023-07-28 15:14:46 +00:00
Felix Marty
2cb191e114 fix bugs 2023-07-28 14:10:44 +00:00
Felix Marty
547fb198d1 fix 2023-07-27 12:36:25 +00:00
qwopqwop200
ed2aa9368e
fix cuda buf 2023-07-25 16:46:32 +09:00
qwopqwop200
e04c3b86cc
add cuda 2023-06-03 07:28:35 +09:00
qwopqwop200
5fc2064e1a
Rename autogptq_cuda_kernel.cu to autogptq_cuda_kernel_64.cu 2023-06-03 07:27:45 +09:00
qwopqwop200
446e12d3de
Rename autogptq_cuda.cpp to autogptq_cuda_64.cpp 2023-06-03 07:27:31 +09:00
qwopqwop200
90106d7c34
support cuda 64dim 2023-06-02 19:49:38 +09:00
PanQiWei
69609c4bc7 support faster vecquant4matmul cuda kernel 2023-05-26 08:55:05 +08:00
TheBloke
898f1ef62d Rename 'quant_cuda' to 'autogptq_cuda' to avoid conflicts with existing GPTQ-for-LLaMa installations. 2023-05-20 09:33:51 +01:00