AutoGPTQ

Author	SHA1	Message	Date
Felix Marty	c203a85dee	fix	2023-08-04 15:00:12 +00:00
Felix Marty	d0608b09db	rocm support	2023-08-04 13:38:02 +00:00
Félix Marty	4fb3e20c5e	Merge branch 'main' into exllama-q4-kernel	2023-08-04 15:13:34 +02:00
Felix Marty	179776bd1d	exllama kernel	2023-07-31 11:50:45 +00:00
Felix Marty	677d23be2d	style	2023-07-28 15:14:46 +00:00
Felix Marty	2cb191e114	fix bugs	2023-07-28 14:10:44 +00:00
Felix Marty	547fb198d1	fix	2023-07-27 12:36:25 +00:00
qwopqwop200	ed2aa9368e	fix cuda buf	2023-07-25 16:46:32 +09:00
qwopqwop200	e04c3b86cc	add cuda	2023-06-03 07:28:35 +09:00
qwopqwop200	5fc2064e1a	Rename autogptq_cuda_kernel.cu to autogptq_cuda_kernel_64.cu	2023-06-03 07:27:45 +09:00
qwopqwop200	446e12d3de	Rename autogptq_cuda.cpp to autogptq_cuda_64.cpp	2023-06-03 07:27:31 +09:00
qwopqwop200	90106d7c34	support cuda 64dim	2023-06-02 19:49:38 +09:00
PanQiWei	69609c4bc7	support faster vecquant4matmul cuda kernel	2023-05-26 08:55:05 +08:00
TheBloke	898f1ef62d	Rename 'quant_cuda' to 'autogptq_cuda' to avoid conflicts with existing GPTQ-for-LLaMa installations.	2023-05-20 09:33:51 +01:00