AutoGPTQ

Author	SHA1	Message	Date
潘其威(William)	04db761eed	Merge pull request #347 from alex4321/peft-model-use-adapter-name Use `adapter_name` for `get_gptq_peft_model` with `train_mode=True`	2023-09-26 13:55:06 +08:00
Marc Sun	c912bf361a	exllamav2 integration	2023-09-25 16:51:18 +00:00
Alexander Pozharskii	0185095402	Use `adapter_name` for `get_gptq_peft_model` with `train_mode=True`	2023-09-24 17:11:19 +04:00
qwopqwop200	45a1ee4d84	install check qigen	2023-08-31 14:37:39 +09:00
qwopqwop200	6a9d80eddc	Merge remote-tracking branch 'qwopqwop200/main' into main	2023-08-25 18:06:03 +09:00
Felix Marty	04730ac66c	expose api to set exllama max length	2023-08-24 11:22:15 +00:00
qwopqwop200	084c9d8860	name change	2023-08-17 15:17:09 +09:00
qwopqwop200	7ba78af3ae	support cpu	2023-08-10 22:48:04 +09:00
PanQiWei	44c7a1a184	make exllama_kernels compilation as optional	2023-08-09 17:42:22 +08:00
Félix Marty	4fb3e20c5e	Merge branch 'main' into exllama-q4-kernel	2023-08-04 15:13:34 +02:00
Felix Marty	38447262c0	fix fused attn	2023-07-31 13:46:32 +00:00
Felix Marty	179776bd1d	exllama kernel	2023-07-31 11:50:45 +00:00
Felix Marty	23eb519e68	typo	2023-07-28 17:45:34 +00:00
Felix Marty	caf6625b68	warning about triton	2023-07-28 17:42:37 +00:00
PanQiWei	722a621aaa	simplified code	2023-07-26 17:53:47 +08:00
Casper	992a0ab102	Reference Perplexity class	2023-06-19 20:03:32 +02:00
Casper	b351c8c547	Add perplexity calculation class	2023-06-19 20:03:22 +02:00
qwopqwop200	b1a8cc28e8	remove raise	2023-05-31 00:03:51 +09:00
PanQiWei	6c64b0b361	raise NotImplementedError when model with fused attention injected try to use ADAPTION_PROMPT peft type	2023-05-28 22:35:34 +08:00
PanQiWei	def084bf0e	reset value of AdaptionPromptConfig.adapter_layers to number of model's hidden layers when exceeds	2023-05-28 22:11:02 +08:00
PanQiWei	ad10c13d40	support AdaLora	2023-05-28 21:30:45 +08:00
PanQiWei	3ee2daa73c	make GPTQLoraModel to inherit from LoraModel to simplify code	2023-05-28 17:36:18 +08:00
PanQiWei	22d1d8dcaa	add 'auto_find_all_linears' argument to get_gptq_peft_model function	2023-05-28 17:04:38 +08:00
PanQiWei	83132a663a	add warning to guide users interact with lora properly	2023-05-28 16:57:31 +08:00
PanQiWei	5bc5325920	add find_all_linear_names help function, make customized lora module more general	2023-05-27 07:49:17 +08:00
PanQiWei	8bf21a7e4c	set xavier_uniform_ as lora_A's init function	2023-05-26 14:06:53 +08:00
PanQiWei	2b532f9453	add trainable mode	2023-05-26 13:11:30 +08:00
PanQiWei	cfd27e8caa	refactor file structure of qlinears	2023-05-26 07:18:16 +08:00
PanQiWei	f6a34137e9	lora compatibility	2023-05-25 19:44:53 +08:00
PanQiWei	d293bf3a04	first upload peft_utils.py	2023-05-25 15:11:11 +08:00
PanQiWei	86b3b52c63	fix ImportError when triton is not installed	2023-05-20 16:15:20 +08:00
PanQiWei	5445c67190	add library version comparison help functions	2023-05-14 16:16:06 +08:00
PanQiWei	d718d63e9c	add import_utils.py for commonly used module importation	2023-05-12 19:58:48 +08:00
PanQiWei	41564a48db	make data_utils.py as global utils	2023-04-28 18:08:58 +08:00

34 commits