AutoGPTQ

Author	SHA1	Message	Date
LaaZa	9fd558f2ba	Add support for Baichuan	2023-06-18 20:13:29 +03:00
Angainor Development	e75611e1b7	Fix error message	2023-06-05 22:19:09 +02:00
lunar	618a5f50ee	Add transpose operator when replace Conv1d with qlinear_cuda_old	2023-06-05 23:11:18 +08:00
潘其威(William)	023bb1c593	Merge pull request #125 from PanQiWei/support-32dim Support 32dim	2023-06-03 19:08:29 +08:00
qwopqwop200	f4820f2988	change qlinear cuda support 64dim	2023-06-03 07:30:34 +09:00
潘其威(William)	b4fdd8d264	Merge branch 'main' into peft_integration	2023-06-02 19:11:59 +08:00
qwopqwop200	2df7d7105d	support 64 cuda dim	2023-06-02 19:54:37 +09:00
qwopqwop200	b03f53294f	support 64dim cuda	2023-06-02 19:53:50 +09:00
qwopqwop200	0891ea4036	support 32dim triton]	2023-06-02 19:05:55 +09:00
qwopqwop200	b3654a68c3	support 32dim triton kernel	2023-06-02 19:04:12 +09:00
PanQiWei	ec6603d0ab	support older version python	2023-05-31 22:11:16 +08:00
qwopqwop200	b1a8cc28e8	remove raise	2023-05-31 00:03:51 +09:00
qwopqwop200	c381958a5f	add warning	2023-05-30 23:53:33 +09:00
qwopqwop200	0f2841cb13	remove log	2023-05-30 23:51:55 +09:00
qwopqwop200	33809a8e59	remove log	2023-05-30 23:51:39 +09:00
qwopqwop200	dfd9dc0e6b	change if trainable backend pytorch	2023-05-30 23:43:55 +09:00
qwopqwop200	5274313067	change if trainable backend pytorch	2023-05-30 23:40:58 +09:00
潘其威(William)	defc96ff04	Merge pull request #91 from TheBloke/TheBloke_support-HF-download Add support for HF Hub download, and `push_to_hub`	2023-05-30 07:37:15 +08:00
潘其威(William)	2245fad095	Update auto.py fix None type error	2023-05-30 07:35:15 +08:00
潘其威(William)	15db2cdc44	Update _base.py fix problem that recursively adding file extension to model_base_name	2023-05-30 07:26:42 +08:00
潘其威(William)	cfa7271617	Update _base.py fix variable not found error	2023-05-30 07:22:10 +08:00
潘其威(William)	e5771fb206	Update _base.py fix key mismatch	2023-05-30 06:44:45 +08:00
潘其威(William)	61a4ea035f	Update auto.py add back save_dir for backward compatible	2023-05-30 06:43:00 +08:00
潘其威(William)	ea74e15199	Update _base.py add model_name_or_path and model_file_base_name to BaseQuantizeConfig for better model file management; add back save_dir to .from_quantized() for backward compatible	2023-05-30 06:40:31 +08:00
PanQiWei	6c64b0b361	raise NotImplementedError when model with fused attention injected try to use ADAPTION_PROMPT peft type	2023-05-28 22:35:34 +08:00
PanQiWei	def084bf0e	reset value of AdaptionPromptConfig.adapter_layers to number of model's hidden layers when exceeds	2023-05-28 22:11:02 +08:00
PanQiWei	ad10c13d40	support AdaLora	2023-05-28 21:30:45 +08:00
PanQiWei	3ee2daa73c	make GPTQLoraModel to inherit from LoraModel to simplify code	2023-05-28 17:36:18 +08:00
PanQiWei	22d1d8dcaa	add 'auto_find_all_linears' argument to get_gptq_peft_model function	2023-05-28 17:04:38 +08:00
PanQiWei	83132a663a	add warning to guide users interact with lora properly	2023-05-28 16:57:31 +08:00
PanQiWei	86f060c74b	Merge branch 'main' into peft_integration	2023-05-28 16:23:38 +08:00
PanQiWei	491da62402	fix signature at import time	2023-05-27 17:49:58 +08:00
潘其威(William)	23998345f5	Merge branch 'main' into falcon	2023-05-27 16:23:16 +08:00
Bill Cai	0729760234	Update auto.py	2023-05-27 11:16:43 +08:00
潘其威(William)	269ef7335c	Merge branch 'main' into falcon	2023-05-27 08:15:52 +08:00
潘其威(William)	3c3b0e1e79	Merge branch 'main' into GPTBigCode	2023-05-27 08:03:03 +08:00
潘其威(William)	eab728b263	Merge branch 'main' into Codegen	2023-05-27 08:00:19 +08:00
潘其威(William)	f6fd314d5a	Merge branch 'main' into GPTBigCode	2023-05-27 07:57:25 +08:00
qwopqwop200	277809381b	fix bug	2023-05-27 08:53:47 +09:00
PanQiWei	5bc5325920	add find_all_linear_names help function, make customized lora module more general	2023-05-27 07:49:17 +08:00
PanQiWei	eb9c0b140f	update FusedLlamaMLPForQuantizedModel for general usage purpose	2023-05-27 07:47:20 +08:00
qwopqwop200	bcb345fb35	support falcon	2023-05-27 07:53:39 +09:00
qwopqwop200	4d5b4fa5c6	add dtype	2023-05-27 07:49:28 +09:00
qwopqwop200	c14b4c1567	change find layer algorithm	2023-05-27 07:48:50 +09:00
PanQiWei	f7e705848a	move peft compatible model injection to the last step	2023-05-26 14:29:33 +08:00
PanQiWei	8bf21a7e4c	set xavier_uniform_ as lora_A's init function	2023-05-26 14:06:53 +08:00
PanQiWei	2b532f9453	add trainable mode	2023-05-26 13:11:30 +08:00
PanQiWei	fe5f5d12ed	Merge branch 'main' into peft_integration	2023-05-26 09:48:06 +08:00
PanQiWei	69609c4bc7	support faster vecquant4matmul cuda kernel	2023-05-26 08:55:05 +08:00
PanQiWei	cfd27e8caa	refactor file structure of qlinears	2023-05-26 07:18:16 +08:00

1 2 3 4 5 ...

284 commits