AutoGPTQ

Author	SHA1	Message	Date
qwopqwop200	ac41f68532	add gpt2	2023-04-28 09:14:05 +09:00
qwopqwop200	dad249990c	add gpt2	2023-04-28 09:13:22 +09:00
qwopqwop200	435eebee4b	support conv1d,conv2d	2023-04-28 09:13:00 +09:00
qwopqwop200	cc0f71a568	add gpt2	2023-04-28 09:11:50 +09:00
qwopqwop200	3f90a22632	fix bug	2023-04-28 08:26:58 +09:00
qwopqwop200	9c38393e31	fix bug about wf meta device	2023-04-28 08:26:11 +09:00
PanQiWei	d0cd5af5d3	make code more robust	2023-04-28 01:29:12 +08:00
PanQiWei	51d2e53130	add support to cpu offloading and multi gpus inference on quantized model	2023-04-28 00:53:57 +08:00
PanQiWei	b14dca9207	disk offload assertion	2023-04-27 21:31:53 +08:00
PanQiWei	7a3397e7ba	add cpu offload when doing quantization	2023-04-27 21:25:24 +08:00
PanQiWei	ac3f7054e0	big fix	2023-04-27 19:33:25 +08:00
PanQiWei	498de923f2	support multi gpus quantization	2023-04-27 18:48:43 +08:00
qwopqwop200	8b6ee04aee	add option	2023-04-27 17:29:36 +09:00
PanQiWei	c9bb427546	align 'from_pretrained' api	2023-04-27 02:29:32 +08:00
PanQiWei	a2abff983e	support dispatch layers to different devices when loading pretrained model before quantization	2023-04-27 02:24:08 +08:00
PanQiWei	950f203260	add 'n_positions' to sequence length search list	2023-04-27 01:09:10 +08:00
PanQiWei	893c3264cb	make layer ignorance more robust	2023-04-26 19:35:19 +08:00
PanQiWei	f2359f56cb	add support to use push_to_hub to upload and share quantized model	2023-04-26 16:55:01 +08:00
PanQiWei	bf2ae6768d	bug fix	2023-04-26 13:33:56 +08:00
PanQiWei	73cb1dbf09	optimize import and format code	2023-04-26 13:08:47 +08:00
PanQiWei	975f100d0f	init Quantizer() at GPTQ() init stage	2023-04-25 23:13:09 +08:00
PanQiWei	c35dce525e	format code	2023-04-25 22:58:52 +08:00
PanQiWei	9f7f44146f	format code	2023-04-25 22:45:27 +08:00
PanQiWei	b71211b4c3	format code	2023-04-25 22:36:28 +08:00
PanQiWei	062b34f31a	add inference_mode and autocast context manager to generate function	2023-04-25 20:47:33 +08:00
PanQiWei	7915278e5f	bug fix	2023-04-25 20:43:40 +08:00
PanQiWei	31d683f85b	add option to choose whether autotune warmup or not after quantization	2023-04-25 20:29:05 +08:00
PanQiWei	9c405b1628	add triton support	2023-04-25 20:05:22 +08:00
PanQiWei	832dc4a7a1	refactor file structure	2023-04-25 18:58:20 +08:00
PanQiWei	0bb84053a2	update setup.py	2023-04-25 18:50:21 +08:00
PanQiWei	419160b733	always trust remote code	2023-04-25 12:52:49 +08:00
PanQiWei	6b6dd3e1e3	always trust remote code	2023-04-25 12:15:32 +08:00
PanQiWei	f748dad2e1	always trust remote code	2023-04-25 12:13:46 +08:00
PanQiWei	a259fb06bb	add support to MOSS model	2023-04-25 11:54:29 +08:00
PanQiWei	7d3a625cee	fix mismatch GPTNeoxForCausalLM's lm_head	2023-04-24 20:51:56 +08:00
PanQiWei	1a8c460262	fix problem that some models required more positional arguments in transformer layer's forward function	2023-04-24 14:46:21 +08:00
PanQiWei	37c0a80092	fix problem that some models can't get seqlen from model.config.max_position_embeddings	2023-04-24 14:24:00 +08:00
PanQiWei	4763c0b9a1	fix bugs	2023-04-23 19:27:16 +08:00
PanQiWei	7ba0edffe0	refactor file structure of modeling module	2023-04-23 17:33:09 +08:00
PanQiWei	3af35a05f0	add text summarization eval task	2023-04-23 16:31:17 +08:00
PanQiWei	6850a1e5bf	refactor file structure of utils	2023-04-23 16:11:54 +08:00
PanQiWei	07ede4c0e5	specifically using needed fields instead of dropping unused ones	2023-04-23 14:25:51 +08:00
PanQiWei	baaf044dd0	add sequence classification evaluation task	2023-04-23 14:25:51 +08:00
PanQiWei	110b6f57e2	force to merge_prompt_label in language modeling task	2023-04-23 14:25:51 +08:00
PanQiWei	c305496e36	add language modeling evaluation task	2023-04-23 14:25:51 +08:00
PanQiWei	15fc47e482	add utils for dataset and dataloader	2023-04-23 14:25:51 +08:00
PanQiWei	a830a62bc3	fix bugs for attention_mask and position_ids	2023-04-20 18:32:21 +08:00
PanQiWei	bcc7e0a051	make BaseGPTQForCausalLM as nn.Module, add more shortcut apis and fix some bugs	2023-04-17 01:15:30 +08:00
PanQiWei	969ec250ad	add shortcut to model.to method	2023-04-17 00:34:14 +08:00
PanQiWei	12ae4d024c	fix gptj forward and add torch.no_grad context manager	2023-04-17 00:15:41 +08:00

... 3 4 5 6 7

308 commits