AutoGPTQ

Author	SHA1	Message	Date
潘其威(William)	1acb0c5eba	Merge pull request #18 from PanQiWei/push_to_hub_integration push_to_hub integration	2023-04-26 17:52:45 +08:00
PanQiWei	e5e9296617	update README.md	2023-04-26 17:51:07 +08:00
PanQiWei	f2359f56cb	add support to use push_to_hub to upload and share quantized model	2023-04-26 16:55:01 +08:00
qwopqwop200	2d85af5f6d	add simple demo ppl test with wikitext2	2023-04-26 17:19:54 +09:00
PanQiWei	a8e748c511	empty cache before switch model	2023-04-26 15:22:30 +08:00
PanQiWei	8f93e97137	update example code	2023-04-25 20:41:27 +08:00
PanQiWei	0a7ee423b8	update example code	2023-04-25 20:24:00 +08:00
PanQiWei	c6af495da2	add --use_triton flag	2023-04-25 20:23:52 +08:00
PanQiWei	f748dad2e1	always trust remote code	2023-04-25 12:13:46 +08:00
PanQiWei	3e074e6ea2	update README.md	2023-04-23 19:39:30 +08:00
PanQiWei	4b506f3dd4	add examples to use eval_tasks module	2023-04-23 19:30:46 +08:00
PanQiWei	1dc723dac5	refactor examples file structure	2023-04-23 19:30:11 +08:00
PanQiWei	be0c894d89	do not use fast tokenizer by default	2023-04-20 18:33:49 +08:00
PanQiWei	a8e80f4a47	optimize data preprocess logic	2023-04-17 01:55:30 +08:00
PanQiWei	21e5438479	add quant_with_alpaca.py and update README.md	2023-04-17 01:27:37 +08:00
PanQiWei	f394250718	update README.md and add some examples	2023-04-16 22:45:28 +08:00

1 2