Commit graph

66 commits

Author SHA1 Message Date
潘其威(William)
1acb0c5eba
Merge pull request #18 from PanQiWei/push_to_hub_integration
push_to_hub integration
2023-04-26 17:52:45 +08:00
PanQiWei
e5e9296617 update README.md 2023-04-26 17:51:07 +08:00
PanQiWei
f2359f56cb add support to use push_to_hub to upload and share quantized model 2023-04-26 16:55:01 +08:00
qwopqwop200
2d85af5f6d
add simple demo ppl test with wikitext2 2023-04-26 17:19:54 +09:00
PanQiWei
a8e748c511 empty cache before switch model 2023-04-26 15:22:30 +08:00
PanQiWei
8f93e97137 update example code 2023-04-25 20:41:27 +08:00
PanQiWei
0a7ee423b8 update example code 2023-04-25 20:24:00 +08:00
PanQiWei
c6af495da2 add --use_triton flag 2023-04-25 20:23:52 +08:00
PanQiWei
f748dad2e1 always trust remote code 2023-04-25 12:13:46 +08:00
PanQiWei
3e074e6ea2 update README.md 2023-04-23 19:39:30 +08:00
PanQiWei
4b506f3dd4 add examples to use eval_tasks module 2023-04-23 19:30:46 +08:00
PanQiWei
1dc723dac5 refactor examples file structure 2023-04-23 19:30:11 +08:00
PanQiWei
be0c894d89 do not use fast tokenizer by default 2023-04-20 18:33:49 +08:00
PanQiWei
a8e80f4a47 optimize data preprocess logic 2023-04-17 01:55:30 +08:00
PanQiWei
21e5438479 add quant_with_alpaca.py and update README.md 2023-04-17 01:27:37 +08:00
PanQiWei
f394250718 update README.md and add some examples 2023-04-16 22:45:28 +08:00