Commit graph

618 commits

Author SHA1 Message Date
PanQiWei
a259fb06bb add support to MOSS model 2023-04-25 11:54:29 +08:00
PanQiWei
73606a3a16 release v0.0.3 2023-04-25 01:09:30 +08:00
PanQiWei
7d3a625cee fix mismatch GPTNeoxForCausalLM's lm_head 2023-04-24 20:51:56 +08:00
PanQiWei
1a8c460262 fix problem that some models required more positional arguments in transformer layer's forward function 2023-04-24 14:46:21 +08:00
PanQiWei
5657bd2db7 set package version to v0.0.3-dev 2023-04-24 14:24:52 +08:00
PanQiWei
8b487af045 Merge remote-tracking branch 'origin/main' 2023-04-24 14:24:28 +08:00
PanQiWei
37c0a80092 fix problem that some models can't get seqlen from model.config.max_position_embeddings 2023-04-24 14:24:00 +08:00
潘其威(William)
9279602cea
fix typo 2023-04-24 13:49:39 +08:00
潘其威(William)
a19f58a870
Merge pull request #10 from eltociear/patch-1
Update README.md
2023-04-24 10:14:12 +08:00
Ikko Eltociear Ashimine
cad46eec02
Update README.md
huggingface -> Hugging Face
2023-04-24 02:22:56 +09:00
潘其威(William)
4f84d21684
Merge pull request #5 from PanQiWei/evaluation_and_benchmark
[WIP]Evaluation and benchmark
2023-04-23 20:03:00 +08:00
PanQiWei
4d3795b3a5 upgrade package version to v0.0.2 2023-04-23 20:02:35 +08:00
PanQiWei
254d23f5d4 update README.md 2023-04-23 20:02:16 +08:00
PanQiWei
3e074e6ea2 update README.md 2023-04-23 19:39:30 +08:00
PanQiWei
4b506f3dd4 add examples to use eval_tasks module 2023-04-23 19:30:46 +08:00
PanQiWei
1dc723dac5 refactor examples file structure 2023-04-23 19:30:11 +08:00
PanQiWei
4763c0b9a1 fix bugs 2023-04-23 19:27:16 +08:00
PanQiWei
7ba0edffe0 refactor file structure of modeling module 2023-04-23 17:33:09 +08:00
PanQiWei
3af35a05f0 add text summarization eval task 2023-04-23 16:31:17 +08:00
PanQiWei
6850a1e5bf refactor file structure of utils 2023-04-23 16:11:54 +08:00
PanQiWei
f5352fac1b Merge remote-tracking branch 'origin/evaluation_and_benchmark' into evaluation_and_benchmark 2023-04-23 14:26:09 +08:00
PanQiWei
07ede4c0e5 specifically using needed fields instead of dropping unused ones 2023-04-23 14:25:51 +08:00
PanQiWei
687f6272e0 update setup.py 2023-04-23 14:25:51 +08:00
PanQiWei
baaf044dd0 add sequence classification evaluation task 2023-04-23 14:25:51 +08:00
PanQiWei
110b6f57e2 force to merge_prompt_label in language modeling task 2023-04-23 14:25:51 +08:00
PanQiWei
c305496e36 add language modeling evaluation task 2023-04-23 14:25:51 +08:00
PanQiWei
15fc47e482 add utils for dataset and dataloader 2023-04-23 14:25:51 +08:00
潘其威(William)
715fc0788d
Update README.md 2023-04-23 10:58:50 +08:00
潘其威(William)
db95aa1f32
Update README.md 2023-04-23 10:58:07 +08:00
潘其威(William)
611f228186
Update README.md 2023-04-22 21:43:10 +08:00
PanQiWei
8dd7ae0dc9 specifically using needed fields instead of dropping unused ones 2023-04-22 21:33:05 +08:00
PanQiWei
99d41ec4aa update setup.py 2023-04-22 21:22:57 +08:00
PanQiWei
19adba7a9b add sequence classification evaluation task 2023-04-22 21:22:51 +08:00
PanQiWei
d9c0c73ffc force to merge_prompt_label in language modeling task 2023-04-22 20:52:11 +08:00
PanQiWei
4908449a28 add language modeling evaluation task 2023-04-22 19:37:51 +08:00
PanQiWei
21437d8b1c add utils for dataset and dataloader 2023-04-22 19:37:12 +08:00
PanQiWei
0a1f19352d gptj no longer requires transformers<=4.27.4 2023-04-20 18:35:22 +08:00
PanQiWei
be0c894d89 do not use fast tokenizer by default 2023-04-20 18:33:49 +08:00
PanQiWei
a830a62bc3 fix bugs for attention_mask and position_ids 2023-04-20 18:32:21 +08:00
PanQiWei
4d34fc11d0 update README.md 2023-04-20 12:21:39 +08:00
PanQiWei
06ab6e7d3d fix typo 2023-04-20 12:16:16 +08:00
PanQiWei
9c93b95df7 update README.md 2023-04-20 12:15:45 +08:00
PanQiWei
0ef6f66168 update README.md 2023-04-20 12:09:56 +08:00
PanQiWei
a8e80f4a47 optimize data preprocess logic 2023-04-17 01:55:30 +08:00
PanQiWei
21e5438479 add quant_with_alpaca.py and update README.md 2023-04-17 01:27:37 +08:00
PanQiWei
bcc7e0a051 make BaseGPTQForCausalLM as nn.Module, add more shortcut apis and fix some bugs 2023-04-17 01:15:30 +08:00
PanQiWei
4331f8a6e4 update README.md 2023-04-17 00:40:28 +08:00
PanQiWei
ecaa0f7389 update README.md 2023-04-17 00:36:26 +08:00
PanQiWei
24258123fd update README.md 2023-04-17 00:35:03 +08:00
PanQiWei
969ec250ad add shortcut to model.to method 2023-04-17 00:34:14 +08:00