Commit graph

663 commits

Author SHA1 Message Date
PanQiWei
6395e4b301 update setup.py 2023-07-26 18:58:49 +08:00
PanQiWei
d6b6ec83ef Merge remote-tracking branch 'origin/main' 2023-07-26 18:41:01 +08:00
PanQiWei
1138240385 update version to 0.3.2 2023-07-26 18:40:44 +08:00
潘其威(William)
b0889e4dab
Merge pull request #212 from casperbh96/main
Fix build on non-CUDA machines after #206
2023-07-26 18:35:53 +08:00
Casper
c68b4492f6 Fix build on non-CUDA machines after #206 2023-07-26 12:21:58 +02:00
PanQiWei
ff1f100ded remove argument 'save_dir' in method from_quantized 2023-07-26 17:58:04 +08:00
PanQiWei
722a621aaa simplified code 2023-07-26 17:53:47 +08:00
PanQiWei
5d6862ee8d update README 2023-07-26 14:18:26 +08:00
潘其威(William)
22748dd2b7
Merge pull request #209 from PanQiWei/fix_no_cuda_kernel
Fix error raised when CUDA kernels are not installed
2023-07-26 14:07:30 +08:00
潘其威(William)
fd24e84eb2
Merge pull request #166 from casperbh96/main
[FEATURE] Implement perplexity metric to compare against llama.cpp
2023-07-26 14:04:51 +08:00
PanQiWei
5883b45d73 fix error raised when cuda kernels are not installed 2023-07-26 13:59:28 +08:00
潘其威(William)
bbc4a7c455
Merge pull request #208 from TheBloke/TB_Add_SafeTensors_Metadata
Add Safetensors metadata saving, with some values saved to each .safetensor file
2023-07-26 11:54:47 +08:00
潘其威(William)
228867a753
Merge pull request #207 from TheBloke/TB_version
Add a central version number
2023-07-26 11:27:23 +08:00
潘其威(William)
cbc319b4c8
Merge pull request #206 from TheBloke/TB_InstallScript
Change the install script so it attempts to build the CUDA extension in all cases
2023-07-26 11:20:53 +08:00
潘其威(William)
2456f71125
Merge pull request #205 from TheBloke/TB_fix_revision
Fix `revision` and other huggingface_hub kwargs in .from_quantized()
2023-07-26 10:34:43 +08:00
潘其威(William)
df4c4312ff
Merge pull request #202 from PanQiWei/fix-cuda-bug
Fix cuda bug that causes group_size and desc_act can't be used together
2023-07-26 10:32:18 +08:00
TheBloke
2647c92743 safetensors_metadata: add conversion to str() for input metadata to avoid errors from save_safe. Warn if this results in keys being overwritten. 2023-07-25 21:14:21 +00:00
TheBloke
ee7d80945b Add version to metadata using new value 2023-07-25 14:25:24 +00:00
TheBloke
3817d154af Merge branch 'TB_version' into TB_Add_SafeTensors_Metadata 2023-07-25 14:09:29 +00:00
TheBloke
7575eae6ab Added to __init__.py to show a central version number. Also slightly adjust way version is stored in setup.py to make it easier to edit on version update. Bump version to 0.3.1 in both 2023-07-25 14:06:51 +00:00
TheBloke
eeaf5ebc53 Extend huggingface_hub features to AutoGPTQForCausalLM.from_pretrained() so models can be quantised from the hub including using a private token and revision/branch etc 2023-07-25 13:26:37 +00:00
TheBloke
593d32cb45 Typo in version joining 2023-07-25 13:18:52 +00:00
TheBloke
c9124e3fc7 Fix revision and other huggingface_hub args for .from_quantized(), which were not being passed through 2023-07-25 12:48:33 +00:00
TheBloke
6fc69c5b83 Fix check for Torch CUDA version 2023-07-25 12:45:27 +00:00
TheBloke
29da6c239f setup.py now builds CUDA ext unless BUILD_CUDA_EXT=0. Also add a check of CUDA_VERSION from Torch, if available. GITHUB_ACTIONS=true is no longer needed. 2023-07-25 11:44:43 +00:00
TheBloke
3f359fc778 Add support for Safetensors metadata 2023-07-25 11:30:39 +00:00
qwopqwop200
9578c59d31
fix cuda bug 2023-07-25 16:50:05 +09:00
qwopqwop200
ed2aa9368e
fix cuda buf 2023-07-25 16:46:32 +09:00
PanQiWei
45576f0933 0.3.0 release 2023-07-16 15:24:06 +08:00
潘其威(William)
c2c5a74f4b
Merge pull request #158 from MarisaKirisame/main
Fix stale documentation
2023-07-11 10:43:09 +08:00
潘其威(William)
79f8a08a6d
Merge pull request #189 from cczhong11/main
Add support for InternLM
2023-07-11 10:42:30 +08:00
tc
e28e8ee809 Add support for InternLM 2023-07-07 09:25:40 -07:00
PanQiWei
590219d048 update README 2023-07-06 17:15:50 +08:00
Casper
1949e8607d Fix usage of device 2023-06-19 20:16:16 +02:00
Casper
5b88f03bba Create example of how to evaluate perplexity 2023-06-19 20:03:42 +02:00
Casper
992a0ab102 Reference Perplexity class 2023-06-19 20:03:32 +02:00
Casper
b351c8c547 Add perplexity calculation class 2023-06-19 20:03:22 +02:00
潘其威(William)
046c031139
Merge pull request #141 from AngainorDev/patch-1
Fix error message
2023-06-19 10:11:10 +08:00
潘其威(William)
93368c4e36
Merge pull request #164 from LaaZa/Baichuan
Add support for BaiChuan model
2023-06-19 10:08:21 +08:00
LaaZa
03577a7698 Rename the class to match reference capitalisation 2023-06-18 21:01:07 +03:00
LaaZa
9fd558f2ba Add support for Baichuan 2023-06-18 20:13:29 +03:00
Marisa Kirisame
ae80f2dc72 fix stale documentation 2023-06-14 20:04:27 +00:00
PanQiWei
9baff43f6f remove upload sdist step 2023-06-08 14:07:38 +08:00
PanQiWei
1b226c7bcf revert use absolute path in include_dirs 2023-06-08 14:03:08 +08:00
PanQiWei
7520133a74 upload sdist at every job 2023-06-08 14:02:46 +08:00
PanQiWei
0d4c54add9 remove build_sdist_wheel_cpu_only.yml 2023-06-08 14:02:21 +08:00
PanQiWei
bb4924e9a8 change workflow name 2023-06-08 12:33:59 +08:00
PanQiWei
8801cdf340 build_wheels.yml -> build_wheels_cuda.yml 2023-06-08 12:32:42 +08:00
PanQiWei
67bb388bf8 remove build_sdst job 2023-06-08 12:32:17 +08:00
PanQiWei
15d1981f25 add build_sdist_wheel_cpu_only.yml 2023-06-08 12:31:35 +08:00