潘其威(William)
|
b0889e4dab
|
Merge pull request #212 from casperbh96/main
Fix build on non-CUDA machines after #206
|
2023-07-26 18:35:53 +08:00 |
|
Casper
|
c68b4492f6
|
Fix build on non-CUDA machines after #206
|
2023-07-26 12:21:58 +02:00 |
|
PanQiWei
|
ff1f100ded
|
remove argument 'save_dir' in method from_quantized
|
2023-07-26 17:58:04 +08:00 |
|
PanQiWei
|
722a621aaa
|
simplified code
|
2023-07-26 17:53:47 +08:00 |
|
PanQiWei
|
5d6862ee8d
|
update README
|
2023-07-26 14:18:26 +08:00 |
|
潘其威(William)
|
22748dd2b7
|
Merge pull request #209 from PanQiWei/fix_no_cuda_kernel
Fix error raised when CUDA kernels are not installed
|
2023-07-26 14:07:30 +08:00 |
|
潘其威(William)
|
fd24e84eb2
|
Merge pull request #166 from casperbh96/main
[FEATURE] Implement perplexity metric to compare against llama.cpp
|
2023-07-26 14:04:51 +08:00 |
|
PanQiWei
|
5883b45d73
|
fix error raised when cuda kernels are not installed
|
2023-07-26 13:59:28 +08:00 |
|
潘其威(William)
|
bbc4a7c455
|
Merge pull request #208 from TheBloke/TB_Add_SafeTensors_Metadata
Add Safetensors metadata saving, with some values saved to each .safetensor file
|
2023-07-26 11:54:47 +08:00 |
|
潘其威(William)
|
228867a753
|
Merge pull request #207 from TheBloke/TB_version
Add a central version number
|
2023-07-26 11:27:23 +08:00 |
|
潘其威(William)
|
cbc319b4c8
|
Merge pull request #206 from TheBloke/TB_InstallScript
Change the install script so it attempts to build the CUDA extension in all cases
|
2023-07-26 11:20:53 +08:00 |
|
潘其威(William)
|
2456f71125
|
Merge pull request #205 from TheBloke/TB_fix_revision
Fix `revision` and other huggingface_hub kwargs in .from_quantized()
|
2023-07-26 10:34:43 +08:00 |
|
潘其威(William)
|
df4c4312ff
|
Merge pull request #202 from PanQiWei/fix-cuda-bug
Fix cuda bug that causes group_size and desc_act can't be used together
|
2023-07-26 10:32:18 +08:00 |
|
TheBloke
|
2647c92743
|
safetensors_metadata: add conversion to str() for input metadata to avoid errors from save_safe. Warn if this results in keys being overwritten.
|
2023-07-25 21:14:21 +00:00 |
|
TheBloke
|
ee7d80945b
|
Add version to metadata using new value
|
2023-07-25 14:25:24 +00:00 |
|
TheBloke
|
3817d154af
|
Merge branch 'TB_version' into TB_Add_SafeTensors_Metadata
|
2023-07-25 14:09:29 +00:00 |
|
TheBloke
|
7575eae6ab
|
Added to __init__.py to show a central version number. Also slightly adjust way version is stored in setup.py to make it easier to edit on version update. Bump version to 0.3.1 in both
|
2023-07-25 14:06:51 +00:00 |
|
TheBloke
|
eeaf5ebc53
|
Extend huggingface_hub features to AutoGPTQForCausalLM.from_pretrained() so models can be quantised from the hub including using a private token and revision/branch etc
|
2023-07-25 13:26:37 +00:00 |
|
TheBloke
|
593d32cb45
|
Typo in version joining
|
2023-07-25 13:18:52 +00:00 |
|
TheBloke
|
c9124e3fc7
|
Fix revision and other huggingface_hub args for .from_quantized(), which were not being passed through
|
2023-07-25 12:48:33 +00:00 |
|
TheBloke
|
6fc69c5b83
|
Fix check for Torch CUDA version
|
2023-07-25 12:45:27 +00:00 |
|
TheBloke
|
29da6c239f
|
setup.py now builds CUDA ext unless BUILD_CUDA_EXT=0. Also add a check of CUDA_VERSION from Torch, if available. GITHUB_ACTIONS=true is no longer needed.
|
2023-07-25 11:44:43 +00:00 |
|
TheBloke
|
3f359fc778
|
Add support for Safetensors metadata
|
2023-07-25 11:30:39 +00:00 |
|
qwopqwop200
|
9578c59d31
|
fix cuda bug
|
2023-07-25 16:50:05 +09:00 |
|
qwopqwop200
|
ed2aa9368e
|
fix cuda buf
|
2023-07-25 16:46:32 +09:00 |
|
PanQiWei
|
45576f0933
|
0.3.0 release
|
2023-07-16 15:24:06 +08:00 |
|
潘其威(William)
|
c2c5a74f4b
|
Merge pull request #158 from MarisaKirisame/main
Fix stale documentation
|
2023-07-11 10:43:09 +08:00 |
|
潘其威(William)
|
79f8a08a6d
|
Merge pull request #189 from cczhong11/main
Add support for InternLM
|
2023-07-11 10:42:30 +08:00 |
|
tc
|
e28e8ee809
|
Add support for InternLM
|
2023-07-07 09:25:40 -07:00 |
|
PanQiWei
|
590219d048
|
update README
|
2023-07-06 17:15:50 +08:00 |
|
Casper
|
1949e8607d
|
Fix usage of device
|
2023-06-19 20:16:16 +02:00 |
|
Casper
|
5b88f03bba
|
Create example of how to evaluate perplexity
|
2023-06-19 20:03:42 +02:00 |
|
Casper
|
992a0ab102
|
Reference Perplexity class
|
2023-06-19 20:03:32 +02:00 |
|
Casper
|
b351c8c547
|
Add perplexity calculation class
|
2023-06-19 20:03:22 +02:00 |
|
潘其威(William)
|
046c031139
|
Merge pull request #141 from AngainorDev/patch-1
Fix error message
|
2023-06-19 10:11:10 +08:00 |
|
潘其威(William)
|
93368c4e36
|
Merge pull request #164 from LaaZa/Baichuan
Add support for BaiChuan model
|
2023-06-19 10:08:21 +08:00 |
|
LaaZa
|
03577a7698
|
Rename the class to match reference capitalisation
|
2023-06-18 21:01:07 +03:00 |
|
LaaZa
|
9fd558f2ba
|
Add support for Baichuan
|
2023-06-18 20:13:29 +03:00 |
|
Marisa Kirisame
|
ae80f2dc72
|
fix stale documentation
|
2023-06-14 20:04:27 +00:00 |
|
PanQiWei
|
9baff43f6f
|
remove upload sdist step
|
2023-06-08 14:07:38 +08:00 |
|
PanQiWei
|
1b226c7bcf
|
revert use absolute path in include_dirs
|
2023-06-08 14:03:08 +08:00 |
|
PanQiWei
|
7520133a74
|
upload sdist at every job
|
2023-06-08 14:02:46 +08:00 |
|
PanQiWei
|
0d4c54add9
|
remove build_sdist_wheel_cpu_only.yml
|
2023-06-08 14:02:21 +08:00 |
|
PanQiWei
|
bb4924e9a8
|
change workflow name
|
2023-06-08 12:33:59 +08:00 |
|
PanQiWei
|
8801cdf340
|
build_wheels.yml -> build_wheels_cuda.yml
|
2023-06-08 12:32:42 +08:00 |
|
PanQiWei
|
67bb388bf8
|
remove build_sdst job
|
2023-06-08 12:32:17 +08:00 |
|
PanQiWei
|
15d1981f25
|
add build_sdist_wheel_cpu_only.yml
|
2023-06-08 12:31:35 +08:00 |
|
PanQiWei
|
590685cad5
|
use absolute path in include_dirs
|
2023-06-08 12:30:14 +08:00 |
|
潘其威(William)
|
2ea23297c6
|
Merge pull request #140 from geekinglcq/fix_issue95
fix weights not transpose for Conv1D/2D in qlinear_cuda_old
|
2023-06-06 19:54:57 +08:00 |
|
Angainor Development
|
e75611e1b7
|
Fix error message
|
2023-06-05 22:19:09 +02:00 |
|