Commit graph

618 commits

Author SHA1 Message Date
PanQiWei
1b226c7bcf revert use absolute path in include_dirs 2023-06-08 14:03:08 +08:00
PanQiWei
7520133a74 upload sdist at every job 2023-06-08 14:02:46 +08:00
PanQiWei
0d4c54add9 remove build_sdist_wheel_cpu_only.yml 2023-06-08 14:02:21 +08:00
PanQiWei
bb4924e9a8 change workflow name 2023-06-08 12:33:59 +08:00
PanQiWei
8801cdf340 build_wheels.yml -> build_wheels_cuda.yml 2023-06-08 12:32:42 +08:00
PanQiWei
67bb388bf8 remove build_sdst job 2023-06-08 12:32:17 +08:00
PanQiWei
15d1981f25 add build_sdist_wheel_cpu_only.yml 2023-06-08 12:31:35 +08:00
PanQiWei
590685cad5 use absolute path in include_dirs 2023-06-08 12:30:14 +08:00
潘其威(William)
2ea23297c6
Merge pull request #140 from geekinglcq/fix_issue95
fix weights not transpose for Conv1D/2D in qlinear_cuda_old
2023-06-06 19:54:57 +08:00
Angainor Development
e75611e1b7
Fix error message 2023-06-05 22:19:09 +02:00
lunar
618a5f50ee
Add transpose operator when replace Conv1d with qlinear_cuda_old 2023-06-05 23:11:18 +08:00
潘其威(William)
bf521cbe7b
Merge pull request #134 from TheBloke/TB_benchmark
add command flags inject_fused_attention and inject_fused_mlp
2023-06-05 23:02:36 +08:00
潘其威(William)
8a1616c63c
Merge pull request #102 from PanQiWei/peft_integration
Peft integration
2023-06-05 22:55:34 +08:00
PanQiWei
129884c598 update version to 0.3.0.dev0 2023-06-05 22:53:50 +08:00
PanQiWei
b132d774e3 update README 2023-06-05 22:53:17 +08:00
TheBloke
edb13d493e Default inject_fused_attention and mlp to True, matching defaults 2023-06-03 17:58:40 +01:00
TheBloke
4617629f0c Support setting inject_fused_attention and inject_fused_mlp to False 2023-06-03 17:48:36 +01:00
PanQiWei
923fc87a11 Merge branch 'main' into peft_integration 2023-06-03 19:10:41 +08:00
潘其威(William)
023bb1c593
Merge pull request #125 from PanQiWei/support-32dim
Support 32dim
2023-06-03 19:08:29 +08:00
潘其威(William)
95a4381f50
Merge pull request #126 from PanQiWei/support-cuda-64dim
Support cuda 64dim
2023-06-03 19:08:12 +08:00
潘其威(William)
810ed4de66
Merge pull request #132 from EliEron/patch-1
Specify UTF-8 encoding for README.md in setup.py
2023-06-03 10:57:52 +08:00
qwopqwop200
f4820f2988
change qlinear cuda support 64dim 2023-06-03 07:30:34 +09:00
qwopqwop200
8951212ab3
change setup 2023-06-03 07:29:19 +09:00
qwopqwop200
e04c3b86cc
add cuda 2023-06-03 07:28:35 +09:00
qwopqwop200
5fc2064e1a
Rename autogptq_cuda_kernel.cu to autogptq_cuda_kernel_64.cu 2023-06-03 07:27:45 +09:00
qwopqwop200
446e12d3de
Rename autogptq_cuda.cpp to autogptq_cuda_64.cpp 2023-06-03 07:27:31 +09:00
EliEron
eeb8b78a55
Specify Encoding when reading README.md
Prevents UnicodeDecodeError from being raised in certain locals.
2023-06-02 20:54:32 +02:00
潘其威(William)
b4fdd8d264
Merge branch 'main' into peft_integration 2023-06-02 19:11:59 +08:00
PanQiWei
7206705456 set version to 0.2.1 2023-06-02 19:07:56 +08:00
qwopqwop200
2df7d7105d
support 64 cuda dim 2023-06-02 19:54:37 +09:00
qwopqwop200
b03f53294f
support 64dim cuda 2023-06-02 19:53:50 +09:00
qwopqwop200
90106d7c34
support cuda 64dim 2023-06-02 19:49:38 +09:00
PanQiWei
65c0115b86 update README 2023-06-02 18:18:11 +08:00
PanQiWei
0e609bec40 only append CUDA_VERSION to release version string when in github actions 2023-06-02 18:16:38 +08:00
qwopqwop200
0891ea4036
support 32dim triton] 2023-06-02 19:05:55 +09:00
qwopqwop200
b3654a68c3
support 32dim triton kernel 2023-06-02 19:04:12 +09:00
PanQiWei
50ac2ad4bc update README 2023-06-02 10:59:36 +08:00
PanQiWei
113884d976 Merge remote-tracking branch 'origin/main' 2023-06-02 10:57:16 +08:00
PanQiWei
b248a2655a update README 2023-06-02 10:56:57 +08:00
潘其威(William)
f948b56c07
Merge pull request #123 from jllllll/main
Fix and extend build_wheels.yml workflow
2023-06-02 10:12:20 +08:00
jllllll
0f1793b554
Revert "Remove workflow restriction for testing"
This reverts commit e62bda1c1e.
2023-06-01 20:49:42 -05:00
jllllll
3c6a002be5
Clean up workflow sdist creation 2023-06-01 20:35:30 -05:00
jllllll
e62bda1c1e
Remove workflow restriction for testing 2023-06-01 20:27:40 -05:00
jllllll
996382788b
Finalize workflow fix 2023-06-01 13:58:24 -05:00
jllllll
198e079da4
Restrict build_wheels.yml to minimum compute 6.0 2023-06-01 13:25:04 -05:00
jllllll
a0063fc9db
Add GitHub Actions bypass for cuda check to setup.py 2023-06-01 13:07:00 -05:00
jllllll
3084422095
Merge branch 'PanQiWei:main' into main 2023-06-01 12:50:34 -05:00
PanQiWei
b5db750c00 update setup.py 2023-06-02 01:39:56 +08:00
jllllll
2b96343e87
Update build_wheels.yml (#1) 2023-06-01 12:39:56 -05:00
PanQiWei
6a37f7c266 update setup.py 2023-06-02 00:03:44 +08:00