PanQiWei
|
1b226c7bcf
|
revert use absolute path in include_dirs
|
2023-06-08 14:03:08 +08:00 |
|
PanQiWei
|
7520133a74
|
upload sdist at every job
|
2023-06-08 14:02:46 +08:00 |
|
PanQiWei
|
0d4c54add9
|
remove build_sdist_wheel_cpu_only.yml
|
2023-06-08 14:02:21 +08:00 |
|
PanQiWei
|
bb4924e9a8
|
change workflow name
|
2023-06-08 12:33:59 +08:00 |
|
PanQiWei
|
8801cdf340
|
build_wheels.yml -> build_wheels_cuda.yml
|
2023-06-08 12:32:42 +08:00 |
|
PanQiWei
|
67bb388bf8
|
remove build_sdst job
|
2023-06-08 12:32:17 +08:00 |
|
PanQiWei
|
15d1981f25
|
add build_sdist_wheel_cpu_only.yml
|
2023-06-08 12:31:35 +08:00 |
|
PanQiWei
|
590685cad5
|
use absolute path in include_dirs
|
2023-06-08 12:30:14 +08:00 |
|
潘其威(William)
|
2ea23297c6
|
Merge pull request #140 from geekinglcq/fix_issue95
fix weights not transpose for Conv1D/2D in qlinear_cuda_old
|
2023-06-06 19:54:57 +08:00 |
|
Angainor Development
|
e75611e1b7
|
Fix error message
|
2023-06-05 22:19:09 +02:00 |
|
lunar
|
618a5f50ee
|
Add transpose operator when replace Conv1d with qlinear_cuda_old
|
2023-06-05 23:11:18 +08:00 |
|
潘其威(William)
|
bf521cbe7b
|
Merge pull request #134 from TheBloke/TB_benchmark
add command flags inject_fused_attention and inject_fused_mlp
|
2023-06-05 23:02:36 +08:00 |
|
潘其威(William)
|
8a1616c63c
|
Merge pull request #102 from PanQiWei/peft_integration
Peft integration
|
2023-06-05 22:55:34 +08:00 |
|
PanQiWei
|
129884c598
|
update version to 0.3.0.dev0
|
2023-06-05 22:53:50 +08:00 |
|
PanQiWei
|
b132d774e3
|
update README
|
2023-06-05 22:53:17 +08:00 |
|
TheBloke
|
edb13d493e
|
Default inject_fused_attention and mlp to True, matching defaults
|
2023-06-03 17:58:40 +01:00 |
|
TheBloke
|
4617629f0c
|
Support setting inject_fused_attention and inject_fused_mlp to False
|
2023-06-03 17:48:36 +01:00 |
|
PanQiWei
|
923fc87a11
|
Merge branch 'main' into peft_integration
|
2023-06-03 19:10:41 +08:00 |
|
潘其威(William)
|
023bb1c593
|
Merge pull request #125 from PanQiWei/support-32dim
Support 32dim
|
2023-06-03 19:08:29 +08:00 |
|
潘其威(William)
|
95a4381f50
|
Merge pull request #126 from PanQiWei/support-cuda-64dim
Support cuda 64dim
|
2023-06-03 19:08:12 +08:00 |
|
潘其威(William)
|
810ed4de66
|
Merge pull request #132 from EliEron/patch-1
Specify UTF-8 encoding for README.md in setup.py
|
2023-06-03 10:57:52 +08:00 |
|
qwopqwop200
|
f4820f2988
|
change qlinear cuda support 64dim
|
2023-06-03 07:30:34 +09:00 |
|
qwopqwop200
|
8951212ab3
|
change setup
|
2023-06-03 07:29:19 +09:00 |
|
qwopqwop200
|
e04c3b86cc
|
add cuda
|
2023-06-03 07:28:35 +09:00 |
|
qwopqwop200
|
5fc2064e1a
|
Rename autogptq_cuda_kernel.cu to autogptq_cuda_kernel_64.cu
|
2023-06-03 07:27:45 +09:00 |
|
qwopqwop200
|
446e12d3de
|
Rename autogptq_cuda.cpp to autogptq_cuda_64.cpp
|
2023-06-03 07:27:31 +09:00 |
|
EliEron
|
eeb8b78a55
|
Specify Encoding when reading README.md
Prevents UnicodeDecodeError from being raised in certain locals.
|
2023-06-02 20:54:32 +02:00 |
|
潘其威(William)
|
b4fdd8d264
|
Merge branch 'main' into peft_integration
|
2023-06-02 19:11:59 +08:00 |
|
PanQiWei
|
7206705456
|
set version to 0.2.1
|
2023-06-02 19:07:56 +08:00 |
|
qwopqwop200
|
2df7d7105d
|
support 64 cuda dim
|
2023-06-02 19:54:37 +09:00 |
|
qwopqwop200
|
b03f53294f
|
support 64dim cuda
|
2023-06-02 19:53:50 +09:00 |
|
qwopqwop200
|
90106d7c34
|
support cuda 64dim
|
2023-06-02 19:49:38 +09:00 |
|
PanQiWei
|
65c0115b86
|
update README
|
2023-06-02 18:18:11 +08:00 |
|
PanQiWei
|
0e609bec40
|
only append CUDA_VERSION to release version string when in github actions
|
2023-06-02 18:16:38 +08:00 |
|
qwopqwop200
|
0891ea4036
|
support 32dim triton]
|
2023-06-02 19:05:55 +09:00 |
|
qwopqwop200
|
b3654a68c3
|
support 32dim triton kernel
|
2023-06-02 19:04:12 +09:00 |
|
PanQiWei
|
50ac2ad4bc
|
update README
|
2023-06-02 10:59:36 +08:00 |
|
PanQiWei
|
113884d976
|
Merge remote-tracking branch 'origin/main'
|
2023-06-02 10:57:16 +08:00 |
|
PanQiWei
|
b248a2655a
|
update README
|
2023-06-02 10:56:57 +08:00 |
|
潘其威(William)
|
f948b56c07
|
Merge pull request #123 from jllllll/main
Fix and extend build_wheels.yml workflow
|
2023-06-02 10:12:20 +08:00 |
|
jllllll
|
0f1793b554
|
Revert "Remove workflow restriction for testing"
This reverts commit e62bda1c1e .
|
2023-06-01 20:49:42 -05:00 |
|
jllllll
|
3c6a002be5
|
Clean up workflow sdist creation
|
2023-06-01 20:35:30 -05:00 |
|
jllllll
|
e62bda1c1e
|
Remove workflow restriction for testing
|
2023-06-01 20:27:40 -05:00 |
|
jllllll
|
996382788b
|
Finalize workflow fix
|
2023-06-01 13:58:24 -05:00 |
|
jllllll
|
198e079da4
|
Restrict build_wheels.yml to minimum compute 6.0
|
2023-06-01 13:25:04 -05:00 |
|
jllllll
|
a0063fc9db
|
Add GitHub Actions bypass for cuda check to setup.py
|
2023-06-01 13:07:00 -05:00 |
|
jllllll
|
3084422095
|
Merge branch 'PanQiWei:main' into main
|
2023-06-01 12:50:34 -05:00 |
|
PanQiWei
|
b5db750c00
|
update setup.py
|
2023-06-02 01:39:56 +08:00 |
|
jllllll
|
2b96343e87
|
Update build_wheels.yml (#1)
|
2023-06-01 12:39:56 -05:00 |
|
PanQiWei
|
6a37f7c266
|
update setup.py
|
2023-06-02 00:03:44 +08:00 |
|