潘其威(William)
|
beab695c5b
|
Merge branch 'main' into xformers_integration
|
2023-08-10 15:27:11 +08:00 |
|
PanQiWei
|
2826729e73
|
use pytorch normal forward logic when trainable is True
|
2023-08-06 11:44:29 +08:00 |
|
Felix Marty
|
38447262c0
|
fix fused attn
|
2023-07-31 13:46:32 +00:00 |
|
Felix Marty
|
179776bd1d
|
exllama kernel
|
2023-07-31 11:50:45 +00:00 |
|
PanQiWei
|
5883b45d73
|
fix error raised when cuda kernels are not installed
|
2023-07-26 13:59:28 +08:00 |
|
qwopqwop200
|
9578c59d31
|
fix cuda bug
|
2023-07-25 16:50:05 +09:00 |
|
qwopqwop200
|
f4820f2988
|
change qlinear cuda support 64dim
|
2023-06-03 07:30:34 +09:00 |
|
qwopqwop200
|
b03f53294f
|
support 64dim cuda
|
2023-06-02 19:53:50 +09:00 |
|
qwopqwop200
|
0f2841cb13
|
remove log
|
2023-05-30 23:51:55 +09:00 |
|
qwopqwop200
|
5274313067
|
change if trainable backend pytorch
|
2023-05-30 23:40:58 +09:00 |
|
PanQiWei
|
2b532f9453
|
add trainable mode
|
2023-05-26 13:11:30 +08:00 |
|
PanQiWei
|
69609c4bc7
|
support faster vecquant4matmul cuda kernel
|
2023-05-26 08:55:05 +08:00 |
|
PanQiWei
|
cfd27e8caa
|
refactor file structure of qlinears
|
2023-05-26 07:18:16 +08:00 |
|