Commit graph

15 commits

Author SHA1 Message Date
PanQiWei
cfd27e8caa refactor file structure of qlinears 2023-05-26 07:18:16 +08:00
PanQiWei
db63c0876a half out 2023-05-23 16:08:28 +08:00
PanQiWei
86b3b52c63 fix ImportError when triton is not installed 2023-05-20 16:15:20 +08:00
PanQiWei
2273f9ef39 refactor file structure for triton kernels 2023-05-14 11:49:10 +08:00
TheBloke
1b3329b399 Fix 'groupsize' -> 'group_size' in all other .py files. I haven't touched any CUDA kernels in case there's any complexity there I don't understand 2023-05-05 14:44:16 +01:00
qwopqwop200
50c0fd13c5
Multi-GPU, allocate output tensor 2023-05-02 17:51:41 +09:00
qwopqwop200
c9215a1b5b
change div num 2023-04-28 22:42:29 +09:00
qwopqwop200
19f167e58b
add raise-exception 2023-04-28 22:24:44 +09:00
qwopqwop200
329a64ed40
support conv1d,conv2d 2023-04-28 09:15:42 +09:00
PanQiWei
bf2ae6768d bug fix 2023-04-26 13:33:56 +08:00
PanQiWei
73cb1dbf09 optimize import and format code 2023-04-26 13:08:47 +08:00
PanQiWei
c35dce525e format code 2023-04-25 22:58:52 +08:00
PanQiWei
9f7f44146f format code 2023-04-25 22:45:27 +08:00
PanQiWei
b71211b4c3 format code 2023-04-25 22:36:28 +08:00
PanQiWei
9c405b1628 add triton support 2023-04-25 20:05:22 +08:00