AutoGPTQ/auto_gptq/modeling
2023-04-25 20:47:33 +08:00
..
__init__.py refactor file structure of modeling module 2023-04-23 17:33:09 +08:00
_base.py add inference_mode and autocast context manager to generate function 2023-04-25 20:47:33 +08:00
_const.py add support to MOSS model 2023-04-25 11:54:29 +08:00
_utils.py add triton support 2023-04-25 20:05:22 +08:00
auto.py refactor file structure 2023-04-25 18:58:20 +08:00
bloom.py first init 2023-04-14 01:09:40 +08:00
gpt_neox.py fix mismatch GPTNeoxForCausalLM's lm_head 2023-04-24 20:51:56 +08:00
gptj.py first init 2023-04-14 01:09:40 +08:00
llama.py fix bugs for attention_mask and position_ids 2023-04-20 18:32:21 +08:00
moss.py add support to MOSS model 2023-04-25 11:54:29 +08:00
opt.py fix bugs for attention_mask and position_ids 2023-04-20 18:32:21 +08:00