AutoGPTQ/auto_gptq/modeling
2023-04-26 16:55:01 +08:00
..
__init__.py refactor file structure of modeling module 2023-04-23 17:33:09 +08:00
_base.py add support to use push_to_hub to upload and share quantized model 2023-04-26 16:55:01 +08:00
_const.py add support to MOSS model 2023-04-25 11:54:29 +08:00
_utils.py bug fix 2023-04-26 13:33:56 +08:00
auto.py add support to use push_to_hub to upload and share quantized model 2023-04-26 16:55:01 +08:00
bloom.py first init 2023-04-14 01:09:40 +08:00
gpt_neox.py fix mismatch GPTNeoxForCausalLM's lm_head 2023-04-24 20:51:56 +08:00
gptj.py first init 2023-04-14 01:09:40 +08:00
llama.py fix bugs for attention_mask and position_ids 2023-04-20 18:32:21 +08:00
moss.py add support to MOSS model 2023-04-25 11:54:29 +08:00
opt.py fix bugs for attention_mask and position_ids 2023-04-20 18:32:21 +08:00