Does the framework support evaluation of quantized models, such as AWQ models? #1393
Unanswered
watermelon-lee
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I attempted to test the quantized Qwen (AWQ) model, but the performance dropped significantly. Even after downloading the official quantized model, the results were still far from satisfactory. Does this framework support the evaluation of quantized models?
my script is:
python run.py --datasets ceval_ppl_578f8d
--hf-type base
--hf-path $MODEL_PATH
--model-kwargs torch_dtype='torch.float16' trust_remote_code=True device_map='auto'
--max-out-len 1024
--min-out-len 1
--hf-num-gpus 1
--batch-size 1
--max-num-workers 1
--stop-words "<|im_end|>" "<|endoftext|>"
--w "outputs/${SAVE_NAME}/p1"
--debug
Beta Was this translation helpful? Give feedback.
All reactions