Qwen2.5-VL-W4A16-G128 / recipe.yaml
jeffcookio's picture
Upload folder using huggingface_hub
d40da64 verified
DEFAULT_stage:
DEFAULT_modifiers:
GPTQModifier:
sequential_targets: [Qwen2_5_VLDecoderLayer]
scheme: W4A16
targets: Linear
ignore: [lm_head, 're:visual.*']