fix attention in qwen model #222

atticusg · 2025-06-15T20:59:47Z

Description

A quick bugfix where the qwen file referenced "head_dim" which does not exist and is now replaced with "num_attention_heads/hidden_size"

This allowed my pyvene code with qwen to work

fix attention in qwen model

ac44a84

aryamanarora approved these changes Jul 25, 2025

View reviewed changes

aryamanarora merged commit 3c6cb78 into stanfordnlp:main Jul 25, 2025
2 checks passed