-
Notifications
You must be signed in to change notification settings - Fork 231
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Phi-3 mini support? #459
Comments
Hi @vackosar, I am open to PRs from the community. For now, I will not have time to include it. |
@casper-hansen thank you. Is there any guidance on how to do that for architecture like Phi-3? |
I checked its architecture and it shouldn't be very hard to implement basic quantization. But its position encoding is special (LongRoPE) and implementing the fusion layer might need more work. |
It seems that the 4k version is the base and maybe uses standard Rope. That would reduce needed effort. Is there any way to test? https://huggingface.co/microsoft/Phi-3-mini-4k-instruct/tree/main |
This can easily be ported to AutoAWQ if someone has the time. |
Somebody implemented a merge request here: But it was not merged yet. |
Not the most powerful, but a useful model:
https://huggingface.co/microsoft/Phi-3-mini-128k-instruct
The text was updated successfully, but these errors were encountered: