Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Phi-3 mini support? #459

Open
vackosar opened this issue Apr 26, 2024 · 6 comments
Open

Phi-3 mini support? #459

vackosar opened this issue Apr 26, 2024 · 6 comments

Comments

@vackosar
Copy link

Not the most powerful, but a useful model:

https://huggingface.co/microsoft/Phi-3-mini-128k-instruct

@casper-hansen
Copy link
Owner

Hi @vackosar, I am open to PRs from the community. For now, I will not have time to include it.

@vackosar
Copy link
Author

@casper-hansen thank you. Is there any guidance on how to do that for architecture like Phi-3?

@TechxGenus
Copy link
Contributor

I checked its architecture and it shouldn't be very hard to implement basic quantization. But its position encoding is special (LongRoPE) and implementing the fusion layer might need more work.

@vackosar
Copy link
Author

It seems that the 4k version is the base and maybe uses standard Rope. That would reduce needed effort. Is there any way to test?

https://huggingface.co/microsoft/Phi-3-mini-4k-instruct/tree/main

@casper-hansen
Copy link
Owner

This can easily be ported to AutoAWQ if someone has the time.
mit-han-lab/llm-awq#183

@vackosar
Copy link
Author

vackosar commented Jun 3, 2024

Somebody implemented a merge request here:
#481

But it was not merged yet.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants