Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Quality of VM #24

Open
RewindL opened this issue Jan 21, 2025 · 0 comments
Open

Quality of VM #24

RewindL opened this issue Jan 21, 2025 · 0 comments

Comments

@RewindL
Copy link

RewindL commented Jan 21, 2025

Thanks for your impressive work.

I have trained mistral VM following PRM/train_VM_mistral.py and use it to guide ToT evaluation in evaluate.py. But after training 2 epochs following recommended setting, the test accuracy is only 0.1582. And it outputs unreliable scores to tree nodes.

Since the default depth and branch is limited and exploring nodes are ranked by values, an relatively accurate score seems necessary. So I wonder if this is a normal situation, and how do you handle this problem?

Looking forward for your help, thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant