Quality of VM #24

RewindL · 2025-01-21T07:46:53Z

Thanks for your impressive work.

I have trained mistral VM following PRM/train_VM_mistral.py and use it to guide ToT evaluation in evaluate.py. But after training 2 epochs following recommended setting, the test accuracy is only 0.1582. And it outputs unreliable scores to tree nodes.

Since the default depth and branch is limited and exploring nodes are ranked by values, an relatively accurate score seems necessary. So I wonder if this is a normal situation, and how do you handle this problem?

Looking forward for your help, thanks.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quality of VM #24

Quality of VM #24

RewindL commented Jan 21, 2025

Quality of VM #24

Quality of VM #24

Comments

RewindL commented Jan 21, 2025