You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
The observation and info returned at the last step in PointMaze with continuing_task=True, aren't updated (i.e. they contain the old goal). This is not the intended general semantics: in a common RL loop, the agent will use the old observation to predict the action to go to the old goal, instead of the new one.
The text was updated successfully, but these errors were encountered:
younik
changed the title
[Bug Report] Info semantics in PointMaze with continuing_task
[Bug Report] Obs and info semantics in PointMaze with continuing_task
Nov 29, 2024
Describe the bug
The observation and info returned at the last step in PointMaze with
continuing_task=True
, aren't updated (i.e. they contain the old goal). This is not the intended general semantics: in a common RL loop, the agent will use the old observation to predict the action to go to the old goal, instead of the new one.See related issue: Farama-Foundation/Minari#265
See:
Gymnasium-Robotics/gymnasium_robotics/envs/maze/point_maze.py
Lines 392 to 406 in 3719d9d
Code example
You need an expert policy to see this; check https://github.com/Farama-Foundation/minari-dataset-generation-scripts/blob/main/scripts/pointmaze/create_pointmaze_dataset.py
The text was updated successfully, but these errors were encountered: