I was using the “done” information from the environment as a part the observation. It means I did minor modification to the challenge.py file :
action = gent.act(state, done) #[it was action = gent.act(state)]
Can I pass the binary variable “done” to the agent? Or the agent should have self-awareness of when it reach the target?
Thanks for your question. You should not use the binary variable “done” as part of the observation. The agent does not need to have self-awareness that it has reached the goal. Once it converges to the goal, the episode will automatically finish. In this sense, passing “done” as part of the observation will not be helpful in terms of solving the task.
Hi @Ruddra_dev, the target position is given by the first two dimensions of obs['sensor']. It gives goal location in the agent’s frame. You don’t need to change challenge.py file.