Skip to content

Question about perfomance and hyperparameters #450

@Rodyarad

Description

@Rodyarad

Hello, thanks for your work!

We have implemented a custom environment with discrete action spaces. We’ve observed that after reaching a certain level of performance (in terms of reward or success rate), the results begin to degrade during further training (we’re training with UniZero).

We have also encountered similar behavior with the original EfficientZero and EfficientZeroV2 repositories when running other custom environments.

Have you encountered performance degradation after a plateau? Are there any specific hyperparameters or strategies for solving it?

Thank you in advance for your response.

Metadata

Metadata

Assignees

No one assigned

    Labels

    discussionDiscussion of a typical issue or concept

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions