Skip to content

After some steps, all the NNs always output same action #75

@Eify666666

Description

@Eify666666

I'm training a A3C these days, but the NN always take the same action, after some steps.
The game I train for is similar to playing Go. There will be few reward in the short term. So it hard to learn something useful for the NN form the game. Maybe that is where the problem is. I tried ' torch.nn.utils.clip_grad_norm(lnet.parameters(), 50) ', and used relu as activate function. But it doesn't work.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions