Add the plot of the reward function during training on Wandb dashboard using Stable baseline 3

I have a custom environment and the following standard code for training a PPO model

    if __name__ == "__main__":
        torch.set_num_threads(1)
        WANDB_NOTEBOOK_NAME = 'name' #CHANGE
        WANDB_API_KEY= 'code' #CHANGE
    
        vec_env = CustomOfflineEnv(data="data.csv")
    
        verbosity=2
        
        config = {
            "policy_type": "MlpPolicy",
            "total_timesteps": 66000
        }
    
        name='66000-ppo'
        run = wandb.init(
            project="projectName",
            config=config,
            sync_tensorboard=True,
            monitor_gym=True,
            name=name
        )
    
        model = PPO(policy=config["policy_type"], env=vec_env,
                    batch_size=256,
                    verbose=verbosity, 
                    tensorboard_log=f"wandb/runs/{name}{run.id}",
                    )
           
        model.learn(
            total_timesteps=config["total_timesteps"],
            callback=[WandbCallback(
                gradient_save_freq=100,
                model_save_freq=100,
                model_save_path=f"../../training/models/ppo/{name}{run.id}",
                verbose=verbosity),]
        )
        run.finish()

When I open my wandb dashboard, I only see the loss function related plots: I would like to see the reward of the model to understand how the model performs but I do not find any useful material online.

edited Oct 16, 2023 at 20:08

asked Oct 16, 2023 at 17:06

Zackbord

789 bronze badges

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Add the plot of the reward function during training on Wandb dashboard using Stable baseline 3

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest