Shivang Shrivastav – Medium

Shivang Shrivastav

TD3 Code Implementation: Taming Continuous Control with Twin Delayed DDPG

From theory to practice: Translating TD3 into code for robust and stable continuous control

2d ago

TD3 Code Implementation: Taming Continuous Control with Twin Delayed DDPG

2d ago

Unveiling the Nuances: Dissecting the Differences Between DDPG and TD3

Understanding how TD3 addresses inefficiencies of DDPG and improves performance in continuous control tasks

2d ago

Unveiling the Nuances: Dissecting the Differences Between DDPG and TD3

2d ago

DDPG Code Implementation: Applying Actor-Critic Methods for Continuous Control

Training Ant to Run: Implementing DDPG for Mastering Locomotion in a Simulated Environment

4d ago

DDPG Code Implementation: Applying Actor-Critic Methods for Continuous Control

4d ago

DQN Code Implementation: Lunar Lander Descent with DQN and Pytorch Lightning

Lunar Lander: An AI Playground for Deep Reinforcement Learning

Mar 28

DQN Code Implementation: Lunar Lander Descent with DQN and Pytorch Lightning

Mar 28

A2C Code Implementation: Your Code Companion for Deep RL

Acrobat Robot trained by Advantage Actor Critic in Reinforcement Learning

Mar 28

A2C Code Implementation: Your Code Companion for Deep RL

Mar 28

REINFORCE Code Implementation: Mastering Policy Gradients for Reinforcement Learning

Balancing the Cart-Pole: A Practical Implementation of the REINFORCE Algorithm

Mar 28

REINFORCE Code Implementation: Mastering Policy Gradients for Reinforcement Learning

Mar 28

Generalized Advantage Estimation (GAE): A Deep Dive into Bias, Variance, and Policy Gradients

GAE: Balancing Bias and Variance for Efficient Policy Gradients

Mar 28

Generalized Advantage Estimation (GAE): A Deep Dive into Bias, Variance, and Policy Gradients

Mar 28

Proximal Policy Optimization (PPO): Clipping, Stability, and Sample Efficiency in Reinforcement…

Beyond Vanilla Policy Gradients: PPO for Enhanced Performance in Complex Environments

Mar 28

Proximal Policy Optimization (PPO): Clipping, Stability, and Sample Efficiency in Reinforcement…

Mar 28

DDPG: DQN Re-engineered for Continuous Action Spaces

How DDPG algorithm is built by modifying DQN in Reinforcement Learning

Mar 27

DDPG: DQN Re-engineered for Continuous Action Spaces

Mar 27

SAC vs. A2C: A Comparative Analysis of Reinforcement Learning Algorithms

Soft Actor-Critic (SAC): Unveiling the Power of Stochastic Policies for Optimal Control

Jan 5

SAC vs. A2C: A Comparative Analysis of Reinforcement Learning Algorithms

Jan 5

Shivang Shrivastav

Shivang Shrivastav

I am an A.I enthusiast. Passionate about learning Deep Learning and it’s applications. “Inquisitive by mind and Explorer by soul”, This is my approach.

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech