Rafael Stekolshchik – Medium

Rafael Stekolshchik

Rafael Stekolshchik
in
Towards Data Science

Entropy in Soft Actor-Critic (Part 2)

10 min readJun 7, 2021

--

Entropy in Soft Actor-Critic (Part 2)

--

Rafael Stekolshchik
in
Towards Data Science

Entropy in Soft Actor-Critic (Part 1)

9 min readMay 4, 2021

--

1

Entropy in Soft Actor-Critic (Part 1)

--

1

Rafael Stekolshchik
in
Towards Data Science

Three aspects of Deep RL: noise, overestimation and exploration

Noise can be harmful, it can lead to a systematic overestimation. However, noise can be useful, such as noise for exploration.

11 min readMay 1, 2020

--

Three aspects of Deep RL: noise, overestimation and exploration

--

Rafael Stekolshchik
in
Towards Data Science

A pair of interrelated neural networks in DQN

DQN and Double DQN models. In these models, comparing two interrelated neural networks is crucial.

10 min readMar 18, 2020

--

A pair of interrelated neural networks in DQN

--

Rafael Stekolshchik
in
Towards Data Science

How does the Bellman equation work in Deep RL?

The connection between Bellman equation and Neural Networks, with formulas, examples and Python code

11 min readFeb 11, 2020

--

2

Artifician Intelligence and Neural Network Learning System Art

--

2

Rafael Stekolshchik

Rafael Stekolshchik

Ph.D. in Math, Algorithm and SW developer, Researcher. Fan of Deep Learning and Neural Networks. @r.stekol

Following

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams