GitHub - pondbooks/CDQL_with_Sim

Source Code of Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time Systems (Sim-CDQL)

Pre-train DNN's paramete vectors are in weight_DNN.

The example of a discrete-time system is a pendulum dynamics computed by Euler-method with stepsize 2**(-4). We describe the dynamics in Pendulum.pdf (https://github.com/pondbooks/CDQL_with_Sim/blob/main/Pendulum.pdf). The range of an angle is [-np.pi, np.pi].

Normalize an angle

We use the np.arctan2(y, x) function (https://numpy.org/doc/stable/reference/generated/numpy.arctan2.html) for normalization of an angle parameter. The range of the angle parameter is [-np.pi, np.pi]. At first, the angle_normalize obtains an angle parameter theta. Secondly, the function computes the y-coordinate y and x-coordinate x by np.sin(theta) and np.cos(theta), respectively. Finally, the function computes the normalized angle by np.arctan2(y,x).

def angle_normalize(theta):
  x_plot = np.cos(theta)
  y_plot = np.sin(theta)
  angle = np.arctan2(y_plot,x_plot)
  return angle

This Fig. shows the angle normalize function in the range [-5*np.pi, 5*np.pi]. The red lines show the angles = -5*np.pi, -3*np.pi, -np.pi, np.pi, 3*np.pi, 5*np.pi. The blue lines show the normalized angles = -3*np.pi, 3*np.pi.

warning (2022/4/10)

In this source code, we use A**(-1) for computing an inverse matrix. However, in this case, we must define the matrix as np.matrix. So, we should change A**(-1) to np.linalg.inv(). In this example, fortunately, we consider the 1-dim problem. We can obtain the same result.

PC

CPU: Intel Core i7-10700 (1200/2.9/16M/C8/T16)

Memory: Samsung M378A2K43CB1-CTD (DDR4 PC4-21300) 16GB×2

Motherboard: ASUS PRIME H470-PLUS (H470 1200 DDR4 ATX)I

GPU: NVIDIA(R) GeForce RTX 2070 SUPER

OS: windows10

Python: 3.6.10

Pytorch: 1.5.1

matplotlib: 3.3.0

numpy: 1.18.5

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
Control_Policy_Plot		Control_Policy_Plot
N2_Score		N2_Score
N3_Score		N3_Score
N4_Score		N4_Score
N8_Score		N8_Score
Policy_Score_Plot		Policy_Score_Plot
Time_Response_N4_case1		Time_Response_N4_case1
Time_Response_N4_case1_noupdate		Time_Response_N4_case1_noupdate
Time_Response_N4_case2		Time_Response_N4_case2
Variation_50_5		Variation_50_5
Variation_5_50		Variation_5_50
section_5_3		section_5_3
weight_DNN		weight_DNN
LICENSE.txt		LICENSE.txt
Pendulum.pdf		Pendulum.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Source Code of Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time Systems (Sim-CDQL)

Normalize an angle

warning (2022/4/10)

PC

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Source Code of Continuous Deep Q-Learning with Simulator for Stabilization of Uncertain Discrete-Time Systems (Sim-CDQL)

Normalize an angle

warning (2022/4/10)

PC

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages