Trajectory tracking control of a quadrotor with reinforcement learning

Çakmak, Eren

Trajectory tracking control of a quadrotor with reinforcement learning

dc.contributor.advisor	Doğan, Mustafa
dc.contributor.author	Çakmak, Eren
dc.contributor.authorID	504181134
dc.contributor.department	Control Engineering
dc.date.accessioned	2024-06-26T06:55:12Z
dc.date.available	2024-06-26T06:55:12Z
dc.date.issued	2023-01-23
dc.description	Thesis (M.Sc.) -- İstanbul Technical University, Graduate School, 2023
dc.description.abstract	Drone control algorithms are usually broken down into several steps. The innermost parts of a drone control algorithm are angle and angular velocity control loops. Whether it is fixed-wing or rotary-wing, these control loops conventionally consist of PID based controllers. Although a PID controller can control these loops successfully, it may not lead the outer loops to desired positions or velocities. An outer loop designed to manage these situations can be done with conventional controller loops. However, these kinds of controllers are heavily model-dependent and often require tuning. Motivated by this situation, the aim of the presented study is to show that reinforcement learning based algorithms can control a quadrotor drone without prior knowledge of the model. The most preferred model-free reinforcement algorithms in the literature are DDPG, TRPO, and PPO. The studies that use state-of-the-art reinforcement learning methods for quadcopter control are compared, and it is concluded that PPO is the best choice to begin with. An actor-critic neural network for PPO-clip, the most successful version of PPO, is built and trained on a custom Gym environment. The environment is a quadrotor model that covers fundamental dynamics. This study is composed of six chapters. In the first chapter, motivation of research and literature review are given. In the second chapter, the theoretical background to construct a quadrotor model is given, and a general picture of reinforcement learning and model-free algorithms is drawn. In the third chapter, a custom simulation environment using the features of Gym library is designed. Then, the neural network based controller is designed, in the fourth chapter. Next, the agent is trained in the custom environment, in the fifth chapter. The simulation results of hovering and trajectory tracking tests are given. In the last chapter, it is concluded that a model-free reinforcement learning-based neural network without any additional control loop can control a quadrotor, and possible future works for this study are discussed.
dc.description.degree	M.Sc.
dc.identifier.uri	http://hdl.handle.net/11527/24995
dc.language.iso	en_US
dc.publisher	Graduate School
dc.sdg.type	Goal 9: Industry, Innovation and Infrastructure
dc.subject	drone
dc.subject	insansız hava aracı
dc.subject	control theory
dc.subject	kontrol teorisi
dc.subject	orbit control
dc.subject	yörünge kontrolü
dc.subject	learning algorithms
dc.subject	öğrenme algoritmaları
dc.title	Trajectory tracking control of a quadrotor with reinforcement learning
dc.title.alternative	Pekiştirmeli öğrenme ile bir quadrotor'un yörünge takip kontrolü
dc.type	Master Thesis

Dosyalar

Orijinal seri

Şimdi gösteriliyor 1 - 1 / 1

Ad:: 504181134.pdf
Boyut:: 2.09 MB
Format:: Adobe Portable Document Format
Açıklama

İndir

Lisanslı seri

Şimdi gösteriliyor 1 - 1 / 1

Ad:: license.txt
Boyut:: 1.58 KB
Format:: Item-specific license agreed upon to submission
Açıklama

İndir

Koleksiyonlar

LEE- Kontrol ve Otomasyon Mühendisliği-Yüksek Lisans