Engineering Design Automation via Imitation Learning and Reinforcement Learning

dc.contributor.advisorThekinen, Joseph
dc.contributor.authorBozorgmehry Boozarjomehry, Ghazal
dc.contributor.committeememberBisheban, Mahdis
dc.contributor.committeememberYanushkevich, Svetlana
dc.date2025-06
dc.date.accessioned2025-02-05T23:28:14Z
dc.date.available2025-02-05T23:28:14Z
dc.date.issued2025-01-31
dc.description.abstractReinforcement Learning (RL) has achieved notable success in robotics and gaming, yet its application to automating engineering design faces significant challenges, including slow training times and poor generalization. Traditional RL methods require exploring millions of design states, which is computationally expensive, especially when dealing with complex physics models. In contrast, behavioral cloning, which enables RL agents to mimic human designers based on their decision data, presents a more resource-efficient alternative. This thesis investigates the use of both RL and imitation learning to automate engineering design, using aircraft design as a surrogate task to model engineering design. We evaluate the performance of a behavioral cloning agent trained on human design decision data, employing recurrent neural networks such as GRU, LSTM, and simple-RNN. We define a metric Q-score, which quantifies design quality on a scale between 0 and 1, with higher values indicating better design quality. Our findings demonstrate that the GRU architecture outperforms both LSTM and simple-RNN in terms of accuracy, achieving a Q-score of 0.8 after training on a relatively small dataset. The GRU model strikes an optimal balance between accuracy, simplicity, and computational efficiency, making it particularly suitable for surrogate design tasks like aircraft design optimization. Additionally, we assess the performance of RL agents, specifically Proximal Policy Optimization and Advantage Actor-Critic ,in the same design task. Both RL approaches achieved higher Q-scores (up to 0.99) but incurred significant computational costs and required extensive training time. In contrast, behavioral cloning provided a faster, more computationally efficient approach, though its performance was constrained by the availability of labeled human decision data. The results suggest that while RL methods excel in exploration and optimization, imitation learning offers a faster and more resource-efficient solution, albeit with reduced exploration and adaptability.
dc.identifier.citationBozorgmehry Boozarjomehry, G. (2025). Engineering design automation via imitation learning and reinforcement learning (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.
dc.identifier.urihttps://hdl.handle.net/1880/120726
dc.language.isoen
dc.publisher.facultySchulich School of Engineering
dc.publisher.institutionUniversity of Calgary
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.
dc.subject.classificationEngineering--Mechanical
dc.subject.classificationArtificial Intelligence
dc.titleEngineering Design Automation via Imitation Learning and Reinforcement Learning
dc.typemaster thesis
thesis.degree.disciplineEngineering – Mechanical & Manufacturing
thesis.degree.grantorUniversity of Calgary
thesis.degree.nameMaster of Science (MSc)
ucalgary.thesis.accesssetbystudentI do not require a thesis withhold – my thesis will have open access and can be viewed and downloaded publicly as soon as possible.
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2025_bozorgmehryboozarjomehry_ghazal.pdf
Size:
1.08 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.62 KB
Format:
Item-specific license agreed upon to submission
Description: