Vor über 30 Tagen veröffentlicht

Master s Thesis - Enhance Performance of Neural-Network-Based Action Masking

Technische Universität München

Fach- und Hochschulen

München

Art der Anstellung: Vollzeit
Vor Ort

Über diesen Job

Zurück zu Nachrichten-Bereich

Master's Thesis - Enhance Performance of Neural-Network-Based Action Masking

17.08.2025, Studentische Hilfskräfte, Praktikantenstellen, Studienarbeiten

Provably safe reinforcement learning is critical for real-world safety-critical applications. One of the core challenges is to ensure that the agent does not take unsafe actions during both training and deployment. Action masking is a common technique to prevent the agent from selecting unsafe actions. Current methods often rely on hand-crafted rules or heuristics to define and compute safe actions, which can be conservative and difficult to scale. Neural networks have shown promise in learning to mask unsafe actions directly from data and then be used for training safe reinforcement learning agents. However, the performance of neural-network-based action masking is limited especially in complex and dynamic environments.

In this thesis, we aim to enhance the performance of neural-network-based action masking for reinforcement learning. The goal is to improve and extend the existing pipeline for neural-network-based action masking, implement and test curriculum learning techniques, and finally evaluate the performance of the enhanced action masking network in an autonomous driving scenario based on CommonRoad and CommonRoad-RL.

This thesis offers an opportunity to engage in practical applications of autonomous driving. The project also aims for a publication in a peer-reviewed conference or journal.

Your tasks:
- Familiarize with our current action masking techniques.
- Familiarize with the existing code base for neural-network-based action masking in CommonRoad-RL.
- Enhance the efficiency and performance of the existing action masking pipeline.
- Implement curriculum learning techniques to improve the performance of the action masking method.
- Evaluate the performance in an autonomous driving scenario.
- Documentation of your results.

Required skills:
- Knowledge of Reinforcement Learning and Curriculum Learning.
- Good Python programming skills and experience with PyTorch.

Please find the attached PDF for a detailed topic description.

If you are interested in this topic, please send an email to shuaiyi.li@tum.de with your CV and transcript with title "[Bachelor/Master Thesis Application] ..." :D

Kontakt: shuaiyi.li@tum.de

ThesisProposal, (Type: application/pdf, Größe: 40.1 kB) Datei speichern

Unternehmens-Details

Technische Universität München

Fach- und Hochschulen

5.001-10.000 Mitarbeitende

München, Deutschland

Bewertung von Mitarbeitenden

Gesamtbewertung

3.4

Basierend auf 314 Bewertungen

Vorteile für Mitarbeitende

Flexible Arbeitszeiten

Home-Office

Kantine

Restaurant-Tickets

Kinderbetreuung

Betriebliche Altersvorsorge

Barrierefreiheit

Gesundheitsmaßnahmen

Betriebsarzt

Training

Parkplatz

Günstige Anbindung

Vorteile für Mitarbeitende

Smartphone

Gewinnbeteiligung

Veranstaltungen

Privat das Internet nutzen

Hunde willkommen

Unternehmenskultur

314 Mitarbeitende haben abgestimmt: Sie bewerten die Unternehmenskultur bei Technische Universität München als ausgeglichen zwischen traditionell und modern.Der Branchen-Durchschnitt geht übrigens in Richtung modern

Mehr Infos anzeigen

Ähnliche Jobs

Abschlussarbeit (m/w/d) - Untersuchung der Möglichkeit von AI für Machine-Vision-Produkte

Abschlussarbeit (m/w/d) - Untersuchung der Möglichkeit von AI für Machine-Vision-Produkte

(IDP - Master thesis)Simulation of Tower Crane 3D Lift Task Integrated with Reinforcement Learning

(IDP - Master thesis)Simulation of Tower Crane 3D Lift Task Integrated with Reinforcement Learning

Student/in o. ä. (w/m/d) Entwicklung Quantenalgorithmen

Student/in o. ä. (w/m/d) Entwicklung Quantenalgorithmen

Master s Thesis - Enhance Performance of Neural-Network-Based Action Masking