Tayari, H. (2026). Tiny deep reinforcement learning for compute constrained agents : solving the inverted pendulum problem on less than 520kB SRAM using skill-oriented autonomous real-world E2E deep reinforcement learning [Diploma Thesis, Technische Universität Wien]. reposiTUm. https://doi.org/10.34726/hss.2026.128166