I am currently a Systems Engineer at Scania Group, working on algorithms for battery management systems. Previously, I earned my M.Sc. in Robotics (with honors) from Warsaw University of Technology, where I was supervised by Prof. Elżbieta Jarzębowska. For my master’s thesis, I explored deep reinforcement learning techniques to improve the navigation performance of mobile robots in dynamic environments. Before that, I completed my undergraduate studies (with honors) in Electrical Engineering at Addis Ababa Science and Technology University in Ethiopia.
We present a framework for mobile robot navigation in dynamic environments using Deep Reinforcement Learning (DRL) and the Robot Operating System (ROS). Traditional navigation methods often lack the real-time adaptability required in highly dynamic settings. To address this, we leverage the TD7 algorithm—an extension of the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm incorporating state and state-action embeddings—to directly map raw sensor inputs to control actions. These embeddings, trained to minimize the mean squared error (MSE) between the encoded state-action representation and the transition-predicted next state, enhance the system’s ability to model environment dynamics and improve navigation performance. Extensive simulations were conducted in custom Gazebo environments of increasing complexity, ranging from open spaces to scenarios with static obstacles and moving actors. Performance was evaluated based on navigation success rate, time to goal, path efficiency, and collision rate. Results indicate that this approach consistently improves navigation performance, particularly in highly dynamic environments.
@inproceedings{Nurye-2025,author={Nurye, Ahmed Yesuf and Jarzebowska, Elzbieta},booktitle={2025 29th International Conference on Methods and Models in Automation and Robotics (MMAR)},title={Deep Reinforcement Learning for Mobile Robot Navigation in Dynamic Environments},year={2025},keywords={Mobile Robot Navigation; Deep Reinforcement Learning; TD3; TD7; ROS; Gazebo},}