M. Sadegh Talebi

    Mohammad Sadegh Talebi


     PhD
     Department of Automatic Control
     School of Electrical Engineering and Computer Science
     KTH Royal Institute of Technology

     Email: mstms (at) kth (dot) se



  Publications    Teaching    Students    Curriculum Vitae 


I am a researcher in the Department of Automatic Control at School of Electrical Engineering and Computer Science of KTH Royal Institute of Technology. I received my PhD in Electrical Engineering under the supervision of Prof. Alexandre Proutiere and Prof. Mikael Johansson. During summer 2017, I have been a visiting PhD student at SequeL Team of INRIA Lille-Nord Europe, working under the supervision of Odalric-Ambrym Maillard on reinforcement learning in Markov decision processes.

I received my BSc degree in Electrical Engineering (minor: Electronics) from Iran University of Science and Technology (IUST) in 2004, and my MSc degree in Electrical Engineering (minor: Communication Systems) from Sharif University of Technology in 2006. Prior to starting my PhD, I worked a few years as a research engineer in the School of Computer Science of Institute for Research in Fundamental Sciences (IPM).


My research interests include:
  • Reinforcement learning in Markov decision processes
  • Stochatic multi-armed bandits
  • Resource allocation in networks

RECENT WORK
  • Variance-aware regret bounds for undiscounted reinforcement learning in MDPs
    with O.-A. Maillard (equal contribution).
    Proc. International Conference on Algorithmic Learning Theory (ALT), 2018.

  • Stochastic online shortest path routing: The value of feedback
    with Z. Zou, R. Combes, A. Proutiere, and M. Johansson.
    IEEE Transactions on Automatic Control (forthcoming) [doi][arXiv].


top |
Last updated on January 2018.