M. Sadegh Talebi

    Mohammad Sadegh Talebi

     PhD Student
     Department of Automatic Control
     School of Electrical Engineering
     KTH The Royal Institute of Technology

     Email: mstms (at) kth (dot) se

  Publications    Teaching    Students    Curriculum Vitae 

I am currently pursuing my PhD in Electrical Engineering at the Department of Automatic Control of KTH The Royal Institute of Technology under the supervision of Prof. Alexandre Proutiere and Prof. Mikael Johansson. During summer 2017, I have been a visiting PhD student at SequeL team of INRIA Lille-Nord Europe, working under the supervision of Odalric-Ambrym Maillard on reinforcement learning in Markov decision processes.

I presented my licentiate seminar on "online combinatorial optimization under bandit feedback" on Februaray 5, 2016. My licentiate thesis and the slides can be found here.

I received my BSc degree in Electrical Engineering from Iran University of Science and Technology (IUST) in 2004, and my MSc degree in Electrical Engineering from Sharif University of Technology in 2006. Prior to starting my PhD, I worked a few years as a research engineer in the School of Computer Science of Institute for Research in Fundamental Sciences (IPM).

My research interests include:
  • Reinforcement learning in Markov decision processes
  • Stochatic multi-armed bandits
  • Online decision making
  • Resource allocation in networks

  • Variance-aware regret bounds for undiscounted reinforcement learning in MDPs
    with O.-A. Maillard.

  • Stochastic online shortest path routing: The value of feedback
    with Z. Zou, R. Combes, A. Proutiere, and M. Johansson.
    IEEE Transactions on Automatic Control (forthcoming) [doi][arXiv].

top |
Last updated on October 2017.