M. Sadegh Talebi

    Mohammad Sadegh Talebi


     PhD Student
     Department of Automatic Control
     School of Electrical Engineering
     KTH The Royal Institute of Technology

     Email: mstms (at) kth (dot) se



  Publications    Teaching    Students    Curriculum Vitae 


I am currently pursuing my PhD in Electrical Engineering at the Department of Automatic Control of KTH The Royal Institute of Technology under the supervision of Prof. Alexandre Proutiere and Prof. Mikael Johansson. Currently, I am a visiting PhD student at SequeL team of INRIA Lille-Nord Europe, working under the supervision of Odalric-Ambrym Maillard on reinforcement learning in Markov decision processes.

I presented my licentiate seminar on "online combinatorial optimization under bandit feedback" on Februaray 5, 2016. My licentiate thesis and the slides can be found here.

I received my BSc degree in Electrical Engineering from Iran University of Science and Technology (IUST) in 2004, and my MSc degree in Electrical Engineering from Sharif University of Technology in 2006. Prior to starting my PhD, I worked a few years as a research engineer in the School of Computer Science of Institute for Research in Fundamental Sciences (IPM).


My research interests include:
  • Reinforcement learning in Markov decision processes
  • Stochatic multi-armed bandits
  • Online decision making
  • Resource allocation in networks

RECENT WORK
  • Stochastic online shortest path routing: The value of feedback
    with Z. Zou, R. Combes, A. Proutiere, and M. Johansson.
    IEEE Transactions on Automatic Control (forthcoming) [arXiv].

  • Multi-period network rate allocation with end-to-end delay constraints
    with M. H. Hajiesmaili and A. Khonsari.
    IEEE Transactions on Control of Network Systems (forthcoming) [doi].


top |
Last updated on July 2017.