Position: | Assistant Professor in Intelligent Systems with spec. in Machine Learning Division of Speech, Music and Hearing (TMH) School of Electrical Engineering and Computer Science KTH Royal Institute of Technology |
|
E-mail: | ghe at kth se |
|
Address: | KTH Royal Institute of Technology Lindstedtsvägen 24, office 518 SE-100 44 Stockholm SWEDEN |
Research
My research interests are probabilistic modelling and machine learning for data-generation tasks, most prominently speech synthesis as well as motion and gesture. Particular interests include deep generative modelling, the perceptual effects of modelling assumptions and design decisions, statistical robustness, and how to perform and analyse subjective evaluations.
Publications
A list of my publications is provided on a separate page, grouped by type:
- Books and book chapters
- Journal papers
- Conference papers
- Non-peer reviewed conference contributions
- Preprints
- Theses
Degrees and positions
- 2023: Degree of Docent in computer science (machine learning) from KTH
- 2020–: Assistant professor at the Division of Speech, Music and Hearing (TMH) at KTH
- 2018–2019: Post-doc at the Division of Speech, Music and Hearing (TMH) at KTH
- 2016–2018: Post-doc at Prof. Junichi Yamagishi's lab at the National Institute of Informatics (NII), Tokyo, Japan
- 2013–2016: Post-doc at the Centre for Speech Technology Research (CSTR) at the University of Edinburgh, UK
- 2013: Ph.D. in electrical engineering (telecommunications) from KTH (main supervisor Prof. W. Bastiaan Kleijn)
- 2007: M.Sc. in engineering physics from KTH
Students
I am fortunate to be supervising the following doctoral students:
- 2023–: Haotian Qi (co-supervisor; main supervisor Gabriel Skantze)
- 2021–: Rajmund Nagy (main supervisor)
- 2020–: Shivam Mehta (main supervisor
- 2020–: Ulme Wennberg (main supervisor)
- 2019–: Ghazaleh Esfandiari Baiat (co-supervisor; main supervisor Jens Edlund)
I previously supervised the following students:
- 2022–2022: Olga Mikheeva (co-supervisor; main supervisor Hedvig Kjellström)
- 2020–2022: Dr. Patrik Jonell (co-supervisor; main supervisor Jonas Beskow)
- 2019–2021: Dr. Taras Kucherenko (co-supervisor; main supervisor Hedvig Kjellström)
Awards and recognition
Awards and recognition since my return to KTH in 2018:- Apr. 2022: Gustavo Teodoro Döhler Beck, whose thesis I informally co-supervised, won the SAIS Master's Thesis Award 2022
- Sept. 2021: Received an Extended Abstract Honourable Mention at IVA 2021 (given to the top 2 of 8 accepted extended abstracts, among approximately twice as many submissions)
- Oct. 2020: Received a Best Paper Award at ICMI 2020 (given to 2 of 159 long paper submissions)
- Oct. 2020: Received the Best Paper Award at IVA 2020 (given to 1 of 137 full paper submissions)
- May 2020: Received an Honourable Mention at EUROGRAPHICS 2020 (best paper award nominee, top 4 of 141 submissions)
- May 2020: Ulme Wennberg, whose thesis I co-supervised, won the SAIS Master's Thesis Award 2020
- Sept. 2019: Received the Best Show and Tell Paper Award at Interspeech 2019 (given to 1 of 37 demos presented at the conference)
Selected presentation materials
-
Move over, MSE! – New probabilistic models of motion
A presentation most recently given at the Department of Computer Science, University of California, Davis, USA, in April 2021.
[ video presentation | slides w/o video (22.3 MiB) | abstract ]
-
Practical classifier design
A lecture from the M.Sc. course EQ2341 Pattern Recognition and Machine Learning at KTH Royal Institute of Technology, Stockholm, Sweden, most recently given in May 2019.
[ slides ]
-
Cyborgs and other controllable synthesisers: an update on past and planned research
A presentation given at the Centre for Speech Technology Research, the University of Edinburgh, UK, in November 2018.
[ slides w/o audio | abstract ]
-
Wagging speech by the tail: The case for robust data generation
A presentation most recently given at the ACCESS Linnaeus Centre, KTH Royal Institute of Technology, Stockholm, Sweden, in October 2018.
[ slides w/ audio (2.6 MiB) | slides w/o audio | abstract ]
-
Introduction to modern and controllable speech synthesis
Excerpt from a presentation given at the annual EACare project retreat in Sigtuna, Sweden, in April 2018.
[ slides w/ audio (6.6 MiB) | slides w/o audio ]
-
Perceptual debugging of speech synthesis: Using speech from the future to pinpoint the design decisions that matter most
A presentation most recently given at the Tokuda and Nankaku Laboratory, Nagoya Institute of Technology, Japan, in January 2018.
[ slides w/ audio (7.6 MiB) | slides w/o audio | abstract ]
Note: Some audio files have been removed from the slides and presentations on this page due to strict Japanese regulations regarding personal data. Audio playback from pdf is known to work in Adobe Acrobat Reader (from trusted documents), but not in Preview on macOS.
Data releases
-
Taras Kucherenko, Patrik Jonell, Youngwoo Yoon, Pieter Wolfert, and Gustav Eje Henter (2020)
Data from the GENEA Challenge 2020
Motion data, video stimuli, visualisation code, evaluation code, user responses, analysis code, and scientific papers from the first-ever challenge in automatic, speech-driven gesture generation.
[ main page | workshop page | gesture data (direct link) | gesture data readme file | Zenodo repository | .bib | publication ]
-
Gustav Eje Henter, Thomas Merritt, Matt Shannon, Catherine Mayo, and Simon King (2014)
Repeated Harvard Sentence Prompts (REHASP) corpus version 0.5
Studio recordings of 30 Harvard sentences, each read aloud 40 times by a female native speaker of British English. Released under a CC BY 4.0 license.
[ official repository | readme file | .bib | publication ]
Events organised
- GENEA Workshop 2021 on the generation and evaluation of non-verbal behaviour for embodied agents (co-organised together with Taras Kucherenko, Zerrin Yumak, Pieter Wolfert, Youngwoo Yoon, and Patrik Jonell)
- GENEA Challenge 2020 on speech-driven gesture generation (co-organised together with Taras Kucherenko, Pieter Wolfert, Youngwoo Yoon, and Patrik Jonell)
- GENEA Workshop 2020 on the generation and evaluation of non-verbal behaviour for embodied agents (co-organised together with Taras Kucherenko, Pieter Wolfert, Youngwoo Yoon, and Patrik Jonell)
Links and additional information
- Download my curriculum vitæ.
- Download a .bib file with all my publications.
- Access my profile on Google Scholar.
- Find my ORCID iD (0000-0002-1643-1054).
- Access my page on GitHub. (Not currently up to date.)
- Access my official profile on KTH Social.
- Disclaimer for personal web pages at KTH.
This page was last updated 2023-10-06.
[ back to the top of the page | contact the author ]
❧