Wednesday, May 12, 2021

Fwd: [DMRN-LIST] Vacancy: Research Fellow on Machine Learning for Audio Captioning @ University of Surrey, UK

Applications are invited for a Research Fellow (RF) position for 22
months within the Centre for Vision Speech and Signal Processing
(CVSSP), University of Surrey, UK, to work in the area of machine
learning and acoustic signal processing.

The post is funded by British Council under Newton Institutional Links
Award, under a project titled "Automated Captioning of Image and Audio
for Visually and Hearing Impaired", which is a joint project between
the University of Surrey and the Izmir Katip Celebi University (IKCU),
Turkey, with project partners from charities and industrial sectors
working with the hearing and visually impaired.

The focus at Surrey will be to develop machine learning and signal
processing algorithms for information extraction from audio data,
recognize audio classes (i.e. tags and labels), and generate text
description of audio content. This work is built on the recent
contributions of CVSSP in the area of acoustic scene analysis, audio
event detection, environmental sound recognition, and audio tagging,
together with some latest results on audio captioning. The algorithms
developed will be integrated by the partner university IKCU into a
smartphone app to prototype and demonstrate the concept.

The post-holder is expected to have a PhD degree (or equivalent) in
the area of machine learning, acoustic signal processing, audio
engineering, or a related area in electronic engineering, applied
mathematics, computer science, artificial intelligence, and
statistics. The post-holder is expected to have good analytical
skills, and programming skills in Python, Matlab or C/C++. Preference
will be given to those who have experience on audio classification,
audio tagging, audio captioning, or cross modal translations (such as,
audio<->texts, image<->texts, or audio<->video), but candidates who
have experience in machine learning and audio are welcome to apply.

The post-holder will be based in CVSSP, and work under the direction
of the Principal Investigator Prof Wenwu Wang, with co-supervision by
Prof Sabine Braun, Director of the Centre for Translation Studies, at
University of Surrey, and in collaboration with Dr Volkan Kilic, from
the IKCU, Turkey.

CVSSP is an International Centre of Excellence for research in
Audio-Visual Machine Perception, with over 150 researchers, a grant
portfolio of £24M (£17.5M EPSRC) from EPSRC, EU, InnovateUK, charity
and industry, and a turnover of £7M/annum. The Centre has
state-of-the-art acoustic capture and analysis facilities and a Visual
Media Lab with video and audio capture facilities supporting research
in real-time video and audio processing and visualisation. CVSSP has a
compute facility with 120 GPUs and >1PB of high-speed secure storage.

This post is available to start immediately, or as soon as possible.

To apply online, please visit the following page:

https://jobs.surrey.ac.uk/vacancy.aspx?ref=015821-R


Many thanks.

Best wishes,

Wenwu



--
Professor Wenwu Wang
Centre for Vision Speech and Signal Processing
Department of Electronic Engineering
University of Surrey
Guildford GU2 7XH
United Kingdom
Phone: +44 (0) 1483 686039
Fax: +44 (0) 1483 686031
Email: w.wang@surrey.ac.uk
http://personal.ee.surrey.ac.uk/Personal/W.Wang/