Dr Theodoros Giannakopoulos was born in Athens, Greece, in 1980. He received the Degree in Informatics and Telecommunications from the University of Athens (UOA), Athens, Greece, in 2002, the M.Sc. (Honors) Diploma in signal and image processing from the University of Patras, Patras, Greece, in 2004 and his Ph.D. in the field of Multimodal Machine Learning from the department of Informatics and Telecommunications, UOA, in 2009. He is the coauthor of more than 100 publications in journals and conferences in the fields of pattern recognition and multimedia analysis and the coauthor of a book titled "Introduction to Audio Analysis: A MATLAB Approach". He is an active member of the open source community, author of the pyAudioAnalysis and deep_audio_features libraries, and he is the top Python contributor in Greece and in the top 0.1% worldwide. He is currently a Tenured Researcher Institute of Informatics and Telecommunications, NCSR “Demokritos”, Greece. He has several years of experience in tutoring, mostly in Master Programs organized by NCSR Demokritos, courses such as: Machine Learning, Deep Learning, Data Programming and Multimodal Data Analysis. His research interests lie in the fields of multimodal machine learning, music information retrieval and speech analytics.
This presentation demonstrates how signal processing, machine and deep learning can be utilised to build applications that analyse and recognise sounds. Apart from a brief intro to the basics of speech and audio processing, it examines a range of open source tools and libraries for building audio analysis applications.