Speaker Diarization for Audio Transcription

This research is concerned with speaker diarization for the purpose of facilitating automated speech transcription. This problem has multiple depths depending on the prior knowledge provided to the system. The type and amount of information about the number and characteristics of the speakers can differentiate this problem in a range from a 1-to-N matching, where the voice is compared against different templates, to a clustering problem, where no prior knowledge is available. We intend to find a solution for the speaker diarization problem by incorporating state-of-the-art supervised and unsupervised machine learning methods. This internship will help the interns gain professional work experience in the field of natural language processing with the help from experts in both industry and academia. It is an opportunity for them to practice and improve their industry skills and gain a better understanding of what they are learning in the academia.

Faculty Supervisor:

Otman Basir

Student:

Pouya Mehrannia;Nada Gohider

Partner:

TRINT NORTH AMERICA INC.

Discipline:

Engineering - computer / electrical

Sector:

Administrative and support, waste management and remediation services

University:

University of Waterloo

Program:

Accelerate

Current openings

Find the perfect opportunity to put your academic skills and knowledge into practice!

Find Projects