To communicate with their end users, businesses regularly produce written documents such as letters, notices, statements, etc.., in various languages. A set of rules are usually used to ensure that information in these documents is ‘correct’ and consistent across languages and communication channels. However, with the increasing volume and variety of information being sent out to clients, it becomes difficult to preserve the semantics of client messages across vocabulary and language variations. This project aims at creating algorithms capable of measuring semantic similarity of two text documents regardless of the natural language being used for each document. The set of similarity algorithms must scale with the size of the corpus being used.
Information and communications technologies
Find the perfect opportunity to put your academic skills and knowledge into practice!Find Projects
The strong support from governments across Canada, international partners, universities, colleges, companies, and community organizations has enabled Mitacs to focus on the core idea that talent and partnerships power innovation — and innovation creates a better future.