Merging different sub-companies into TELUS caused some of customer records to be repeated through the merged data-set. Algorithms are needed to determine the duplicate records. Currently a deterministic algorithm is being used in TELUS. In this project, we will investigate if machine learning can help to detect duplicates. Solving this problem has several parts. We have to preprocess the data and select some features from the TELUS records that help us in our model. A probabilistic model should be selected, implemented and tuned. Then, it is necessary to test the proposed model and compare that with the current systems.
Information and communications technologies
University of British Columbia
Find the perfect opportunity to put your academic skills and knowledge into practice!Find Projects
The strong support from governments across Canada, international partners, universities, colleges, companies, and community organizations has enabled Mitacs to focus on the core idea that talent and partnerships power innovation — and innovation creates a better future.