Related projects
Discover more projects across a range of sectors and discipline — from AI to cleantech to social innovation.
Portable Document Format or PDF is the de facto standard for presenting textual-visual content. In this project, we aim to develop a machine learning framework for PDF document understanding. Despite the recent proliferation of deep learning-based methods for the analysis and processing of natural images, there have been considerably less efforts on designing similar approaches for highly structured data such as documents. Our project will explore two novel ideas. First, we will develop a structured and organizational representation of PDF documents which is built on labeled content blocks (e.g., heading, figure, list, caption, etc.). Second, we will investigate how recursive neural networks (RvNN), one type of deep neural networks that have been utilized to language parsing, can be adopted and formulated for learning PDF document structures.
Richard Zhang
Chenyang Zhu
PDFTron Systems
Computer science
Information and communications technologies
Accelerate
Discover more projects across a range of sectors and discipline — from AI to cleantech to social innovation.
Find the perfect opportunity to put your academic skills and knowledge into practice!
Find ProjectsThe strong support from governments across Canada, international partners, universities, colleges, companies, and community organizations has enabled Mitacs to focus on the core idea that talent and partnerships power innovation — and innovation creates a better future.