Web Page Contextual Sub-Components Recognition and Identification

The intent of this project is to build a web-page content subsection identification and analysis service. The subsections of a web-page are considered to be meaningful page components (articles, images. media blocks) that can be identified and uniquely fingerprinted.
The intent of the project is fourfold:
a) sub-component identification and fingerprinting
b) sub-component content change tracking and precedence history identification
c) sub-component analysis disregarding non-relevant component
d) compact service component implementation with the emphasis of user-side execution (rust/web assembly)
The research conducted in this project will enable key functionality of the Scrawlr platform, including ability to build content-aware features, enabling richer user interaction capabilities.

Faculty Supervisor:

Shurui Zhou

Student:

Partner:

Scrawlr Development Inc.

Discipline:

Computer science

Sector:

Information and Communications Technology; Other

University:

University of Toronto

Program:

Accelerate

Current openings

Find the perfect opportunity to put your academic skills and knowledge into practice!

Find Projects