The goal of cross-modal recipe retrieval is to design systems that are able to find a digital recipe, given the user’s image of the food, or find its image, given its ingredients or cooking instructions. For such a cross- modal retrieval task, a common image-text representation space is needed to embed the semantic information of […]
Read More