Order-Embeddings: Иерархия изображений и текста в ИИ

Q: Кто разработал Order-Embeddings of Images and Language?

Модель Order-Embeddings of Images and Language разработана компанией University of Toronto (Canada).

// задачи

Image captioning

// описание

Модель Order-Embeddings выстраивает четкую визуально-семантическую иерархию, связывая изображения и текст через структуру частичного порядка. Такой подход позволяет ИИ глубже понимать контекст и взаимосвязи между объектами, что критически важно для точного создания описаний к картинкам.

// abstract

Hypernymy, textual entailment, and image captioning can be seen as special cases of a single visual-semantic hierarchy over words, sentences, and images. In this paper we advocate for explicitly modeling the partial order structure of this hierarchy. Towards this goal, we introduce a general method for learning ordered representations, and show how it can be applied to a variety of tasks involving images and language. We show that the resulting representations improve performance over current approaches for hypernym prediction and image-caption retrieval.

// faq

Что такое Order-Embeddings of Images and Language?+

Кто разработал Order-Embeddings of Images and Language?+

Какие задачи решает Order-Embeddings of Images and Language?+

// похожие модели