ICMR 2021: Special Session

Delving into Vision and Language Intelligence

The natural world is abundant with concepts expressed via visual, acoustic, tactile, and linguistic modalities. The term cross-modal learning emerges as important research direction in computer vision and multimedia, which refers to the adaptive, synergistic integration of complex perceptions from multiple sensory modalities, such as the learning that occurs within any individual visual sensory modality can be enhanced with information from one or more other modalities, e.g., texts. This session focuses on understanding, reasoning and generation across language/text and vision. It prompts the creation of intelligent services, including vision-to-text captioning, textto-vision generation, and question answering/ dialog about images and videos. This special session invites papers that will be complimentary to all ICMR 2021 conference registrants.

Perspective submissions should fall into the following topics but not limited to:

Vision/text translation
Image/video captioning
Dialog generation for video understanding
Cross-modal retrieval
Cross-modal data learning
Deep generative models for images and texts
Visual question answering
Scene image generation from language
Cross-modal learning and semantic correlation
Auxiliary knowledge for images/videos
Weakly supervised cross-modal learning/integration
Deep learning for cross-modal embedding

Maximum Length of a Paper

Each full paper should be limited to 6-8 pages (6 pages limit + reference).

Important Dates

Paper Submission: March 3, 2021
Notification of Acceptance: April 11, 2021
Camera-Ready Papers Due: May 1, 2021

Submission Instructions

See the ICMR 2021 Paper submission section.

Organizers

Lin Wu (jolin.lwu@gmail.com), Hefei University of Technology, China
Zongyuan Ge (zongyuan.ge@monash.edu.au), Monash University, Australia
Yang Wang (yangwang@hfut.edu.cn), Hefei University of Technology, China
Zhao Zhang (cszzhang@gmail.com), Hefei University of Technology, China
Jialie Shen (j.shen@qub.ac.uk), Queen's University Belfast, UK