CVPR 2026  ·  Workshop
Vision for Intelligent
Task Assistants
VITA 2026  ·  June 3, 1:00 PM  ·  Denver, CO
About VITA 2026

About VITA 2026

Advances in computer vision, multimodal learning, and AR/VR/XR technologies and smart glasses are converging toward Virtual Intelligent Task Assistants (VITAs)—systems that observe, interpret, and guide humans in complex real-world activities. This workshop bridges computer vision foundations and interactive AR/VR/XR research to enable long-term task understanding and assistance. Topics include learning from long streaming egocentric and exocentric videos, multimodal reasoning, task and step prediction, procedure planning and correction, human-AI collaboration and coaching, and new datasets and benchmarks. By fostering dialogue across disciplines, the workshop aims to define the core challenges and opportunities for building practical and generalizable VITAs.

When

June 3, 2026
1:00 PM

Where

Room 108
Colorado Convention Center
Denver, CO

Speakers

Kristen Grauman

Kristen Grauman

University of Texas at Austin

Ivan Laptev

Ivan Laptev

MBZUAI

Marc Pollefeys

Marc Pollefeys

ETH Zurich

Antonino Furnari

Antonino Furnari

University of Catania

Gedas Bertasius

Gedas Bertasius

University of North Carolina, Chapel Hill

Steven Feiner

Steven Feiner

Columbia University

Organizers

Mohsen Moghaddam

Mohsen Moghaddam

Georgia Institute of Technology

🔗
Angela Yao

Angela Yao

National University of Singapore

🔗
Jason Corso

Jason Corso

University of Michigan

🔗
Ehsan Elhamifar

Ehsan Elhamifar

Northeastern University

🔗

Schedule

[in Denver local time · MDT / UTC−6]

1:30 – 2:00
Opening Talk
2:00 – 2:30
Invited Talk 1
2:30 – 3:00
Invited Talk 2
3:00 – 3:30
Invited Talk 3
3:30 – 4:00
Break
4:00 – 4:30
Invited Talk 4
4:30 – 5:00
Invited Talk 5
5:00 – 5:30
Invited Talk 6
5:30 – 5:40
Concluding Remarks