Large-scale Video Object Segmentation

Latest News

[🔥Update - Jul 10] The evaluation servers for all three challenge tracks are now available: [MOSEv2] / [MeViSv2-Text] / [MeViSv2-Audio].

[News] Workshop paper submission link released: OpenReview submission portal.

Introduction

The 8th LSVOS challenge will be held in conjunction with ECCV 2026 in Malmö, Sweden. This year, we will continue the same setup as last year and still have two tracks: VOS and RVOS. In the Video Object Segmentation (VOS) track, we will utilize LVOS and MOSE. to study the VOS under more challenging complex environments. LVOS is designed for long-term videos, dealing with complex object motion and long-term reappearance, while MOSE focuses on complex scenes, covering aspects such as object disappearance and reappearance, inconspicuous small objects, heavy occlusions, and crowded environments. For the Referring Video Object Segmentation (RVOS) track, we will continue to use MeViS. MeViS focuses on the identification of the target object in a video based on motion-related descriptions rather than static attributes. This innovative approach subverts the foundational design principles of existing RVOS methods, compelling researchers to engage in a more in - depth exploration and reevaluation of motion modeling. In addition, we will hold a series of talks by the leading experts in video understating and embodied intelligence. In this year, the following topics will be covered:

Semantic/panoptic segmentation for images/videos
Video Object Segmentation in Complex Scenes
Long-term Video Object Segmentation
Referring Video Object Segmentation
Video Segmentation with Motion Expressions
Vision and Language
Cognitive Models of Object Perception
Real-world Understanding and embodied intelligence

Challenge Timeline

Event	Date
Challenge Release	Jul 01, 2026
Validation Server Online	Jul 01, 2026
Test Server Online	Jul 20, 2026
Submission Deadline	Jul 27, 2026
Notification of Results	Aug 01, 2026

*All dates are in UTC, 23:59 of the specified day.

Call for Paper

[Update] LSVOS 2026 workshop paper submission is now open on OpenReview.

We invite authors to submit unpublished papers (8-page ECCV format) to our workshop, to be presented at a poster session upon acceptance. All submissions will go through a double-blind review process. Please submit your paper through the workshop paper submission portal.

Accepted papers will be published in the official ECCV Workshops proceedings and the Computer Vision Foundation (CVF) Open Access archive.

Paper Submission Timeline

Event	Date
Submission portal open	Jun 20, 2026
Regular paper submission deadline	Jul 20, 2026
Supplemental material deadline	Jul 20, 2026
Notification of paper acceptance	Aug 05, 2026
Camera ready deadline	Aug 10, 2026

*All dates are in UTC, 23:59 of the specified day.

Challenge Tracks & Submission

The 8th LSVOS challenge includes three tracks: MOSEv2, MeViSv2-Text, and MeViSv2-Audio.

Below are the task descriptions for the three tracks:

Track 1: Complex Video Object Segmentation (MOSEv2) Track

MOSEv2 focuses on tracking and segmenting objects in videos captured in complex environments. Submission server [click here].

Track 2: Text-based Referring Motion Expression Video Segmentation (MeViSv2 - Text) Track

MeViSv2-Text focuses on segmenting video objects guided by a sentence that describes the motion of the target objects. Submission server [click here].

Track 3: Audio-based Referring Motion Expression Video Segmentation (MeViSv2 - Audio) Track

MeViSv2-Audio focuses on segmenting video objects guided by an audio clip that describes the motion of the target objects. Submission server [click here].