COCO + LVIS
Joint Recognition Challenge Workshop at ECCV 2020

Table of Contents

  1. Overview
  2. Dates
  3. Organizers
  4. Rules and Awards
  5. Challenge tracks
  6. Invited Speaker

1. Overview

As ECCV goes online this year, our workshop will be held online. We will update this web-page with relevant details as soon as ECCV organizers will share more organizational details.

The goal of the joint COCO and LVIS Workshop is to study object recognition in the context of scene understanding. This workshop will host the COCO suite of challenges and a new challenge on large vocabulary instance segmentation (LVIS). While both the COCO and LVIS challenges look at the general problem of visual recognition, the specific tasks in the challenges probe different aspects of the problem.

COCO is a widely used visual recognition dataset, designed to spur object detection research with a focus on full scene understanding. In particular: detecting non-iconic views of objects, localizing objects in images with pixel level precision, and detection of objects in complex scenes. The COCO dataset includes 330K images of complex scenes exhaustively annotated with 80 object categories with segmentation masks, 91 stuff categories with segmentation masks, person keypoint annotations, and 5 captions per image.

Large Vocabulary Instance Segmentation (LVIS) includes high-quality instance segmentations for more than 1000 entry-level object categories. The LVIS dataset contains a long-tail of categories with few examples, making it a distinct challenge from COCO and exposes shortcomings and new opportunities in machine learning. We expect this dataset to inspire new methods in the detection research community. This year we plan to host the first challenge for LVIS, a new large vocabulary dataset.

2. Challenge Dates

July 31, 2020
Submission deadline (11:59 PM PST)
August 3, 2020
Technical report submission deadline (11:59 PM PST)
August 10, 2020
Challenge winners notified
August 15, 2020
Presenter's video upload deadline
August 23, 2020
ECCV 2020 Workshop

3. Organizers

3.1. COCO

3.2. LVIS

4. Rules and Awards

5. COCO Challenges

COCO is an image dataset designed to spur object detection research with a focus on detecting objects in context. The annotations include instance segmentations for object belonging to 80 categories, stuff segmentations for 91 categories, keypoint annotations for person instances, and five image captions per image. The specific tracks in the COCO 2018 Challenges are (1) object detection with segmentation masks (instance segmentation), (2) panoptic segmentation, (3) person keypoint estimation, and (4) DensePose. We describe each next. Note: neither object detection with bounding-box outputs nor stuff segmentation will be featured at the COCO 2020 challenge (but evaluation servers for both tasks remain open).

5.1. COCO Object Detection Task

The COCO Object Detection Task is designed to push the state of the art in object detection forward. Note: only the detection task with object segmentation output (that is, instance segmentation) will be featured at the COCO 2019 challenge. For full details of this task please see the COCO Object Detection Task.

5.2. COCO Panoptic Segmentation Task

The COCO Panoptic Segmentation Task has the goal of advancing the state of the art in scene segmentation. Panoptic segmentation addresses both stuff and thing classes, unifying the typically distinct semantic and instance segmentation tasks. For full details of this task please see the COCO Panoptic Segmentation Task.

5.3. COCO Keypoint Detection Task

The COCO Keypoint Detection Task requires localization of person keypoints in challenging, uncontrolled conditions. The keypoint task involves simultaneously detecting people and localizing their keypoints (person locations are not given at test time). For full details of this task please see the COCO Keypoint Detection Task.

5.4. COCO DensePose Task

The COCO DensePose Task requires dense estimation of human pose in challenging, uncontrolled conditions. The DensePose task involves simultaneously detecting people, segmenting their bodies and mapping all image pixels that belong to a human body to the 3D surface of the body. For full details of this task please see the COCO DensePose Task.

6. LVIS Challenge

LVIS is a new, large-scale instance segmentation dataset that features > 1000 object categories, many of which have very few training examples. LVIS presents a novel low-shot object detection challenge to encourage new research in object detection. For more information, please see LVIS challenge page.

6. Invited Speaker

TBD