Overview

VisDrone is one of the most comprehensive benchmarks for drone-based computer vision tasks, including object detection and tracking. The dataset was collected by the AISKYEYE team at Lab of Machine Learning and Data Mining, Tianjin University, China.

288
Video Clips
261,908
Video Frames
10,209
Static Images
2.6M+
Bounding Box Annotations

Dataset Statistics

Tasks Supported

DET
Object Detection in Images
VID
Object Detection in Videos
SOT
Single-Object Tracking
MOT
Multi-Object Tracking
CC
Crowd Counting

Object Detection in Images (VisDrone-DET)

Object Detection in Videos (VisDrone-VID)

Single-Object Tracking (VisDrone-SOT)

Multi-Object Tracking (VisDrone-MOT)

Strengths and Limitations

Strengths

  • Exceptional scale and diversity
  • Supports multiple tasks (detection, tracking, counting)
  • Well-documented with established benchmarks
  • Regular updates and challenges
  • Realistic drone-captured footage

Limitations

  • Primarily focused on urban environments
  • Limited to visible spectrum imagery
  • Annotation density varies across dataset
  • Some classes have limited representation

Characteristics for Drone Deployment

Back to Datasets