Outils pour utilisateurs

Outils du site


datasets

Computer vision datasets

(kept by the computer vision and ML group of the LIRIS laboratory, INSA-Lyon)

Robotics simulators

Mobile agents, navigation

A useful classification on the AI-Thor website

  • 2019 3D Deep-RL and reasoning w/o supercomputer by Inria-Chroma and LIRIS: high-level reasoning with VizDoom arxiv paper
  • 2019 StreetLearn Dataset Navigation, from StreetView, by DeepMind.
  • 2019 Habitat-Sim by FAIR. Built on SUNCG. Different tasks, including language, or not.
  • 2019 RobotIx Robot interactions, Pepper robot
  • 2018 CHALET Cornell House Agent Learning Environment. CG, not photo-realistic, manipulable, 58 rooms, 10 houses. Still at version 0.1, i.e. unstable.
  • 2018 SYnCity Vehicles, Unity. Commercial: Prize?
  • 2018 Holodeck Agent or Drone, Unreal-4, single or multi-agent
  • 2018 Matterport Indoor. Photo-realism, no physics, 90 buildings. Natural language interaction with robots.
  • 2018 Gibson Real world images, indoor
  • 2018 Chalet rendered images, indoor
  • 2017 Minos Photo-realism, 90 buildings. goal-directed navigation in complex indoor environments.
  • 2017 Minos-SunCG No-Photorealsim; customable. 45 000 houses.
  • 2017 Minos-Matterport3D Photorealsim; 90 buildings.
  • 2017 HoME simulated indoor, sun-cg (procedurally created)
  • 2017 AI-Thor Allen I., Stanford, CMU. Photo realism, physics/actionable, customable, 32 rooms.
  • 2017 Udacity Self driving car simulator based on Unity
  • 2017 CARLA Autonomous driving. Python+ROS support. Depth,LIDAR,Semantic segmentation. Dynamic weather, multiple cars.
  • 2017  SynCity Drones & driving.
  • 2017 MS Airsim Drone or car, game engine.
  • 2002 Gazebo Very general robotics simulator
Walking, crawling etc.
Robotic arms

Other dataset lists and surveys

Physics simulators

Aerial imagerie

Semantic Full Scene Labelling

  • Places dataset Places 2,5 millions d’images avec 205 Scènes Labellisées.

Synthetically created datasets

Gesture recognition

Full body pose estimation

Hand pose estimation

Action recognition

Surveys and dataset lists
Datasets
  • Multiview datasets:
    • 2010 VideoWeb Dataset focusus on interactions (people meeting, people following, vehicles turning, people dispersing, shaking hands, gesturing, waving, hugging, and pointing)
    • 2009 i3DPost Multi-view Dataset Groundtruth inclues 3D mesh models (walking, running, jumping, bending, hand-waving, jumping in place, sitting-stand up, running-falling, walking-sitting, running-jumping-walking, handshaking, pulling, and facial-expressions)
    • 2009 PETS 2009
    • 2007 PETS 2007
    • 2006 INRIA IXMAS dataset 5 cameras, daily living (nothing, checking watch, crossing arms, scratching head, sitting down, getting up, turning around, walking, waving, punching, kicking, pointing, picking up, throwing (over head), and throwing (from bottom up))
    • 2006 HumanEva dataset Multi-camera RGB / Grayscale images

Body part segmentation

Touch gesture recognition

Object recognition and segmentation

Pedestrian detection

Motion capture

Visual Question Answering

  • 2016 VQA Dataset 200,000 real scene images from MSCOCO along with 1 million questions. Versions: v1, v2, VQA-CP.
  • 2016 CLEVR Diagnostic dataset for compositional language and elementary visual reasoning (synthetic data). Versions: CLEVR, CLEVR Humans, CoGenT.
  • 2016 Visual Genome 108,000 real scene images (MSCOCO & YFCC100M intersection) along with 1.7 million questions. It is a general purpose dataset as it proposes many annotations in addition to question/answer paires: object instances, relationships, etc…
  • 2016 Visual Dialog 123,000 images from MSCOCO. Each image is annotated with a dialog composed of 10 question answer paires.
  • 2018 VizWiz answering visual questions from blind people.
  • 2017 GuessWhat?! visual object discovery through multi-modal dialogue. 66,000 images from MSCOCO along with 160,000 dialogues (822,000 question answer paires).
  • 2016 Visual7W grounded question answering in images. It is a subset of Visual Genome, 47,000 images from MSCOCO along with 328,000 question answer paires.

Main : VOIR - a smart vision platform

datasets.txt · Dernière modification: 2019/04/09 10:55 de cwolf

Outils de la page