TorontoCity: Seeing the World with a Million Eyes€¦ · Ground Level Panorama. Dataset Aerial...

TorontoCity: Seeing the World

with a Million Eyes

Authors

Shenlong Wang, Min Bai, Gellert Mattyus, Hang Chu, Wenjie Luo, Bin Yang

Justin Liang, Joel Cheverie, Sanja Fidler, Raquel Urtasun

* Project Completed by Summer 2016

Why Toronto? The best place to live in the world*

• Toronto 4

*According to 2015 Global Liveability Ranking

Why Toronto? The best place to live in the world*

• Toronto 4

The places you are working at:

• Boston 36

• Pittsburgh 39

• San Francisco 49

• Los Angeles 51

*According to 2015 Global Liveability Ranking

A dataset over 700 km2 region!

From all the views!

Dataset

Aerial

Data Source

Dataset

Aerial

Ground Level Panorama

Data Source

Dataset

Aerial LIDAR

Data Source

Dataset

Aerial LIDAR Stereo

Data Source

Dataset

Aerial LIDAR Stereo

Data Source

Dataset

Aerial Airborne LIDAR

Data Source

LIDAR Stereo

Why we need this?

• Mapping for Autonomous Driving

• Smart City

• Benchmarking:• Large-Scale Machine Learning / Deep Learning

• 3D Vision

• Remote Sensing

• Robotics

Source: Here 360

Why we need this?

• Smart City

• 3D Vision

• Remote Sensing

• Robotics

Source: Toronto SmartCity Summit

Why we need this?

• Smart City

• 3D Vision

• Remote Sensing

• Robotics

Annotations

• Manual annotation? Impossible!• Suppose each 500x500 image costs $1 to annotate pixel-wise

labels, we need to pay $11M to create ground-truth only for the aerial images.

Annotations

I’m not as

rich as

Jensen

Annotations

• However, humans already collect rich knowledge about the world!

I’m not as

rich as

Jensen

Annotations

labels, we need to pay $1139200 to create ground-truth only for the aerial images.

• Humans already collect rich knowledge about the world!

Use maps!

I’m not as

rich as

Jensen

Map as Annotations

HD Map

Map as Annotations

3D BuildingHD Map

Map as Annotations

3D BuildingHD Map Meta Data

Together, the rich sources of data enable a plethora of exciting tasks!

Building Footprint Extraction

Road Curb and Centerline Extraction

Building Instance Segmentation

Zoning Prediction

InstitutionalResidential

Commercial

Technical DifficultiesMis-alignment and Data Noise

Aerial-ground images mis-alignment from raw GPS location data

Road centerline is shifted

Building’s shape/location is not accurate

Data Pre-processing and AlignmentAppearance based Ground-aerial Alignment

Before Alignment

After Alignment

Data Pre-processing and AlignmentInstance-wise Aerial-map Alignment

Before alignment

Data Pre-processing and AlignmentInstance-wise Aerial-map Alignment

After alignment

Data Pre-processing and AlignmentRobust Road Surface Generation

Input Road Curb and Centreline (Noisy)

Polygonized Road Surface

Pilot Study with Neural NetworksBuilding Contour and Road Curb/Centerline Extraction

GT ResNet

Pilot Study with Neural NetworksSemantic Segmentation

Method Road Building Mean

FCN 74.94% 73.88% 74.41%

ResNet-56 82.72% 78.80% 80.76%

Metric: Intersection-over-union (IOU), higher is better

Pilot Study with Neural NetworksBuilding Instance Segmentation

Input DWT

Metric: Weighted Coverage, AP, Precision-50%, Recall-50%, higher is better

MethodWeighted

Coverage

Average

PrecisionRecall-50% Precision-50%

FCN 41.92% 11.37% 21.50% 36.00%

ResNet-56 40.65% 12.13% 18.90% 45.36%

Watershed

Transform

56.22% 21.22% 67.16% 63.67%

Join the other talk today to know more about the deep watershed instance segmentation: Wednesday, May 10, 4:00 PM - 4:25 PM – Room 210G

MethodWeighted

Coverage

Average

PrecisionRecall-50% Precision-50%

FCN 41.92% 11.37% 21.50% 36.00%

ResNet-56 40.65% 12.13% 18.90% 45.36%

Watershed

Transform

56.22% 21.22% 67.16% 63.67%

Pilot Study with Neural NetworksGround-view Road Segmentation

True Positive: Yellow; False Negative: Green; False Positive: Red

Pilot Study with Neural NetworksGround-view Road Segmentation

Metric: Intersection-over-Union, higher is better

Method Non-Road IOU Road IOU Mean IOU

FCN 97.3% 95.8% 96.5%

ResNet-56 97.8% 96.6% 97.2%

Pilot Study with Neural NetworksGround-view Zoning Classification

Top-1 Accuracy

Method From-ScratchPre-trained from

ImageNet

AlexNet 66.48% 75.49

GoogLeNet 75.08% 77.95%

ResNet 75.65% 79.33%

Metric: Top-1 Accuracy, higher is better

• # of buildings: 397846

• Total area: 712.5 km2

• Total length of road: 8439 km

Statistics

Building height distribution Zoning type distribution

Statistics

Conclusion

• We propose a large dataset with from different views and sensors

• Maps are used to create GT annotations

• In future we have many more exciting tasks to come

• Check our paper for more details: https://arxiv.org/abs/1612.00423

• Data available soon. Stay tuned and welcome to over-fit

Join the other talk today to know more about the deep watershed instance segmentation:

Wednesday, May 10, 4:00 PM - 4:25 PM – Room 210G

TorontoCity: Seeing the World with a Million Eyes€¦ · Ground Level Panorama. Dataset Aerial...

Documents

David Webb- VWCC. LIDAR, LiDAR, LIDaR, lidar and LADAR(laser altimetry). Light detection and ranging Laser Imaging, Detection and Ranging Combining the

Assimilating subsampled airborne lidar: How much lidar is ...depts.washington.edu/mtnhydr/news/WSC19_poster.pdfAssimilating subsampled airborne lidar: How much lidar is enough? J.M

Lidar and the Oregon Lidar Consortium Portland State Office Building: photo and lidar point cloud Beaverton: photo and lidar highest hit model Eagle Creek

A Geosynchronous LIDAR Observatory For Atmospheric Winds ... Winds LIDAR/Geo Winds LIdar Fi… · A Geosynchronous LIDAR Observatory For Atmospheric Winds And Moisture Measurements

Precision LiDAR Mapping of Transportation Corridors Using LiDAR

COMPANY PROFILE - Best Geospatial, UAV, Drone, Volumetric ... · LiDAR: UAV / Airborne LiDAR data processing, Mobile LiDAR data processing and Terrestrial LiDAR data processing. Building

Lecture 43. White-Light Lidar & Lidar Future Outlooksuperlidar.colorado.edu/Classes/Lidar2014/LidarLecture43_WhiteLightLidar.pdf · Lecture 43. White-Light Lidar & Lidar Future Outlook

LiDAR Business Opportunities - Beyond Typical Elevation Modelssummit.geobuiz.com/presentations/lidar-business-opportunities... · LiDAR Business Opportunities - Beyond Typical

Manual Lidar

LiDAR Acquisition Report - Nanaimo 2016 LiDAR Acquisition Report.pdfMay 2, 2016 2. Acquisition & Calibration 2.1 Airborne LiDAR Collection A Riegl Q1560 dual-channel LiDAR system was

Metadatos LIDAR

Tehnologia LIDAR

Lidar Based Off-Road Negative Obstacle Detection and Analysisvelodynelidar.com/lidar/hdlpressroom/pdf/Articles/Lidar Based Off... · Lidar Based Off-road Negative Obstacle Detection

한국원자력연구소 양자광학부 김덕현€¦ · · 2016-12-19MieScattering LIDAR Differential Absorption LIDAR (DIAL) Raman Scattering LIDAR • Inelastic scattering

Lecture 41. Lidar Architecture and Lidar Designsuperlidar.colorado.edu/Classes/Lidar2012/LidarLecture… · · 2012-12-11Lecture 41. Lidar Architecture and Lidar Design ... The

Integrated Path Differential Absorption (IPDA) LIDAR Measurement … · 2016. 5. 11. · IPDA LIDAR measurements using a backscatter LIDAR at 1064 nm. IPDA LIDAR concentrations of

Lecture 37. Lidar Architecture and Lidar Designhome.ustc.edu.cn/~522hyl/%b2%ce%bf%bc%ce%c4%cf%d7/... · Lidar architecture is the art of lidar instrumentation, concerning the lidar

LIDAR R S P C CU-B Lecture 30. Lidar Data Inversion (1)superlidar.colorado.edu/.../Lidar2016_Lecture30_LidarDataInversion1.… · Introduction: Lidar Data Inversion q Lidar data inversion

LIDAR IPRAL (IPSL Hi-Performance multi-wavelength Raman Lidar

USGS LiDAR Guidelines and Base Specification … · USGS LiDAR Guidelines and Base Specification ... LiDAR Acquisition Simulation 6 . ... The USGS LiDAR Guidelines and Base Specifications,