12
NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, [email protected] HPDRC.FIU.edu TerraFly.com

NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, [email protected] HPDRC.FIU.edu

Embed Size (px)

Citation preview

Page 1: NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, rishe@fiu.edu HPDRC.FIU.edu

NSF Industry/University Cooperative Research Center

High Performance Database Research CenterNaphtali D. Rishe, Director, [email protected] TerraFly.com

Page 2: NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, rishe@fiu.edu HPDRC.FIU.edu

HPDRC Expertise:HPDRC Expertise:

• Data visualization

• Spatial databases

• Internet-distributed heterogeneous databases

• Database design methodologies

• Information analysis

• GIS

• Location Data

• Health Informatics

Director: Dr. Naphtali David Rishe Director: Dr. Naphtali David Rishe

The Inaugural Outstanding University Professor of FIU

Awards: $40M, Patents: 4, Books: 4, Papers: 250

HPDRC.FIU.edu TerraFly.com

Page 3: NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, rishe@fiu.edu HPDRC.FIU.edu

GIS Solutions Based On Geo-Spatial Research GIS Solutions Based On Geo-Spatial Research TechnologyTechnology• Next generation GIS: Internet geo-visualization & spatial

cloud-computing platform and service

• Advanced geo-spatial computing engine

• Open architecture, API provided

• 40 TB database of aerial imagery and spatial data

• Rich datasets in a user-friendly environment

• Professionally customizable to domain requirements

• NASA, NSF, IBM and USGS funded technology

HPDRC.FIU.edu TerraFly.com

Page 4: NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, rishe@fiu.edu HPDRC.FIU.edu

Web-BasedWeb-BasedWeb-BasedWeb-Based

Open Architecture APIOpen Architecture APIOpen Architecture APIOpen Architecture API

Geo-Spatial Mapping SolutionGeo-Spatial Mapping SolutionGeo-Spatial Mapping SolutionGeo-Spatial Mapping Solution

HPDRC.FIU.edu TerraFly.com

Page 5: NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, rishe@fiu.edu HPDRC.FIU.edu

Address LocatorAddress LocatorAddress LocatorAddress Locator

Vicinity InformationVicinity InformationVicinity InformationVicinity Information

Demographic DataDemographic DataDemographic DataDemographic Data

Nationwide Layers: 2000 US Census, Businesses, Schools, Travel, Features (airports, heliports, public buildings, churches, hospitals, libraries, post offices, towers, tunnels, water, etc.), Property Lines and much more

Nationwide Layers: 2000 US Census, Businesses, Schools, Travel, Features (airports, heliports, public buildings, churches, hospitals, libraries, post offices, towers, tunnels, water, etc.), Property Lines and much more

HPDRC.FIU.edu TerraFly.com

Page 6: NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, rishe@fiu.edu HPDRC.FIU.edu

HPDRC.FIU.edu TerraFly.com

Page 7: NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, rishe@fiu.edu HPDRC.FIU.edu

Your System Your System Your DataYour Data

TerraFly TerraFly systemsystem

TerraFly TerraFly IntegrationIntegration

Solution Solution

HPDRC.FIU.edu TerraFly.com

Page 8: NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, rishe@fiu.edu HPDRC.FIU.edu

Query Date Range

Time Series

Animation Player

Census Block

Groups are Selectable

Layer Control

HPDRC.FIU.edu TerraFly.com

Page 9: NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, rishe@fiu.edu HPDRC.FIU.edu

MapReduce use casesMapReduce use cases● Spatial data indexing [SSDBM’09]● Geospatial query support [SSDBM’10]● Parallel spatial data processing

[GrC’10]● Parallel set-similarity spatial joins● Real Estate data analytics

HPDRC.FIU.edu TerraFly.com

Page 10: NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, rishe@fiu.edu HPDRC.FIU.edu

● Objective: Provide computational analytics for estimating how an event influences property values

● Dataset: Miami-Dade county records (~ 20M)● Community Boundaries: U.S. Census Blocks and Tracts

(polygons)● Property transaction geo-database: Join of property

public records (Deeds, Tax Roll)

HPDRC.FIU.edu TerraFly.com

Parcel

Date Value Area Type Location

...

1001 03-10-2005 100K 840 Condo 1,2

1002 03-15-2005 130K 2000 Single-Family 2,3

1003 12-22-2005 80K 1085 Single-Family 2,2

… … … … … …

Page 11: NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, rishe@fiu.edu HPDRC.FIU.edu

● MethodMethod• Virtual Community (VC): a set of homes with similar

characteristics within a geographical area• Consider unit price per VC, e.g. median of $/sq-foot• Compute VC’s unit price rate change to compare communities• Hadoop MapReduce: Temporal, in parallel, self-join of the

dataset to compute property value rate change

Parcel Date Value … Loc.

1001 03-10-2005 100K 1,2

1002 03-15-2005 130K 2,3

1003 05-22-2005 80K 3,2

1004 06-20-2005 80K 4,2

1005 07-10-2006 100K 5,2

1006 03-15-2005 130K 6,3

1007 12-22-2005 80K 7,2

1008 12-22-2005 80K 8,2

… … … …

9000 03-15-2005 130K 9,3

9001 12-22-2005 80K 9,2

9002 01-22-2006 80K 2,2

9003 02-10-2006 100K 1,2

9004 03-15-2005 130K 2,3

9005 12-22-2005 80K 2,2

9006 12-22-2005 80K 9,2

Reduce

Map

Map

(Partitions records by: Community, Type, Date)

(Computes community values and change

rate)

Reduce

Community Rate Value …

300 -10% 100

301 -60% 80

302 15% 55

303 2% 60

… … …

Input: dataset, tstart, tend

Output: Community value change rate

Page 12: NSF Industry/University Cooperative Research Center High Performance Database Research Center Naphtali D. Rishe, Director, rishe@fiu.edu HPDRC.FIU.edu

Hurricane Andrew, August 1992, Q2’92 vs. Q4’92

Drop

Raise

No change

HPDRC.FIU.edu TerraFly.com