Spatial Indexing. Spatial Queries Given a collection of geometric objects (points, lines,...

Spatial Indexing

Spatial Queries

Given a collection of geometric objects (points, lines, polygons, ...)

organize them on disk, to answer point queries range queries k-nn queries spatial joins (‘all pairs’ queries)

Spatial Queries

Given a collection of geometric objects (points, lines, polygons, ...)

organize them on disk, to answer point queries range queries k-nn queries spatial joins (‘all pairs’ queries)

Spatial Joins

Spatial joins: find (quickly) all counties intersecting

R-trees – spatial join We assume that both organized in R-

trees using the MBRs Find the MBRs that intersect Check the original objects

R-tree – Spatial JoinsSPJ1(T1, T2)for each parent P1 of tree T1 for each parent P2 of tree T2 if their MBRs intersect, process them recursively (ie.,

check their children)

R-tree – Spatial Joins We assume that the trees have the

same height The traversal is done in DFS order

R-tree – Spatial Joins Optimization: SPJ2: First compute the

intersection of nodes T1 and T2. Check for intersection only the rectangles in the intersection

Huge improvement on CPU time!

R-tree – Spatial Joins

R-tree – Spatial Joins Is there any way to do better? Yes, using plane sweep! To check for intersection, naïve:

O(n2) But with plane sweep: O(n log n)

R-tree – Spatial Joins Move a vertical line (sweep line)

from left to right. Every time that you find a new object do some processing

Objects are sorted over their x-coordinate

R-tree – Spatial Joins What happens if only one relation

has an index? Build another index on the other

relation, then join Use the first tree to build the

second one: since we want to compute the join we can filter out some rectangle during the construction of the second tree!

Spatial Joins Similar idea if we have z-ordering/

quadtrees Merge the lists of z-ordering, use

the properties of z-values (10* encloses 1001)

R-trees - performance analysis

How many disk (=node) accesses we’ll need for range nn spatial joins

why does it matter?

why does it matter? A: because we can design split etc

algorithms accordingly; also, do query-optimization

A: because we can design split etc algorithms accordingly; also, do query-optimization

motivating question: on, e.g., split, should we try to minimize the area (volume)? the perimeter? the overlap? or a weighted combination? why?

How many disk accesses for range queries? query distribution wrt location? “ “ wrt size?

How many disk accesses for range queries? query distribution wrt location? uniform;

(biased) “ “ wrt size? uniform

easier case: we know the positions of parent MBRs, eg:

How many times will P1 be retrieved (unif. queries)?

How many times will P1 be retrieved (unif. POINT queries)?

How many times will P1 be retrieved (unif. POINT queries)? A: x1*x2

How many times will P1 be retrieved (unif. queries of size q1xq2)?

R-trees - performance analysis Minkowski sum

How many times will P1 be retrieved (unif. queries of size q1xq2)? A: (x1+q1)*(x2*q2)

Thus, given a tree with N nodes (i=1, ... N) we expect

#DiskAccesses(q1,q2) = sum ( xi,1 + q1) * (xi,2 + q2)

= sum ( xi,1 * xi,2 ) +

q2 * sum ( xi,1 ) +

q1* sum ( xi,2 )

q1* q2 * N

Thus, given a tree with N nodes (i=1, ... N) we expect

#DiskAccesses(q1,q2) = sum ( xi,1 + q1) * (xi,2 + q2)

= sum ( xi,1 * xi,2 ) +

q2 * sum ( xi,1 ) +

q1* sum ( xi,2 )

q1* q2 * N

‘volume’

surface area

Observations: for point queries: only volume

matters for horizontal-line queries: (q2=0):

vertical length matters for large queries (q1, q2 >> 0): the

count N matters

Observations (cont’ed) overlap: does not seem to matter formula: easily extendible to n

dimensions (for even more details: [Pagel +,

PODS93], [Kamel+, CIKM93])

Conclusions: splits should try to minimize area and

perimeter ie., we want few, small, square-like

parent MBRs rule of thumb: shoot for queries with

q1=q2 = 0.1 (or =0.5 or so).

Range queries - how many disk accesses, if we just now that we have

- N points in n-d space?A: ?

Range queries - how many disk accesses, if we just now that we have

- N points in n-d space?A: can not tell! need to know

distribution

What are obvious and/or realistic distributions?

A: uniformA: Gaussian / mixture of GaussiansA: self-similar / fractal. Fractal

dimension ~ intrinsic dimension

Formulas for range queries and k-nn queries: use fractal dimension [Kamel+, PODS94], [Korn+ ICDE2000] [Kriegel+, PODS97]

Formulas for spatial joins of regions: open research question

R-trees–performance analysis Assuming Uniform distribution:

where And D is the density of the dataset, f

the fanout [TS96]

}){(1)( 21

NqDqDA

Spatial Indexing. Spatial Queries Given a collection of geometric objects (points, lines,...

Documents

A Unified Algorithm for Continuous Monitoring of Spatial Queries

Processing Fuzzy Spatial Queries : A Configuration

Spatial Queries in Dynamic Environmentsdimitris/PAPERS/TODS03-SQDE.pdf · 2003-06-10 · Spatial Queries in Dynamic Environments † 103 spatio-temporal databases, which can be applied

Spatial Variation in Search Engine Queries · Spatial Variation in Search Engine Queries Lars Backstrom Jon Kleinberg Dept. of Computer Science Cornell University Ithaca, NY 14853

WEB-BASED INFORMATION SYSTEM FOR ABORIGINAL LAND MANAGEMENT 2/LIMA… · Implementation of basic spatial queries and advanced spatial queries through ArcXML coding has been presented

Ecient Processing of Top-k Frequent Spatial Keyword Queries

Spatial Queries & Analysis in GIS Dr Nigel Trodd Coventry University

Spatial Information Systems (SIS) COMP 30110 Spatial queries and operations

Spatial Variation in Search Engine Queries · 2008-03-10 · Spatial Variation in Search Engine Queries Lars Backstrom Jon Kleinberg Dept. of Computer Science Cornell University Ithaca,

Similarity Search for Adaptive Ellipsoid Queries Using Spatial Transformation

Spatial Variation in Search Engine Queries

Efficient processing of spatial range queries on wireless broadcast streams

On Trip Planning Queries in Spatial Databaseslifeifei/papers/sstd_tpq.pdf · 2011. 8. 9. · On Trip Planning Queries in Spatial Databases Feifei Li, Dihan Cheng, Marios Hadjieleftheriou,

GeoAlchemy2 Documentation - Read the Docs€¦ · Using spatial ﬁlters in SQL SELECT queries is very common. Such queries are performed by using spatial relationship functions,

On Trip Planning Queries in Spatial Databaseshadjieleftheriou.com/papers/sstd05-tpq.pdf · On Trip Planning Queries in Spatial Databases Feifei Li, Dihan Cheng, Marios Hadjieleftheriou,

Spatial Skyline Queries - Informatik · Beispielbild Spatial Skyline Queries Seminar zur Datenverwaltung, SoSe 2010 Fachbereich Mathematik und Informatik, Institut für Informatik

Proximity Queries Using Spatial Partitioning & Bounding Volume Hierarchy Dinesh Manocha

IMPROVING 3D SPATIAL QUERIES SEARCH: NEWFANGLED … · 2013-12-06 · IMPROVING 3D SPATIAL QUERIES SEARCH: NEWFANGLED TECHNIQUE OF SPACE FILLING CURVES IN 3D CITY MODELING U. Uznir

Querying Geo-Textual Data: Spatial Keyword Queries and Beyond 16 tutorial-F-clean.pdf · Querying Geo-Textual Data: Spatial Keyword Queries and Beyond ... Location-based social networks

Spatial Queries - School of Computinglifeifei/cis5930/lecture4.pdfSpatial Queries Given a collection of geometric objects (points, lines, polygons, ...) organize them on disk, to answer