Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial...

Spatial Indexing I

Point Access Methods

Spatial Indexing Point Access Methods (PAMs) vs Spatial

Access Methods (SAMs) PAM: index only point data

Hierarchical (tree-based) structures Multidimensional Hashing Space filling curve

SAM: index both points and regions Transformations Overlapping regions Clipping methods (non-overlapping)

Data partitioning vs Space partitioning

The problem Given a point set and a rectangular query, find

the points enclosed in the query We allow insertions/deletions on line

Grid File Hashing methods for

multidimensional points (extension of Extensible hashing)

Idea: Use a grid to partition the space each cell is associated with one page

Two disk access principle (exact match)

Grid File

Start with one bucket for the whole space.

Select dividers along each dimension. Partition space into cells

Dividers cut all the way.

Grid File Each cell corresponds

to 1 disk page. Many cells can point to

the same page. Cell directory

potentially exponential in the number of dimensions

Grid File Implementation

Dynamic structure using a grid directory Grid array: a 2 dimensional array with

pointers to buckets (this array can be large, disk resident) G(0,…, nx-1, 0, …, ny-1)

Linear scales: Two 1 dimensional arrays that used to access the grid array (main memory) X(0, …, nx-1), Y(0, …, ny-1)

Example

Linear scale X

Linear scale

Grid Directory

Buckets/Disk

Blocks

Grid File Search

Exact Match Search: at most 2 I/Os assuming linear scales fit in memory.

First use liner scales to determine the index into the cell directory

access the cell directory to retrieve the bucket address (may cause 1 I/O if cell directory does not fit in memory)

access the appropriate bucket (1 I/O) Range Queries:

use linear scales to determine the index into the cell directory.

Access the cell directory to retrieve the bucket addresses of buckets to visit.

Access the buckets.

Grid File Insertions

Determine the bucket into which insertion must occur.

If space in bucket, insert. Else, split bucket

how to choose a good dimension to split? If bucket split causes a cell directory to split do

so and adjust linear scales. insertion of these new entries potentially

requires a complete reorganization of the cell directory--- expensive!!!

Grid File Deletions Deletions may decrease the space

utilization. Merge buckets We need to decide which cells to merge

and a merging threshold Buddy system and neighbor system

A bucket can merge with only one buddy in each dimension

Merge adjacent regions if the result is a rectangle

Tree-based PAMs Most of tb-PAMs are based on kd-tree kd-tree is a main memory binary tree

for indexing k-dimensional points Needs to be adapted for the disk model

Levels rotate among the dimensions, partitioning the space based on a value for that dimension

kd-tree is not necessarily balanced

kd-tree At each level we use a different dimension

y=3 y=6

x<5 x>=5

Kd-tree properties Height of the tree O(log2 n) Search time for exact match:

O(log2 n) Search time for range query: O(n1/2

kd-tree example

y=5 y=6

x=8 x=7

X=5 X=8

X=7X=3

External memory kd-trees (kdB-tree)

Pack many interior nodes (forming a subtree) into a block using BFS-travesal.

it may not be feasible to group nodes at lower level into a block productively.

Many interesting papers on how to optimally pack nodes into blocks recently published.

Similar to B-tree, tree nodes split many ways instead of two ways

insertion becomes quite complex and expensive. No storage utilization guarantee since when a

higher level node splits, the split has to be propagated all the way to leaf level resulting in many empty blocks.

LSD-tree Local Split Decision – tree Use kd-tree to partition the space.

Each partition contains up to B points. The kd-tree is stored in main-memory.

If the kd-tree (directory) is large, we store a sub-tree on disk

Goal: the structure must remain balanced: external balancing property

Example: LSD-tree

x1 x2 x3

N2 N6 N7

N2 N3 N4 N5 N6 N7N1 N8

y:y1 y:y2

x:x2 x:x3

y:y4y:y3

buckets

Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial...

Documents

Spatial Indexing II

Spatial and Geographic Databases. Spatial databases store information related to spatial locations, and support efficient storage, indexing and querying

Spatial Indexing for Data Searching in Mobile Sensing

AN OVERVIEW OF SPATIAL INDEXING WITHIN RDBMS

Multidimensional Indexing: Spatial Data Management & High Dimensional Indexing

2003 June 301 Spatial Indexing Clive Page. 2003 June 302

VO Course 11: Spatial indexing

Spatial Indexing for Data Searching in Mobile Sensing ...epubs.surrey.ac.uk/841473/1/sensors-17-01427.pdf · Spatial Indexing for Data Searching in Mobile Sensing Environments Yuchao

Distributed Spatial Indexing for the Internet of Things ...epubs.surrey.ac.uk/813879/13/PID4703763.pdf · A. Spatial indexing Building indexing structure for single-dimensional data

Chap4: Spatial Storage and Indexing 4.1 Storage:Disk and Files 4.2 Spatial Indexing 4.3 Trends 4.4 Summary

Complexity of Adaptive Spatial Indexing for Robust Distributed Data

Multidimensional Indexing: Spatial Data Management & High Dimensional Indexing

Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation

Spatial Information Systems (SIS) COMP 30110 Spatial access methods: Indexing

Large Spatial Database Indexing with aX-tree

Spatial Databases and Spatial Indexing Techniques - … Databases and Spatial Indexing Techniques ... Spatial Databases and Spatial Indexing Techniques ... •Query Processing and

Ordered indexing methods for data streaming applications

Simonas Šaltenis Aalborg Universitypeople.cs.aau.dk/~torp/Teaching/E10/presentations/dat5_10_rtree.pdf · Simonas Šaltenis Aalborg University daisy.aau.dk Spatial Indexing Indexing

New SPECIAL MACHINES 512 October 2019 · 2020. 2. 6. · Plain indexing head Universal indexing head Optical indexing head Methods of indexing Direct indexing Simple or plain indexing

A Sorting Approach to Indexing Spatial Datahjs/pubs/ijsm08.pdf · 2. Methods Based on Spatial Occupancy The indexing methods that are based on sorting the spatial objects by spatial