6
Implementing Deduplication Tool For Storage Device Using Sparse Indexing

Chunking ………… Data Stream ………… ………… Chunks (static size) Chunks (variable size)

Embed Size (px)

Citation preview

Page 1: Chunking ………… Data Stream ………… ………… Chunks (static size) Chunks (variable size)

Implementing Deduplication Tool For Storage Device Using

Sparse Indexing

Page 2: Chunking ………… Data Stream ………… ………… Chunks (static size) Chunks (variable size)

Chunking

Introduction

……

Data Stream

……

……

Chunks(static size)

Chunks(variable size)

Page 3: Chunking ………… Data Stream ………… ………… Chunks (static size) Chunks (variable size)

Fingerprinting-Deduplicating

Introduction

A

Storage

A G T

DataNew Data

Compare

If ‘A’ exists in the storage, ‘A’ won’t be written.

A C ……

Remove a copyand Share ‘A’.

Original Data

Fingerprinting(using hash function)

Page 4: Chunking ………… Data Stream ………… ………… Chunks (static size) Chunks (variable size)

Overall Architecture

Dataset- 파일의 모음

- (*.exe, *.txt, *.jpg, *.mp3 …)

중복 데이터 분석 툴(CHUNKING - FINGERPRINTING -

SAMPLING - DEDUPLICATING)

입력

출력

분석 결과(deduplication factor, elpased

time …by variable criteria)

Experiment Criteria- Chunk Size(4kb, 8kb…)- Segment Size- Sampling Rate- Variability of Chunk, Seg-ment

Page 5: Chunking ………… Data Stream ………… ………… Chunks (static size) Chunks (variable size)

Windows XP, 7C++

Development Environment

Page 6: Chunking ………… Data Stream ………… ………… Chunks (static size) Chunks (variable size)

Schedule

1주 2주 3주 4주

3 월 Background Study(Reading Paper… ETC)

4 월 Implementing & Debugging Deduplication Tool

5 월 Implementing & Debug-ging

Experiments & Report

6 월