Challenges in Using Lifetime Personal Information Stores based on MyLifeBits Gordon Bell, Jim...

Preview:

Citation preview

Challenges in UsingLifetime Personal Information

Storesbased on MyLifeBits

Gordon Bell, Jim Gemmell, Roger LuederGordon Bell, Jim Gemmell, Roger LuederSIGIRSIGIR

University of Sheffield, University of Sheffield, July 26, 2004July 26, 2004

““I have watched as hundreds of I have watched as hundreds of millions of dollars have been millions of dollars have been invested to re-invent the wheel - invested to re-invent the wheel - often badly.”often badly.”

-Marcia Bates-Marcia Bates

The 1 TB Life

1TB gives you 65+ years of:1TB gives you 65+ years of: 100 email messages a day (5KB each)100 email messages a day (5KB each) 100 web pages day (50KB each)100 web pages day (50KB each) 5 scanned pages a day (100KB each)5 scanned pages a day (100KB each) 1 book every 10 days (1 MB each)1 book every 10 days (1 MB each) 10 photos per day (400 KB JPEG each)10 photos per day (400 KB JPEG each) 8 hours per day of sound - e.g. telephone,8 hours per day of sound - e.g. telephone,

voice annotations, and meeting recordings (8 Kb/s)voice annotations, and meeting recordings (8 Kb/s) 1 new music CD every 10 days (45 min each at 128 Kb/s)1 new music CD every 10 days (45 min each at 128 Kb/s)

It will take you 5 years to fill up your 80 GB driveIt will take you 5 years to fill up your 80 GB drive Want video? Buy more cheap drives (1 TB/year lets Want video? Buy more cheap drives (1 TB/year lets

you record 4 hours/day of 1.5 Mb/s video)you record 4 hours/day of 1.5 Mb/s video)

Everything goes in a database

You need all the features of a databaseYou need all the features of a database(Consistency, Indexing, Pivoting, Queries, Speed/scalability, Backup, (Consistency, Indexing, Pivoting, Queries, Speed/scalability, Backup, replication)replication)

If you don’t use one, you will find yourself creating one!If you don’t use one, you will find yourself creating one! Files as blobs, also sync with file system for legacy appsFiles as blobs, also sync with file system for legacy apps

SQLSQL

MyLifeBits Software

MyLifeBits store

database

Voice Voice annotation annotation tooltool

Text Text annotation annotation tooltool

Telephone Telephone capture toolcapture tool

TV capture TV capture tooltool

TV EPG TV EPG download download tooltool

Radio Radio capture capture & EPG& EPG

PocketPC PocketPC transfer transfer tooltool

PocketRadio PocketRadio playerplayer

Import filesImport files

MyLifeBits MyLifeBits ShellShell

files

Legacy Legacy applicationsapplications

Browser Browser tooltool

InternetInternet

IM captureIM capture

MAPI MAPI interfaceinterface

Legacy Legacy email clientemail client

GPS import & GPS import & Map displayMap display

SenseCamSenseCam

Screen saverScreen saver

MemexAs We May Think, Vannevar Bush, 1945

““A memex is a device in which an individual stores all A memex is a device in which an individual stores all his books, records, and communications, and which his books, records, and communications, and which is mechanized so that it may be consulted with is mechanized so that it may be consulted with exceeding speed and flexibility”exceeding speed and flexibility”

Full-text search, text & audio annotations, and Full-text search, text & audio annotations, and hyperlinkshyperlinks

I am data

The guinea pig

Gordon Bell is digitizing his lifeGordon Bell is digitizing his life Has now scanned virtually all:Has now scanned virtually all:

Books written (and read when possible)Books written (and read when possible) Personal documents (correspondence including memos and email, Personal documents (correspondence including memos and email,

bills, legal documents, papers written, …)bills, legal documents, papers written, …) PhotosPhotos Posters, paintings, photo of things (artifacts, …medals, plaques)Posters, paintings, photo of things (artifacts, …medals, plaques) Home movies and videosHome movies and videos CD collectionCD collection And, of course, all PC filesAnd, of course, all PC files

Now recording: phone, radio, TV (movies), web pages… Now recording: phone, radio, TV (movies), web pages… conversations and meetings to comeconversations and meetings to come

Paperless throughout 2002. 12” scanned, 12’ discardedPaperless throughout 2002. 12” scanned, 12’ discarded.. Only 30 GB!!!Only 30 GB!!!

Capture and encoding

I mean everything

50+ year old newspaper clippings

400 year old books

O(100s) tapes from videotape “black hole”

Personal LifeLog Applications

Conservator

Baby Book

Companion Caretaker

Babysitter

AdvisorMentorTutor

Autobiography

Photo Album

PersonalAssistant

Diary/Journal

Biography

Medical Manager

ExecutorObituary

Others Self

Assistant for Elderly

Application controlled by:

Oth

ers

Sel

fA

pp

lica

tio

n u

sed

by:

Personal Proxy

Parole Officer

Pers Flight Recorder

Meeting Prep

Captain’s Log

Trustee

Financial Manager

Why bother? ..some reasons Technology creates an opportunity e.g. 1 TB disksTechnology creates an opportunity e.g. 1 TB disks Technology creates a need e.g. jpgTechnology creates a need e.g. jpg It will decay or disappear if you don’t save itIt will decay or disappear if you don’t save it To eliminate physical storage (paper, CDs…)To eliminate physical storage (paper, CDs…) It costs more (in time) to delete than it costs to storeIt costs more (in time) to delete than it costs to store The mantra of the squirrel: “I may need it some day.” The mantra of the squirrel: “I may need it some day.” For posterity and nostalgia: “Maybe others will want it.”For posterity and nostalgia: “Maybe others will want it.” For memory enhancement & faster searchFor memory enhancement & faster search

(search your LifeBits rather than the web or your colleagues … (search your LifeBits rather than the web or your colleagues …

a single source to look for “stuff I’ve seen”)a single source to look for “stuff I’ve seen”) Let content analysis and data mining discover trends Let content analysis and data mining discover trends

and correlations in our lives…that even we don’t know.and correlations in our lives…that even we don’t know. Aid to aging or failed memoriesAid to aging or failed memories

So you’ve got it – now what do you do with it?

““A record if it is to be useful … must be A record if it is to be useful … must be continuously extended, it must be stored, and continuously extended, it must be stored, and above all it must be consulted” above all it must be consulted”

““The difficulty seems to be, not so much that The difficulty seems to be, not so much that we publish unduly … but rather that we publish unduly … but rather that publication has been extended far beyond our publication has been extended far beyond our present ability to make real use of the record” present ability to make real use of the record”

- Vannevar Bush- Vannevar Bush

Trying to use my life bits

#1: Folders#1: Folders

One item. One place.One item. One place.

It worked for 1000s of years.It worked for 1000s of years.

My docs and archive

S

Self

EE

X- Employer

EmployerEmployer

X-EmployerProjectProject

ProjectProject

Employer

Library/file cab

Library/file cab

Library/file cab

Library/file cab

Library/file cab

Library/file cab

Active Employer

Library/file cab

Library/file cabLibrary/file cab

<1995 Library/file cabLibrary/file cab

Project

BusinessInvests, family $s, & Legal

Personal, including Medical

Library/file cab

Freedom from hierarchyc:\my documents\talks\MyLifeBits.pptc:\my documents\talks\MyLifeBits.ppt

ID=location=organization=display stringID=location=organization=display stringDon’t make me invent unique namesDon’t make me invent unique namesDon’t make me file everythingDon’t make me file everythingOr let me pick multiple foldersOr let me pick multiple folders

“ “multiple categorization not only improves multiple categorization not only improves organization and retrieval times but also organization and retrieval times but also matches more closely with the way users matches more closely with the way users naturally think about organizing their naturally think about organizing their information” – Quan et al (MIT’s Haystack)information” – Quan et al (MIT’s Haystack)

MyLifeBits collection dialog

Of course Aliases and Shortcuts can be used albeit painfully to file by time and/or event, subject, location, type.

Trying to use my life bits

#2: Text annotations#2: Text annotations

Making bits more valuable and retrievable. Making bits more valuable and retrievable.

“Its just bits until it is annotated”

Getting the user to tell a story is the ultimate in media value

A story is a “layout” in time and spaceA story is a “layout” in time and space Most valuable content (by selection, and by being well annotated)Most valuable content (by selection, and by being well annotated) Stories must include links to any media they use (for future navigation/search – Stories must include links to any media they use (for future navigation/search –

“transclusion”).“transclusion”). Cf: MovieMaker; Creative Memories PhotoAlbumsCf: MovieMaker; Creative Memories PhotoAlbums

Dapeng was an Dapeng was an intern at BARC intern at BARC for the summer for the summer of 2000of 2000

We took him to We took him to lunch at our lunch at our favorite Dim Sum favorite Dim Sum place to say place to say farewellfarewell

At table L-R: Dapeng, Gordon, Tom, Jim, Don, At table L-R: Dapeng, Gordon, Tom, Jim, Don, Vicky, Patrick, JimVicky, Patrick, Jim

Annotation like this…

VoiceAnnotation

Annotation when you feel like it, how you feel like it

Screensaver is the killer app!Screensaver is the killer app!

Trying to use my life bits

#3: “I remember when…”#3: “I remember when…”

The 1The 1stst or 2 or 2ndnd most important retrieval handle. most important retrieval handle.

MyLifeBits time overlap

MyLifeBits on-the-fly time clustering

MSR Next Media Team

M Stewart Lifeline v2Mark Stewart’s Lifeline

Copyright Mark Stewart, 2004

Trying to use my life bits

#4: Relationships (links)#4: Relationships (links)

Using something near ‘it”, to find “it”.Using something near ‘it”, to find “it”.

Mark Stewart’s first page

Copyright Mark Stewart, 2004

The Stew family tree

Copyright Mark Stewart, 2004

PhotoFinder - Schneiderman and Kang

MyLifeBits Entities & Links

AnnotatesAnnotates

Caller in Phone CallCaller in Phone Call

Photo of EventPhoto of Event

TranscludesTranscludes

Trying to use my life bits

#5: I remember where#5: I remember where

Just essentialJust essential..

Trying to use my life bits

#6: more meta-data (properties)#6: more meta-data (properties)

I remember something about the contentI remember something about the content

(understanding a person’s work)(understanding a person’s work)

Lederberg Finder page

Dublin core of a given item

Trying to use my life bits

#7: classification#7: classification

Moving oward the ultimate time sink.Moving oward the ultimate time sink.

Is traditional classification required?

……at OCLC there was unanimous agreement at OCLC there was unanimous agreement among faculty and participants thatamong faculty and participants that

“access to electronic resources “access to electronic resources requires controlled vocabulary and requires controlled vocabulary and classification”classification”

OCLC Institute, “Knowledge Access Management: Tools OCLC Institute, “Knowledge Access Management: Tools and Concepts for Next Generation Catalogers”, 17-19 and Concepts for Next Generation Catalogers”, 17-19 November 1997, Dublin, Ohio.November 1997, Dublin, Ohio.

www.alberteinstein.info

Professional Life:

Organizations

Administrivia

Projects

Library

Lederberg papers official reports

Number of document segments

Lederberg Artifact types Abstracts Agendas not Announcements m; Application forms Articles m Autobiographies m Bibliographies m Biographies m Brochures m Certificates m Correspondence m Diaries m Drafts (documents) Drawings m Electronic images m Essays m Eulogies Excerpts Grant proposals Interviews m Invitations

Laboratory notebooks m Laboratory notes Lecture notes Lectures m Legal documents m Legislative records Lists Manifestoes Memoirs m Minutes Monographs m Narratives Newsletters Newspaper columns m Notebooks m Notes Obituaries Official reports Oral histories m Petitions Photographic prints m

Press releases mProcedures Proceedings mPrograms mProposals mQuestionnaires Reminiscences Reports mResolutions Resumes Reviews mSchool records Speeches mSummaries Tables (documents) Technical reports mTranscripts mTypescripts Video recordings m

Species: Animals: Chordata: Vertebrata: bony fish

Computer structures: digital computer: minicomputer

Computer structures: digital computer: minicomputer (refined: Digital Equipment Corp.)

Computer structures taxonomy: computers

Trying to use my life bits

#8: “ontology”???#8: “ontology”???

““Succumbing to the ‘ontology’ fallacy”Succumbing to the ‘ontology’ fallacy”-Bates-Bates

Company1

1. Generic organization: Correspondence, financial, manuals, notebooks, org chart, plans, products, stocks, etc.. Facets: doc type, dissemination, institution type

2. Generic org. plus projects x roles; facets: financial; legal3. Generic organization for club, foundation, museum,

professional org, religious, sport, etc.4. Books, CDs, papers, videos Facets: media type,

Employer2

Non-profit3

Library4

HealthLegal

Organizations

Academic Inst.2

Financial Assets

Family & related social

Ancestors, Parents,Siblings

Media

ArtifactsComm.

Library & archives: info & records.Personal archives (Ambiance…)

ChildrenSpouse/

Significant Other

Friends

Articles, bio, books,interviews, talks,

…web pages

Auto, home& other “things”

Property

Diaries

Family Business2

Self Family ($,property, legal, health)potentially private…

Institution type: academic,… companies, family, other Orgs…self

MyLifeBits: Some Lives(t) PersonalPersonal

Parents, children, grandkidsParents, children, grandkids CGB himselfCGB himself GKBGKB SSFSSF Close friendsClose friends

GB $s; Legal entitiesGB $s; Legal entities Personal incl. several legal Personal incl. several legal

structuresstructures Properties: autos, real estate,Properties: autos, real estate, Investments & contractsInvestments & contracts

Past prof. companies/organiz’nsPast prof. companies/organiz’ns DECDEC Carnegie-Mellon U.Carnegie-Mellon U. DEC, NSF, Encore, Ardent, DEC, NSF, Encore, Ardent,

Me Inc., Bell-MasonMe Inc., Bell-Mason

Bell-Mason DirectorBell-Mason Director Diamond & Vanguard Brds.Diamond & Vanguard Brds. Startups & boardsStartups & boards

CGB@ MicrosoftCGB@ Microsoft MLBMLB ClustersClusters TelepresenceTelepresence WWW presenceWWW presence

Computer History MuseumComputer History Museum BOD memberBOD member Fund-raisingFund-raising CyberMuseumCyberMuseum

GB Timeline

1900

1910

1920

1930

1940

1950

1960

1970

1980

1990

2000

2010

F F F F E E F E W F F E W W W W W W O F O F F F F

Roles & Institutions

I <am son of> ….I <am son of> ….

I <am father of> Brigham <1960->, Laura <1963->I <am father of> Brigham <1960->, Laura <1963->

I <studied at> MIT <1952-1957; 1959-1960>I <studied at> MIT <1952-1957; 1959-1960>

I <worked for> DEC <1960-1966; 1972-1983>I <worked for> DEC <1960-1966; 1972-1983>

I <am a member of> ACM <1960- ->… NAEI <am a member of> ACM <1960- ->… NAE

I <am on the board of> Computer Museum…I <am on the board of> Computer Museum…

Things

Can everything be part of the model?Can everything be part of the model?PetsPetsHousesHousesCarsCarsAssetsAssets

Trying to use my life bits

#9: logging & reports#9: logging & reports

Interface to xls

TV Usage

MyLifeBits Log of a video file

Open Problems

The “dear appy” problemDear Appy, Dear Appy,

How committed are you?How committed are you?Please come back to me.Please come back to me.Forever yours truly, Forever yours truly, Lost and forgotten dataLost and forgotten data

Who’s responsible?Who’s responsible?Media or 8 track cassette, 8” floppyMedia or 8 track cassette, 8” floppyEvolving platform, file, and databaseEvolving platform, file, and databaseEvolving, incompatible standards & formats for Evolving, incompatible standards & formats for legacy data that disregard ancestorslegacy data that disregard ancestorsEvolving and/or disappearing appsEvolving and/or disappearing apps   

A Storocratic Oath

1.1. Do no harm to datesDo no harm to dates(File creation, Photo taken)(File creation, Photo taken)

2.2. Do no harm to device created & Do no harm to device created & other meta-data.other meta-data.

• Camera data & location data are sacred.Camera data & location data are sacred.

3.3. Support & aid the creation of critical meta-Support & aid the creation of critical meta-data. data.

• When/how the user feels like itWhen/how the user feels like it• Auto-magically!Auto-magically!

4.4. Maintain user confidentialityMaintain user confidentiality

Classification wish list Download classifications rather than build themDownload classifications rather than build them Definitions & synonyms should help find what I wantDefinitions & synonyms should help find what I want Today it is too expensive to manually classify my Today it is too expensive to manually classify my

scanned paper. E.g. “right time” meta-data is critical!scanned paper. E.g. “right time” meta-data is critical! Next year I hope “the system” can classify my papersNext year I hope “the system” can classify my papers In 10 years I expect all documents to appear In 10 years I expect all documents to appear

electronically & classified electronically & classified with a little help from mewith a little help from me

Personal Search is notProfessional or Web search

System sees every entry & accessSystem sees every entry & accessEverything, not just a professional life Everything, not just a professional life Limited to SIS, not an infinite amount, Limited to SIS, not an infinite amount,

covers a profession & personal lifecovers a profession & personal life

Web as seen by search engines

MyLifeBits

Knowledge breadth e.g. Dewey classification

Depth e.g. information item types & coverage

Professional user

The killer app?? Input, File, Classify, and Find…Input, File, Classify, and Find…Observe every action…Observe every action…OperationalOperational

SIS (e.g. msg, name, paper, fact, birthday, SIS (e.g. msg, name, paper, fact, birthday, phone call, phone call,

Time & motion (routing, communicating, Time & motion (routing, communicating, scheduling … thinking)scheduling … thinking)

Archival one’s self Archival one’s self Finder aka Table of Contents aka Site Map Finder aka Table of Contents aka Site Map Story telling. Story telling.

Screen saver & personal ambienceScreen saver & personal ambience

The A/V/real time data Future: new capture modes/devices

SenseCam

Deja View

Body Media

Quindi

Sensecam &Interactive jewellery

www.MyLifeBits.com

Recommended