View
216
Download
0
Category
Tags:
Preview:
Citation preview
Challenges in UsingLifetime Personal Information
Storesbased on MyLifeBits
Gordon BellGordon BellAlpbach Forum 26 August 2004Alpbach Forum 26 August 2004
The 1 TB Life
1TB gives you 65+ years of:1TB gives you 65+ years of: 100 email messages a day (5KB each)100 email messages a day (5KB each) 100 web pages day (50KB each)100 web pages day (50KB each) 5 scanned pages a day (100KB each)5 scanned pages a day (100KB each) 1 book every 10 days (1 MB each)1 book every 10 days (1 MB each) 10 photos per day (400 KB JPEG each)10 photos per day (400 KB JPEG each) 8 hours per day of sound - e.g. telephone,8 hours per day of sound - e.g. telephone,
voice annotations, and meeting recordings (8 Kb/s)voice annotations, and meeting recordings (8 Kb/s) 1 new music CD every 10 days (45 min each at 128 Kb/s)1 new music CD every 10 days (45 min each at 128 Kb/s)
It will take you 5 years to fill up your 80 GB driveIt will take you 5 years to fill up your 80 GB drive Want video? Buy more cheap drives (1 TB/year lets Want video? Buy more cheap drives (1 TB/year lets
you record 4 hours/day of 1.5 Mb/s video)you record 4 hours/day of 1.5 Mb/s video)
Everything goes in a database
You need all the features of a databaseYou need all the features of a database(Consistency, Indexing, Pivoting, Queries, Speed/scalability, Backup, (Consistency, Indexing, Pivoting, Queries, Speed/scalability, Backup, replication)replication)
If you don’t use one, you will find yourself creating one!If you don’t use one, you will find yourself creating one! Files as blobs, also sync with file system for legacy appsFiles as blobs, also sync with file system for legacy apps
SQLSQL
MyLifeBits Software
MyLifeBits store
database
Voice Voice annotation annotation tooltool
Text Text annotation annotation tooltool
Telephone Telephone capture toolcapture tool
TV capture TV capture tooltool
TV EPG TV EPG download download tooltool
Radio Radio capture capture & EPG& EPG
PocketPC PocketPC transfer transfer tooltool
PocketRadio PocketRadio playerplayer
Import filesImport files
MyLifeBits MyLifeBits ShellShell
files
Legacy Legacy applicationsapplications
Browser Browser tooltool
InternetInternet
IM captureIM capture
MAPI MAPI interfaceinterface
Legacy Legacy email clientemail client
GPS import & GPS import & Map displayMap display
SenseCamSenseCam
Screen saverScreen saver
MemexAs We May Think, Vannevar Bush, 1945
““A memex is a device in which an individual stores all A memex is a device in which an individual stores all his books, records, and communications, and which his books, records, and communications, and which is mechanized so that it may be consulted with is mechanized so that it may be consulted with exceeding speed and flexibility”exceeding speed and flexibility”
Full-text search, text & audio annotations, and Full-text search, text & audio annotations, and hyperlinkshyperlinks
I am data
Capture and encoding
I mean everything
Personal Search is notProfessional or Web search
System sees every entry & accessSystem sees every entry & accessEverything, not just a professional life Everything, not just a professional life Limited to SIS, not an infinite amount, Limited to SIS, not an infinite amount,
covers a profession & personal lifecovers a profession & personal life
Web as seen by search engines
MyLifeBits
Knowledge breadth e.g. Dewey classification
Depth e.g. information item types & coverage
Professional user
Why bother? ..some reasons Technologist: “we can” an opportunity e.g. 1 TB disksTechnologist: “we can” an opportunity e.g. 1 TB disks For all of us with new media: a need e.g. jpg. Mp3For all of us with new media: a need e.g. jpg. Mp3 Environmentalist: eliminates “atoms” (paper, CDs…)Environmentalist: eliminates “atoms” (paper, CDs…) For business--memory enhancement & faster search:For business--memory enhancement & faster search:
Let content analysis and data mining discover trends Let content analysis and data mining discover trends and correlations in our lives…that even we don’t know.and correlations in our lives…that even we don’t know.
Business: It costs more to delete than it costs to storeBusiness: It costs more to delete than it costs to store Preservationist: decays or disappears unless its savedPreservationist: decays or disappears unless its saved For the human pack rat: “I may need it some day.” For the human pack rat: “I may need it some day.” For posterity and nostalgia: “Maybe others will want it.”For posterity and nostalgia: “Maybe others will want it.” Stories and ambience: basis for creating contentStories and ambience: basis for creating content For the aging & failed memory: surrogate memoryFor the aging & failed memory: surrogate memory
Using my life bits: beyond folders
#1: Folders#1: Folders
One item. One place.One item. One place.
It worked for 1000s of years.It worked for 1000s of years.
My docs and archive
S
Self
EE
X- Employer
EmployerEmployer
X-EmployerProjectProject
ProjectProject
Employer
Library/file cab
Library/file cab
Library/file cab
Library/file cab
Library/file cab
Library/file cab
Active Employer
Library/file cab
Library/file cabLibrary/file cab
<1995 Library/file cabLibrary/file cab
Project
BusinessInvests, family $s, & Legal
Personal, including Medical
Library/file cab
Freedom from hierarchyc:\my documents\talks\MyLifeBits.pptc:\my documents\talks\MyLifeBits.ppt
ID=location=organization=display stringID=location=organization=display stringDon’t make me invent unique namesDon’t make me invent unique namesDon’t make me file everythingDon’t make me file everythingOr let me pick multiple foldersOr let me pick multiple folders
“ “multiple categorization not only improves multiple categorization not only improves organization and retrieval times but also organization and retrieval times but also matches more closely with the way users matches more closely with the way users naturally think about organizing their naturally think about organizing their information” – Quan et al (MIT’s Haystack)information” – Quan et al (MIT’s Haystack)
MyLifeBits collection dialog
Of course Aliases and Shortcuts can be used albeit painfully to file by time and/or event, subject, location, type.
Using my life bits: easily adding valuable content
#2: Text annotations#2: Text annotations
Making bits more valuable and retrievable. Making bits more valuable and retrievable.
“Its just bits until it is annotated”
Getting the user to tell a story is the ultimate in media value
A story is a “layout” in time and spaceA story is a “layout” in time and space Most valuable content (by selection, and by being well annotated)Most valuable content (by selection, and by being well annotated) Stories must include links to any media they use (for future navigation/search – Stories must include links to any media they use (for future navigation/search –
“transclusion”).“transclusion”). Cf: MovieMaker; Creative Memories PhotoAlbumsCf: MovieMaker; Creative Memories PhotoAlbums
Dapeng was an Dapeng was an intern at BARC intern at BARC for the summer for the summer of 2000of 2000
We took him to We took him to lunch at our lunch at our favorite Dim Sum favorite Dim Sum place to say place to say farewellfarewell
At table L-R: Dapeng, Gordon, Tom, Jim, Don, At table L-R: Dapeng, Gordon, Tom, Jim, Don, Vicky, Patrick, JimVicky, Patrick, Jim
Using my life bits:the value of time & time posts
#3: “I remember when…”#3: “I remember when…”
The 1The 1stst or 2 or 2ndnd most important retrieval handle. most important retrieval handle.
190019101920193019401950196019701980199020002010
Chester Bell
Lola Bell
Gordon Bell
Sharon (Smith)
Kirksville, MO
M.I.T.
U. of N.S.W.
Gwen Druyor Bell
Brigham (son)
Fiona Bell
Bridget Bell
Laura (daughter)
Kolbe Schultz
Stryker Schultz
Sheridan Forbes
M.I.T. Speech Lab
Digital (DEC)
CMU
Encore
NSF
Ardent
Bell Ltd.
Microsoft Res.
Computer Museum
F: father
F: mother
F: self
F: Sister
Education
Education
Education
F: spouse
F: son
F: grandChild
F: grandChild
F: daughter
F: grandchild
F: grandchild
F: Significant Other
W/Education
Work
Work
Work
Work
Work
Work
Work
Organization
M Stewart Lifeline v2Mark Stewart’s Lifeline
Copyright Mark Stewart, 2004
MSR Next Media Team
Using my life bits:Where, an essential attribute
#4: I remember where#4: I remember where
Just essentialJust essential..
Using my life bits: pivoting on data to aid recall
#5: Relationships (links)#5: Relationships (links)
Using something near ‘it”, to find “it”.Using something near ‘it”, to find “it”.
MyLifeBits Entities & Links
AnnotatesAnnotates
Caller in Phone CallCaller in Phone Call
Photo of EventPhoto of Event
TranscludesTranscludes
PhotoFinder - Schneiderman and Kang
Using my life bits:never enough meta-data …
but, can you afford it?b
#6: more meta-data (properties)#6: more meta-data (properties)
I remember something about the contentI remember something about the content
(understanding a person’s work)(understanding a person’s work)
Lederberg Finder page
Dublin core of a given item
Using my life bits:classification of everything
#7: classification#7: classification
Is any gain from non-automated classification Is any gain from non-automated classification worth the cost and pain?worth the cost and pain?
Is traditional classification required?
……at OCLC there was unanimous agreement at OCLC there was unanimous agreement among faculty and participants thatamong faculty and participants that
“access to electronic resources “access to electronic resources requires controlled vocabulary and requires controlled vocabulary and classification”classification”
OCLC Institute, “Knowledge Access Management: Tools OCLC Institute, “Knowledge Access Management: Tools and Concepts for Next Generation Catalogers”, 17-19 and Concepts for Next Generation Catalogers”, 17-19 November 1997, Dublin, Ohio.November 1997, Dublin, Ohio.
www.alberteinstein.info
Professional Life:
Organizations
Administrivia
Projects
Library
Lederberg Artifact types Abstracts Agendas not Announcements m; Application forms Articles m Autobiographies m Bibliographies m Biographies m Brochures m Certificates m Correspondence m Diaries m Drafts (documents) Drawings m Electronic images m Essays m Eulogies Excerpts Grant proposals Interviews m Invitations
Laboratory notebooks m Laboratory notes Lecture notes Lectures m Legal documents m Legislative records Lists Manifestoes Memoirs m Minutes Monographs m Narratives Newsletters Newspaper columns m Notebooks m Notes Obituaries Official reports Oral histories m Petitions Photographic prints m
Press releases mProcedures Proceedings mPrograms mProposals mQuestionnaires Reminiscences Reports mResolutions Resumes Reviews mSchool records Speeches mSummaries Tables (documents) Technical reports mTranscripts mTypescripts Video recordings m
Species: Animals: Chordata: Vertebrata: bony fish
Computer structures: digital computer: minicomputer
Classification wish list Download classifications rather than build themDownload classifications rather than build them Definitions & synonyms should help find what I wantDefinitions & synonyms should help find what I want Today it is too expensive to manually classify my Today it is too expensive to manually classify my
scanned paper. E.g. “right time” meta-data is critical!scanned paper. E.g. “right time” meta-data is critical! Next year we hope “the system” can classify papers Next year we hope “the system” can classify papers
and other documents e.g. billsand other documents e.g. bills In 10 years we expect all documents to appear In 10 years we expect all documents to appear
electronically & classified electronically & classified with a little help from mewith a little help from me
Using my life bits:Ontologies…
useful? or fool’s errand?#8: “ontology”???#8: “ontology”???
““Succumbing to the ‘ontology’ fallacy”Succumbing to the ‘ontology’ fallacy”-Bates-Bates
Company1
1. Generic organization: Correspondence, financial, manuals, notebooks, org chart, plans, products, stocks, etc.. Facets: doc type, dissemination, institution type
2. Generic org. plus projects x roles; facets: financial; legal3. Generic organization for club, foundation, museum,
professional org, religious, sport, etc.4. Books, CDs, papers, videos Facets: media type
Employer2
Non-profit3
Library4
HealthLegal
Organizations
Academic Inst.2
Financial Assets
Family & related social
Ancestors, Parents,Siblings
Media
ArtifactsComm.
Library & archives: info & records.Personal archives (Ambiance…)
ChildrenSpouse |
Significant Other
Friends
Articles, bio, books,interviews, talks,
…web pages
Auto, home& other “things”
Property
Diaries
Family Business2
SelfFamily ($,property, legal, health)
potentially private…
Institution type: academic,… companies, family, other organizations…selfvs. complex contact??
Using my life bits:Providing insight, including…Where did I spend my time? What has been by output?
#9: logging & reports#9: logging & reports
Interface to xls
TV Usage
MyLifeBits Log of a video file
Using my life bits:Recording everything!
#10: CARPE#10: CARPEContinuous archival recording of Continuous archival recording of
personal experiencespersonal experiences
The A/V/real time data Future: new capture modes/devices
SenseCam
Deja View
Body Media
Quindi
www.joshgemmell.com
Open Problems
The Agenda for the Tbyte(s), Lifetime, PC:The killer app after office and mail.searching
1.1. Guarantee that data will live forever! “dear appy” problemGuarantee that data will live forever! “dear appy” problem2.2. Cheap, easy, and data-rich (e.g. time, place) capture:Cheap, easy, and data-rich (e.g. time, place) capture:
GPS and time everywhereGPS and time everywherePaper capture has to be as easy as discarding (scanner/shredder)Paper capture has to be as easy as discarding (scanner/shredder)Personal meeting capture...perhaps by the roomPersonal meeting capture...perhaps by the roomE-book…e-magazines & journals need to have critical mass! E-book…e-magazines & journals need to have critical mass! Telephony and audio capture with indexing (telephonic speech-to-text needed)Telephony and audio capture with indexing (telephonic speech-to-text needed)Media Center compatible for entertainment (photos, video, TV, radio)Media Center compatible for entertainment (photos, video, TV, radio)
3.3. Content analysis (critical for photo & video!); doable for text. Content analysis (critical for photo & video!); doable for text. 4.4. Information control: privacy, security, expunge/deniability,…Information control: privacy, security, expunge/deniability,… 5.5. Having to be schizophrenic or have a lobotomy when leaving a “life” or Having to be schizophrenic or have a lobotomy when leaving a “life” or
being a part of some other person’s life recordingbeing a part of some other person’s life recording6.6. One One dbase for everything (articles, books, conversations, ... financial dbase for everything (articles, books, conversations, ... financial
transactions) …vs. long-term use of hierarchical files. transactions) …vs. long-term use of hierarchical files. Is dbase intuitive?Is dbase intuitive?7.7. Annotations/meta-information add every-increasing value at high cost!Annotations/meta-information add every-increasing value at high cost!
Easy annotation for aiding search and Easy annotation for aiding search and it becomes the contentit becomes the content8.8. Other “killer apps”: Alzheimer, immortality, surrogate memory?Other “killer apps”: Alzheimer, immortality, surrogate memory?9.9. GUI’s to improve use (e.g. time to learn, use, aid in retention)GUI’s to improve use (e.g. time to learn, use, aid in retention)
www.MyLifeBits.com
Recommended