Upload
lisa-henry
View
213
Download
0
Embed Size (px)
Citation preview
of 49
lecture 18: tagging and folksonomy
of 49ece 627, winter ‘13 2
metadata
is:“structured information that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource”
(NISO)
it allows systems to collocate related information, and helps users find relevant information
of 49ece 627, winter ‘13 3
metadataways of creation
generally in two ways:
professional creation (professionals working with complex, detailed rule sets and vocabularies)
author creation (authors of documents provide metadata along with their creations)
are ontologies the result of that???
of 49ece 627, winter ‘13 4
metadatathird (new) way
user- created metadata
users of the documents and media create metadata for their own individual use that is also shared throughout a community
of 49ece 627, winter ‘13 5
folksonomywhat is it?
it is a people's taxonomy
is composed of terms in a flat namespacethere is no hierarchy, no parent- child or sibling relationships between these terms
of 49ece 627, winter ‘13 6
folksonomywhat is it?
the set of terms (called tags) that a group of users tagged content with, they are not a predetermined set of classification terms or labels
of 49ece 627, winter ‘13 7
folksonomy…
the cumulative force of all the individual tags can produce a bottom-up, self-organized system for classifying items on the web
of 49ece 627, winter ‘13 8
what is tagging?introduction
a tag is a non-hierarchical keyword or term assigned to a piece of information - such as an internet bookmark, digital image, or computer file
(Wikipedia)
tagging – to mark with a tag; to label, identify, or recognize with or as if with a tag- a unique and powerful way of organizing information
of 49ece 627, winter ‘13 9
what is tagging?tagging system
three “components”-users-resources-tags
of 49ece 627, winter ‘13 10
tagging systemusers
the people who employ a tagging system (sometimes also called taggers) – they create the tags, and sometimes they add resources
have a variety of different interests, needs, goals, and motivations – but they are trying to achieve some larger goal – such as sharing a photo or labeling a document so they can find it later
of 49ece 627, winter ‘13 11
tagging systemresources
are items that users tag
a resource can be just about anything – a book, a Web page, a video, or even a location
within each tagging system, resources often share some common properties – they are books, or photos, or …
of 49ece 627, winter ‘13 12
tagging systemtags
the keywords added by users are tags
can be just about any kind of term, they can be descriptions of the resource’s subject matter, its location, its intended user, a reminder, or something else entirely – can be individual words or phrases
tags are essentially metadata about the resource
of 49ece 627, winter ‘13 13
tagging systemtags
tags are more than just metadata in an application – they are a tool people use to track, share, and find information
of 49ece 627, winter ‘13 14
tagging systemtag cloud
is a method of presenting tags where the more frequently used tags are emphesized
of 49ece 627, winter ‘13 15
tagging systemtag cloud – example
www.wordle.net
of 49ece 627, winter ‘13 16
tagging system…
all tagging happens in the context of a system, and the system defines what kind of tagging can take place
for example, the system may allow users to add their own resources or not, may allow to tag any resource or not, may forbid certain kinds of tags
of 49ece 627, winter ‘13 17
tagging systemperspectives …
tagging sitsat the intersection of three establishedfields
Social Software
Personal Information Management
InformationArchitecture
tagging
of 49ece 627, winter ‘13 18
tagging systeminformation architecture
the structural design of shared information environments andthe art and science of organizing and labeling web sites, intranet, online communities, and software to support usability and findability
information architects focus on using controlled vocabularies, search-and-browse systems
of 49ece 627, winter ‘13 19
tagging systemsocial software
applications that people use to communicate, collaborate, and share online
people who design social software are interested in facilitating group interaction within the system
of 49ece 627, winter ‘13 20
tagging systempersonal information management
“refers to the practice and study of the activities people perform in order to acquire, organize, maintain, retrieve, and use information items such as documents, web pages, e-mail messages …”
(Wikipedia)
they are programs for managing information and methods for keeping yourself on track – help you file, track, and find your information when you need it
of 49ece 627, winter ‘13 21
tagging systemtensions …
personal <-> social
do people tag primary for their own benefit?or are they motivated to share information with a group …?
of 49ece 627, winter ‘13 22
tagging systemtensions …
idiosyncratic<-> standard
should tags be unique?or should be standardized so they can be used for browsing and searching?
of 49ece 627, winter ‘13 23
tagging systemtensions …
freedom<-> control
does the system give users complete freedom?or does it influence or control their tags?
of 49ece 627, winter ‘13 24
tagging systemtensions …
amateur<-> expert
how qualified are the people who do tagging?should tags contributed by amateurs count as much as tags created by experts?
of 49ece 627, winter ‘13 25
tagging why matters
it is popularit is multifaceted it is flexibleit is also made for the stream – the constant flow of information we experience online
of 49ece 627, winter ‘13 26
tagging motivation
ease of use-tags are simple
just typing few words-tags are flexible
tags can be whatever you need them to be
-tags are extensibleyou can always add new tags
-tags can be aggregated
… can be messy and may not conform to any recognizable pattern
of 49ece 627, winter ‘13 27
tagging motivation
managing personal information- do not need to consider the whole
categorization scheme, you just add tags- you can add any tags, instead of finding the
one category that is the best fit- re-categorization is easy if we make a mistake
of 49ece 627, winter ‘13 28
tagging motivation
collaborating and sharing- you can explore topics using the tags of other
users- other users may be experts- you may use tags to connect with other users
who share interests
having fun
expressing yourself
of 49ece 627, winter ‘13 29
tagging system architecture
requires to set up rules about your users (who they are and how they join the system), your resources (how they are added to the system), and tags (who can tag which resources)
how users interact with each other
of 49ece 627, winter ‘13 30
tags as metadatakinds of metadata
metadata:-helps you (or others) find data you want-helps you manage your data-lets you relate your data to other data you own, as well as other data out there in the world
of 49ece 627, winter ‘13 31
tags as metadatakinds of metadata
descriptive – provide details about the resourceadministrative – used to manage a collection of resources (for example, date a resource was acquired, the person who owns the rights to the resource)structural – used to associate the resource with other resources (for example, volume of books, maps of how individual files relate to each other)
of 49ece 627, winter ‘13 32
tags as metadatakinds of metadata
tag type exampledescriptive webdesign, drama, sushi
gardening, musicresource blog, book, video, photoownership/source nytimes, genesmith (author)opinion cool, funny, lameself-reference mystuff, minetask organizing todo, workplay/performance helo3, poetry
of 49ece 627, winter ‘13 33
tags …taxonomies and controlled vocabularies
two kinds of classification systems – define relationships between terms
help us understand and navigate concepts by making language less ambiguous, by connecting concepts, and by capturing the relationships between objects observed in the real world
of 49ece 627, winter ‘13 34
tags …controlled vocabularies
a system for managing the meaning of words – it removes ambiguity of language
synonym rings – give two or more words an equivalent meaningauthority files – as above but one of the words is identified as a preferred term
of 49ece 627, winter ‘13 35
tags …taxonomies
establishes parent-child relationships between terms, are typically hierarchical
of 49ece 627, winter ‘13 36
tags …enriching taxonomy with tags
bubble-up approachtags are attached to a resource, for example, a songthose tags are “bubble-up” from several songs to describe their parent item, albumalbum tags are then bubbled up again to describe the artist
relationships between resources are preserved while capturing the descriptive terms of users
of 49ece 627, winter ‘13 37
folksonomyintroduction
it is a term used to describe the bottom-up classification systems that emerge from social tagging
of 49ece 627, winter ‘13 38
folksonomyintroduction
the relationships between tags are inferred based on their usage patterns
no formal relationships parent-child like in taxonomyno equivalences between terms as in a controlled vocabulary
of 49ece 627, winter ‘13 39
folksonomyintroduction
ajax
webdesign
css HIV
cxcr4
ccr5
of 49ece 627, winter ‘13 40
folksonomy- independence
users are free to choose their tags
some systems offer suggestions – a tool aimed to help users add tags more easily and efficiently
of 49ece 627, winter ‘13 41
folksonomy- aggregation
pulling all the tags together in an automated way – this creates folksonomy
manual sampling of tags, few users – not a folksonomy
– based on users’ activities and interests
of 49ece 627, winter ‘13 42
folksonomy- inference
relationships between tags are inferred from their use
they are based on the language and usage patterns of real users
of 49ece 627, winter ‘13 43
folksonomy- methods to infer semantic relationships
-counting tags to see which is most popular-co-occurrence counts which tags are used together (loose approximation of the associative relationships)-clustering of tags that have a high probability of co-occurence
of 49ece 627, winter ‘13 44
folksonomywhen to use
-nomenclature is uncertain or evolving-dynamic information space-semantic relationships are not critical-multiple viewpoints are desirable-you can tap in an active base of users
of 49ece 627, winter ‘13 45
from folksonomy to ontologysuper-class relationships
tags that co-occur with other tags often are thought to be more general than more specific-tags that co-occur with other tags less often
of 49ece 627, winter ‘13 46
from folksonomy to ontologysuper-class relationships
for example"music" co-occurs with both "piano" and "guitar", and as such can be suspected being a super-class of bothon the other hand, "piano" probably does not co-occur with more possible tags than "music" and usually co-occurs with "music" and so it likely is a subclass
of 49ece 627, winter ‘13 47
from folksonomy to ontologysynonym relationships
detecting synonyms is actually counter-intuitive, since I believe that the same user will not tag a URI both "computer" and "PC," but will probably only pick one of those
however, groups of users will use different synonyms, and over time most of the convergence will come from synonyms being merged.
of 49ece 627, winter ‘13 48
from folksonomy to ontologystructured relationships
tags that co-occur often might have a facet, or structured relationshipthese may be pairs or trids
of 49ece 627, winter ‘13 49
from folksonomy to ontologystructured relationships
for example"book" and "author" and "Mark Twain" is a triadic ("triple" on the Semantic Web) relationship, and if these co-occur quite often they are probably a relationshipin fact, one would suspect that most co-occurences are pairs, like "author" and "Zadie Smith," or "book" and "Mark Twain," and making these work with the Semantic Web would be slightly more difficult