44
IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web http://musicspace.mspace.fm

IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

Embed Size (px)

Citation preview

Page 1: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

IAML Annual Study Weekend12 April 2010

musicSpace: Music and the Semantic Webhttp://musicspace.mspace.fm

Page 2: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

2

Page 3: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

Contents

1. Motivation

2. Solutions

3. Evaluation

4. The Future

3

Page 4: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

1. Motivation (the problem we’re addressing)

4

Page 5: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

5

Centuries of material...

Page 6: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

6

... is now increasingly digitised

Page 7: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

Yet, with exceptions like Copac, ...

7

Page 8: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

... data is typically siloed.

8

Page 9: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

... data is typically siloed.

Geographical dispersal has been replaced by a virtual dispersal on the web. Data is now segregated into a plethora of online resources by: – Media type (text, image, audio,

video)– Date of creation/publication– Subject

9

Page 10: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

... data is typically siloed.

Geographical dispersal has been replaced by a virtual dispersal on the web. Data is now segregated into a plethora of online resources by: – Language– Copyright holder– Ad hoc/insecure nature of project

funding

10

Page 11: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

... data is typically siloed.

Crucially:

Data sources are not usually interoperable.

Data and metadata from one source cannot be used (automatically) as the basis for a query of another source

11

Page 12: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

12

The Google/textbox search paradigm is limiting.

Page 13: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

13

You’d better know what you want!

Page 14: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

14

What tickles your fancy?

Page 15: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

So, interacting with current data resources present barriers

at all stages of the research process:

15

Page 16: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

16

Inchoate ideas Hum… something on Monteverdi’s madrigals?

Specific complex questions Which scribes have created manuscripts of

Monteverdi’s works, and which other composers’ works have they inscribed?

What recording of works by Cage exist, which performers have recorded a particular work by Cage, and what else have they recorded?

Page 17: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

17

Which scribes have created manuscripts of Monteverdi’s works, and which other composers’ works have they inscribed?

We’d use RISM for this question.

1. Execute a ‘People’ search for ‘Monteverdi’.

2. Manually filter out results returned where the composer’s name appears in reference to a role other than that of composer (although for Monteverdi this isn’t likely to take long).

3. Examine the remaining records to identify the scribe in each case.

Page 18: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

18

Which scribes have created manuscripts of Monteverdi’s works, and which other composers’ works have they inscribed?

4. Execute a ‘People’ search for each scribe of interest.

5. Manually filter out records where the scribe is named in reference to role other than that of scribe (e.g. ‘former owner’, ‘composer’ etc.).

6. Collate a list of other composers whose works the scribes have inscribed.

Page 19: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

19

What recording of works by Cage exist, which performers have recorded a

particular work by Cage, and what else have they recorded?

We’d use BLSA, Copac and Naxos.

1. Search for recordings where the composer is ‘Cage’ in: i. BLSA

ii. Copac

iii. Naxos

(Each requiring a different search formulation)

2. Collate results and make a list of performers.

Page 20: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

20

What recording of works by Cage exist, which performers have recorded a

particular work by Cage, and what else have they recorded?

3. Search Naxos for recordings where these performers are given as the ‘performer’ and ‘Cage’ is given as the ‘composer’.

4. Search BLSA and Copac for records that name these performers and Cage, and then manually filter out results where names do not occur in relation the appropriate role.

5. Manually collate repertoire lists for each Cage performer.

Page 21: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

21

The barriers to tractability and their solutions

Need to consult several sources … and metadata from one source cannot guide searches of another source.

Insufficient granularity of data and/or search option.

Multi-part queries have to be broken down and results collated manually.Pen and paper!

Solutions:

Integration

Increase granularity

Optimally interactive UI (‘mSpace’)

Page 22: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

2. Solutions

22

Page 23: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

Integration

23

Page 24: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

24

Rather than using many portals ...

Page 25: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

25

... what if you could use just one?

Page 26: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

26

Our partners use a variety of data formats

MARC-XML

MODS-XML

Custom MARC

Source-specific XML

Tables/CSV

We import these as RDF

Why RDF? 1.Standard format for the

Semantic Web.

2.It’s modular; we can add records and record fields without having to start from scratch.

3.RDF can be created using lots of different tools.

Page 27: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

Granularity

27

Page 28: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

28

Metadata hierarchy

We use a two-level hierarchy based on metadata type.

Person

Composer Scribe etc.PerformerAuthor

Crucially, our search UI exposes this hierarchy so that both broad and narrow searching is possible.

Page 29: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

29

Adding/exposing granularity

Where possible we add to/expose the granularity of the metadata.

Person ‘Immyns, John [scr]’.

Scribe ‘Immyns, John’.

Book ‘Come Death, I shall not fear thee ...’ with author ‘Monteverdi, Claudio’.

Book of printed music ‘Come Death, I shall not fear thee ...’ with composer ‘Monteverdi, Claudio’.

(Because of marc-leader info.)

Page 30: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

30

Generating metadata: Grove works lists

Page 31: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

31

Our Tool for the Works Lists

Page 32: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

User Interface

32

Page 33: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

33

‘musicSpace’ is a faceted browser

Page 34: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

34

Screencast 1: Which scribes have created manuscripts of Monteverdi’s works, and which other composers’ works have they inscribed?

Page 35: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

35

Screencast 2: What recording of works by Cage exist, which performers have recorded a particular work by Cage, and what else have they recorded?

Page 36: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

3. Evaluation

36

Page 37: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

37

Feedback on speed and ease of use:

‘All the information showed up very quickly, and it was easy to find material. It was really good to have different kinds of material in the same place.’

‘[musicSpace offers] a speedier way to research crossed search pathways.’

‘Excellent interface – very simple to understand.’

Page 38: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

38

Feedback on browsing around a subject or changing the search

paradigm:

‘I would recommend musicSpace for its ability to manipulate queries in order to get results that you wouldn’t otherwise be able to get [without starting over].’

‘I liked the ability to explore around a topic once you’ve identified something of interest.’

‘The ability to switch columns around and add new columns was most useful.’

Page 39: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

39

Feedback on improved data granularity:

‘[Without using musicSpace] it would not be at all easy to do an opera character search. You would have to use printed reference books like Pipers Enzyklopädie des Musiktheaters, but even this does not have an index of characters, so you’d have to look at the entry for each opera and manually collate information. You would also have to know what you were looking for before starting out!’

‘I used musicSpace to explore how many operas have a character named Alceste. This information simply isn’t get-at-able using other search interfaces – you’d have to sort through the information by hand.’

Page 40: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

An invitation to try our demo

Musicologists: – Monteverdi recordings– C19th opera buffa– Schubert’s songs – C20th electroacoustic music

Music librarians / library scientists

Music technologists / web scientists

40

Page 41: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

An invitation to try our demo

musicspace.mspace.fm

(needs Firefox)

41

Page 42: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

4. The Future

42

Page 43: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

43

Works lists project with Grove Music Online

Composer URI project

Page 44: IAML Annual Study Weekend 12 April 2010 musicSpace: Music and the Semantic Web

44

Thank you!