29
NITF Maintenance www.NITF.org “All news is an exaggeration of life.” - Daniel Schorr, CBS News Stuart Myles Alan Karben Dow Jones XML Team Solutions Prague / October 17, 2007

NITF Working Group October 2007

Embed Size (px)

DESCRIPTION

IPTC's NITF is the XML standard for marking up news articles. These slides are from the October meeting of the NITF Working Group.

Citation preview

Page 1: NITF Working Group October 2007

NITF Maintenance www.NITF.org

“All news is an exaggeration of life.”- Daniel Schorr, CBS News

Stuart Myles Alan KarbenDow Jones XML Team Solutions

Prague / October 17, 2007

Page 2: NITF Working Group October 2007

© IPTC – www.iptc.org 2

Agenda

• Approval of minutes from previous meeting• Matters Arising• Chairman’s Report• State of NITF Today• NITF 4.0• Future of NITF• Other business• The next meeting

Page 3: NITF Working Group October 2007

© IPTC – www.iptc.org 3

NITF Minutes

• Approval of Minutes from previous meeting: Tokyo 30th May 2007

NM0704.1

Page 4: NITF Working Group October 2007

© IPTC – www.iptc.org 4

NITF Matters

• Matters arising?- I took over!- Děkuji to Alan Karben.

Page 5: NITF Working Group October 2007

© IPTC – www.iptc.org 5

State of NITF Today

• News Industry Text Format

• “A solution for sharing news”

• Developed by News Publishers, for News Publishers

• Defines the content and structure of news articles

• NITF v3.4 in DTD and XML Schema

• http://www.nitf.org

Page 6: NITF Working Group October 2007

© IPTC – www.iptc.org 6

State of NITF Today

• IPTC’s most widely-used XML standard– 455 members on the Y! list– Since June, 9 subscribed, 4 unsubs

• Recent history– NITF largely the same since 90’s– Consolidation: removed HTML formats,

streamlined enriched text, etc.– XML schema

• Stability

Page 7: NITF Working Group October 2007

© IPTC – www.iptc.org 7

State of NITF Today

• Some activity– Only 27 Y! messages since June– No one responded to NITF profiles or

contributed to the G2 map

• Emerging Alternatives– XHTML, Microformats, RDFa, PRISM, etc.

• NITF is strong, but with looming challenges

Page 8: NITF Working Group October 2007

© IPTC – www.iptc.org 8

NITF 4.0

• NITF Profiles: Core– Inline and structural markup

– No metadata that conflicts with G2– Slimmed-down set of NITF elements– http://tinyurl.com/ywzawr

• NITF Profiles: Power– Map Power metadata to G2 metadata– http://tinyurl.com/2rgfx6

Page 9: NITF Working Group October 2007

© IPTC – www.iptc.org 9

NITF 4.0: Core, Power, G2

G2

PowerCore

Metadata not in NITF

G2 expansion of NITF possible

Map to G2

No map to G2

Page 10: NITF Working Group October 2007

© IPTC – www.iptc.org 10

Containers:• paragraph• sub-headline• table• media• list• sidebar• preformatted block• editorial note

Closers:• credit-line• biographical blurb

Many Containers hold Enriched Text:– phrases (people, titles, etc)– highlights (stylistic emphasis)– link– break

Core Conceptual Modelarticle

abstract

headlinesuper-headline

sub-headline

main-headline

byline

Containers

Closer

G2

main-headline

byline

phrases

link

Page 11: NITF Working Group October 2007

© IPTC – www.iptc.org 11

NITF Power Modelnitf

head

title

body

Most NITF <head> elements and attributes map to G2 (or make no sense)

Notable exceptions are<iim><pubdata> and<revision-history>

Map details athttp://tinyurl.com/2rgfx6

meta

tobject

iim

docdata

pubdata

revision-history

iim

pubdata

revision-history

Page 12: NITF Working Group October 2007

© IPTC – www.iptc.org 12

NITF G2: Embedded Media

• NITF articles often embed a media item, e.g. image, audio, video, etc

• The media item often has associated metadata, e.g. copyright, mime-type, height, width, etc.

• When switching to NITF+G2, what do you do with the media item and its metadata?

media

body.content

nitf

media-metadata

media-reference

media-caption

Page 13: NITF Working Group October 2007

© IPTC – www.iptc.org 13

NITF G2: Embedded Media

It depends.

Page 14: NITF Working Group October 2007

© IPTC – www.iptc.org 14

NITF G2: Embedded Media

• Three options:– Create an item for the media object and

use packageItem to glue the NITF article and media object together

– Use links in the item header to reference media objects needed for the item to display correctly

– Embed the media object in the NITF article

Page 15: NITF Working Group October 2007

© IPTC – www.iptc.org 15

NITF G2: Embedded Media

Create an item for the media object and use packageItem to glue the NITF article and media object together

newsMessage

itemSet

packageItem

newsItem

newsItem

nitf

media

Page 16: NITF Working Group October 2007

© IPTC – www.iptc.org 16

NITF G2: Embedded Media

Use a link in the item header to reference the media object

(e.g. a graphic / audio / video on the web).

newsMessage

newsItem

nitf

media

link

Page 17: NITF Working Group October 2007

© IPTC – www.iptc.org 17

NITF G2: Embedded Media

– Embed the media object in the NITF article

– Provide context– Exact placement– Specific order– Captions often

specific to the article

media

body.content

nitf

media-reference

newsItem

media-caption

link

Page 18: NITF Working Group October 2007

© IPTC – www.iptc.org 18

NITF G2

• The choices with embedded media illustrate a general conclusion

• When using NITF within NewsML-G2 there will be a number of choices that news publishers need to make

• How much to embrace the “G2 Way”?

Page 19: NITF Working Group October 2007

© IPTC – www.iptc.org 19

NITF G2: qcodes

• The NAR makes extensive use of qcodes

• Should NITF adopt qcodes too?

• qcodes are “qualified codes”

• Scheme identifier followed by a colon followed by a code (which can contain a colon), e.g.

qcode=“org:DJ”

qcode=“poi:cz:praha”

Page 20: NITF Working Group October 2007

© IPTC – www.iptc.org 20

NITF G2: qcodesIn NAR’s PCL text markup:

<headline>The

<inline qcode=“org:DJ”>

Dow Jones</inline> representative visited <inline qcode=“poi:cz:praha”>

Prague

</inline>

</headline>

In NITF

<hl>The

<org idsrc=“org” value=“DJ”>

Dow Jones</org> representative visited <city code-src=“poi” city-code=“cz:praha”>

Prague

</city>

</hl>

Page 21: NITF Working Group October 2007

© IPTC – www.iptc.org 21

NITF G2: qcodes

• NAR provides powerful metadata capabilities using qcodes– Hooks into conceptItem (path to ontology)– Ambiguous assertions of identity

• Nothing to stop providers using qcode-like values for existing NITF attributes<org value=“org:DJ”>

• Or could add non backward compatible qcode attribute to relevant elements<city qcode=“poi:cz:praha”>

Page 22: NITF Working Group October 2007

© IPTC – www.iptc.org 22

NITF G2: qcodes

• Proposal: Do not add qcodes to NITF.Instead, use NAR’s inlineRef and NITF’s id.

• NAR’s <inlineRef> mechanism allows qcodes to be applied to any element that sports an XML id attribute

• All NITF elements support id• Including the useful “catchall” <classifier>

Page 23: NITF Working Group October 2007

© IPTC – www.iptc.org 23

NITF G2: qcodes example

<newsItem><inlineRef idrefs=“e1” qcode=“e:happy” confidence=“77”>

<name>Happiness</><description>Mirth.</><inlineRef idrefs=“p7” qcode=“p:buddha”><name>Gautama Buddha</></inlineRef>…<nitf><person id=“p7”>Buddha</> discussed the role of the mind in the pursuit of <classifier id=“e1”>happiness</> through the practice of the eightfold path…

</nitf></newsItem>

Page 24: NITF Working Group October 2007

© IPTC – www.iptc.org 24

NITF 4.0: Next Steps

1. Re-solicit comments on Yahoo groups

2. Resolve and harmonize discussion points

3. Draft an NITF-in-G2 User Guide

4. Vote on NITF 4.0 Proposal in Beijing

Page 25: NITF Working Group October 2007

© IPTC – www.iptc.org 25

NITF Future

• Stability– Few changes recently– Not even to integrate with NewsML-G2

• Requests– Inline markup– More complex and richer– Inspired by microformats, XHTML

Page 26: NITF Working Group October 2007

© IPTC – www.iptc.org 26

NITF Future: Drive Adoption

• Text article model is close to universal

• And yet it is being re-invented:– PRISM, microformats,

RDFa, XHTML, etc.

• This hurts interoperability and therefore competition and costs

article

abstract

headlinesuper-headline

sub-headline

main-headline

byline

Containers

Closer

Page 27: NITF Working Group October 2007

© IPTC – www.iptc.org 27

NITF Future: Drive Adoption

• What is the best way to drive adoption?• We could promote the NITF format

– Evangelize the NITF tags amongst publishers of other kinds

– Adopt more and more of the complex markup of XHTML

• We could promote the common article model– Map from the model to XHTML, RDFA, etc.– De-emphasize the NITF tags themselves

Page 28: NITF Working Group October 2007

© IPTC – www.iptc.org 28

NITF Future: Drive Adoption

• In my view, the best way to drive adoption is to do a combination of the two

• We should create an explicit article model (similar to the NAR)

• We would then create maps from the article model to various formats

• The IPTC would declare support for NITF as “first among equals”

Page 29: NITF Working Group October 2007

© IPTC – www.iptc.org 29

NITF

• Any other business?

• Date and place of next meeting:– Beijing, China Spring 2008

Děkuji!