Mdst3703 culturomics-2012-11-01

Lecture/Studio:Culturomics

Prof. AlvaradoMDST 3703/77031 November 2012

Business

• Everyone’s families and friends OK?

Review

• The New Epistemology– Rise of Big Data: massive, available, social– Shifts our relationship to primary sources– From reading to quantitative methods and

visualizations– Example of media determinism

• Manovich– Consistent with database logic– Applies spirit of Big Data methods to art

Review

• Rationalization Effects– What are we looking at?– What is theory?– What are models?– What is culture?– What are the humanities?

Overview

• Combined Studio and Lecture• Lecture– Google’s NGram Viewer– Culturomics

• Studio: – Collaborative Topic Index

Google Does the Humanities

Google NGrams

• Google Books comprises 11% of the corpus of published books, about 2 trillion words

• NGrams uses 5.2 million books (4% of the corpus)

• 500 billion words• Published between 1500-1800• In English, French, Spanish, German, Chinese

and Russian (Hebrew too)

Erez Lieberman Aiden and Jean-Baptiste Michel

What’s an NGram?

A space-delimited string

N = number of strings

Case sensitivePurely syntactic

Very hard to index

Culturomics

• A method more than a model (like Anderson argues)

• Analogy is to genomics– Does this make sense? – What is the analog to the gene?

Parallel

Crossing

Convergent/Divergent

American

British

“There’s not even a historian of the book connected to the project,” Mr. Menand noted.

Anthony Grafton, History, Princeton

Studio

• We are now at the point where we have all the pieces in place– HTML markup, CSS, JavaScript– Structured data (table in Google Docs)– Visualization tools

• Create Character Index– We will use everything we have done so far – notes,

network visualizations, etc.– Today we begin to collaboratively create the Character

Index (a subset of a full topic index)

Mdst3703 culturomics-2012-11-01

Documents

Visions and open challenges for a knowledge-based culturomicsmogren.one/publications/2015/visions/tahmasebi-visions-2015.pdf · Visions and open challenges for a knowledge-based culturomics

Mdst3703 2013-08-29-hello-world

Mdst3703 2013-09-03-plato2

Mdst3703 visualization-2012-10-23

COMPOSIÇÃO DE JOGOS DE MOTOR - … Volks.pdf · gol 1.6-1.8 - retificaÇÃo c/ retentor 01 01 01 01 04 01 01 01 01 08 01 01 01 01 01 01 01 01 02 01 01 01 02 02 j. cabeÇote j. tampa

mizuho-topics-2020-2 02142002 ／ 01 03 ／ 01 04 ／ 01 05 ／ 01 06 ／ 01 07 ／ 01 08 ／ 01 09 ／ 01 10 ／ 01 11 ／ 01 12 ／ 01 13 ／ 01 14 ／ 01 15 ／ 01 16 ／ 01 17

1 · Web view2017/18. 2000/01. 2000/01. 2000/01. 2000/01. 2000/01. 2000/01. 2000/01. 2001/02. 2000/01. 2000/01. 2000/01. 2000/01. 2000/01. 2000/01. 2000/01. 2001/02. 2000/01. 2000

Ascri 05_2017.pdf · CST em Gestão ambiental 01 01 01 01 01 Engenharia ElétriCa, Engenharia Eletrônica, Engenharia de Telecomunicações e Engenharia de 01 01 01 01 01 01 01 01

شهداء الضفة الغربية، 1980- 1989info.wafa.ps/pdf/west_bank_1980-1989.pdf1 08980891 1 01/01/1980 01/01/1954 2 01/01/1980 01/01/1958 3 01/01/1980 01/01/1960 4 01/01/1980

w3.ufsm.brw3.ufsm.br/cachoeira/images/2018/editais/Edital_Monitoria_1sem... · 01 01 01 01 01 01 01 01 01 01 01 01 01 01 02 01 01 01 01 Disciplinas ... Circuitos Elétricos I e Il

Analyzing Text at the Middle Distance between the Close Read and Culturomics Marti A. Hearst U.C Berkeley Joint Work with Aditi Muralidharan

Mdst3703 2013-09-24-hypertext

laayoune.ma · -av.f.a.r b.p:495 laayoune 70000 sakia el hamm - maroc web page : . 72 01 01 02 01 01 01 01 02 04 01 01 04 01 01 04 01 01 15 11 01 01 2m 02 . 04 04 01 01 01 02 11 02

Digital archives, big data and image-based culturomics for social … · 2017-12-14 · 1 Digital archives, big data and image-based culturomics for social impact assessment: opportunities

SUPPLEMENTARY INFORMATION - Nature · 1 Supplementary informations: 2 Culture of previously uncultured members of the human gut microbiota by culturomics 3 Jean-Christophe Lagier

Culturomics: Reflections on the Potential of Big Data Discourse … · 2019-10-22 · Big data discourse analysis is a current trend in ... methods on the available massive volumes

Mdst3703 2013-10-01-hypertext-and-history

BigData@Chalmers Machine Learning Business Intelligence ... · Machine Learning Business Intelligence, Culturomics and Life Sciences Devdatt Dubhashi LAB (Machine Learning. Algorithms,

Mdst3703 graph-theory-11-20-2012

Mdst3703 2013-09-17-text-models