23
1 Class presentations, Epilogue CS 249B: Science of Networks Week 15: Monday, 05/05/08 Daniel Bilar Wellesley College Spring 2008

Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

Embed Size (px)

Citation preview

Page 1: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

1

Class presentations, Epilogue

CS 249B: Science of NetworksWeek 15: Monday, 05/05/08Daniel BilarWellesley CollegeSpring 2008

Page 2: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

2

Goals today

� Epilogue of CS249B and review

� (Network science redux: Up one level of scale to Motifs)

Page 3: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

3

Epilogue: Have we done what we set out to do?

Metrics, models, processes and algorithms of networks

� Graph theory� Probability theory� Power laws � Phenomenological models

� Random graphs (Erdos-Reniy)� Small world (Watts-Strogatz)

� Generative (dynamic) models� Preferential attachment (aka Yule

process , Gibrat principle, Matthew effect)

� Hierarchy generation (fractal, self-similarity)

� [Self-organized criticality, EOC]� HOT systems

� Processes and Consequences� Attack, Error tolerances� Diffusion, Epidemics

Learning to learn� Dealing with unknowns,

confusion, abundance of new material

� Efficient reading of difficult papers

� Mind maps/ Concept maps� Presentation to audience� Culminating project

Scientific paradigm� Witness the unfolding of the

scientific paradigm� Course material� Self

� A first-hand look at the acquisition, discussion, correction and evaluation of scientific models to explain phenomena

Page 4: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

4

You� You showed consistent progress and development, application of our class concepts, and most of all inquisitive couragein your final project

� You took chances, dared to be bold, staked out a position, then argued scientifically, empirically and logically – this is key

� Imagination as the first step is much more important than knowledge “Think like Einstein”

� Don’t fear being wrong, you cannot grow if you do not take that chance

It was a pleasure and a real treat to work with you this semester ☺

Page 5: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

5

Have a great summer!� You might forget most material of this course in time -> not so tragic� More important are practice of skills that you learned are universally

applicable� Dealing with unknowns, confusion, abundance of new material� Efficient reading of difficult papers, negotiating unknowns, impasses� Clearly expressing yourself in English

� The Scientific Paradigm: In order of importance� Being bold, Primacy of data, “Consillience of Induction” and logic

looks like a duckquacks like a duckhas duck genome

hangs out with ducksPope tells you it is a duck

lives in waterdoes ducky things“I am your father” Duck Vader ☺

Page 6: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

6

Comparing Apples with Oranges

� A spectrographic analysis of ground, desiccated samples of a Granny Smith apple and a Sunkist navel orange

Sandford S. Apples and oranges: a comparison. Annals of Improbable Research 1995;1(3)

Read other ground breaking research: “Is Kansas flatter than a pancake?” at

http://www.improbable.com/airchives/paperair/volume9/v9i3/kansas.html

Page 7: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

7

Addendum

Page 8: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

8

Motifs

� We discussed some properties of network and some conclusions that researchers drew

� Some processes (which are temporal, i.e. over time) may be driven or influenced by re-occurring structures or subgraphpatterns

� In the network context, these structures are called motifs

Page 9: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

9

Recurring Patterns are motifsIdea: If a pattern occurs more often in real world

networks than random networks, it is called a motifWhere have we seen this type of approach before?

Technique:1. Construct many random graphs with the same number of

nodes and edges ( and same node degree distribution)

2. Count the number of motifs in those graphs

3. Calculate the Z score: the probability that the given number of motifs in the real world network could have occurred by chance

� Software available:� http://www.weizmann.ac.il/mcb/UriAlon/

Page 10: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

10

Motif count criteria

� Three concepts are considered for the determination of motif frequency, which have different applications and restrictions on counting overlapping matches (i.e. matches sharing edges or vertices) for motif frequency.

� F1 has no restrictions

� F2 allows the sharing of vertices, but not of edges

� F3 does not allow any sharing of network elements

Page 11: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

11

Application of F1, F2 and F3

motif matches in the target graph

motif to be foundgraph

one of {M1,M2,M3,M4,M5}

F3

{M1,M4} or {M3,M4}F2

{M1,M2,M3,M4,M5}F1

MatchConcept

Page 12: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

12

3-node Feed-forward loop

Idea: Processes decision-making based on fluctuating external signals

Shen-Orr, Milo, Mangan, Alon. Network motifs in the transcriptional regulation network of Escherichia coli. Nature Genetics, 31:64-68 (2002)

“coherent FF loop act as a circuit that rejects transient activation signals from the general transcription factor and responds only to persistent signals, while allowing a rapid system shutdown”

What does this mean in English?

Page 13: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

13

4-node: Single-Input module

“motif can show a temporal program of expression according to a hierarchy of activation thresholds of the genes”

What does this mean in English?

Shen-Orr, Milo, Mangan, Alon. Network motifs in the transcriptional regulation network of Escherichia coli. Nature Genetics, 31:64-68 (2002)

Idea: Temporal ordering useful in processes that require several stages to complete

Page 14: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

14

What is a Z-score?

� A common statistical way of standardizing data on one scale so a ‘normalized’ comparison can take place� “yard stick” for all types of data

� Each z-score corresponds to a point in a normal distribution

� z-score will describe how much a point deviates from a mean or specification point.

Page 15: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

15

What the Z score meansµ =mean number of times the motifappeared in the random graph

# of times motif

appeared in random graph

zx=x - µµµµx

σσσσx

σ standard deviationthe probability observing a Z

score of 2 is 0.02275

In the context of motifs:

Z > 0, motif occurs more often

than for random graphs

Z < 0, motif occurs less often

than in random graphs

|Z| > 1.65, only a 5% chance of

random occurence

Page 16: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

16

Z-scores and motif profiling

Milo et al. Superfamilies of designed and evolved networks.Science, 303:1538-42 (2004)

Page 17: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

17

General motif detection approach

Milo, Shen-Orr, Itzkovitz, Kashtan, Chklovskii & Alon. Network Motifs: Simple Building Blocks of Complex NetworksScience, 298:824-827 (2002)

Page 18: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

18

Milo (2002) Motif discovery� Discovery of recurring patterns – motifs in broad empirical networks

� Can randomly remove, add, change 20% of edges and set of significant motifs do not change!

� Speculation on classification of network families based on motifs

� “Information processing”, computational circuits?

Page 19: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

19

Mangan (2003) FFL motif

� Investigation of one motif, the Feed Forward loop� Most significant network motifs in both E. coli and yeast� A three-gene pattern, is composed of two input transcription factors, one of which

regulates the other, both jointly regulating a target gene.� The FFL has eight possible structural types (why?), because each of the three

interactions in the FFL can be activating or repressing� Mathematically modeled the 8 types

� All are sign sensitive� Both coherent and incoherent types delay responses to stimulus steps, but only in one

direction

Page 20: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

20

Milo (2004) Family classification� Superfamilies can be constructed

based on Triad Significance Profile (TSP) acting as a “Motif signature”� Motif and antimotif

� Examples

Sensory transcription networks� “rate limited networks” by

individual component, faster� FFL over-, 3-chain under-

represented

Signal transduction networks� Not “rate limited”, slower� Triads 7,9,10 (+), 1,2,4,5 (-)

Social networks, languageWWW

Similarities between networks visualized bylooking at the correlation between theTSPs of different networks. Correlationsused to cluster the networks into distinctsuperfamilies

Page 21: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

21

Finding classes on graphs based on motif signatures

Page 22: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

22

Papers of interest

� Shen-Orr (2002) “Network motifs in the transcriptional regulation network of Escherichia coli”

� Przulj (2006) “Biological Network Comparison Using Graphlet Degree Distribution”

� Gupta (2007) “Quantifying similarities betweeen motifs”

� Learn more at http://www.weizmann.ac.il/mcb/UriAlon

Page 23: Class presentations, Epilogue - Wellesley CScs.wellesley.edu/~cs249B/lecture/05.05.08.LastLecture.CS249B.pdf · Matthew effect) Hierarchy generation ... development , application

23

Comparing Apples with Oranges

� A spectrographic analysis of ground, desiccated samples of a Granny Smith apple and a Sunkist navel orange

Sandford S. Apples and oranges: a comparison. Annals of Improbable Research 1995;1(3)

Read other ground breaking research: “Is Kansas flatter than a pancake?” at

http://www.improbable.com/airchives/paperair/volume9/v9i3/kansas.html