Control Biology, Wolkenhauersub

Embed Size (px)

Citation preview

  • 8/2/2019 Control Biology, Wolkenhauersub

    1/11

    Systems Biology

    Looking at opportunitiesand challengesin applyingsystemstheory to molecularand cell biology.

    S

    ystems theory or systems science has never

    managed to achieve widespread and inde-

    pendent status in curricula, departments, and

    journals but instead acts as an umbrella for a

    number of research activities across the physi-

    calandengineeringsciences. Now, withrevolu-

    tionary developments in the life sciences, there is renewedinterest in systems thinking. In this article, we survey op-

    portunities and challenges for the application of systems

    theory to biology in the postgenomic eraa new area of re-

    search also referred to as systems biology.

    With the sequencing of DNA for a number of genomes,

    scientists now have an inventory of genes available to em-

    bark on the study of the organization and control of genetic

    pathways. This new phase in the biological revolution, the

    postgenomic era, is closely associated with the fields

    genomics, transcriptomics, proteomics, and metabolo-

    mics (called the omics for short). These fields take us

    from the DNA sequence of a gene to the structure of the

    product forwhich it codes (usually a protein)to theactivity

    of that protein and its function within a cell, the tissue, and

    ultimatively the organism. A series of articles in Nature [1]

    are recommended for an introduction to these research ar-

    eas of the biomedical sciences.

    With the emergence of the omics, molecular biology

    currently witnesses a shift of focus from molecular charac-

    terization to the understanding of functional activity. The

    two central questions scientists investigate are What are

    the genes functional role? and How do genes and/or pro-

    teins interact? Answeringthesequestions hasbecome pos-

    sible with new high-throughput technologies to take

    measurements at the molecular level. In the past, single

    genes were studied, but with DNA microarray technology

    we can now measure the activity levels of thousands of

    38 IEEE Control Systems Magazine August 20030272-1708/03/$17.002003IEEE

    EYEWIRE

    By Olaf Wolkenhauer, Hiroaki Kitano, and Kwang-Hyun Cho

  • 8/2/2019 Control Biology, Wolkenhauersub

    2/11

    genes at thesame time. Thus it becomes possible to identify

    interrelationships between groups of genes (with respectto

    their functional role) and to analyze dynamic interactions

    among genes (gene networks). Similarly, proteomics re-

    search shows that most proteinsinteractwith several other

    proteins, andit is increasingly understood that the function

    of a protein is appropriately described in the context of its

    interactions with other proteins.Most of these interactions are the

    consequence of dynamic and con-

    trolled processes, and it is not sur-

    prising thatthere is renewedinterest

    in the application of systems think-

    ing to biology.

    The rest of this article is orga-

    nized as follows. We first introduce systems biology in the

    context of the study of complex systems, reviewing a num-

    berof relatedandrelevant areasof research anddefine com-

    plexity in thecontextof biological systems. Systems biology

    has a history, and its early stages in the 1960s involved emi-

    nent researchers, including Wiener, Kalman, Bertalanffy,

    Rosen, and Mesarovic. We discuss why these attempts dis-

    appeared from the research agendas and why there is re-

    newed interest in the postgenomic era of the life sciences.

    We then followwithtwoexamples of systems biology before

    outlining current activities from groups around the world.

    We conclude by listing some of the challenges and hurdles

    for this (re)emerging field.

    Genomic CyberneticsThe understanding of causality and coping with complexity

    is not the holy grail of science but part of its very essence.

    Notsurprisinglythen, complexity studies have remainedas

    elusive as inconclusive.

    Weaver [2] defined disorganized complexity as a prob-

    lemin whichthenumber ofvariablesis verylarge andany of

    these variablesis best describedas a randomprocess. Here

    we are at themolecular level, andthe most successful for-

    mal methods for representing phenomena at this level de-

    rive from statistical considerations. In the context of the

    cell, at the cellular level, matters are complicated by the

    fact that organization becomes an essential feature of the

    processes under consideration. Weaver referred to prob-

    lems in which a large numberof factors are interrelatedinto

    a wholeas organized complexity. The number of variables

    is too large to be dealt with in the Newtonian realm of phys-

    ics and mathematical modeling, and the systems are too or-

    ganized to allow statistical techniques. At the time, Weaver

    described organized complexity as the challenge for sci-

    ence in the coming 50 years. The enthusiasm he expressed

    in 1948 is very similar to how one feels today in the

    postgenomic era of the life sciences: It is doubtless true

    that we are only scratching the surface of the cancer prob-

    lem, but at least there are now some tools to dig with and

    there have been located some spots beneath which almost

    surely there is pay-dirt [2].

    Following Hakens synergetics, chaos theory, and frac-

    tals, the science of self-organized criticality [3], nonequili-

    brium physics, power laws, and emergent phenomena

    renewed the interest in complexity studies over the last de-

    cade orso.Thesestudiesdevelopedmostlywithintheareas

    of physics and mathematics. They are seeking general prin-

    ciple of phenomena that can be observed in a wide range of

    disciplines. Kauffmans workon genetic networks [4] paved

    the way for complexity studies in biology. The work of

    Goodwin [5], [6], Harrison [7], and Meinhardt [8] marked a

    trend toward an approach more focused on specific organ-

    isms but continued to investigate cellular processes and

    morphological development in evolutionary terms. As Har-

    old pointed out in his recent book: Complexity studies is a

    fresh label for a well-known pigeonhole: general systems

    theory, that waspioneered by LudwigvonBertalanffy in the

    1930s [9, p. 222]. Systems biology is an emerging field that

    continues this research into the postgenomic era of the life

    sciences [10]-[12]. Complexity studies and systems biology

    are different in that the latter takes a signal- and sys-

    tems-oriented approachto describe the dynamic processes

    within and between biological cells. As we shall see later,

    systems biology has more to do with the application of sys-tems and control theory to cellular systems than with the

    application of physics to biology.

    Systems biology provides a vital interface between cell

    biology and biotechnological applications. Before we dis-

    cuss this area in greater detail in subsequent sections, we

    note that complexity in the context of biological systems

    can be defined as

    a propertyof an encoding(mathematicalmodel) (e.g.,

    its dimensionality, order, or number of variables)

    an attribute of the natural system under consider-

    ation (e.g., the number of components, descriptive

    and organizational levels that ensure its integrity)

    our ability to interact with the system and to observe

    it (i.e., to make measurements and generate experi-

    mental data).

    On all three accounts,genes,cells, tissues,organs,organ-

    isms, and populations are individually and as a functional

    whole a complex system. It is the availability of experimen-

    tal techniques, modern microscopy, laser tweezers, and

    nanotechnology, as well as DNA microarrays, gel technol-

    ogy, and mass spectrometry that drives this renewed inter-

    est in complexity studies and systems biology. While the

    August 2003 IEEE Control Systems Magazine 39

    Systems biologyapplyingsystems theory to biology

    in the postgenomic era.

  • 8/2/2019 Control Biology, Wolkenhauersub

    3/11

    technology to generate and manage data races ahead, it be-

    comes apparent that methodologicaladvances in the analy-

    sis of data are urgently required if wewantto turn the newly

    available data into information and knowledge. This need

    for research into new methodologies and the development

    of novel conceptual frameworks has been neglected in the

    euphoria about new technology. Problems in the post-

    genomic era of the life sciences will not only be experimen-

    tal or technical, but also conceptual. The interpretation of

    data, turning information into knowledge, is as important

    for scientific and biotechnological progress as the possibil-

    ity of generating and managing data.

    With the generation of vast amounts of data, computer

    scientists have been the natural allies of biologists in the

    management of these data. The growth of bioinformatics

    parallels the excitingdevelopments in biology. However the

    availabilityof genomesequencedata has ledto a focus shift

    from molecular characterization and sequence analysis to

    an understanding of functional activity and now interac-

    tions of genes and proteins in pathways. Gene expression

    and regulation, i.e., to understand the organization and dy-

    namicsof genetic, signaling, andmetabolicpathways, is the

    challenge for the next 50 years. The nature of the experi-ments and the data thereby generated requires an alliance

    of the biological and biomedical sciences with physical sci-

    entists (engineers, mathematicians, and physicists). The

    following discussionon the challenges andhurdleswillclar-

    ify why such an alliance is so important.

    (Not) A New Kid on the BlockAlthough generallyconsidered to be a newareaof research,

    systems biology is not without history, and as early as the

    1960s the term was used to describe the application of sys-

    tems and control theory to biology [12]. At the time,

    Mesarovic wrote: In spite of the considerable interest andefforts, theapplication of systems theory in biology has not

    quite lived up to expectations. ...oneof themain reasons for

    the existing lag is that systems theory has not been directly

    concernedwith some of theproblemsof vital importancein

    biology. Today, scientists in this field are motivated by the

    availability of experimental data, including, for example,

    DNA microarray time series, and interdisciplinary collabo-

    rations are widely supported. In fact, the importance of in-

    terdisciplinary research and close collaborations between

    biologists and physical scientists is evident in the many

    multidisciplinary research centers that are being built

    around the world, gently forcing researchers to interact by

    confining them into purpose-built housing.

    Mesarovic further suggested that progress could be

    made by more directand stronger interactions of biologists

    with systemscientists: The real advance in the application

    of systems theory to biology will come about only when the

    biologists start asking questionswhich are based on the system-theo-

    retic concepts rather than using

    these concepts to represent in still

    another way the phenomena which

    are already explainedin terms of bio-

    physical or biochemical principles.

    ... then we will not have the applica-

    tion of engineering principles to bio-

    logical problems butrather a field of

    systems biologywith its own identity and in its own right.

    Molecular characterization has led to very accurate spatial

    representations of cellular components, and biochemical

    modeling has been the main approach to studying cellular

    processes. However, the future lies in extending this knowl-

    edge to observations at higher organizational levels. There

    are fewexamplesof a concertedeffort to translate biologi-

    cal representations of gene expression and regulation into

    thelanguage of thesystem scientist[13], [14], andall indica-

    tions are that the field is going to provide the vital interface

    between basic cell biology, physiology, and biotechnologi-

    cal applications such as in metabolic engineering.

    Systems biology has new technologies available to gen-

    erate data from the genome, transcriptome, proteome, and

    metabolome, in addition to the physiome. However, while

    bioinformatics is usually associated with vast amounts ofdata available in databases, the systems-biological descrip-

    tion of cellular processes often suffers from a lack of data.

    Gene Expression and RegulationEach cell of a (multicellular) organism holds the genome

    with theentire genetic material, represented by a large dou-

    ble-stranded DNA molecule with the famous double-helix

    structure. Cells are therefore the fundamental unit of living

    matter. They take up chemical substances from their envi-

    ronmentandtransformthem. Thefunctionsofa cell are sub-

    ject to regulation such that the cell acts and interacts in an

    optimal relationship with its environment. The centraldogma of biology describes how information, stored in

    DNA, is transformed into proteinsvia an intermediateprod-

    uct, called RNA. Transcription is the process by which cod-

    ing regions of DNA (called genes) synthesize RNA

    molecules. This is followed by a process referred to as

    translation, synthesizing proteins using the genetic infor-

    mationin RNAas a template.Mostproteinsareenzymesand

    carry out the reactions responsible for the cells metabo-

    lismthe reactions that allow it to process nutrients, to

    build new cellular material, to grow, and to divide.

    40 IEEE Control Systems Magazine August 2003

    Systems biology has more to do withthe application of systems and controltheory to cellular systems than with theapplication of physics to biology.

  • 8/2/2019 Control Biology, Wolkenhauersub

    4/11

    Researchconductedin the1960s showedthatmost basic

    cellular processes are dynamic and feedback regulated.

    Whileinvestigatingregulatory proteinsand the interactions

    of allosteric enzymes, Jacob and Monod introduced the dis-

    tinction between structural genes (coding for proteins)

    and regulatory genes, which control the rate at which

    structural genes are transcribed. This control of the rate of

    protein synthesis was the first indication that such pro-cesses are most appropriately viewed as dynamic systems.

    Figure 1 illustrates the processes of gene ex-

    pression and regulation in bacterial cells.

    Although bacterial cells are capable of pro-

    ducing several thousand different proteins, not

    all are producedat the same time or inthe same

    quantity. The energy consumption for protein

    synthesisandtherelatively short half-lifeof the

    RNA molecules are reasons for the cell to con-

    trolboth thetypes andamountsof each protein.

    One example of a global regulatory network is

    the heat-shock response. When proteins are ex-

    posed to extremes of heat, they are said to un-

    dergo denaturation. Denaturation is the

    destruction of the folding properties of a pro-

    tein leading (usually) to loss of biological activ-

    ity. To counteract possible toxic effects from

    insoluble aggregates in the cell, the change in

    temperature and the quantity of denatured pro-

    teins are sensed by the cell and specific heat-

    shock proteins are produced. Figure 2 illus-

    trates heat-shock regulationof theDnaKoperon

    in the bacterium Bacillus subtilis. The protein

    DnaK is such a chaperone, one of a group of

    proteins called molecular chaperones, whichhelp other proteins to fold properly. These spe-

    cialist proteins produce barrel-like structures,

    providing an environment for the denatured

    proteins to refold. The described mechanism is

    referred to as negative control through

    repressor deactivation. A repressor protein is a

    regulatory protein that binds to a specific site

    on the DNA and thereby blocks transcription.

    Drawings like Figure 1 are frequently used in

    biology textbooks to illustrate structural as-

    pects and the spatial organization of compo-

    nents in the cell. Figure 2, on the other hand,

    also shows the signal flow and temporal effectscomponents have. The next section introduces

    an area in which signal- and systems-oriented

    representations play an even greater role.

    Intra- and IntercellularDynamics: CellularWeather ForecastingThepreviousexample described howgenes act

    and interact within the context of the cell. Bio-

    logical cells are not running a program but rather continu-

    ally sensing their environment and making decisions on the

    basis of that information. To determinehow cells act and in-

    teract within the context of the organism to generate coher-

    ent and functional wholes, we need to understand how

    information is transferred betweenand within cells. Cellsig-

    naling, or signal transduction, is the study of the mecha-

    nisms that enable the transfer of biological information.Signaling impinges on all aspects of biology, from develop-

    August 2003 IEEE Control Systems Magazine 41

    Figure 1. Gene expression and regulation in bacteria. Information, stored inthe DNA, is transformed into proteins via an intermediate product called mRNA.

    The short half-life of mRNA and the energy consumption of protein synthesis form

    the basis for a sophisticated hierarchy of a control mechanism.

    Figure 2. Negative regulation of the dnaK and GroESL operons in Bacillussubtilis. The HrcA repressor is a regulatory protein that binds at specific sites on

    DNA and blocks transcription. In the absence of stress, the GroESL co-repressor

    binds withHrcAand thereby increases repressionof thednaKoperon genes.Upon

    heat shock, the transcription rate of a group of heat shock proteins, called

    chaperones, is increased. They build barrel-type structures, which helpdenatured

    protein to refold and regain its function.

  • 8/2/2019 Control Biology, Wolkenhauersub

    5/11

    mentto disease.Manydiseases, suchas cancer, involvemal-

    function of signal transduction pathways. Downward [15]

    provided an excellent account of this field.

    Figure 3 illustrates a very basic signaling model. As indi-

    cated in the previoussection, bacteria regulatecellmetabo-

    lism in response to a wide variety of environmental

    fluctuations, including the heat-shock example above.

    Thus, there must be mechanisms by which the cells receive

    signals from the environment and transmit them to the spe-

    cific targetto be regulated. Receptorsare proteinsthat span

    the membrane, with a site for binding the signaling com-

    pound on the outer surface. Binding of the extracellular sig-

    naling compound to the outer surface of the receptor

    results in an activation of an intracellular protein (the re-sponse regulator), forexample,by phosphorylation. Signal

    transduction pathways commonly consist of many more

    cascaded modules between receptor and genome. There

    can be numerous intermediate steps before the signal

    transduction process ends, often with a change in the gene

    expression program of the cell. In the figure, the phos-

    phorylated response regulator is a DNA binding protein,

    which serves as a repressor, preventing the RNA polymer-

    ase from transcribing the adjacent gene(s).

    In addition to crosstalk between pathways, negative

    feedback systems canoccur, and thetime courseof a signal

    transduction pathway can be critical. It is therefore impor-

    tant to develop experimental techniques that allow quanti-

    tative measurements of proteins and protein interactions.

    Mathematical modeling and simulation in this field has the

    purpose to help and guide the biologist in designing experi-

    ments andgenerally to establisha conceptual framework in

    which to think. The article by Hasty [16] provides a survey

    of such in numero molecularbiology (see also [17]).Smolen

    [18]surveysmathematical modeling of transcriptional con-

    trol and future directions. The signal-oriented approach to

    cellular models by Kremling et al. [13] is an example of sys-

    tems and control theory bridging the gap between cellular

    biology and metabolic engineering. Progressing from

    merely descriptive models to predictive models will require

    the integration of data analysis and mathematical modeling

    with information stored in biological

    databases. The data we currently have

    available do not allow parametric sys-

    temsidentification techniquesto build

    predictive models. Instead, it is the

    systems thinking, the modeling pro-

    cess itself, that often proves useful.

    Example: Modeling ofRas/Raf-1/MEK/ERK

    Signal TransductionPathwayThe Ras/Raf-1/MEK/ERK module in

    Figure 4 is a ubiquitously expressed

    signaling pathway that conveys mito-

    genic and differentiation signals from

    the cell membrane to the nucleus

    [19]-[22]. This kinase cascade appears

    to be spatially organized in a signaling

    complex nucleated by Ras proteins.

    ThesmallG protein Rasis activatedby

    42 IEEE Control Systems Magazine August 2003

    EnvironmentalSignal Cell Surface Receptor

    (Sensor Kinase)

    IntracellularProtein

    Cell Membrane

    Phosphatase

    ResponseRegulator

    PhosphorylatedProtein

    RNAPolymerase

    Promoter Operator Gene(s)

    P

    P

    DNA

    Figure 3. Cell signaling (signal transduction). Intracellulardynamics (geneexpression)can be affected by extracellular signals.

    Receptors spanning the cell membrane receive signals and transmit

    the information to activate intracellular proteins (the response

    regulator). In the figure, theresponse regulatorbinds to theoperator

    region of a gene and prevents the RNA polymerase from

    transcription of the adjacent gene. A phosphatase ensures that theprocess is continuous.

    Figure 4. Biologistsdrawingfor theRas/Raf-1/MEK/ERKsignaltransductionpathway.

  • 8/2/2019 Control Biology, Wolkenhauersub

    6/11

    many growth factor receptors and binds to the Raf-1 kinase

    with high affinity when activated. This induces the recruit-

    ment of Raf-1 from the cytosol to the cell membrane. Acti-

    vated Raf-1 then phosphorylates and activates MAPK/ERK

    kinase(MEK), a kinasethat in turn phosphorylates andacti-

    vates extracellularsignal regulated kinase (ERK), the proto-

    typic mitogen-activated protein kinase (MAPK). Activated

    ERKs can translocate to the nucleus and regulate gene ex-pression by the phosphorylation of transcription factors.

    This kinase cascade controls the proliferation and differen-

    tiation of different cell types. The specific biological effects

    arecrucially dependenton theamplitudeandhistoryof ERK

    activity. The adjustment of these parameters involves the

    regulation of protein interactions within this pathway and

    motivates a systems biological study. Figures 5 and 6 de-

    scribe the circuit diagrams of the biokinetic reactions for

    which a mathematical model is used to simulate the influ-

    ence of ligand variations on the pathway

    S E SE E P k

    k

    k+

    +

    1

    2

    3 .

    Signal transduction pathways can be represented as se-

    quences of enzyme kinetics reactions which turn a sub-

    strate S into a product P via an intermediate complex SE

    and regulated by an enzyme E. The rate by which the en-

    zyme- substrate complexSE is formed is denoted by k1. The

    complexSE holds twopossibleoutcomesin thenext step. It

    can be dissociated into E and S with a rate constant k2 or it

    canfurther proceed to form a productPwith a rate constant

    k3. It is required to express the relations between the rate of

    catalysisandthechangeof concentration for thesubstrate,

    theenzyme, thecomplex,and theproduct.Based on this re-

    action kinetics[23], wefirstconsidera basicmodeling block

    August 2003 IEEE Control Systems Magazine 43

    Figure 5. Basic pathway modeling block. The pathway model isconstructed from basic reaction modules like this enzyme kinetic

    reaction for which a set of four differential equations is required.

    Figure 6. Graphical representation of the Ras/Raf-1/MEK/ERK signal transduction pathway (the shadowed part represents thesuppression by RKIP): a circle represents a state for the concentration of a protein and a bar a kinetic parameter of the reaction to be

    estimated. The directed arc (arrows) connecting a circle and a bar represents a direction of a signal flow. The bidirectional thick arrows

    represent anassociationand a dissociationrateat thesame time. Thethin unidirectionalarrows represent a productionrateofproducts.

  • 8/2/2019 Control Biology, Wolkenhauersub

    7/11

    of signal transduction pathways. This basic modeling block

    is illustrated in Figure 5 and can be described by the follow-

    ing set of nonlinear ordinary differential equations:

    dm t

    dtk m t m t k m t

    dm t

    dt

    k m t m

    11 1 2 2 3

    21 1 2

    ( )( ) ( ) ( )

    ( )( ) (

    = +

    = t k m t k m t

    dm t

    dtk m t m t k m t k

    ) ( ) ( )

    ( )( ) ( ) ( )

    + +

    =

    2 3 3 3

    31 1 2 2 3 3m t

    dm t

    dtk m t

    3

    43 3

    ( )

    ( )( ).=

    From these we have

    m t m t C

    m t m t m t C

    2 3 1

    1 3 4 2

    ( ) ( ) ,

    ( ) ( ) ( ) .

    + =

    + + =

    Hence we can describe the basic reaction module by two

    nonlinear equations subject to two algebraic conditions.

    In general, for a given signal transduction system, the

    whole pathway can be modeled by a set of nonlinear dif-

    ferentialequationsand a setof algebraicconditions in the

    following form:

    d t

    dtt t

    m t Cii p

    j

    mf m k

    ( )( ( ), ( )),

    ( ) ,{ , }

    =

    =

    1

    where m( ) [ ( ), ( ), , ( )]t m t m t m t p= 1 2 K , k( ) [ ( ), ( ), ,t k t k t = 1 2 K

    k tq ( )], p is thenumber ofproteinsinvolved inthe pathway,q

    is the required number of parameters, and j J{ , , }1K with

    the number of algebraic conditions J p< .

    Parameterestimation is widely regarded as a majorprob-

    lem in dynamic pathway modeling [25], [26]. A simple

    method firstdiscretizes the nonlinear differential equations

    into algebraic difference equations that are linear with re-

    44 IEEE Control Systems Magazine August 2003

    Figure 7. Illustration of parameter estimation from time series data: Each parameter is determined from the value to which the estimatesconverge (shown by the horizontal line). (Note that any experimental noise can be further eliminated by regression techniques if multiple

    experimental replicates at each time point are available.)

  • 8/2/2019 Control Biology, Wolkenhauersub

    8/11

    spect to theparameters andthen solve the transformed lin-

    ear algebraic difference equations to obtain the parameter

    values at each sampling time point. We can then estimate

    the required parameter values by employing curve fitting,

    calculation of steady state values, and regression tech-

    niques. Forthis purpose of parameterestimation, theprevi-

    ous equations are transformed into

    k g mm

    ( ) ( ),( )

    t td t

    dt=

    ,

    and this can be further transformed into a set of algebraic

    difference equationsby approximatingthe differential oper-

    ator vector g via a difference operator vector h as

    k h m m m( ) ( ( ), ( ), , ( ))t t t t r 1 K

    where r depends on the order of approximation. Without

    lossof generality, k( )t canbe approximated by k since most

    of the signal transduction systems can be regarded as

    slowly time varying systems compared with the measure-

    ment windows in time scale. Hence we have

    k h m m m ( ( ), ( ), , ( ))t t t r 1 K ,

    which implies the parameter estimates based on time

    course measurements.

    The entire model, as shown in Figure 6, is constructed in

    this way, leading to what usually becomes a relatively large

    set of differential equations for which parameter values

    have to be identified. As illustrated inFigure 7, in theestima-

    tion ofparameters fromwestern blot data, theparameteres-

    timates usually appearas a time dependentprofilesince the

    time course data include various uncertainties. However,

    since the signal transduction system itself can be consid-

    ered as time invariant, the estimated parameter profile

    shouldconverge to a constant valueat steadystate. Figure 7

    illustrates this estimation procedure.

    August 2003 IEEE Control Systems Magazine 45

    Figure 8. Thesimulationresults forfixed initial conditions: (a)shows the bindingof RKIP to Raf-1*,(b) shows thebindingof MEK-PPtoERK-P, (c) shows the binding of ERK-PP to Raf-1*/RKIP, and (d) shows the binding of RP to RKIP-P.

  • 8/2/2019 Control Biology, Wolkenhauersub

    9/11

    If a reasonable model is constructed, this can then be

    used in a variety of ways to validate and generate hypothe-

    ses, or to help experimental design [22], [24]. Based on the

    mathematical model illustrated in Figure 6 and the esti-

    mated parameter values as, for example, obtained using a

    discretization of the nonlinear ordinary differential equa-

    tions (as illustrated in Figure 7), we can perform simulation

    studies to validate the signal transduction mechanism as il-lustrated in Figure 5 and also to analyze the signal

    transduction system with respect to the sensitivity for the

    ligand (via simulation of variable initial conditions) as illus-

    trated in Figure 9.

    Current Research:An International PerspectiveThe interest in systems biology is documented by the in-

    creasing number of conferences, research groups, and insti-

    tutes dedicatedto thisarea. Funding initiatives in theUnited

    States, Japan, South Korea, and Germany provide new op-

    portunities to the engineering sciences. Systems biologyprovidesevidenceof the growinginvolvement of control en-

    gineers, notjustat thetechnological level,but also playing a

    vital role in the development of novel methodological ap-

    proaches in mathematical modeling, simulation, and data

    analysis [27]. To allow large scalesimulations, international

    projectsaredevelopinga systems biology markup language

    (SBML) [28] and Systems Biology Workbench to allow the

    integration of models and simulation tools (visit

    http://www.sbw-sbml.org for further information).

    The principal challenge for the biomedical sciences is to

    answer the following questions [6]:

    How do cells actandinteract withinthe context of the

    organism to generatecoherent andfunctional wholes?

    How do genes act and interact within the context of

    the cell to bring about structure and function?

    Forsystems biology we cansummarize thechallenges as

    follows:

    methodologies for parameter estimation

    experimental and formal methods for model valida-

    tion

    identification of causal relationships, feedback, and

    circularity from experimental data

    modular representationsand simulation of large scaledynamic systems

    investigations into the stabilityand robustness of cel-

    lular systems

    46 IEEE Control Systems Magazine August 2003

    Figure 9. The simulation results for variable initial conditions: (a) shows the variation of Raf-1*, (b) shows the variation of ERK, (c)shows the variation of RKIP, and (d) shows the variation of RKIP-P.

  • 8/2/2019 Control Biology, Wolkenhauersub

    10/11

    visualization and fusion of information, integration of

    models and simulators

    scalingmodels acrosstimescales anddescription lev-

    els (from genes, transcripts, and proteins to cells and

    organisms).

    Gene expression takes place within the context of a cell

    and between cel ls , organs, and organisms. The

    reductionist approach is to isolate a system, conceptu-

    ally closingit off from its environment through the defini-

    tionof inputs and outputs. We inevitably loose information

    using this approach since conceptual closure amounts to

    the assumption of constancy for the external factors and

    the fact that external forces are described as a function of

    something inside the system. Different levels of cellular

    systems may require different modeling strategies, and ul-

    timately we require a common conceptual framework that

    integrates different models, forexample,differential (mass

    actionor rate) equations providea realistic modelingpara-

    digm for a single-gene or single-cell representation, but

    cell-to-cell and large-scale gene interaction networkscould, for example, be represented by finite-state models

    or using agent-based simulation.

    One should not jump to the conclusion, however, that

    systems and control theory could provide all answers to

    the challenges given by dynamic systems in molecular bi-

    ology. In dynamic systems theory, one can often ignore

    spatial aspects. However, time andspace are essential for

    explaining the physical reality of gene expression. The

    same component of a pathway may have a different func-

    tional role depending on its location within the cell. Al-

    though components of cells have specific locations, these

    locations lack exact coordinates. Not only signals are being

    transmitted but components also move around in a non-random fashion. Without spatial entailment there canbe no

    livingcell,andfor systems biology it is thus necessary to in-

    tegrate a topological representation of this organization

    with models of the dynamic behavior.

    The biologist Frank Harold presented an excellent dis-

    cussion of the complexity of cellular processes and pro-

    vided a compelling argument for the need for more

    research in complexity studies: From the chemistry of

    macromolecules and the reactions that they catalyze, lit-

    tle can be inferred regarding their articulation into physi-

    ological functions at the cellular level, and nothing

    whatever can be said regarding the form of developmentof these cells. It therefore seems to me self-evident that

    the quest for the nature of life cannot be conducted exclu-

    sively on the biochemists horizon. We must also inquire

    how molecules are organized into larger structures, how

    direction and function and form arise, and how parts are

    integrated into wholes [9].

    In general, causation is a principle of explanation of

    change in the realm of matter. In systems biology causation

    is defined as a (mathematical) relationship, not between

    materialobjects(molecules),butbetweenchanges of states

    within and between elements of a system. Instead of trying

    to identify genes as causalagents forsome function,role,or

    change in phenotype we relate these observations to se-

    quences of events. In other words, instead of looking for a

    gene that is the reason, explanation, or cause of some phe-

    nomenon we seek an explanation in the dynamics (se-

    quences of events ordered by time) that led to it. It is

    systems dynamics, not a genetic program that gives rise tobiological forms and function [9, p. 199].

    In analyzing experimental data, we usually rely on as-

    sumptions made about the ensemble of samples. A statisti-

    cal or average perspective, however, may hide short-term

    effects that are thecausefor a whole sequence of eventsin a

    genetic pathway. What in statistical terms is considered an

    outlier may just be the phenomenon the biologist is looking

    for. It is verydifficult to obtainsufficiently large andreliable

    data sets for pathway modeling; it is therefore important to

    compare different methodologies, their implicit assump-

    tions, and the consequences of the biological questions

    asked.To allow reasoning in thepresenceof uncertainty, wehave to be precise about uncertainty, and if we cannot be

    precise about uncertainty, modeling (generating hypothe-

    ses), and model validation (for hypothesis testing) become

    complementary aspects of an iterative process. It is thus of

    paramount importance that we strive to bridge the gap be-

    tween data and models. In the words of Bertalanffy: Thus

    even supposedly unadulterated facts of observation al-

    ready are interfused with all sorts of conceptual pictures,

    model concepts, theories, or whatever expression you

    choose. The choice is not whether to remain in the field of

    data or to theorize; the choice is only between models that

    are more or less abstract, generalized, near or more remote

    from direct observation, more or less suitable to representobserved phenomena [29].

    ConclusionsSystems biology marks a shift away from an often obses-

    sively reductionist (molecular) approach to providing a

    causal and dynamic account of cellular form and function.

    The biggest if not the principal hurdle for the systems ap-

    proach is Zadehs uncertainty principle, which states

    that as the complexity of a system increases, our ability to

    make precise and yet significant statements about its be-

    havior diminishes until a threshold is reached beyond

    which precision and significance (or relevance) become al-most exclusive characteristics.

    The cell is a self-controlled andself-regulatingdynamicsys-

    tem consisting of components that are interacting in space

    and time. The relationships that prevail between structure,

    function, and regulation in cellular networks are still largely

    unknown. Systems biology aims to identify and explain these

    relationships through an integrated effort of both experimen-

    tal and theoretical methodologies. For this we require scien-

    tists who are preparedto investtimeand effort intomore than

    one discipline and scientific culture. This will necessitate a

    August 2003 IEEE Control Systems Magazine 47

  • 8/2/2019 Control Biology, Wolkenhauersub

    11/11

    change in the education, training, and career prospects of in-

    terdisciplinaryscientists. Forthose whopersist, the rewardis

    a better understanding of life itself.

    References[1] O.G. Vukmirovic et al., InsightFunctional genomics, Nature, vol. 405,

    no. 15, pp. 819-846, June 2000.

    [2] W. Weaver, Science and complexity, Amer. Scientist, vol. 36, pp. 536-544,

    Oct. 1948.

    [3] P. Bak,How Nature Works: The Science of Self-Organized Criticality. Oxford,

    U.K.: Oxford Univ. Press, 1997.

    [4] S. Kauffman, At Home in the Universe: The Search for Laws of

    Self-Organization and Complexity. Oxford, U.K.: Oxford Univ. Press, 1995.

    [5] B. Goodwin, How the Leopard Changed Its Spots:The Evolution of Complex-

    ity. Princeton, NJ: Princeton Univ. Press, 2001.

    [6]R.V. Soleand B.C.Goodwin,Signsof Life:How Complexity Pervades Biology.

    New York: Basic Books, 2002.

    [7] L.G. Harrison, Kinetic Theory of Living Pattern, Cambridge, U.K.: Cam-

    bridge Univ. Press, 1993.

    [8] H. Meinhardt, The Algorithmic Beauty of Sea Shells. Berlin, Germany:

    Springer-Verlag, 1988.

    [9] F.M. Harold,TheWayof theCell:Molecules, Organismsand theOrderof Life.

    Oxford, U.K.: Oxford Univ. Press, 2001.

    [10]H. Kitano,Ed.,Foundations of Systems Biology. Cambridge,MA: MITPress,2000.

    [11]H. Kitano,Systemsbiology: A briefoverview,Science,vol.295,no.5,pp.

    1662-1664, Mar. 2002.

    [12] O. Wolkenhauer, Systems biology: The reincarnation of systems theory

    applied in biology?, Briefings Bioinform., vol. 2, no. 3, pp. 258-270, 2001.

    [13]A. Kremling,K. Jahreis,J.W. Lengeler, andE.D. Gilles,The organizationof

    metabolicreaction networks:A signal-oriented approachto cellularmodels,

    Metabolic Eng., vol. 2, no. 3, pp. 190-200, July 2000.

    [14] Caltech ERATO Kitano, Systems Biology Workbench, [Online].

    http://www.cds.caltech.edu/erato/

    [15] J. Downward, The ins and outs of signaling, Nature, vol. 411, no. 14, pp.

    759-762, June 2001.

    [16]J. Hasty,D. McMillen,F.Isaacs,and J.J.Collins, Computationalstudiesof

    gene regulatory networks: In numero molecular biology,Nature Reviews Ge-

    netics, vol. 2, no 4, pp. 268-279, Apr. 2001.

    [17]J.J. Tyson andM.C.Mackey, Molecular, metabolic, and genetic control,Chaos, vol. 11, no. 1, pp. 81-282, Mar. 2001.

    [18]P.Smolen,D.A. Baxter, and J.H.Byrne, Modeling transcriptionalcontrol

    in gene networksMethods, recent results, and future directions, Bull.

    Math. Biol., vol. 62, pp. 247-292, 2000.

    [19] K. Yeung, P. Janosch, D.W. Rose, H. Mischak, J.M. Sedivy, and W. Kolch,

    Mechanism of suppression of the Raf/MEK/Extracellular Signal-Regulated

    Kinase pathway by the Raf Kinase Inhibitor Protein, Mol. Cell. Biol., vol. 20,

    pp. 3079-3085, May 2000.

    [20] K. Yeung, T. Seitz, S. Li, P. Janosch, B. McFerran, C. Kaiser, F. Fee, K.D.

    Katsanakis, D.W. Rose,H. Mischak, J.M.Sedivy, andW. Kolch, Suppressionof

    Raf-1 kinase activity and MAP kinase signaling by RKIP, Nature, vol. 401, pp.

    173-177, Sept. 1999.

    [21] C.J. Marshall, Specificity of receptor tyrosine kinase signaling: Tran-

    sient versus sustained extracellular signal-regulated kinase activation, Cell,

    vol. 80, pp. 179-185, Jan. 1995.

    [22] K.-H. Cho, S.-Y. Shin, H.-W. Kim, O. Wolkenhauer, B. McFerran, and W.

    Kolch, Mathematical Modeling of the Influence of RKIP on the ERK Signaling

    Pathway(Lecture Notes in Computer Science). Milan, Italy: Springer-Verlag,

    2003.

    [23] D.P. Robert and M. Tom, Kinetic modeling approaches to in vivo imag-

    ing, Nature Rev.: Molecular Cell Biol., vol. 2, pp. 898-907, Dec. 2001.

    [24] K.-H. Cho, S.-Y. Shin, H.-W. Lee, and O. Wolkenhauer, Investigations into

    the analysis and modeling of the TNF mediated NF-B signaling pathway,

    in Proc. 3rd Int. Conf. Systems Biology, Stockholm, Sweden, p. 87, 2002 and Ge-

    nome Research (Special Issue on Systems Biology), to be published.

    [25] H.G. Bock, Numerical treatment of inverse problems in chemical reac-

    tion kinetics, in Modeling Chemical Reaction Systems, K.H. Ebert, Ed.

    Springer-Verlag: New York, vol. 18, pp. 102-125, 1981.

    [26] R. Hegger, H. Kantz, F. Schmuser, M. Diestelhorst, R.P. Kapsch, and H.

    Beige, Dynamical properties of a ferroelectric capacitor observed through

    nonlinear time series analysis, Chaos, vol. 8, pp. 727-736, 1998.

    [27]H. Kitano,Computationalsystems biology,Nature, vol.420, pp.206-210,

    14 Nov. 2002.

    [28] M. Hucka, A. Finney, H. Sauro, H. Bolouri, J. Doyle, and H. Kitano, The

    ERATO systems biology workbench: An integrated environment for

    multiscale and multitheoretic simulations in systems biology, in Founda-

    tions of Systems Biology, H. Kitano, Ed. Cambridge: MIT Press, 2001, chap. 6,

    pp. 125-143.

    [29] L. Bertalanffy, General Systems Theory. New York: Braziller, 1969.

    Olaf Wolkenhauerreceived the Dipl.-Ing.and BEng. degree

    incontrol engineering in 1994 fromtheUniversity ofApplied

    Sciences, Hamburg, Germany and the University of

    Portsmouth, U.K., respectively. He received the Ph.D. from

    the University of Manchester Institute of Science and Tech-

    nology (UMIST). From 1997 to 2002 he held a research lec-

    tureship at the Control Systems Centre, UMIST, and since

    2002 he has held a joint senior lectureship between the De-

    partment of Biomolecular Sciences and the Department of

    Electrical Engineering and Electronics. In 1999 and 2000 he

    was a visiting research fellow at Delft University of Technol-ogy, The Netherlands. He has authored two books, Possibil-

    ity Theory with Applications to Data Analysis (RSP, 1998) and

    Data Engineering(Wiley, 2001). Hisresearch interestis in the

    application of systems andcontrolmethodologiesto molec-

    ular and cell biology. Since July 2003 he has held the chair in

    Bioinformatics and Systems Biology at the University of

    Rostock. He can be contacted at the Department of

    Biomolecular Sciences and Department of Electrical Engi-

    neeringandElectronics,ControlSystems Centre,University

    of Manchester Instituteof ScienceandTechnology , P.O. Box

    88,Manchester M601QD,U.K.,[email protected].

    Hiroaki Kitano is a director at Sony Computer Science

    Laboratories, Inc., director of ERATOKitano Symbiotic Sys-

    tems Project, JST, and president of The Systems BiologyIn-

    stitute. He received his B.A. in physics from International

    Christian University, Tokyo, in 1984 and a Ph.D. in com-

    puter science from Kyoto University in 1991. He received

    theComputersand ThoughtAwardin 1993 and thePrixArs

    Electronica in 2000.

    Kwang-Hyun Cho receivedhisB.S., M.S., andPh.D. degrees

    in electrical engineering from the Korea Advanced Institute

    of Science and Technology in 1993, 1995, and 1998, respec-

    tively. He joined the School of Electrical Engineering at theUniversity of Ulsan,Korea, in 1999 as an assistantprofessor.

    From 2002 to 2003, he was a research fellow at the Control

    Systems Centre, Department of Electrical Engineering and

    Electronics at the University of Manchester Institute of Sci-

    ence and Technology, U.K. His research interests cover the

    areas of systems science and control engineering including

    analysis andsupervisory control of discreteevent systems,

    nonlinear dynamics, hybrid systems, and applications to

    complex systems suchas communicationnetworksand bio-

    logical systems.

    48 IEEE Control Systems Magazine August 2003