8
Endemic and emerging infectious diseases caused by microbial pathogens pose a con- siderable threat to human and animal health worldwide 1–5 . At the molecular level, patho- gen evolution, transmission dynamics and genome plasticity, which account for many of the pathogenic properties of infectious agents, have been the focus of much research 2,3,5,6 . Functional and comparative genomics have improved both our understanding of micro- bial pathogenesis 7,8 and what constitutes the border between commensal and pathogenic organisms 9 . It is becoming increasingly evi- dent that genome-sequence changes have an impact on the pathogenic potential and host tropism of many bacterial pathogens 10 . With the availability of complete genome sequences for many strains of different bacte- rial pathogens, the development of genetic tools to screen for genome polymorphisms is being rigorously pursued. In this Opinion article, we discuss how genetic markers that are associated with bacterial genome fluidity can be developed and harnessed for diag- nostics, molecular epidemiology and vaccine production. Mechanisms of genome alteration There are three main mechanisms of large-scale genome alteration by which pathogens alter their genomes to evolve into pathotypes — subgroups of strains that cause disease using common sets of virulence factors — that are adapted to specific host niches: gene acquisition by horizontal gene transfer (HGT); gene duplication followed by amplification and genome decay, which can occur through HGT; and DNA dele- tions, rearrangements and point mutations. These alterations create genetic variability that, upon selection, shapes the content of the genome in response to environmental conditions (FIG. 1). Genome evolution is a continuous process that comprises long-term ‘macroevolution’, which leads to the development of new spe- cies or subspecies over millions of years, and short-term ‘microevolution’, which spans days or weeks and leads to the alteration of genes and traits in short time frames 11 . With respect to their susceptibility to these evolutionary processes, bacterial genomes comprise stable regions that form the core genome and variable regions that form the flexible gene pool 12 . The flexible component of the genome can accommodate and amel- iorate rearrangements owing to homologous recombination and the activities of phages, plasmids and transposons, and can also accommodate large mobile regions that are known as genomic islands (GEIs) 13,14 . It seems that some GEIs are acquired once during the evolutionary lifetime of a lineage, and are subsequently subject to mutation to prevent further transmission and integra- tion. Pathogenicity islands (PAIs) — a sub- group of GEIs — were originally described in uropathogenic Escherichia coli (UPEC) as clusters of virulence genes that are absent in closely related strains or species 15 . There is substantial evidence indicating that HGT shapes the acquisition, integration and maintenance of PAIs in microbial genomes 12,16,17 . Gene duplication and HGT are intercon- nected phenomena in which HGT enhances the propensity for gene amplification by augmenting the availability of multiple copies of identical DNA regions that can serve as targets for homologous recombina- tion 18 . Although a reversible phenomenon that does not result in permanent genetic change, amplification facilitates adaptation to changing environmental conditions 18 , which affects the fitness and immunogenic properties of many bacterial pathogens. In addition, short tandem DNA-sequence repeats that originate as a result of misalign- ment during replication can have an impact on gene regulation and protein production 19 . Extensive genomic reduction is an additional evolutionary force that is active on a variable evolutionary timescale. One possible outcome of this process is the phe- nomenon of endosymbiosis, which occurs as the organism that is undergoing genomic reduction loses its metabolic and regula- tory versatility. Mechanisms for genomic ‘downsizing’ can permanently alter bacterial genotypes and result in adaptation to the environment through genome optimization. This can eventually result in a minimal genome that is sufficient to support only the basic metabolic activities of the pathogen or to allow successful exploitation of the host. An interesting example of bacterial- genome optimization mediated by gene loss and increasing metabolic specialization is that of Yersinia pestis and its closest relative Yersinia pseudotuberculosis. In Y. pestis, 466 genes are inactivated or deleted compared with Y. pseudotuberculosis 1,20,21 . Y. pestis- specific phenotypes, such as cysteine requirement, the stimulatory effect of carbon dioxide on growth, the inability to OPINION Genomic fluidity and pathogenic bacteria: applications in diagnostics, epidemiology and intervention Niyaz Ahmed, Ulrich Dobrindt, Jörg Hacker and Seyed E. Hasnain Abstract | The increasing availability of DNA-sequence information for multiple pathogenic and non-pathogenic variants of individual bacterial species has indicated that both DNA acquisition and genome reduction have important roles in genome evolution. Such genomic fluidity, which is found in human pathogens such as Escherichia coli, Helicobacter pylori and Mycobacterium tuberculosis, has important consequences for the clinical management of the diseases that are caused by these pathogens and for the development of diagnostics and new molecular epidemiological methods. NATURE REVIEWS | MICROBIOLOGY VOLUME 6 | MAY 2008 | 387 PERSPECTIVES © 2008 Nature Publishing Group

Genomic fluidity and pathogenic bacteria: applications in diagnostics, epidemiology and intervention

Embed Size (px)

Citation preview

Endemic and emerging infectious diseases caused by microbial pathogens pose a con-siderable threat to human and animal health worldwide1–5. At the molecular level, patho-gen evolution, transmission dynamics and genome plasticity, which account for many of the pathogenic properties of infectious agents, have been the focus of much research2,3,5,6. Functional and comparative genomics have improved both our understanding of micro-bial pathogenesis7,8 and what constitutes the border between commensal and pathogenic organisms9. It is becoming increasingly evi-dent that genome-sequence changes have an impact on the pathogenic potential and host tropism of many bacterial pathogens10.

With the availability of complete genome sequences for many strains of different bacte-rial pathogens, the development of genetic tools to screen for genome polymorphisms is being rigorously pursued. In this Opinion article, we discuss how genetic markers that are associated with bacterial genome fluidity can be developed and harnessed for diag-nostics, molecular epidemiology and vaccine production.

Mechanisms of genome alterationThere are three main mechanisms of large-scale genome alteration by which pathogens alter their genomes to evolve

into pathotypes — subgroups of strains that cause disease using common sets of virulence factors — that are adapted to specific host niches: gene acquisition by horizontal gene transfer (HGT); gene duplication followed by amplification and genome decay, which can occur through HGT; and DNA dele-tions, rearrangements and point mutations. These alterations create genetic variability that, upon selection, shapes the content of the genome in response to environmental conditions (FIG. 1).

Genome evolution is a continuous process that comprises long-term ‘macroevolution’, which leads to the development of new spe-cies or subspecies over millions of years, and short-term ‘microevolution’, which spans days or weeks and leads to the alteration of genes and traits in short time frames11. With respect to their susceptibility to these evolutionary processes, bacterial genomes comprise stable regions that form the core genome and variable regions that form the flexible gene pool12. The flexible component of the genome can accommodate and amel-iorate rearrangements owing to homologous recombination and the activities of phages, plasmids and transposons, and can also accommodate large mobile regions that are known as genomic islands (GEIs)13,14. It seems that some GEIs are acquired once

during the evolutionary lifetime of a lineage, and are subsequently subject to mutation to prevent further transmission and integra-tion. Pathogenicity islands (PAIs) — a sub-group of GEIs — were originally described in uropathogenic Escherichia coli (UPEC) as clusters of virulence genes that are absent in closely related strains or species15. There is substantial evidence indicating that HGT shapes the acquisition, integration and maintenance of PAIs in microbial genomes12,16,17.

Gene duplication and HGT are intercon-nected phenomena in which HGT enhances the propensity for gene amplification by augmenting the availability of multiple copies of identical DNA regions that can serve as targets for homologous recombina-tion18. Although a reversible phenomenon that does not result in permanent genetic change, amplification facilitates adaptation to changing environmental conditions18, which affects the fitness and immunogenic properties of many bacterial pathogens. In addition, short tandem DNA-sequence repeats that originate as a result of misalign-ment during replication can have an impact on gene regulation and protein production19.

Extensive genomic reduction is an additional evolutionary force that is active on a variable evolutionary timescale. One possible outcome of this process is the phe-nomenon of endosymbiosis, which occurs as the organism that is undergoing genomic reduction loses its metabolic and regula-tory versatility. Mechanisms for genomic ‘downsizing’ can permanently alter bacterial genotypes and result in adaptation to the environment through genome optimization. This can eventually result in a minimal genome that is sufficient to support only the basic metabolic activities of the pathogen or to allow successful exploitation of the host.

An interesting example of bacterial-genome optimization mediated by gene loss and increasing metabolic specialization is that of Yersinia pestis and its closest relative Yersinia pseudotuberculosis. In Y. pestis, 466 genes are inactivated or deleted compared with Y. pseudotuberculosis1,20,21. Y. pestis-specific phenotypes, such as cysteine requirement, the stimulatory effect of carbon dioxide on growth, the inability to

o p i n i o n

Genomic fluidity and pathogenic bacteria: applications in diagnostics, epidemiology and interventionNiyaz Ahmed, Ulrich Dobrindt, Jörg Hacker and Seyed E. Hasnain

Abstract | The increasing availability of DNA-sequence information for multiple pathogenic and non-pathogenic variants of individual bacterial species has indicated that both DNA acquisition and genome reduction have important roles in genome evolution. Such genomic fluidity, which is found in human pathogens such as Escherichia coli, Helicobacter pylori and Mycobacterium tuberculosis, has important consequences for the clinical management of the diseases that are caused by these pathogens and for the development of diagnostics and new molecular epidemiological methods.

NATUrE rEvIEWs | microbiology vOlUME 6 | MAy 2008 | 387

PerSPecTiveS

© 2008 Nature Publishing Group

Nature Reviews | Microbiology

DNA acquisitionby HGT (transformation, transduction and conjugation)

Genome reduction(deletions) Genome

optimization(rearrangementsand mutations)

Genomeoptimization(rearrangementsand mutations)

use hexose through the pentose–phosphate pathway, altered lipid A acylation, the gene-ral loss of adhesins and altered motility, can be explained by gene inactivation that con-tributes to the evolution of a highly virulent epidemic clone from a less-virulent, closely related progenitor21.

In addition, obligate symbiotic bacteria, as well as obligate intracellular bacterial pathogens, often undergo reductive genomic evolution through the generation of pseudo-genes and gene-deletion events22,23. Genomic reduction has been extensively studied in mycobacteria by monitoring the presence or absence of regions of difference (rD)24 in the context of the evolution of different clonal lineages of the tubercle bacilli (FIG. 2). rDs are genomic regions that have been deleted from the genomes of ancestral forms of myco-bacteria, thereby generating more successful forms (‘modern’ species). The ancestor of the Mycobacterium tuberculosis complex under-went a series of precisely timed genomic-deletion events to increase its fitness in new hosts. It is thought that rD1 is the cause of adaptation of M. tuberculosis to human hosts (that is, the creation of a ‘specialist’ mycobacterium), as the Mycobacterium avium complex, from which rD1 has been deleted, have remained ‘generalists’ (ReF. 25).

However, until the role of a given rD dele-tion in either virulence or host tropism is experimentally proven, its absence could simply be a marker that coincides with other genomic changes (for example, sin-gle-nucleotide polymorphisms (sNPs) or microdeletions) that are the actual cause of the functional adaptation.

Harnessing genomic fluidityE. coli. E. coli is an excellent representative of the full spectrum of pathogen genomic fluidity. As one of the model organisms that was initially used to analyse bacterial genome fluidity and its impact on the evolu-tion of pathogenic variants, the contribution of plasmid-mediated and phage-encoded virulence factors and PAIs has been fully described for E. coli16,26. Plasmids, phages and PAIs all play a crucial part in the evolution of different E. coli pathotypes (FIG. 3).

One main feature of the different intestinal E. coli pathotypes is the presence of pathotype-specific plasmids that often encode toxins. The characteristic protein toxins of enterotoxigenic, enteroaggrega-tive, enteroinvasive, enterohaemorrhagic and enteropathogenic E. coli (and also extraintestinal pathotypes) are plasmid-encoded27. Additionally, the coding regions

for identical or closely related protein toxins, such as alpha-haemolysin, cytotoxic necrotizing factor and autotransporter serine proteases, can be located on plasmids, as well as on chromosomal PAIs28. Furthermore, large so-called colicin plasmids seem to contain several gene clusters, including the salmochelin determinant, that can also be found within chromosomal PAIs in E. coli and closely related species.

A large fraction of the genomic dif-ferences between closely related E. coli strains can be accounted for by the activ-ity of phages. Approximately 40% of the enterohaemorrhagic E. coli (EHEC) O157-specific elements are present in 1 of the 18 cryptic prophages or in the only intact phage (933W), which also contains the virulence genes that are most characteristic of O157:H7 — those that encode the shiga toxin29. The fact that E. coli O157 prophages exhibit extensive structural and positional diversity suggests that prophage variation is one of the most important factors in generating genome diversity among O157 strains30,31.

As a result of the marked E. coli genome fluidity, the stepwise acquisition of ‘foreign’ DNA and loss or inactivation of DNA regions has resulted in different metabolic and pathogenic features that can distin-guish different strains and pathotypes. For example, the parallel gain and loss of mobile genetic elements, such as phages, virulence plasmids and the locus of enterocyte effacement PAI, in different E. coli lineages enabled the evolution of separate clones that belong to different E. coli pathotypes (enterohaemorrhagic and enteropathogenic E. coli) and are associated with specific disease symptoms32. similarly, enteroinva-sive E. coli and Shigella spp. evolved from commensal E. coli by the uptake of a large virulence plasmid and inactivation of the cad determinant that encodes a lysine decar-boxylase. This pathoadaptive mutation has also been termed a black hole33,34.

What are the potential consequences of our increasing knowledge of the genetic diversity and genome content of E. coli? The identification of novel GEI-encoded proteins that are involved in bacterial colonization and/or pathogenesis should aid in the devel-opment of new strategies to tackle E. coli infection35. Knowledge of the prevalence of genes among certain subgroups of E. coli should also facilitate specific preventive approaches; for example, development of a UPEC-specific vaccine36,37.

DNA polymorphisms that are a result of repeated DNA acquisition and gene loss can be useful for strain typing and

Figure 1 |mechanismsthatcontributetobacterialgenomeevolution.Genome plasticity results from DNA acquisition by horizontal gene transfer (HGT; for example, through the uptake of plasmids, phages and naked DNA) and genome reduction by DNA deletions, rearrangements and point muta-tions. The concerted action of DNA acquisition and gene loss results in a genome-optimization process that frequently occurs in response to certain growth conditions, including host infection or colonization.

P e r s P e c t i v e s

388 | MAy 2008 | vOlUME 6 www.nature.com/reviews/micro

© 2008 Nature Publishing Group

Nature Reviews | Microbiology

Common ancestor Mycobacterium leprae

Mycobacterium canettii

RD9

TbD1

RD8 RD7

RD12

RD14

Mycobacterium tuberculosis (ancestral)

M. tuberculosis

M. africanumMycobacteriumafricanum

Mycobacterium pinnipedii

Mycobacterium microti

Mycobacterium bovis

Mycobacterium caprae

M. bovis bacille Calmette–Guérin

Decay (pseudogenization)

RD10

RD13

RD4

RD1

RD2

RDsealRDMic

RDcan

risk assessment (that is, evaluation of the likelihood and severity of an infection that can be caused by a bacterial isolate). For example, the mutS–rpoS genomic region of E. coli isolates differs with phylogeny and pathotype38. GEIs and other mobile genetic elements can also be used as markers for diagnostic purposes to identify certain E. coli populations or subpopulations that have pathogenic potential and, in some cases, to predict their antibiotic resistance. The presence or absence of PAIs or GEIs can be confirmed without culturing the bacterium of interest by PCr and/or DNA sequencing. The detection of certain virulence-associated marker genes also facilitates risk assessment in routine diagnostics; for example, in assays to detect the shiga-toxin-encoding genes in certain intestinal pathogenic E. coli (shiga-toxin-expressing E. coli (sTEC)), as well as the K1-capsule determinant of E. coli isolates in newborn meningitis. Promising candidate virulence markers for subgroups of E. coli that lack common major virulence factors, such as extraintestinal pathogenic E. coli, can also be identified39–41. This removes the need to culture organisms in the laboratory and can prevent hazards that are related to the handling and transportation of live cultures.

Increased knowledge of genome fluid-ity, including information on the stability of PAIs or prophages, can be used to develop approaches that specifically atten-uate strains of pathogenic E. coli without affecting their viability. As such strains are also becoming increasingly antibiotic resistant, development of antimicrobial strategies that target bacterial virulence and reduce selective pressure for resist-ance is of substantial importance42. Data continue to accumulate regarding PAI-deletion frequencies in E. coli in response to different growth conditions and envi-ronmental stimuli, as well as information on the specificity of the P4-like phage integrases that are responsible for the chromosomal insertion and excision of PAIs43–45. Consequently, future preventive and therapeutic approaches could aim to selectively induce PAI deletion. It is tempt-ing to speculate whether an established bacterial infection could be treated by inducing virulence-gene deletion in the colonizing bacteria. Future attenuated vac-cines could also be developed by inducing PAI deletion, thereby offering prophylactic interventions.

E. coli genome fluidity could also be exploited by using attenuated E. coli strains as probiotics. Examples of naturally occurring attenuated variants that are

derived from a virulent extraintestinal pathogenic ancestor include E. coli strain Nissle 1917 and asymptomatic bacteriuria (ABU) isolates. E. coli strain Nissle 1917 and ABU strain 83972 have been shown to be closely related to UPEC isolates from symptomatic urinary-tract infections. These attenuated strains have evolved by reductive genome evolution, which included deletion of a PAI (and other genomic-DNA regions) and virulence-gene inactivation owing to point muta-tion9,46–48. Interestingly, both Nissle 1917 and ABU strain 83972 are used as probiot-ics and are able to out-compete pathogenic bacteria under certain conditions49–51.

Genome fluidity that is manifested by the loss and transfer of shiga-toxin-gene (stx)-containing phages during the course of an EHEC infection could result in dif-ferent E. coli pathotypes (EHEC versus atypical enteropathogenic E. coli), and thus has clinical, diagnostic and epide-miological consequences52. shiga toxin is considered to be the crucial factor in the development of haemolytic uraemic syndrome, a leading cause of acute renal failure in children. recently, it has been shown that loss of the stx-carrying phages by EHEC O157:H7 does occur during human infection53. As stx detection is

routinely used to screen for EHEC, stx-negative variants of E. coli (which are still able to cause diarrhoea in humans) are not detected. This is crucial from a public-health perspective, as it hampers appropriate clinical management and epidemiological investigations54.

Helicobacter pylori. H. pylori is an impor-tant human pathogen that has evolved by aggregating different GEIs in its chromo-some, including GEIs that encode three major type Iv secretion systems55–57. The first secretion system identified comprises 29 genes that are located on the cag PAI58. Because of its geographically conserved insertion, deletion and substitution patterns, the cag PAI has recently been used as a marker of human migration59. sequence polymorphisms in the form of insertion and deletion (indel) changes within an unstable region between the right junction of the cag PAI and the gene that encodes glutamate racemase (glr)60 have been widely used as a signature for strain identification. Based on the instabil-ity of the cagA right junction, several inde-pendent molecular-epidemiology studies have been carried out that involve the gen-otyping of hundreds of strains and isolates from different continents to decipher the

Figure 2 |currentevolutionaryscenarioforpathogenicmycobacteria.extensive genomic downsizing has been a dominant trend in the evolution of the fittest forms of mycobacteria with vari-ous host tropisms. Polymorphisms in genomic regions of differences (rDs), which are indicated by differently coloured boxes, have recently become the most accepted method for the identifi-cation of tubercle bacilli24,25. Figure adapted, with permission, from ReF. 24 (2002) National Academy of Sciences.

P e r s P e c t i v e s

NATUrE rEvIEWs | microbiology vOlUME 6 | MAy 2008 | 389

© 2008 Nature Publishing Group

Nature Reviews | Microbiology

LEE

LEE

Black hole

PhageStxφ

Plasmids PAls

Invasion plasmidpEAFpENT

Commensal E. coliEnterotoxigenic E. coli

Enteroinvasive E. coli andShigella spp.

Uropathogenic E. coli

Enteropathogenic E. coli

LEE

Enterohaemorrhagic E. coli

geographical partitioning of H. pylori with reference to gene-pool characteristics and their impact on transmission patterns in individual countries58,59,61–65. For example, the impact of human migration in Peru and India on the survival and persistence of H. pylori has recently been reviewed using core-genome haplotyping through the analysis of multilocus sequence types and cag-PAI genotypes59,66. H. pylori can import DNA from strains from differ-ent populations during co-infection65. However, it seems that the cag PAI and the region that surrounds it was exchanged by the Amerindian strains in Peru59, where the human population underwent major changes in recent history with the arrival of European conquerors and settlers. The European H. pylori strains harboured an intact, functional cag PAI, and the ‘endogenous’ H. pylori in Peru could have been out-competed by the newly arrived cag-PAI-positive strains59.

The relatedness of cagA-gene sequences from Indians and Europeans was exam-ined to test the hypothesis of gene flow in India through Indo–Aryans and the arrival of Neolithic practices and languages from

the Fertile Crescent66. The cag PAI of the Indian strains was found to be completely evolved (that is, an intact PAI without any indels), and was probably acquired from a European source well before the arrival of H. pylori in India66. These observations support the notion that anciently acquired genomic landmarks could be used as markers in studying pathogen biology at the interface of human anthropology.

The CagA antigen, one of the principal virulence factors of H. pylori, is encoded on the cag PAI and is a universally accepted virulence marker that is linked to gastric and peptic ulcer disease and gastric cancer67–69. H. pylori strains are often determined to be carcinogenic or ulcerogenic based on the composition of their cag PAI; for example, using the pres-ence or absence of a particular motif (the EPIyA motif; type C or D) within the cagA gene. EPIyA motifs are crucially involved in the phosphorylation of CagA, which is linked to the severity of atrophic gastritis and gastric carcinoma in patients who are infected with CagA-positive strains of H. pylori67. Therefore, several diagnostic and phenotypic studies have been based

on this phosphorylation event70–72, and many molecular diagnostic methods target genes or gene products that are encoded by the cag PAI, including CagA70,73,74.

Mycobacteria. Genomic deletions repre-sent an important mechanism of genetic variability in mycobacteria75. Analysis of such deletions at the population level and comparison with reference strains is a useful method for identifying genes that could be associated with transmissibility and virulence. Other methods that detect small changes in the genome, using a range of molecular markers, are also important in the context of assessing the most predominant strain in an epidemiological setting76–79.

reductive genomic evolution markers have now been validated for the lineage identification of certain pathogens, including members of the M. tuberculosis complex. Members of this complex, such as M. tuberculosis, Mycobacterium africanum or Mycobacterium bovis, can be differenti-ated by the presence or absence of certain rDs (FIG. 2), which represent markers of unidirectional deletion events (once a

Figure 3 |contributionofhorizontalacquisitionofmobilegeneticelementstotheevolutionofEscherichia colipathotypes.The uptake of mobile genetic elements (phages, virulence plasmids and pathogenic-ity islands), as well as the loss of chromosomal-DNA regions in different E. coli lineages, has enabled the evolution of separate clones, which

belong to different E. coli pathotypes and are associated with specific disease symptoms. Lee, locus of enterocyte effacement; PAi, pathogenic-ity island; peAF, enteropathogenic E. coli adhesion-factor plasmid; peNT, enterotoxin-encoding plasmids; Stx, Shiga-toxin-encoding bacteriophage.

P e r s P e c t i v e s

390 | MAy 2008 | vOlUME 6 www.nature.com/reviews/micro

© 2008 Nature Publishing Group

region is lost it cannot be regained80,81), and by certain sNPs82–85. These rDs24 or sNPs82, together with minisatellite and spoligotype markers, permit identifica-tion of ancestral or modern forms of M. tuberculosis2,86.

As discussed above, the evidence sug-gests that successive genome deletions in pathogenic mycobacteria have led to an increasingly defined host preference. For example, genomic deletions that are specific to M. tuberculosis (TbD1) might have ren-dered the organism more human-specific and aggressive in terms of outbreak poten-tial, compared with its ancestors. similarly, a specific genomic deletion could have

contributed to the greater pathogenic suc-cess of Mycobacterium pinnipedii in seals compared with other hosts76,87. several dele-tions in the M. bovis genome are suggested to be the cause of the successful attenuation of virulence in M. bovis bacille Calmette–Guérin (BCG) vaccine strains24. The deletion marker rD1 is always absent in attenuated strains such as M. bovis BCG24, and has been shown to be a powerful virulence-linked marker that carries virulence-associated genes, such as those that encode the EsAT6 and CFP10 antigens (discussed below)88–90.

Given that the population structure of members of the M. tuberculosis complex is highly clonal, and that HGT between strains

does not occur or is rare, rDs can serve as unidirectional markers for differentiating between closely related species, such as M. bovis, M. africanum and M. tuberculosis, and for gaining deeper insights into phe-notypic traits that have relevance to patho-genicity or vaccine efficacy91–93. In addition to deletion markers88 and conserved genes94, regions of the M. tuberculosis genome that cause strand slippage during replication, which results in strain diversity and gene-function redundancies, also have diagnostic applications95,96.

short-tandem repeats in the form of minisatellites97 and microsatellites98 have robust clinical applications in the manage-

Table 1 | Exploitation of bacterial genome fluidity for diagnostic and health-care applications

molecularmarkerorassay geneticormolecularchange Applicationtohealthcare

escherichia coli

virulence genes (for example, stx and kpsK1)

Presence or absence of the locus; expression of functional virulence factors

Diagnostic markers and antigens; markers of infection

Genomic islands, plasmids and phages

Presence or absence of the locus; expression of functional virulence factors

Diagnostic markers and antigens; markers of infection

Black holes Presence or absence of the locus; expression of functional virulence factors

Diagnostic markers and antigens; markers of infection

core-genome MLST evolution of housekeeping genes Lineage identification

SNPs Base substitution in candidate genes Screening for antimicrobial resistance, molecular epidemiology and evolution

Helicobacter pylori

indel of cag right junction Small-scale insertions and deletions at the right junction of the cag PAi

Markers of human migration and H. pylori lineage identification

cagA Presence or absence of the locus; expression of functional toxin

Diagnostic markers and antigens; markers of invasive gastric inflammation

cagA ePiYA motif Strain-specific tyrosine phosphorylation of cagA Geographical markers of gastric-cancer predisposition

Plasticity-region cluster Presence or absence of loci Putative virulence genes; possible interventional targets

core-genome MLST evolution of housekeeping genes Human-migration studies; lineage identification

Mycobacteria

rDs reductive evolution Diagnostic and genotyping applications for epidemiology; vaccine candidates (rD1)

Drs indel in Dr region Spoligotyping for epidemiology; strain identification

iS6110 rFLPs Mobile-element instability Outbreak investigation; strain identification

DUs Tandem duplication Quality control of BcG vaccines

SNPs Base substitution in candidate genes Screening for antimicrobial resistance, molecular epidemiology and evolution

MirUs (minisatellites) Tandem-repeat tract expansion or shrinkage Large-scale, high-throughput analysis that can monitor spread and transmission dynamics

Microsatellites replication errors Molecular epidemiology; strain identification

Pe–PPe protein family variable-repeat motifs and antigenic variation Diagnostic markers and antigens; putative virulence markers

erp protein family Plasticity of nucleotide-repeat motifs Diagnostic antigens and virulence markers; vaccine candidates (rD1)

BcG, Mycobacterium bovis bacille calmette–Guérin; Dr, direct repeat; DU, genome-duplication marker; MirU, mycobacterial repetitive interspersed unit; MLST, multilocus sequence typing; PAi, pathogenicity island; rD, region of difference; rFLP, restriction-fragment length polymorphism; SNP, single-nucleotide polymorphism.

P e r s P e c t i v e s

NATUrE rEvIEWs | microbiology vOlUME 6 | MAy 2008 | 391

© 2008 Nature Publishing Group

ment of tuberculosis outbreaks. shrinkage and expansion of these repeats, as well as their gain and loss, creates sufficient allelic diversity97 to allow the individualization of strains, thereby allowing investiga-tors to track the movement of clones during tuberculosis epidemics. Dozens of mycobacterial interspersed repetitive unit (MIrU) loci have now been tested in different laboratories, and independent databases exist for strain comparisons (for example, MIrU–vNTrplus; see Further information). MIrU-based markers2,97, together with broad classification markers, such as Is6110 restriction-fragment length polymorphisms99 and the spacer types100 in the direct repeat regions, are widely used in mycobacterial epidemiological studies2,101 (TABLe 1).

The PE–PPE gene families of M. tuberculosis constitute approximately 10% of the coding capacity of its genome, and are probably expanding through gene duplication and amplification. Importantly, members of these families are functionally linked to the EsAT6 (esx) gene cluster, which encodes a system for the secretion of members of the EsAT6 family of potent T-cell antigens102,103. It has recently been sug-gested that the esx-gene cluster facilitates the presentation of important antigens, including members of the PE–PPE protein families, to the host immune system. The preponderance of repetitive tracts within these gene families could contribute to the antigenic variation that is displayed by M. tuberculosis. It has also been speculated that the EsAT6 and CFP10 loci, coupled with their associated PE–PPE genes, might constitute a putative immuno-genicity island89,102,104,105. Two independent members of the PE–PPE families (rv2430c106 and rv2608 (ReF. 95)) have been shown to elicit a strong host humoral immune response that allows the specific detection of active M. tuberculosis infection in some patients. The PE–PPE family genes are non-randomly organized in the M. tuberculosis genome, and follow a defined pattern that has a role not only in function but also in pathogenesis107. The predicted cell-surface localization of the PE–PPE proteins also sug-gests a role for these proteins in host–patho-gen interactions. Moreover, in addition to rD1, other rDs that are deleted in the BCG strains have been shown to encode genes from the PE–PPE families. Finally, a com-parative genomic analysis of laboratory and clinical isolates of M. tuberculosis revealed the presence of sequence polymorphisms within the PE–PPE genes, a finding that is also suggestive of an important role for these

proteins in pathogenesis and immunity108.The M. tuberculosis Erp family of

exported proteins encode variable-repeat motifs within the central region of the protein109, and variations in the repeat length between the different mycobacterial species have potential uses in diagnostics. Homologues of the erp gene are present in Mycobacterium leprae, Mycobacterium smegmatis, M. avium, Mycobacterium marinum, Mycobacterium xenopi and the extracellular toxin-producing Mycobacterium ulcerans. Interestingly, the Erp protein has been reported to be asso-ciated with mycobacterial pathogenesis110, and deletion mutants of the erp gene can stimulate an immune response against mycobacteria. In another study, it was sug-gested that initial bacterial growth and the consequent histological outcome of infec-tion are conditioned by both the nature and number of Erp repeats103.

Nilsson et al.111 observed in Salmonella enterica serovar Typhimurium that extensive genomic deletions can occur over a short evolutionary timescale — possibly within weeks — leading to a reduction in microbial metabolic and regulatory versatility. Another process of in vitro genome evolution was observed for mycobacteria by analysis of suc-cessive generations of the original BCG vac-cine strain, which evolved frequently through various genomic-deletion and tandem-duplication events91. Consequently, because they are frequently sub-cultured, present-day BCG vaccine strains rapidly lose their protec-tive efficiency. Conversely, older versions of the BCG vaccine strain are considered to be superior in providing protection against tuberculosis compared with more commonly used variants. Genome instability in these bacteria can result in a variable antigenic repertoire that differs from M. tuberculosis or M. bovis and therefore seriously impacts on the protective efficacy of vaccine strains91. In addition, duplication and amplification of certain genetic elements could lead to the emergence of novel functions in BCG strains that could affect their antigenic repertoire and subsequent immunological proper-ties91. Molecular markers that are based on genomic deletions and tandem duplications could therefore be employed to validate older versions of the BCG vaccine strain. In addi-tion, these markers could be useful in quality validation and the genetically controlled pro-duction of BCG strains. Genome-duplication (DU) markers, for example DU1 and DU2 (ReF. 91), constitute potential candidates for the evaluation of protective efficacy of dif-ferent BCG strains. The potential uses of

molecular methods in routine quality control and identification of BCG vaccine strains are being discussed112,113.

ConclusionsIn the study of bacterial genome fluidity, many questions remain, including: how is bacterial genome fluidity regulated; what environmental stimuli are responsible for this fluidity; what is the in vivo relevance of bacterial genome fluidity; and how can bacterial genome fluidity be exploited for the generation and selection of optimally adapted microorganisms? Answering these questions will not only increase our knowledge of the mechanisms that are involved in the evolution and adaptation of bacterial pathogens, but will also have an impact on the development of accurate diagnostics and timely therapeutic interventions against infectious diseases. As the magnitude of bacterial genome fluidity becomes clear, the key research goals are evident: first, the exploitation of these data to design, improve and adapt appropriate diagnostics; second, the expansion of our knowledge of the underlying biology of each pathogen; and, finally, the exploitation of bacterial genome dynamics to develop new approaches to combat infectious diseases.

Niyaz Ahmed is at the Pathogen Evolution Laboratory, Center for DNA Fingerprinting and Diagnostics,

Nacharam, Hyderabad 500 076, India

Ulrich Dobrindt and Jörg Hacker are at the University of Würzburg, Institute for Molecular Biology of Infectious

Diseases, Röntgenring 11, 97070 Würzburg, Germany.

Jörg Hacker is also at the Robert-Koch Institut, Nordufer 20, 13353 Berlin, Germany.

Seyed E. Hasnain is at the University of Hyderabad, Hyderabad, 500 046, India, The Institute of Life

Sciences, HCU Campus, Hyderabad, 500 046, India and the Jawaharlal Nehru Centre for Advanced

Scientific Research, Jakkur, Bangalore 560 064, India.

N.A. and U.D. contributed equally to this work. Correspondence to J.H. and S.E.H.

e-mails: [email protected]; [email protected]

doi:10.1038/nrmicro1889 Published online 8 April 2008.

1.  Achtman, M. et al. Yersinia pestis, the cause of plague, is a recently emerged clone of Yersinia pseudotuberculosis. Proc. Natl Acad. Sci. USA 96, 14043–14048 (1999).

2.  Gutierrez, M. C. et al. Predominance of ancestral lineages of Mycobacterium tuberculosis in India. Emerg. Infect. Dis. 12, 1367–1374 (2006).

3.  Hammerschmidt, S., Hacker, J. & Klenk, H. D. Threat of infection: microbes of high pathogenic potential — strategies for detection, control and eradication. Int. J. Med. Microbiol. 295, 141–151 (2005).

4.  Hasnain, S. E. Molecular epidemiology of infectious diseases: a case for increased surveillance. Bull. World Health Organ. 81, 474 (2003).

5.  Hasnain, S. E. & Ahmed, N. Leptospirosis. Lancet Infect. Dis. 4, 543 (2004).

6.  Morschhäuser, J. et al. Evolution of microbial pathogens. Philos. Trans. R. Soc. Lond. B 355, 695–704 (2000).

P e r s P e c t i v e s

392 | MAy 2008 | vOlUME 6 www.nature.com/reviews/micro

© 2008 Nature Publishing Group

7.  Chakhaiyar, P. & Hasnain, S. E. Defining the mandate of tuberculosis research in a postgenomic era. Med. Princ. Pract. 13, 177–184 (2004).

8.  Prakash, P., Yellaboina, S., Ranjan, A. & Hasnain, S. E. Computational prediction and experimental verification of novel IdeR binding sites in the upstream sequences of Mycobacterium tuberculosis open reading frames. Bioinformatics 21, 2161–2166 (2005).

9.  Grozdanov, L. et al. Analysis of the genome structure of the nonpathogenic probiotic Escherichia coli strain Nissle 1917. J. Bacteriol. 186, 5432–5441 (2004).

10.  Doolittle, R. F. Biodiversity: microbial genomes multiply. Nature 416, 697–700 (2002).

11.  Ziebuhr, W., Ohlsen, K., Karch, H., Korhonen, T. & Hacker, J. Evolution of bacterial pathogenesis. Cell Mol. Life Sci. 56, 719–728 (1999).

12.  Dobrindt, U., Hochhut, B., Hentschel, U. & Hacker, J. Genomic islands in pathogenic and environmental microorganisms. Nature Rev. Microbiol. 2, 414–424 (2004).

13.  Hacker, J. & Carniel, E. Ecological fitness, genomic islands and bacterial pathogenicity. A Darwinian view of the evolution of microbes. EMBO Rep. 2, 376–381 (2001).

14.  Hacker, J. & Kaper, J. B. Pathogenicity islands and the evolution of microbes. Annu. Rev. Microbiol. 54, 641–679 (2000).

15.  Blum, G. et al. Excision of large DNA regions termed pathogenicity islands from tRNA-specific loci in the chromosome of an Escherichia coli wild-type pathogen. Infect. Immun. 62, 606–614 (1994).

16.  Dobrindt, U. (Patho-)genomics of Escherichia coli. Int. J. Med. Microbiol. 295, 357–371 (2005).

17.  Hacker, J., Hentschel, U. & Dobrindt, U. Prokaryotic chromosomes and disease. Science 301, 790–793 (2003).

18.  Craven, S. H. & Neidle, E. L. Double trouble: medical implications of genetic duplication and amplification in bacteria. Future Microbiol. 2, 309–321 (2007).

19.  Lovett, S. T. Encoded errors: mutations and rearrangements mediated by misalignment at repetitive DNA sequences. Mol. Microbiol. 52, 1243–1253 (2004).

20.  Achtman, M. et al. Microevolution and history of the plague bacillus, Yersinia pestis. Proc. Natl Acad. Sci. USA 101, 17837–17842 (2004).

21.  Chain, P. S. et al. Insights into the evolution of Yersinia pestis through whole-genome comparison with Yersinia pseudotuberculosis. Proc. Natl Acad. Sci. USA 101, 13826–13831 (2004).

22.  Moran, N. A. Microbial minimalism: genome reduction in bacterial pathogens. Cell 108, 583–586 (2002).

23.  Sallström, B. & Andersson, S. G. Genome reduction in the α-Proteobacteria. Curr. Opin. Microbiol. 8, 579–585 (2005).

24.  Brosch, R. et al. A new evolutionary scenario for the Mycobacterium tuberculosis complex. Proc. Natl Acad. Sci. USA 99, 3684–3689 (2002).

25.  Ahmed, N. et al. Molecular analysis of a leprosy immunotherapeutic bacillus provides insights into Mycobacterium evolution. PLoS ONE 2, e968 (2007).

26.  Kaper, J. B., Nataro, J. P. & Mobley, H. L. Pathogenic Escherichia coli. Nature Rev. Microbiol. 2, 123–140 (2004).

27.  Schmidt, H. & Hensel, M. Pathogenicity islands in bacterial pathogenesis. Clin. Microbiol. Rev. 17, 14–56 (2004).

28.  Dobrindt, U. & Hacker, J. in The Comprehensive Sourcebook of Bacterial Protein Toxins (eds Alouf, J. E. & Popoff, M. R.) 44–63 (Academic, London, 2006).

29.  Perna, N. T. et al. Genome sequence of enterohaemorrhagic Escherichia coli O157:H7. Nature 409, 529–533 (2001).

30.  Ohnishi, M., Kurokawa, K. & Hayashi, T. Diversification of Escherichia coli genomes: are bacteriophages the major contributors? Trends Microbiol. 9, 481–485 (2001).

31.  Ohnishi, M. et al. Genomic diversity of enterohemorrhagic Escherichia coli O157 revealed by whole genome PCR scanning. Proc. Natl Acad. Sci. USA 99, 17043–17048 (2002).

32.  Reid, S. D., Herbelin, C. J., Bumbaugh, A. C., Selander, R. K. & Whittam, T. S. Parallel evolution of virulence in pathogenic Escherichia coli. Nature 406, 64–67 (2000).

33.  Maurelli, A. T., Fernandez, R. E., Bloch, C. A., Rode, C. K. & Fasano, A. “Black holes” and bacterial pathogenicity: a large genomic deletion that enhances the virulence of Shigella spp. and enteroinvasive Escherichia coli. Proc. Natl Acad. Sci. USA 95, 3943–3948 (1998).

34.  Prosseda, G. et al. The two-faced role of cad genes in the virulence of pathogenic Escherichia coli. Res. Microbiol. 158, 487–493 (2007).

35.  Nougayrede, J. P. et al. Escherichia coli induces DNA double-strand breaks in eukaryotic cells. Science 313, 848–851 (2006).

36.  Durant, L. et al. Identification of candidates for a subunit vaccine against extraintestinal pathogenic Escherichia coli. Infect. Immun. 75, 1916–1925 (2007).

37.  Hagan, E. C. & Mobley, H. L. Uropathogenic Escherichia coli outer membrane antigens expressed during urinary tract infection. Infect. Immun. 75, 3941–3949 (2007).

38.  Herbelin, C. J., Chirillo, S. C., Melnick, K. A. & Whittam, T. S. Gene conservation and loss in the mutS–rpoS genomic region of pathogenic Escherichia coli. J. Bacteriol. 182, 5381–5390 (2000).

39.  Bidet, P. et al. Detection and identification by PCR of a highly virulent phylogenetic subgroup among extraintestinal pathogenic Escherichia coli B2 strains. Appl. Environ. Microbiol. 73, 2373–2377 (2007).

40.  Brzuszkiewicz, E. et al. How to become a uropathogen: comparative genomic analysis of extraintestinal pathogenic Escherichia coli strains. Proc. Natl Acad. Sci. USA 103, 12879–12884 (2006).

41.  Plainvert, C. et al. A new O-antigen gene cluster has a key role in the virulence of the Escherichia coli meningitis clone O45:K1:H7. J. Bacteriol. 189, 8528–8536 (2007).

42.  Cegelski, L., Marshall, G. R., Eldridge, G. R. & Hultgren, S. J. The biology and future prospects of antivirulence therapies. Nature Rev. Microbiol. 6, 17–27 (2008).

43.  Middendorf, B. et al. Instability of pathogenicity islands in uropathogenic Escherichia coli 536. J. Bacteriol. 186, 3086–3096 (2004).

44.  Hochhut, B. et al. Role of pathogenicity island-associated integrases in the genome plasticity of uropathogenic Escherichia coli strain 536. Mol. Microbiol. 61, 584–595 (2006).

45.  Wilde, C. et al. Delineation of the recombination sites necessary for integration of pathogenicity islands II and III into the Escherichia coli 536 chromosome. Mol. Microbiol. 68, 139–151 (2008).

46.  Grozdanov, L. et al. A single nucleotide exchange in the wzy gene is responsible for the semirough O6 lipopolysaccharide phenotype and serum sensitivity of Escherichia coli strain Nissle 1917. J. Bacteriol. 184, 5912–5925 (2002).

47.  Klemm, P., Roos, V., Ulett, G. C., Svanborg, C. & Schembri, M. A. Molecular characterization of the Escherichia coli asymptomatic bacteriuria strain 83972: the taming of a pathogen. Infect. Immun. 74, 781–785 (2006).

48.  Zdziarski, J., Svanborg, C., Wullt, B., Hacker, J. & Dobrindt, U. Molecular basis of commensalism in the urinary tract: low virulence or virulence attenuation? Infect. Immun. 76, 695–703 (2008).

49.  Altenhoefer, A. et al. The probiotic Escherichia coli strain Nissle 1917 interferes with invasion of human intestinal epithelial cells by different enteroinvasive bacterial pathogens. FEMS Immunol. Med. Microbiol. 40, 223–229 (2004).

50.  Sartor, R. B. Probiotic therapy of intestinal inflammation and infections. Curr. Opin. Gastroenterol. 21, 44–50 (2005).

51.  Sunden, F., Hakansson, L., Ljunggren, E. & Wullt, B. Bacterial interference — is deliberate colonization with Escherichia coli 83972 an alternative treatment for patients with recurrent urinary tract infection? Int. J. Antimicrob. Agents 28 (Suppl. 1), 26–29 (2006).

52.  Bielaszewska, M. et al. Shiga toxin gene loss and transfer in vitro and in vivo during enterohemorrhagic Escherichia coli O26 infection in humans. Appl. Environ. Microbiol. 73, 3144–3150 (2007).

53.  Friedrich, A. W. et al. Prevalence, virulence profiles, and clinical significance of Shiga toxin-negative variants of enterohemorrhagic Escherichia coli O157 infection in humans. Clin. Infect. Dis. 45, 39–45 (2007).

54.  Bielaszewska, M. et al. Shiga toxin-mediated hemolytic uremic syndrome: time to change the diagnostic paradigm? PLoS ONE 2, e1024 (2007).

55.  Ahmed, N. & Sechi, L. A. Helicobacter pylori and gastroduodenal pathology: new threats of the old friend. Ann. Clin. Microbiol. Antimicrob. 4, 1 (2005).

56.  Alm, R. A. et al. Genomic-sequence comparison of two unrelated isolates of the human gastric pathogen Helicobacter pylori. Nature 397, 176–180 (1999).

57.  Tomb, J. F. et al. The complete genome sequence of the gastric pathogen Helicobacter pylori. Nature 388, 539–547 (1997).

58.  Censini, S. et al. cag, a pathogenicity island of Helicobacter pylori, encodes type I-specific and

disease-associated virulence factors. Proc. Natl Acad. Sci. USA 93, 14648–14653 (1996).

59.  Devi, S. M. et al. Genomes of Helicobacter pylori from native Peruvians suggest admixture of ancestral and modern lineages and reveal a western type cag-pathogenicity island. BMC Genomics 7, 191 (2006).

60.  Kersulyte, D. et al. Cluster of type IV secretion genes in Helicobacter pylori’s plasticity zone. J. Bacteriol. 185, 3764–3772 (2003).

61.  Chattopadhyay, S. et al. Virulence genes in Helicobacter pylori strains from West Bengal residents with overt H. pylori-associated disease and healthy volunteers. J. Clin. Microbiol. 40, 2622–2625 (2002).

62.  Kauser, F. et al. Comparing genomes of Helicobacter pylori strains from the high-altitude desert of Ladakh, India. J. Clin. Microbiol. 43, 1538–1545 (2005).

63.  Mukhopadhyay, A. K. et al. Distinctiveness of genotypes of Helicobacter pylori in Calcutta, India. J. Bacteriol. 182, 3219–3227 (2000).

64.  Saribasak, H., Salih, B. A., Yamaoka, Y. & Sander, E. Analysis of Helicobacter pylori genotypes and correlation with clinical outcome in Turkey. J. Clin. Microbiol. 42, 1648–1651 (2004).

65.  Carroll, I. M., Khan, A. A. & Ahmed, N. Revisiting the pestilence of Helicobacter pylori: insights into geographical genomics and pathogen evolution. Infect. Genet. Evol. 4, 81–90 (2004).

66.  Devi, S. M. et al. Ancestral European roots of Helicobacter pylori in India. BMC Genomics 8, 184 (2007).

67.  Argent, R. H. et al. Determinants and consequences of different levels of CagA phosphorylation for clinical isolates of Helicobacter pylori. Gastroenterology 127, 514–523 (2004).

68.  Hatakeyama, M. Oncogenic mechanisms of the Helicobacter pylori CagA protein. Nature Rev. Cancer 4, 688–694 (2004).

69.  Saadat, I. et al. Helicobacter pylori CagA targets PAR1/MARK kinase to disrupt epithelial cell polarity. Nature 447, 330–333 (2007).

70.  Argent, R. H., Zhang, Y. & Atherton, J. C. Simple method for determination of the number of Helicobacter pylori CagA variable-region EPIYA tyrosine phosphorylation motifs by PCR. J. Clin. Microbiol. 43, 791–795 (2005).

71.  Higashi, H. et al. Biological activity of the Helicobacter pylori virulence factor CagA is determined by variation in the tyrosine phosphorylation sites. Proc. Natl Acad. Sci. USA 99, 14428–14433 (2002).

72.  Zhang, Y., Argent, R. H., Letley, D. P., Thomas, R. J. & Atherton, J. C. Tyrosine phosphorylation of CagA from Chinese Helicobacter pylori isolates in AGS gastric epithelial cells. J. Clin. Microbiol. 43, 786–790 (2005).

73.  Owen, R. J., Sharp, S. I., Chisholm, S. A. & Rijpkema, S. Identification of cagA tyrosine phosphorylation DNA motifs in Helicobacter pylori isolates from peptic ulcer patients by novel PCR-restriction fragment length polymorphism and real-time fluorescence PCR assays. J. Clin. Microbiol. 41, 3112–3128 (2003).

74.  Yamazaki, S. et al. Identification of Helicobacter pylori and the cagA genotype in gastric biopsies using highly sensitive real-time PCR as a new diagnostic tool. FEMS Immunol. Med. Microbiol. 44, 261–268 (2005).

75.  Tsolaki, A. G. et al. Functional and evolutionary genomics of Mycobacterium tuberculosis: insights from genomic deletions in 100 strains. Proc. Natl Acad. Sci. USA 101, 4865–4870 (2004).

76.  Ahmed, N. et al. Genome sequence based, comparative analysis of the fluorescent amplified fragment length polymorphisms (FAFLP) of tubercle bacilli from seals provides molecular evidence for a new species within the Mycobacterium tuberculosis complex. Infect. Genet. Evol. 2, 193–199 (2003).

77.  Ahmed, N. et al. Molecular genotyping of a large, multicentric collection of tubercle bacilli indicates geographical partitioning of strain variation and has implications for global epidemiology of Mycobacterium tuberculosis. J. Clin. Microbiol. 42, 3240–3247 (2004).

78.  Ahmed, N. et al. Distinctiveness of Mycobacterium tuberculosis genotypes from human immunodeficiency virus type 1-seropositive and -seronegative patients in Lima, Peru. J. Clin. Microbiol. 41, 1712–1716 (2003).

79.  Siddiqi, N. et al. Molecular characterization of multidrug-resistant isolates of Mycobacterium tuberculosis from patients in North India. Antimicrob. Agents Chemother. 46, 443–450 (2002).

80.  Dagan, T., Blekhman, R. & Graur, D. The “domino theory” of gene death: gradual and mass gene extinction events in three lineages of obligate

P e r s P e c t i v e s

NATUrE rEvIEWs | microbiology vOlUME 6 | MAy 2008 | 393

© 2008 Nature Publishing Group

symbiotic bacterial pathogens. Mol. Biol. Evol. 23, 310–316 (2006).

81.  Cole, S. T. et al. Massive gene decay in the leprosy bacillus. Nature 409, 1007–1011 (2001).

82.  Filliol, I. et al. Global phylogeny of Mycobacterium tuberculosis based on single nucleotide polymorphism (SNP) analysis: insights into tuberculosis evolution, phylogenetic accuracy of other DNA fingerprinting systems, and recommendations for a minimal standard SNP set. J. Bacteriol. 188, 759–772 (2006).

83.  Huard, R. C. et al. Novel genetic polymorphisms that further delineate the phylogeny of the Mycobacterium tuberculosis complex. J. Bacteriol. 188, 4271–4287 (2006).

84.  Marmiesse, M. et al. Macro-array and bioinformatic analyses reveal mycobacterial ‘core’ genes, variation in the ESAT-6 gene family and new phylogenetic markers for the Mycobacterium tuberculosis complex. Microbiology 150, 483–496 (2004).

85.  Rao, K. R. et al. Analysis of genomic downsizing on the basis of region-of-difference polymorphism profiling of Mycobacterium tuberculosis patient isolates reveals geographic partitioning. J. Clin. Microbiol. 43, 5978–5982 (2005).

86.  Rao, K. R., Ahmed, N., Srinivas, S., Sechi, L. A. & Hasnain, S. E. Rapid identification of Mycobacterium tuberculosis Beijing genotypes on the basis of the mycobacterial interspersed repetitive unit locus 26 signature. J. Clin. Microbiol. 44, 274–277 (2006).

87.  Cousins, D. V. et al. Tuberculosis in seals caused by a novel member of the Mycobacterium tuberculosis complex: Mycobacterium pinnipedii sp. nov. Int. J. Syst. Evol. Microbiol. 53, 1305–1314 (2003).

88.  Harboe, M., Oettinger, T., Wiker, H. G., Rosenkrands, I. & Andersen, P. Evidence for occurrence of the ESAT-6 protein in Mycobacterium tuberculosis and virulent Mycobacterium bovis and for its absence in Mycobacterium bovis BCG. Infect. Immun. 64, 16–22 (1996).

89.  Lewis, K. N. et al. Deletion of RD1 from Mycobacterium tuberculosis mimics bacille Calmette–Guérin attenuation. J. Infect. Dis. 187, 117–123 (2003).

90.  Pym, A. S. et al. Recombinant BCG exporting ESAT-6 confers enhanced protection against tuberculosis. Nature Med. 9, 533–539 (2003).

91.  Brosch, R. et al. Genome plasticity of BCG and impact on vaccine efficacy. Proc. Natl Acad. Sci. USA 104, 5596–5601 (2007).

92.  Marri, P. R., Bannantine, J. P. & Golding, G. B. Comparative genomics of metabolic pathways in Mycobacterium species: gene duplication, gene decay and lateral gene transfer. FEMS Microbiol. Rev. 30, 906–925 (2006).

93.  Newton, S. M. et al. A deletion defining a common Asian lineage of Mycobacterium tuberculosis associates with immune subversion. Proc. Natl Acad. Sci. USA 103, 15594–15598 (2006).

94.  Banerjee, S. et al. Mycobacterium tuberculosis (Mtb) isocitrate dehydrogenases show strong B cell response and distinguish vaccinated controls from TB patients. Proc. Natl Acad. Sci. USA 101, 12652–12657 (2004).

95.  Chakhaiyar, P. et al. Regions of high antigenicity within the hypothetical PPE major polymorphic tandem repeat open-reading frame, Rv2608, show a differential humoral response and a low T cell response in various categories of patients with tuberculosis. J. Infect. Dis. 190, 1237–1244 (2004).

96.  Choudhary, R. K. et al. PPE antigen Rv2430c of Mycobacterium tuberculosis induces a strong B-cell response. Infect. Immun. 71, 6338–6343 (2003).

97.  Mazars, E. et al. High-resolution minisatellite-based typing as a portable approach to global analysis of Mycobacterium tuberculosis molecular epidemiology. Proc. Natl Acad. Sci. USA 98, 1901–1906 (2001).

98.  Sreenu, V. B., Kumar, P., Nagaraju, J. & Nagarajaram, H. A. Microsatellite polymorphism across the M. tuberculosis and M. bovis genomes: implications on genome evolution and plasticity. BMC Genomics 7, 78 (2006).

99.  Narayanan, S. et al. Molecular epidemiology of tuberculosis in a rural area of high prevalence in South India: implications for disease control and prevention. J. Clin. Microbiol. 40, 4785–4788 (2002).

100. Singh, U. B. et al. Predominant tuberculosis spoligotypes, Delhi, India. Emerg. Infect. Dis. 10, 1138–1142 (2004).

101. Supply, P. et al. Automated high-throughput genotyping for study of global epidemiology of Mycobacterium tuberculosis based on mycobacterial interspersed repetitive units. J. Clin. Microbiol. 39, 3563–3571 (2001).

102. Gey van Pittius, N. C. et al. Evolution and expansion of the Mycobacterium tuberculosis PE and PPE multigene families and their association with the duplication of the ESAT-6 (esx) gene cluster regions. BMC Evol. Biol. 6, 95 (2006).

103. Okkels, L. M. et al. PPE protein (Rv3873) from DNA segment RD1 of Mycobacterium tuberculosis: strong recognition of both specific T-cell epitopes and epitopes conserved within the PPE family. Infect. Immun. 71, 6116–6123 (2003).

104. Hsu, T. et al. The primary mechanism of attenuation of bacillus Calmette–Guérin is a loss of secreted lytic function required for invasion of lung interstitial tissue. Proc. Natl Acad. Sci. USA 100, 12420–12425 (2003).

105. Pallen, M. J. The ESAT-6/WXG100 superfamily — and a new Gram-positive secretion system? Trends Microbiol. 10, 209–212 (2002).

106. Choudhary, R. K., Pullakhandam, R., Ehtesham, N. Z. & Hasnain, S. E. Expression and characterization of Rv2430c, a novel immunodominant antigen of Mycobacterium tuberculosis. Protein Expr. Purif. 36, 249–253 (2004).

107. Tundup, S., Akhter, Y., Thiagarajan, D. & Hasnain, S. E. Clusters of PE and PPE genes of Mycobacterium tuberculosis are organized in operons: evidence that PE Rv2431c is co-transcribed with PPE Rv2430c and their gene products interact with each other. FEBS Lett. 580, 1285–1293 (2006).

108. Fleischmann, R. D. et al. Whole-genome comparison of Mycobacterium tuberculosis clinical and laboratory strains. J. Bacteriol. 184, 5479–5490 (2002).

109. Cole, S. T. et al. Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature 393, 537–544 (1998).

110.  Berthet, F. X., Rasmussen, P. B., Rosenkrands, I., Andersen, P. & Gicquel, B. A Mycobacterium tuberculosis operon encoding ESAT-6 and a novel low-molecular-mass culture filtrate protein (CFP-10). Microbiology 144, 3195–3203 (1998).

111.  Nilsson, A. I. et al. Bacterial genome size reduction by experimental evolution. Proc. Natl Acad. Sci. USA 102, 12112–12116 (2005).

112. Knezevic, I. & Corbel, M. J. WHO discussion on the improvement of the quality control of BCG vaccines. Vaccine 24, 3874–3877 (2006).

DATABASESentrez Genome Project: http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=genomeprjEscherichia coli | Helicobacter pylori | Mycobacterium avium | Mycobacterium bovis | Mycobacterium leprae | Mycobacterium marinum | Mycobacterium smegmatis | Mycobacterium tuberculosis | Mycobacterium ulcerans | Yersinia pestis | Yersinia pseudotuberculosis

FURTHER inFoRMATionJörg Hacker and Ulrich Dobrindt’s homepage: http://www.infektionsforschung.uni-wuerzburg.de/Niyaz Ahmed’s homepage: http://www.pathogen-evolution.orgseyed e. Hasnain’s homepage: http://www.isogem.org/hasnain.htmlinstitute Pasteur (colibri database): http://genolist.pasteur.fr/colibri/institute Pasteur (PyloriGene database): http://genolist.pasteur.fr/PyloriGene/J. craig venter institute (comprehensive microbial resource): http://www.jcvi.org/cms/research/projects/cmr/MirU–vNtrplus : http://www.miru-vntrplus.orgWellcome trust sanger institute (pathogen sequencing unit): http://www.sanger.ac.uk/Projects/Pathogens/

AlllinksAreActiveintheonlinepdf

113. Shin, J., Wood, D., Robertson, J., Minor, P. & Peden, K. WHO informal consultation on the application of molecular methods to assure the quality, safety and efficacy of vaccines, Geneva, Switzerland, 7–8 April 2005. Biologicals 35, 63–71 (2007).

AcknowledgementsResearch in the laboratories of N.A. and S.E.H. was supported by several grants from the Department of Biotechnology, Ministry of Science & Technology, Government of India. Research in the laboratories of U.D. and J.H. was supported by grants from the German Research Foundation (SFB479, TP A1), the European Community (European virtual institute for functional genomics of bacterial pathogens; CEE LSHB-CT-2005-512,061) and the Bavarian Research Foundation.

o p i n i o n

Searching for the cause of Kawasaki disease — cytoplasmic inclusion bodies provide new insightAnne H. Rowley, Susan C. Baker, Jan M. Orenstein and Stanford T. Shulman

Abstract | Kawasaki disease (KD) has emerged as the most common cause of acquired heart disease in children in the developed world. The cause of KD remains unknown, although an as-yet unidentified infectious agent might be responsible. By determining the causative agent, we can improve diagnosis, therapy and prevention of KD. recently, identification of an antigen-driven igA response that was directed at cytoplasmic inclusion bodies in KD tissues has provided new insights that could unlock the mysteries of KD.

Kawasaki disease (KD) is an acute child-hood illness that usually affects previously healthy infants and children. The disease is manifested by a high spiking fever and, in classic cases, four of five additional features: rash; red eyes; red lips and mouth; swollen and red hands and feet;

and swollen glands in the neck. These symptoms resolve spontaneously within 1–3 weeks, or sooner after treatment with intravenous gammaglobulin and aspirin. However, inflammation of medium-sized arteries throughout the body, particularly of the coronary arteries, can occur during

P e r s P e c t i v e s

394 | MAy 2008 | vOlUME 6 www.nature.com/reviews/micro

© 2008 Nature Publishing Group