20
REVIEW ARTICLE Origins of bacterial diversity through horizontal genetic transfer and adaptation to new ecological niches Jane Wiedenbeck & Frederick M. Cohan Department of Biology, Wesleyan University, Middletown, CT, USA Correspondence: Frederick M. Cohan, Department of Biology, Wesleyan University, Middletown, CT 06459-0170, USA. Tel.: 11 860 685 3482; fax: 11 860 685 3279; e-mail: [email protected] Received 9 November 2010; accepted 15 June 2011. Final version published online July 2011. DOI:10.1111/j.1574-6976.2011.00292.x Editor: Fernando Baquero Keywords species concept; niche-transcending adaptation; ecotype; evolution; genome content; amelioration. Abstract Horizontal genetic transfer (HGT) has played an important role in bacterial evolution at least since the origins of the bacterial divisions, and HGT still facilitates the origins of bacterial diversity, including diversity based on antibiotic resistance. Adaptive HGT is aided by unique features of genetic exchange in bacteria such as the promiscuity of genetic exchange and the shortness of segments transferred. Genetic exchange rates are limited by the genetic and ecological similarity of organisms. Adaptive transfer of genes is limited to those that can be transferred as a functional unit, provide a niche-transcending adaptation, and are compatible with the architecture and physiology of other organisms. Horizontally transferred adaptations may bring about fitness costs, and natural selection may ameliorate these costs. The origins of ecological diversity can be analyzed by comparing the genomes of recently divergent, ecologically distinct populations, which can be discovered as sequence clusters. Such genome comparisons demon- strate the importance of HGT in ecological diversification. Newly divergent populations cannot be discovered as sequence clusters when their ecological differences are coded by plasmids, as is often the case for antibiotic resistance; the discovery of such populations requires a screen for plasmid-coded functions. Introduction The evolution of bacteria is not just the evolution of animals and plants writ small. The origin of bacterial species is accelerated by unique features of bacterial genetics, perhaps the most important being the ability of bacteria to readily acquire genes from other organisms (Ochman & Davalos, 2006; Cohan & Koeppel, 2008). Fully sequenced genomes reveal that a substantial fraction of ORFs have been horizon- tally transferred (Nakamura et al., 2004; McDaniel et al., 2010), and many of these acquisitions are thought to have driven the origins of new bacterial species (Gogarten et al., 2002). Early evidence of the importance of horizontal genetic transfer (HGT) in bacterial evolution was seen in the spread of penicillin resistance through plasmid transfer across the Enterobacteriaceae (Datta & Kontomichalou, 1965). Bacterio- logists might have interpreted this rapid spread of resistance as evidence for the general importance of HGT in bacterial evolution, but the extent and impact of HGT were not fully appreciated until much later. With eventual access to genome sequences, it became clear that HGT occurs throughout the genome, and has been responsible for the origins of extremely diverse adaptations (Ochman et al., 2000). Moreover, HGT has played a role in bacterial evolu- tion at least since the origins of the bacterial divisions (Gogarten et al., 2002). For example, methanotrophs acquired the ability to synthesize some of the cofactors for methane utilization from methanogenic Archaea (Chistoserdova et al., 1998; Gogarten et al., 2002). More recent HGT events have resulted in important ecological differences between closely related species and between populations within a single recognized species taxon (Welch et al., 2002). For example, the virulence factors that distinguish Salmonella from Escherichia coli were largely acquired by HGT (Groisman & Ochman, 1996; Gogarten et al., 2002). Also, antibiotic resistance factors coded on plasmids distinguish extremely close relatives within recog- nized species taxa. For many bacterial pathogens, the acquisition of antibiotic resistance is likely necessary to survive in the ecological niche of agricultural animals and humans frequently treated by antibiotics (O’Brien, 2002). FEMS Microbiol Rev 35 (2011) 957–976 c 2011 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved MICROBIOLOGY REVIEWS

plugin-j.1574-6976.2011.00292.x

Embed Size (px)

DESCRIPTION

plugin-j.1574-6976.2011.00292.x.

Citation preview

R E V I E W A R T I C L E

Originsof bacterial diversity through horizontal genetictransferandadaptation tonewecological nichesJane Wiedenbeck & Frederick M. Cohan

Department of Biology, Wesleyan University, Middletown, CT, USA

Correspondence: Frederick M. Cohan,

Department of Biology, Wesleyan University,

Middletown, CT 06459-0170, USA.

Tel.: 11 860 685 3482;

fax: 11 860 685 3279;

e-mail: [email protected]

Received 9 November 2010; accepted 15 June

2011.

Final version published online July 2011.

DOI:10.1111/j.1574-6976.2011.00292.x

Editor: Fernando Baquero

Keywords

species concept; niche-transcending

adaptation; ecotype; evolution; genome

content; amelioration.

Abstract

Horizontal genetic transfer (HGT) has played an important role in bacterial

evolution at least since the origins of the bacterial divisions, and HGT still

facilitates the origins of bacterial diversity, including diversity based on antibiotic

resistance. Adaptive HGT is aided by unique features of genetic exchange in

bacteria such as the promiscuity of genetic exchange and the shortness of segments

transferred. Genetic exchange rates are limited by the genetic and ecological

similarity of organisms. Adaptive transfer of genes is limited to those that can be

transferred as a functional unit, provide a niche-transcending adaptation, and are

compatible with the architecture and physiology of other organisms. Horizontally

transferred adaptations may bring about fitness costs, and natural selection may

ameliorate these costs. The origins of ecological diversity can be analyzed by

comparing the genomes of recently divergent, ecologically distinct populations,

which can be discovered as sequence clusters. Such genome comparisons demon-

strate the importance of HGT in ecological diversification. Newly divergent

populations cannot be discovered as sequence clusters when their ecological

differences are coded by plasmids, as is often the case for antibiotic resistance; the

discovery of such populations requires a screen for plasmid-coded functions.

Introduction

The evolution of bacteria is not just the evolution of animals

and plants writ small. The origin of bacterial species is

accelerated by unique features of bacterial genetics, perhaps

the most important being the ability of bacteria to readily

acquire genes from other organisms (Ochman & Davalos,

2006; Cohan & Koeppel, 2008). Fully sequenced genomes

reveal that a substantial fraction of ORFs have been horizon-

tally transferred (Nakamura et al., 2004; McDaniel et al., 2010),

and many of these acquisitions are thought to have driven the

origins of new bacterial species (Gogarten et al., 2002).

Early evidence of the importance of horizontal genetic

transfer (HGT) in bacterial evolution was seen in the spread

of penicillin resistance through plasmid transfer across the

Enterobacteriaceae (Datta & Kontomichalou, 1965). Bacterio-

logists might have interpreted this rapid spread of resistance

as evidence for the general importance of HGT in bacterial

evolution, but the extent and impact of HGT were not fully

appreciated until much later. With eventual access to

genome sequences, it became clear that HGT occurs

throughout the genome, and has been responsible for the

origins of extremely diverse adaptations (Ochman et al.,

2000). Moreover, HGT has played a role in bacterial evolu-

tion at least since the origins of the bacterial divisions

(Gogarten et al., 2002). For example, methanotrophs

acquired the ability to synthesize some of the cofactors

for methane utilization from methanogenic Archaea

(Chistoserdova et al., 1998; Gogarten et al., 2002).

More recent HGT events have resulted in important

ecological differences between closely related species and

between populations within a single recognized species

taxon (Welch et al., 2002). For example, the virulence factors

that distinguish Salmonella from Escherichia coli were largely

acquired by HGT (Groisman & Ochman, 1996; Gogarten

et al., 2002). Also, antibiotic resistance factors coded on

plasmids distinguish extremely close relatives within recog-

nized species taxa. For many bacterial pathogens, the

acquisition of antibiotic resistance is likely necessary to

survive in the ecological niche of agricultural animals and

humans frequently treated by antibiotics (O’Brien, 2002).

FEMS Microbiol Rev 35 (2011) 957–976 c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

MIC

ROBI

OLO

GY

REV

IEW

S

Through HGT, divergent populations can share an adap-

tation whose value transcends their differences in physio-

logical capabilities, cellular structures, and ecological niches.

This sharing of niche-transcending genes does not generally

make the two populations more similar ecologically. Instead,

HGT allows the recipient to build on its unique, pre-existing

adaptations to either invade a new niche or to improve its

performance in its current niche (Cohan & Koeppel, 2008).

For example, enterotoxigenic E. coli, which attacks the

epithelial cells of the small intestine, has shared the Class 5

fimbriae by HGT with Burkholderia cepacia (Anantha et al.,

2004), which can reside in human lungs of cystic fibrosis

patients, attacking the respiratory epithelium. Acquiring

these niche-transcending genes thus allows each lineage to

better attack cells of its respective niche, but does not cause

the donor and the recipient to converge ecologically (Cohan

& Koeppel, 2008; Cohan, 2011). Likewise, when ecologically

disparate human pathogens acquire the same antibiotic

resistance factors by HGT (Fondi & Fani, 2010), their

ecological niches are not converging beyond their response

to natural selection by antibiotics.

Here, we will review the properties of bacteria that make

HGT so effective in fostering adaptations and the origins of

new ecological populations. We will review the classes of

adaptations most and least likely to be delivered by HGT,

and the ecological and phylogenetic limits on genetic

transfer. Also, we will review the evolutionary challenges on

the recipient to accommodate horizontally acquired adapta-

tions. Finally, we will review evidence from genome content

comparisons that demonstrate a prominent role of HGT in

the origins of bacterial diversity, and how genome compar-

isons have led to the discovery of the ecological dimensions

by which bacterial divergence occurs.

The qualities of bacterial geneticexchange that foster adaptive HGT

Exchange of genes in bacteria is both rare and promiscuous.

A broad survey of recombination rates has shown that

recombination, even among close relatives, occurs at a per

capita per gene rate that is generally close to the rate of

mutation, and rarely more than about 10 times the rate of

mutation (Vos & Didelot, 2009). The rarity of recombina-

tion on a per capita basis does not prevent acquisition of

adaptations from other organisms, as the enormous popula-

tion sizes of many bacteria can bring unlikely recombination

events within reach (Levin & Bergstrom, 2000). Moreover,

recombination in bacteria is too rare to hinder the ecological

diversification of closely related populations. This is because

natural selection against maladaptive, niche-specifying

genes (genes that are adaptive only in the context of their

home population) from other populations easily keeps these

foreign genes at negligible frequencies (Cohan, 1994; Cohan

& Koeppel, 2008) (Box 1).

Bacterial recombination is much more promiscuous than

recombination in higher organisms, where it is always

limited to very close relatives (usually members of the same

species or very closely related species) (Mallet et al., 2007;

Mallet, 2008). Bacterial recombination can extend across the

bacterial divisions and even across the three domains of life

(Garcia-Vallve et al., 2000; Rest & Mindell, 2003). Thus,

bacterial recombination can foster the acquisition of adap-

tations from both close and distant relatives.

Transferred DNA is generally short, often the length of

one to several genes. A short length helps to enable the

success of adaptive HGT between deeply divergent bacteria,

as it allows a recipient to pick up a niche-transcending gene

(or set of genes) without also acquiring the niche-specifying

Box 1. Why recombination does not hinder the adaptive divergence

of bacterial populations

Haldane long ago showed why the rare introduction of niche-

specifying alleles from one population to another cannot reverse the

adaptive divergence between populations: the equilibrium frequency

(p�) of a maladaptive, niche-specifying allele from another population is

equal to its rate of entry into the population (m) divided by the selection

intensity disfavoring the foreign allele (s), i.e. p�= m/s (Haldane, 1932).

In the case of bacteria, the rate of entry of another population’s allele is

equal to the rate of recombination between populations (cb) (Cohan,

1994). A broad survey of recombination rates in the Bacteria and

Archaea showed that recombination generally occurs at about the

same rate as mutation and never greater than about 10 times the rate

of mutation (per gene per generation) (Vos & Didelot, 2009), which is

about 10�6 per gene per generation (Drake, 2009). This analysis of

recombination rates took special care to estimate only recombination

rates among closest relatives, and so the recombination rates may be

taken as estimates of the rate of recombination within populations (cw).

Thus, the estimate of recombination rate at 10�6 per gene per

generation may be taken as an upper limit for the rate of recombination

between populations (cb � cw). If foreign alleles are disfavored even by

weak selection, for example around 10�2, the equilibrium frequency of

foreign alleles would be negligible, in this case around p�= cb/s � cw/s,

or 10�6/10�2 = 10�4.

Thus, even if recombination between populations is occurring at as

high a rate as recombination within populations, each population will

be able to hold onto its respective combinations of genes, which adapt

it to its respective way of making a living. Thus, adaptive divergence in

bacteria does not require sexual isolation (Cohan, 1994; Cohan &

Koeppel, 2008). This argument has been ignored by workers claiming a

central role for recombination and sexual isolation in bacterial

divergence (Fraser et al., 2007; Sheppard et al., 2008), but the

argument has never been refuted.

Nevertheless, recombination is an important force fostering adaptive

evolution in bacteria. Recombination of niche-transcending genes has

been shown to be an important means of introduction of adaptations

into bacterial populations. The difference is that maladaptive

recombination of niche-specifying genes can be rejected by natural

selection, but rare, adaptive introductions of niche-transcending genes

are amplified by natural selection.

FEMS Microbiol Rev 35 (2011) 957–976c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

958 J. Wiedenbeck & F.M. Cohan

genes of the donor, which would be maladaptive in the

recipient (Zawadzki & Cohan, 1995; Cohan & Koeppel,

2008). This is in contrast to adaptive introgression in

animals and plants. Because eukaryotic meiosis involves a

50 : 50 mix of the two parents’ genomes, several generations

of backcrossing are required to yield a pure transfer of niche-

transcending genes across species (Rieseberg et al., 2003); in

bacteria, a set of niche-transcending genes can be trans-

ferred, without other genes, in a single transfer event

(Cohan, 2001).

Donor--recipient similarities required forgenetic transfer

The promiscuity of bacteria and the opportunity to transfer

adaptations across taxa are not absolute. Certain shared

characteristics between donor and recipient are required for

transfer to take place (Cohan, 2002). In some cases, genetic

exchange takes place through a plasmid or a phage vector,

whereby the vector incorporates genetic material from one

host, the vector then infects another host, and the genetic

material becomes recombined into the new host (Majewski,

2001). Clearly, vector-based genetic exchange between bac-

teria can take place only if the bacteria share their vectors

(Cohan, 2002). The host ranges of phage and plasmids are

generally narrow, but the host ranges of certain phage

extend across bacterial genera (Jensen et al., 1998), and

some plasmids have extremely broad host ranges that extend

across bacterial divisions (Norman et al., 2009).

Sharing of vectors between hosts is challenged by bacterial

restriction–modification systems. Many bacteria are

equipped with a restriction endonuclease that recognizes

one specific DNA sequence, usually a palindromic tetramer

(e.g. GGCC), hexamer, or octamer; these bacteria are also

equipped with a modification enzyme that methylates the

recognition sequence and thereby protects the cell’s DNA

from its own restriction enzyme (Weiserova & Ryu, 2008).

Thus, bacteria that share the same restriction–modification

system can more easily share a phage or a plasmid, whereas

bacteria with different restriction–modification systems will

effectively digest the DNA of one another’s vectors (Jeltsch,

2003; Budroni et al., 2011). Nevertheless, short plasmid and

phage genomes may be shared across bacteria differing in

their restriction–modification systems, as short vectors are

less likely to contain a given recognition sequence and are

thereby more protected from cleavage (Lacks & Springhorn,

1984). Even some larger plasmids, including the broad-host-

range Inc-P1 plasmids, are able to defend against restriction

endonucleases by removing restriction target sites from their

sequences (Wilkins et al., 1996), whereas others simulate the

host modification system, protecting their sequences from

cleavage (Kruger & Bickle, 1983), thus contributing to their

promiscuity.

In some taxa (e.g. Bacillus and Neisseria), genetic

exchange can occur by transformation, through uptake of

naked DNA from the environment. Most transforming

bacteria are able to take up DNA indiscriminately (Lorenz

& Wackernagel, 1994), but some species are selective about

the DNA sequences that are allowed to cross the membrane.

For example, in Neisseria gonorrhoeae, successful transfor-

mation requires a 10-bp uptake sequence that occurs

abundantly in the genome of this species taxon. The

requirement for this uptake sequence effectively prevents

genetic transfer from divergent organisms (Hamilton &

Dillard, 2006).

Once present in the cytoplasm, transferred DNA must

ensure its replication so that it is not lost. For DNA that

enters a recipient cell on a plasmid, replication may occur by

simply remaining on the plasmid, as seen in many plasmids

with antibiotic resistance factors (Fondi & Fani, 2010).

Alternatively, if transferred DNA is contained within a pair

of insertion sequences on a plasmid or a phage, the DNA

flanked by the insertion sequences may replicate and then

transfer to the recipient chromosome, without a need for

homology between donor and recipient DNA (Vo et al.,

2010).

Otherwise, entering DNA may integrate into the recipient

chromosome through homologous recombination. This

requires a ‘minimum efficient processing segment’ (MEPS)

consisting of near-identical sequences of at least 25 bp at one

or both ends of a donor segment (Shen & Huang, 1986;

Majewski & Cohan, 1999). The requirement of near identity

limits the divergence of genetic material transferred through

homologous recombination, effectively allowing only close

relatives (with o 20% nucleotide divergence) to exchange

genes in this manner (Majewski & Cohan, 1999). The MEPS

requirement has resulted in an exponential decay in the

recombination rate as sequence divergence increases, as seen

in Bacillus (Roberts & Cohan, 1993; Zawadzki et al., 1995;

Majewski & Cohan, 1999), Escherichia (Vulic et al., 1997,

1999), and Streptococcus (Majewski et al., 2000a), although

requirements of near identity may be relaxed in hypermu-

tator strains lacking mismatch repair (Matic et al., 2000).

The requirement for near identity in homologous recom-

bination is expected to reduce the transfer of homologous

adaptations, such as penicillin resistance in Streptococcus,

which occurs through a mutational change in the target

protein (the so-called ‘penicillin-binding protein’) (May-

nard Smith et al., 1991). Resistance alleles have been

transferred within the genus through homologous recombi-

nation (Maynard Smith et al., 1991; Hoffman-Roberts

et al., 2005; Barlow, 2009), and presumably, the rate of

transfer across species taxa was reduced by their sequence

divergence.

Sequence divergence of shared genes may also limit the

transfer of heterologous genes. This is because homologous

FEMS Microbiol Rev 35 (2011) 957–976 c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

959Origins of diversity through horizontal transfer

recombination can facilitate the transfer of heterologous

genes, particularly when a heterologous gene is sandwiched

between homologous genes shared by the donor and the

recipient [i.e. homology-facilitated illegitimate recombina-

tion (HFIR)] (de Vries & Wackernagel, 2002). Provided that

a donor segment’s end sequences are homologous with

recipient sequences and they pass the MEPS criterion, any

genetic material sandwiched between the end segments may

be cotransferred (Majewski & Cohan, 1999).

While acquisition of novel genes by HGT is possible

between extremely divergent organisms, HGT occurs much

more frequently between close relatives than between more

distant relatives. For example, in the gammaproteobacter-

ium Legionella pneumophila, the number of genes acquired

by HGT events from a donor taxon is correlated with the

phylogenetic distance (based on 16S rRNA gene divergence)

between L. pneumophila and the donor (Coscolla et al.,

2011). More broadly, a recent analysis of 657 sequenced

prokaryotic genomes clearly showed that HGT events are

much more common among close relatives than distant

relatives (Popa et al., 2011) (Fig. 1).

A close phylogenetic relationship fosters HGT in at least

two ways. First, close relatives will have greater sequence

identity in their shared genes, thus increasing their rate of

homologous recombination (Zawadzki et al., 1995; Vulic

et al., 1997; Majewski & Cohan, 1998, 1999; Majewski et al.,

2000b), as well as HFIR (de Vries & Wackernagel, 2002).

Second, close relatives are more likely to share the same

habitat than distant relatives; even members of different

families, but the same order are more likely to share habitats

than more distant relatives (Philippot et al., 2010). Because

organisms can exchange genes only when they are in the

same habitat (Matte-Tailliez et al., 2002), phylogenetic

closeness tends to promote genetic exchange through a

sharing of environments.

What adaptations can and cannot bewidely transferred?

Beyond the requirement of similarity between donor and

recipient, there are limitations on the kinds of adaptations

that can be transferred. The various genes must fit on a

chromosomal segment short enough to be transferred or

they must fit on a mobile extrachromosomal element; also,

the set of genes must be functional when it arrives in a new

organism (Lawrence, 1999). The selfish operon model pre-

dicts that natural selection will favor the contiguous ar-

rangement of a functionally related set of genes as an

operon, enabling the gene set to be successfully transferred

as a single, functional element across taxa (Lawrence &

Roth, 1996; Lawrence, 1997, 1999). In this model, natural

selection acts at the level of the operon, preserving the

transferability of the unit, such that an operon can spread

widely across the bacteria. Indeed, certain operons have

passed between organisms at a high rate (Homma et al.,

2007).

An operon may expand to include more functions

through HGT acquisition of additional genes (Homma

et al., 2007); when this occurs, it is generally the first step in

the pathway that is gained, such as the gaining of the fucP

gene in the fucPIKUR operon in E. coli (for the digestion of

fucose) (Pal et al., 2005). In addition, regulators of newly

added genes may be gained along with the genes they control

(Price et al., 2008). The co-incorporation of newly acquired

protein-coding genes along with their regulators may in-

crease the successful transferability of an enlarged operon

(Price et al., 2008).

Transferable gene clusters often contain a set of genes that

are all involved in processing a single resource molecule.

This is exemplified by the lac operon, providing inducible

metabolism of environmental lactose (Jacob & Monod,

1961). The lac operon includes protein-coding genes for a

lactose-binding repressor, which shuts down the operon in

the absence of lactose; a lactose permease, allowing for the

uptake of lactose; and two proteins for catabolizing lactose

in different directions: one that cleaves lactose to its con-

stituent monosaccharides and another that transacetylates

lactose. Another motif for a transferable cluster is an entire

biochemical pathway, for example the full synthetic pathway

for cytochrome c biogenesis (Goldman & Kranz, 1998).

0.5

0.6

0.7

Donor-recipient pairs

Disconnected pairs

0.0

0.1

0.2

0.3

0.4

Pro

port

ion

of s

peci

es p

airs

0.1 1 10 100Genome sequence similarity (%)

Fig. 1. The relationship between genome sequence similarity and the

proportion of species pairs that engaged in HGT. The ‘donor–recipient’

pairs represent the pairs of species that were connected by at least one

HGT event, based on 32 028 HGT events that could be attributed to a

recipient and a donor organism, among 657 sequenced prokaryotic

genomes; ‘disconnected pairs’ represent those species pairs that did not

show evidence of any HGTevents. The decrease in the number of species

pairs apparently engaging in HGT among closest relatives (from 50% to

100% genome sequence similarity) was attributed to the difficulty in

identifying HGT events among close relatives (Popa et al., 2011) (used

with permission from Cold Spring Harbor Laboratory Press).

FEMS Microbiol Rev 35 (2011) 957–976c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

960 J. Wiedenbeck & F.M. Cohan

In both of these cases, a contiguous set of genes constitutes a

transferable adaptation.

Multiple antibiotic resistance genes are frequently ar-

ranged in clusters on plasmids (Gomez-Lus, 1998; Barlow,

2009). While not strictly operons in the sense that the

various contiguously arranged antimicrobial-resistance

genes on a plasmid may be regulated independently

(Gomez-Lus, 1998), the contiguous organization of such

gene sets can be explained through an extension of the

selfish operon theory. The association of different antimi-

crobial resistance genes may contribute to a single, selectable

function. Carrying resistance for multiple antibiotics is a

highly successful strategy for human pathogens likely to be

targeted by multiantibiotic therapy, either simultaneously or

in a cycling regimen (Lawrence, 2000; Summers, 2006).

Surveys of the genes and adaptations that have been most

and least frequently transferred demonstrate some of these

principles of transferability. In a survey of completely

sequenced genomes, the majority of genes identified as

recently transferred were classified into four functional

categories: plasmid, phage, and transposon functions; cell

envelope functions; regulatory functions; and cellular pro-

cesses (Nakamura et al., 2004). Within the cell envelope and

regulatory function categories, commonly transferred genes

include those encoding cell surface structures, biosynthesis

and degradation of surface polysaccharides, and DNA inter-

actions. Frequently transferred cell surface proteins may be

understood as niche-transcending adaptations, as they are

often the targets of immune systems, and a newly acquired

surface protein may allow immune escape (Stein et al.,

2010). The most commonly transferred cellular process

genes were those involved in DNA transformation, patho-

genesis, toxin production, and resistance (Nakamura et al.,

2004). In the case of genes coding for pathogenesis, toxin

production, and resistance, we may interpret their transfer-

ability in terms of providing niche-transcending adaptations

for invading new environments or for resisting new chal-

lenges appearing in an organism’s present environment,

such as antibiotics.

The survey by Nakamura et al. (2004) also showed that

plasmid, phage, and transposon functions comprised nearly

a third of the transferred genes. Many of these transferred

genes are caused by infection by these subcellular particles

and provide no adaptation for bacteria. In one special case,

bacteria can acquire defense against phage through HGT of

gene segments from phage that have infected them; this is

the modus operandi of the clustered regularly interspaced

short repeats (CRISPR) system (Marraffini & Sontheimer,

2008) (Box 2). CRISPR units are comprised of a series of

short repeats separated by spacer sequences that are derived

most often from bacteriophage or plasmids (Bolotin et al.,

2005; Mojica et al., 2005; Pourcel et al., 2005), and provide

defense against the phages or plasmids from which they are

derived (Barrangou et al., 2007). The CRISPR loci are

transcribed into RNA segments, which are cleaved into

smaller units that each target the complementary sequence

on the phage (Brouns et al., 2008) by base-pairing with

either the phage’s DNA (Marraffini & Sontheimer, 2008) or

RNA (Hale et al., 2009). Thus, the CRISPR system repre-

sents a form of acquired defense, where infection of a

bacterium by a phage may be followed by the incorporation

of phage DNA into new spacer regions of the CRISPR

system, thereby providing defense against related phage in

the future (Sorek et al., 2008; van der Oost et al., 2009;

Horvath & Barrangou, 2010).

Genes that have been acquired by a particular lineage

through HGT cannot be assumed to be adaptations, and the

adaptive value of an acquired gene should be confirmed, for

example by showing that it is expressed under natural

conditions in a way that is consistent with its likely adaptive

value (Steunou et al., 2008). Acquired genes are most likely

to be nonadaptive ‘craters’ that have fallen onto the genome

under a variety of circumstances, particularly if the genes

were newly acquired. One possibility, which genome ana-

lyses abundantly support, is that many transferred genes

have entered the genome nonadaptively with invading

phage or transposons (Nakamura et al., 2004). Another

Box 2. Evolution of the CRISPR system through HGT

CRISPR modules are an acquired immunity defense of prokaryotes

against phage and plasmids, and are possessed by most Archaea and

many Bacteria (Makarova et al., 2011). Each CRISPR module within a

genome contains a number of spacer regions, each derived from the

sequence of a previously infecting phage or plasmid through HGT,

and the spacer regions are separated by repeat regions. Resistance to

phages through the CRISPR system is very precise. If the spacer

sequences are not a 100% match to the phage, the phage may

retain the ability to infect the bacterium (Barrangou et al., 2007).

Thus, the incorporation of spacer sequences into CRISPR loci

contributes to the evolutionary arms race between the bacterium

and the phage, where selective pressure imposed on the phage

through the CRISPR system may drive high evolutionary rates in the

phage (Sorek et al., 2008). Spacer regions in CRISPR loci are also

rapidly gained and lost – the most newly acquired spacer regions are

often unique to individual bacterial isolates (Pourcel et al., 2005).

Large-scale evolutionary changes have also been noted in CRISPR

systems. Although sequences of repeats in CRISPR loci vary among

different bacterial species, there are occurrences of unexpectedly

similar repeat sequences between diverse bacteria (Makarova et al.,

2002; Haft et al., 2005; Godde & Bickerton, 2006). These similarities

point to the propagation of CRISPR systems by HGT. The transfer of

CRISPR systems is hypothesized to be mediated by megaplasmids

(plasmids 4 40 kb in size), as CRISPR loci in some genomes are

located on megaplasmids rather than on the bacterial chromosome

(Godde & Bickerton, 2006). The HGTof CRISPR loci may have helped

to establish the widespread presence of CRISPR systems in bacteria

[�40% of bacterial genomes contain CRISPR loci (Godde &

Bickerton, 2006)], and indicates the importance of the CRISPR

system for phage defense.

FEMS Microbiol Rev 35 (2011) 957–976 c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

961Origins of diversity through horizontal transfer

possibility is that a nonadaptive, transferred gene has

randomly reached a high frequency within a population or

a taxon by genetic drift or by periodic selection. Finally, a

gene may have been adaptive in the past, but is no longer

adaptive. This could be the case for an antibiotic resistance

gene that was adaptive when a particular antibiotic was

applied, but in the absence of the antibiotic, the gene is no

longer useful.

Among the cellular processes that are transferred the least

are informational genes, involved in translation and tran-

scription (Jain et al., 1999; Nakamura et al., 2004). Within E.

coli in particular, Davids & Zhang (2008) found genes

encoding for informational processes to be dominated by

core genes shared by all E. coli strains, with no significant

contribution from HGT. The low rates of transfer of

informational genes [as well as photosynthetic genes (Aris-

Brosou, 2005)] can be understood most generally by Riedl’s

‘burden’ hypothesis (Riedl, 1978). The burden hypothesis

predicts (for animal evolution) that organs burdened by

connections to many other organs would be slow to evolve;

their burden of complex interactions with other functional

units would make their evolution difficult. Aris-Brosou

(2005) has argued that the property of burden (or connec-

tivity) that makes adaptive evolution difficult for complex

structures also renders their HGT unlikely to succeed. More

specifically, the burden hypothesis predicts a low rate of

transfer of genes coding for parts of the transcription and

translation machinery, as these functions are burdened with

many complex interactions; the transfer of only one part of a

complex set of co-adapted structures would likely bring

about an incompatibility and loss of function (Jain et al.,

1999). The degree of burden, and thus the predicted

resistance to the transfer of a gene, may be approximated

by the number of protein–protein interactions of the gene’s

product (Wellner et al., 2007; Price et al., 2008). This

measure of a protein’s connectivity has been shown to be

an important factor for the transferability of genes across all

functional categories, in both ancient and recent transfers

(Cohen et al., 2011) (Fig. 2).

Although uncommon, the transfer of highly connected

informational genes can occur (e.g. the transfer of an rRNA

operon or a gene encoding a ribosomal protein) (Gogarten

et al., 2002; Lind et al., 2010). The burden hypothesis (or the

‘complexity’ hypothesis) predicts that for informational

genes to be successfully transferred, the recipient and the

donor should be closely related. As predicted, the fitness

costs of transferred ribosomal genes in Salmonella in experi-

mental manipulations were generally much higher from

distant relatives than from close relatives (Lind et al., 2010).

In contrast, the most phylogenetically distant transfers, such

as those between the Bacteria and the Archaea domains,

almost always involve metabolic genes (Kanhere & Vingron,

2009). This pattern is consistent with informational genes

being incompatible with the physiology and structures of

any but the most closely related organisms.

Among the most complex structures that show no

evidence of HGT are features of cell architecture, which are

profoundly different across the most anciently divergent

bacterial taxa (Cohan & Koeppel, 2008; Cohan, 2010). The

prokaryotic domains of Bacteria and Archaea differ funda-

mentally in the structure of their cell membranes, owing to

their use of ester vs. ether linkages in their membranes’ fatty

acids. Some Archaean lineages build upon the ether linkage

to produce a tetraether diglycerol structure yielding a lipid

monolayer. This monolayer structure is of great ecological

importance to hyperthermophiles, as the monolayer does

not peel apart at high temperatures, in contrast to the lipid

bilayer of Bacteria (Madigan et al., 2009). Thus, this

profound structural difference between these domains is at

least partly responsible for the ecological success of Archaea

at extremely high temperatures. While this architecture-

based ability to resist high temperature might be of value

for some Bacterial lineages, such differences in the ‘bau-

plans’ of cells, however adaptive, are not likely to be

packaged and transferred with success (Cohan & Koeppel,

2008). Because the lipid monolayer membrane interacts

with many aspects of cell physiology, including the function

of many trans-membrane proteins, transfer of such an

adaptation is unlikely, even if it were succinctly packaged as

a string of genes on a plasmid.

Fig. 2. The relationship between the number of HGT events and the

number of protein–protein interactions. Each dot represents a gene

family, and the number of protein–protein interactions for a given family

is quantified as the number of other gene families with which the given

family interacts. The negative relationship between the number of HGT

events and the protein–protein interactions was quantified with a Spear-

man correlation coefficient of �0.422, with Po 8.18�10�105 (Cohen

et al., 2011) (used with permission from Oxford University Press).

FEMS Microbiol Rev 35 (2011) 957–976c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

962 J. Wiedenbeck & F.M. Cohan

Other nontransferable adaptations include architectural

structures of the Firmicutes and the Planctomyces. The

Firmicutes are built with an extremely strong cell wall,

containing multiple layers of cross-linked peptidoglycan

(Madigan et al., 2009), conferring effective and constitutive

resistance to osmotic stress (Schimel et al., 2007). The gram-

positive cell wall thus confers to the Firmicutes the ability to

thrive in drought-prone environments, which can deliver an

osmotic shock by rapidly re-wetting the cell. The Plancto-

myces build a stalked morphology, unique in the bacterial

world, which lifts the cell above the surface of sediment and

brings the cell within reach of much higher levels of

nutrients (Lawler & Brun, 2007). In all these cases, bau-

plan-level adaptations would in principle be gratefully

received by other lineages for their ecological advantages,

but they have not been transferred, presumably because such

adaptations would integrate into the existing physiology and

development of a recipient cell only with extreme difficulty,

if at all (Cohan & Koeppel, 2008; Cohan, 2010; Philippot

et al., 2010).

Accommodation of transferredadaptations

While a set of genes acquired through HGT may provide an

important adaptation to a recipient, the transfer may also

incur harmful pleiotropic effects (Cohan et al., 1994; No-

gueira et al., 2009). In the case of animals and plants, large

genetic changes often disrupt the organism’s physiology and

development (Fisher, 1958), and we should likewise expect

HGT events to be disruptive in bacteria. It is easy to imagine

that the expression of novel genes would disrupt existing

physiology, but even acquiring an adaptive allele by homo-

logous recombination could disrupt the smooth functioning

of a cell, especially in the case of antibiotic resistance alleles.

This is because many antibiotics target informational pro-

teins, which are difficult to alter, as we have discussed. Thus,

many resistance-conferring mutations in targeted informa-

tion molecules bear a severe cost to the growth rate and

competitiveness of a cell (Cohan et al., 1994; Andersson &

Levin, 1999; Levin et al., 2000).

The deleterious side effects of a new adaptation can drive

natural selection toward ‘domesticating’ the adaptation, that

is, toward ameliorating its negative fitness effects. One mode

of ameliorating the cost of a new adaptation is through

compensatory evolution, whereby natural selection favors

modifiers at other loci that compensate for the deleterious

effects of the adaptation (Cohan et al., 1994; Andersson &

Levin, 1999). These other loci are expected to code for

functions that interact with the new adaptation. For exam-

ple, a costly streptomycin-resistance allele of the S12 riboso-

mal protein locus can be ameliorated by compensatory

mutations in the L19 locus, coding for another ribosomal

protein (Maisnier-Patin et al., 2007).

The potential for compensatory evolution may be eval-

uated by observing spontaneous evolution among the

descendants of a cell that has integrated an adaptation

through HGT. This reveals the potential of new mutations

in the genome’s already-existing genes to ameliorate an

adaptation’s harmful effects (Andersson & Levin, 1999;

Levin et al., 2000). Alternatively, one may place an adaptive

gene into different genetic backgrounds within a species

taxon and observe the extent to which the fitness effect of the

adaptive gene varies between genetic backgrounds (Cohan

et al., 1994). This provides an estimate of the potential for a

species’ standing genetic variation to ameliorate the adapta-

tion. For example, placing rifampicin-resistance alleles of

the rpoB locus into different strains of Bacillus subtilis

yielded a large range of fitness costs across different genetic

backgrounds (Cohan et al., 1994).

Amelioration can also occur through change in the

adaptation itself. Newly acquired genes have higher rates of

evolution than other genes in the genome (Daubin & Och-

man, 2004; Hao & Golding, 2006; Marri et al., 2007; Davids

& Zhang, 2008). For example, Hao & Golding (2006) found

that within the Bacillaceae, recently transferred genes had

longer tree lengths and higher Ka/Ks ratios than native genes,

illustrating rapid evolution. As transferred genes persist in

the genome, the Ka/Ks ratio decreases, suggesting adaptation

of the new genes to the new environment of the host (Hao &

Golding, 2006). Additionally, horizontally transferred genes

are more likely to be regulated by multiple factors; this

complex regulation is believed to evolve after the genes are

transferred (Price et al., 2008).

Another mechanism for domesticating a horizontally

acquired adaptation involves altering the amount of protein

product of the transferred gene, by either gene amplification

or gene repression. In full-genome studies of paralogous

genes, it has been shown that genes newly acquired by

the genome are more likely to undergo gene duplication

(Hooper & Berg, 2003a). Although the reason for the

duplication of horizontally transferred genes is not well

established, it is thought that gene duplication functions to

amplify the amount of gene product for genes with subopti-

mal levels of protein product, as genes with large amounts of

protein product are rarely duplicated (Hooper & Berg, 2003b;

Lind et al., 2010). Gene amplification may be especially

important when HGT results in the homologous replacement

of essential genes. For example, Lind et al. (2010) found

that in Salmonella typhimurium, gene amplification was

common following the homologous replacement of riboso-

mal genes from a variety of sources. The growth rates of

these mutants with gene amplifications were significantly

higher than those without, indicating that the gene duplica-

tion, in this case, served as amelioration following HGT.

FEMS Microbiol Rev 35 (2011) 957–976 c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

963Origins of diversity through horizontal transfer

Amelioration can also occur by way of gene repression.

Horizontally acquired sequences are often initially repressed

in the host genome by histone-like nucleoid-structuring

proteins (H-NS) (Schechter et al., 2003; Dorman, 2004,

2009; Oshima et al., 2006). H-NS appears to have the ability

to repress horizontally transferred DNA by recognizing

sequences with an aberrant GC content (Dorman, 2009). In

some strains, upwards of 40% of horizontally transferred

genes are associated with H-NS proteins (Lucchini et al.,

2006). In the case of Salmonella enterica, an Hns-like gene on

a plasmid expresses H-NS-like proteins that target horizon-

tally acquired genes only, allowing the silencing of horizon-

tally transferred genes upon transfer (Beloin et al., 2003;

Doyle et al., 2007; Banos et al., 2009).

The initial silencing of transferred genes may foster the

incorporation and domestication of transferred DNA into

the genome, as repression serves to limit the fitness costs of

expression of the novel DNA on the host cell (Dorman,

2007; Banos et al., 2009). Dorman has argued that this initial

silencing can be lessened over time as compensatory changes

occur, allowing a potential benefit of a transferred gene

while minimizing its disruptive effects (Dorman, 2009).

Finally, one other form of amelioration involves changes

in an acquired set of genes to fit the nucleotide composition

of the host genome. Organisms in different taxa above the

genus level frequently are different in their frequencies of

single nucleotides, dinucleotides, trinucleotides, and so on.

The compositional differences between a donor segment

and the recipient are diminished over time as incorporated

genes are subjected to the host’s pattern of mutation

(Lawrence & Ochman, 1998), as well as natural selection to

better integrate the acquired genes with host genome

regulation (Lercher & Pal, 2008).

HGTand ecological divergence amongclose relatives

How did the bacterial world diversify from a single common

ancestor to the hundred or more divisions of today, repre-

senting profoundly different cell architectures, metabolisms,

and ecologies? While the full answer is buried in the vast

expanses of time, we can approach a partial answer by

investigating a much more modest process, that of specia-

tion, in which one lineage splits into two irreversibly

separate lineages that are at least subtly different in ecologi-

cal requirements and capabilities. One can approach the

origins of new diversity through a genetic analysis of newly

divergent bacterial populations. The availability of full

genome sequences from closely related bacteria has enabled

microbiologists to discover the ecological differences among

closely related groups as well as the genetic differences

underlying the ecological divergence.

In cases where habitat differences suggest ecological

differentiation between close relatives, a genome-based

analysis can reveal the physiological basis for the ecological

divergence and the role of HGT. For example, a recent study

by Luo et al. (2011) aimed to find the physiological basis of

differences between E. coli clades associated with external

environments (e.g. freshwater beaches and sediments) vs.

clades associated with mammals and birds, either as com-

mensals or as pathogens. Comparisons of genome content

confirmed that clades associated with external environments

were indeed ecologically distinct from clades associated with

endosymbiont lifestyles within mammals and birds. In

contrast to commensal and pathogenic clades, the environ-

mental clades were found consistently to contain lysozyme,

for breaking down the cell walls of other bacteria in the

external environment, as well as the biochemical pathway

for diol utilization, apparently for the use of diol as an

energy substrate. The gut-associated clades were found to

have transport and metabolic capability related to various

substrates that are known to be common in the gut,

including N-acetylglucosamine, gluconate, and five- and

six-carbon sugars. Most importantly, by virtue of genome

content differences, the environmental clades are not likely

to compete well in the gut and are not expected to be of

concern for human health (Luo et al., 2011).

Likewise, closely related gut and dairy taxa within Lacto-

bacillus have acquired genes that foster adaptation to their

respective environments (O’Sullivan et al., 2009). The gut-

adapted taxon Lactobacillus acidophilus bears the gene for

maltose-6-phosphate glucosidase, for degradation of the

abundant maltose in the gut; also, the gut-adapted taxon

has a gene for bile salt hydrolase, which contributes to

resistance to bile in the gut. Both these functions are absent

in the dairy-adapted taxon Lactobacillus helveticus. The

dairy-adapted taxon has a carboxypeptidase gene not found

in the gut-adapted taxon, which contributes to survival in

environments, such as milk, where amino acid levels are low.

Genome comparisons among other extremely closely

related populations in other taxa also identify the physiolo-

gical basis of ecological divergence and point to a role for

HGT in the origins of ecological diversification. Genome

comparisons among soil populations of Pseudomonas putida

associated with polluted and unpolluted environments show

the polluted populations to hold metal resistance genes not

found in unpolluted sites (Wu et al., 2010). Also, closely

related populations within Prochlorococcus marinus have

diverged in their phosphate-acquiring adaptations: those

populations adapted to marine environments with very low

levels of phosphate have acquired a set of genes involved in

the uptake, regulation, and utilization of organic phosphates

(Martiny et al., 2009a). In studies like these, comparisons of

genome contents among close relatives point to the physio-

logical basis by which populations diverge ecologically and

FEMS Microbiol Rev 35 (2011) 957–976c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

964 J. Wiedenbeck & F.M. Cohan

implicate a role for HGT as the motive force by which this

diversification has proceeded.

Genome content comparisons can also add to our knowl-

edge about the ecological dimensions by which populations

have diverged. This is seen in the case of two clades of

Synechococcus in a Yellowstone hot spring: one clade asso-

ciated with hotter, upstream environments closest to the

source of the spring (the A clade, 60–65 1C) and the other

associated with cooler, downstream environments (B0 clade,

55–61 1C). Comparison of the genome content of these

groups showed that beyond differences in their temperature

optima (Allewalt et al., 2006), the groups were also different

in their abilities to take up and store mineral nutrients

(Bhaya et al., 2007). The genomes suggest that the cooler,

downstream clade can accommodate lower concentrations

of mineral nutrients (owing to consumption by upstream

populations) by being able to take up phosphonate and store

nitrogen, adaptations acquired by HGT.

Metagenomic approaches have also yielded evidence for

HGT in the origin of ecological diversity. A shotgun

sequencing of random clones from the microbial mat of a

Yellowstone hot spring indicated an adaptive divergence

among extremely closely related populations of Synechococcus

(Bhaya et al., 2007). Whereas the complete genome of one

isolate showed no genome content dedicated to transport of

the reduced ferrous ion, several closely related organisms

identified from the metagenome contained the ferrous trans-

port genes feoA and feoB. This suggested that some popula-

tions, but not others are able to scavenge reduced iron during

the nighttime when the mat becomes anoxic (Bhaya et al.,

2007). Thus, niche adaptation strategies can be inferred by

comparing metagenome sequences with an ‘anchor’ isolate

whose genome has been fully sequenced.

Genome-based analyses have also shown how ecological

diversity can emerge from changes other than HGT, through

changes in existing genes. Petersen et al. (2007) searched the

set of genes shared by six E. coli strains for genes whose

amino acid divergence was accelerated by natural selection.

This search identified several genes coding for cell surface

proteins, and the amino acids under selection were consis-

tently found to be in the extracellular region, indicating that

diversification has resulted from the arms race between the

bacteria and their enemies, such as phage and the host

immune system.

Analyses of genome-wide gene expression have also

helped to characterize how changes in existing genes have

led to niche partitioning among close relatives. Denef et al.

(2010) found that one clade of the Archaean Leptospirillum,

which is typically a late colonizer of acid-mine pools, differs

from a closely related, early-colonizing clade in the expres-

sion of several shared proteins that may be important in

growth at low nutrient concentrations, including proteins

for cobalamin synthesis and glycine cleavage. It will be

fascinating to find the extent to which ecological differences

among close relatives are caused by the horizontal acquisi-

tion of novel genes vs. changes in the sequences and

expression patterns of shared genes.

In summary, genome-based comparisons have confirmed

that closely related populations are ecologically distinct; they

have identified the ecological dimensions by which the

populations have diverged; and they have determined the

genetic and physiological bases of ecological divergence,

including the role of HGT. Approaches adopted up to now,

however, have not directly addressed the origins of species,

by which one lineage irreversibly splits into two lineages, as

we next discuss.

HGTand the origins of bacterial species

To discover the ecological and genetic changes that occur

during speciation, it is not enough to compare closely

related species taxa or even close relatives within a species

taxon. To investigate the dynamics of lineage splitting and

the origin of irreversible separateness, we need to identify

the most newly divergent, ecologically distinct populations.

Focusing on these most newly divergent populations will

allow us to identify the features responsible for the earliest

splitting of lineages. If we instead compare more divergent

lineages, we cannot distinguish the features responsible for

lineage splitting from those features added on after the

lineages have become established. Studies of the speciation

process in bacteria have not yet attempted to identify and

compare the genomes of the most recent products of

speciation.

The established systematics of bacteria does not help

identify the most newly divergent, irreversibly separate,

ecologically distinct bacterial populations. The problem is

that the taxa recognized as species by bacterial systematics

are extremely broadly defined, with enormous levels of

divergence in physiology, genome content, and most im-

portantly, ecology (Welch et al., 2002; Whittam & Bum-

baugh, 2002; Tettelin et al., 2005; Lefebure & Stanhope,

2007; Rasko et al., 2008; Touchon et al., 2009; Paul et al.,

2010). For example, isolates within the species taxon E. coli

show huge differences in habitat (Walk et al., 2007, 2009)

and genome content (Welch et al., 2002; Touchon et al.,

2009; Luo et al., 2011). While there are increasing numbers

of studies comparing the genomes of close relatives within

the recognized species taxa, little attention has been paid as

to whether the populations being compared represent the

most newly divergent, irreversibly separate, ecologically

distinct populations.

One approach to identifying such populations takes into

account the dynamic properties that are widely understood

to characterize species, at least outside of the systematics of

bacteria (de Queiroz, 2005). Foremost among these dynamic

FEMS Microbiol Rev 35 (2011) 957–976 c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

965Origins of diversity through horizontal transfer

properties is that species are cohesive, in the sense that

genetic diversity within a species is limited by a force of

evolution (Templeton, 1989; Cohan & Perry, 2007). In

contrast, different species are viewed to be irreversibly

separate, with no cohesion holding them together. Also,

species are expected to be ecologically distinct, which allows

the species to coexist into the indefinite future. Finally, each

species is expected to be invented only once (de Queiroz,

2005).

The forces of species cohesion in bacteria potentially

include genetic exchange between populations (Papke et al.,

2007; Sheppard et al., 2008; Fraser et al., 2009; Retchless &

Lawrence, 2010), genetic drift (Wernegreen & Moran, 1999;

Kuo et al., 2009), and periodic selection, a diversity-purging

process occurring in populations with low recombination

rates (Koch, 1974; Levin, 1981; Cohan, 1994). Genetic

exchange has been widely considered to be a cohesive force

preventing ecological divergence in obligately sexual animals

and plants; in some models, permanent, adaptive divergence

between animal and plant populations is impossible unless

barriers to genetic exchange have evolved (Mayr, 1963;

Coyne & Orr, 2004), although the necessity of barriers to

genetic exchange in animal and plant speciation has been

recently challenged (Dieckmann et al., 2004; Mallet, 2008).

In the case of bacteria, the rate of exchange of genes

between populations, even between differently adapted

populations of the same microhabitat, is extremely low. This

is because the flow of genes between populations is limited

by the bacterial rate of recombination, which is consistently

near the mutation rate (Vos & Didelot, 2009). At such low

rates of gene flow between populations, genetic exchange is

not a significant force preventing ecological divergence

between bacterial populations (Cohan, 1994; Cohan &

Koeppel, 2008) (Box 1).

Genetic drift and periodic selection are the most likely

forces of cohesion within bacterial species. Cohesion by

these forces is limited to the set of ecologically similar

organisms within an ‘ecotype’ (Kopac & Cohan, 2011).

Genetic drift is most likely to be an important force of

cohesion for ecotypes with low effective population sizes, for

example, obligate endosymbionts that are transmitted be-

tween hosts in extremely small numbers (Wernegreen &

Moran, 1999). Periodic selection can act to purge the

diversity within any ecotype, regardless of its population

size (Levin & Bergstrom, 2000). In a periodic selection

event, an adaptive mutation or a gene acquisition by HGT

within an ecotype outcompetes to extinction all other

members of the ecotype; owing to the low recombination

rates in bacteria, selection favoring the adaptive mutation or

recombinant can bring nearly the entire genome sequence of

the mutant cell to 100% frequency within the ecotype

(Cohan, 2005). Thus, the diversity within an ecotype is

ephemeral, lasting only until the next periodic selection

event (Fig. 3).

The origin of permanent diversity requires the origin of a

new, ecologically distinct population, that is, the origin of an

Fig. 3. The dynamics of ecotype formation and periodic selection within an ecotype. Circles represent different genotypes, and asterisks represent

adaptive mutations. (a) Ecotype-formation event. A mutation or a recombination event allows the cell to occupy a new ecological niche, founding a new

ecotype. A new ecotype can be formed only if the founding organism has undergone a fitness trade-off, whereby it cannot compete successfully with

the parental ecotype in the old niche. (b) Periodic selection event. A periodic selection mutation improves the fitness of an individual such that the

mutant and its descendants outcompete all other cells within the ecotype; these mutations do not affect the diversity within other ecotypes because

ecological differences between ecotypes prevent direct competition. Periodic selection leads to the distinctness of ecotypes by purging the divergence

within, but not between ecotypes. (Used with permission from Landes Publishers.)

FEMS Microbiol Rev 35 (2011) 957–976c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

966 J. Wiedenbeck & F.M. Cohan

ecotype (Cohan & Perry, 2007). A new ecotype is formed

when an organism invades a new ecological niche, by virtue

of a mutation or an HGT event. The diversity-purging

power of an ecotype’s periodic selection events cannot reach

beyond the ecological similarity of ecotype members; the

ecological distinctness across ecotypes will generally protect

an ecotype from periodic selection stemming from another

ecotype (Fig. 3). An ecotype may be most precisely under-

stood as the domain of competitive superiority of an

adaptive mutant (Cohan, 1994).

The formation of a new ecotype requires that the new

niche-invading organism must be able to coexist with the

parental ecotype. This requires a fitness trade-off, where the

success of the niche-invading organism in its new niche

comes at a cost, usually in its ability to compete with the

parental ecotype in the old niche. The fitness trade-off leads

to two alternatively specialized ecotypes that can coexist

(Fig. 4). A different dynamic ensues when a mutant or a

recombinant organism acquires the ability to use new

resources at no cost in the old niche. In this case, the new

organism becomes more of an ecological generalist, and will

simply outcompete the members of its ecotype to extinction,

and no new ecotype is created (Fig. 4).

Thus, the origin of bacterial ecotypes that can coexist can

be viewed as the origin of species: a nascent ecotype is

irreversibly separate from the parental ecotype from which it

is derived, because the divergence of different ecotypes is not

limited either by periodic selection (Fig. 3) or by recombi-

nation (Box 1); also, each ecotype is genetically cohesive,

owing to the effect of genetic drift and/or periodic selection

in purging the diversity among the ecologically interchange-

able membership. Ecotypes thus hold the dynamic attributes

of species, and the origin of species can be understood by

exploring the origins of ecological divergence among newly

formed ecotypes.

Fortunately, the most newly divergent ecotypes can

potentially be identified even when we do not know the

ecological dimensions by which they have diverged,

Event Consequence

(a)

(b)

A

B

Ecotype 1 Ecotype 1

Ecotype 1 Ecotype 1 Ecotype 2

Loss

Ecotypeformation

Periodicselection

Fig. 4. The consequences of a change in ecological niche following an HGT event. Acquisition of a new ecological function by HGT (indicated by the

green triangle) can lead to either a periodic selection event (a) or an ecotype formation event (b). (a) The new ecological function is added to the

ecological repertoire of the recipient strain, and the resultant strain is now able to outcompete the membership of its ecotype by virtue of its greater

repertoire. An example of such a niche-expanding HGT event would be the acquisition of one more antibiotic resistance factor in a human pathogen

that is already resistant to a number of antibiotics; this would expand the set of clinical conditions under which the strain could succeed. (b) Gain of one

function is incompatible with an existing function or reduces the performance of an existing function. Thus, acquisition of the new ecological capability

comes at the expense of some pre-existing function. In this case, acquisition of the new function leads to a new ecotype, which can coexist with the pre-

existing ecotype. An example of this would be the acquisition of a new symbiosis plasmid in Rhizobium, conferring the recipient with the ability to infect

a new plant species; however, two symbiosis plasmids within a cell are incompatible. Thus, the plasmid transfer event would create a splitting of the

ecotype into two ecotypes. Alternatively, a mutation to better utilize a new carbon source could lead to the invention of a new ecotype, provided that

the mutation causes lower performance in utilizing the old carbon source. This has been seen repeatedly in experimental populations of Escherichia coli

that primarily used glucose for carbon; a mutation to utilize secreted acetate created a new ecotype because the acetate-utilizing bacteria were less able

to utilize glucose (Treves et al., 1998).

FEMS Microbiol Rev 35 (2011) 957–976 c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

967Origins of diversity through horizontal transfer

provided that a particular model of bacterial speciation is

assumed (Cohan & Perry, 2007). Under the Stable Ecotype

model (Fig. 5a), ecotypes are formed rarely and each ecotype

endures many periodic selection events within its lifetime.

This model therefore allows the most newly divergent

ecotypes to accumulate a unique set of neutral sequence

mutations since diverging from their most recent common

ancestor; moreover, the recurrent periodic selection events

ensure that each ecotype appears as a sequence cluster of

very close relatives, based on a phylogeny of any gene in the

genome. Different ecotypes will therefore appear to be much

more distantly related than members of the same ecotype

and will correspond to sequence clusters (Fig. 5a).

For decades, sequence clustering has formed the basis for

discovering ecotypes. For example, a diversity of ecotypes

was hypothesized within the species taxon Mycobacterium

bovis based on the clustering of rapidly evolving sequences,

and these clusters were confirmed to be ecologically distinct

on the basis of differences in host range (Smith et al., 2006).

Most closely related ecotypes have similarly been hypothe-

sized and independently confirmed to be ecologically dis-

tinct in many other taxa, including the photosynthetic

marine cyanobacteria of Prochlorococcus (Martiny et al.,

2009b), on the basis of temperature and nitrogen require-

ments.

The demarcation of ecotypes can be aided by theory-

based algorithms, which do not rely on an investigator’s

intuition about the phylogenetic size of a sequence cluster

most likely to correspond to an ecotype. The algorithms

ECOTYPE SIMULATION (Koeppel et al., 2008) and ADAPTML (Hunt

et al., 2008) infer ecotype demarcations using universal

molecular methods, generally interpreting ecotypes as se-

quence clusters for any gene shared among organisms. These

algorithms have inferred ecotypes from sequence data in

various bacterial systems, and the ecotypes have consistently

been confirmed to be ecologically distinct. For example,

ECOTYPE SIMULATION has identified extremely closely related

ecotypes in soil Bacillus that are ecologically distinct in their

ABDABC ABD Abc aBc abC

Stable ecotype

(a) (b) (c)

Nano-niche Recurrentniche invasion

ABC ABC

Fig. 5. Models of bacterial speciation. Ecotypes are represented by different colors; periodic selection events are indicated by asterisks and extinct

lineages are represented by dashed lines. The letters represent the resources that each group of organisms can utilize. In cases where ecotypes utilize the

same set of resources, but in different proportions, the predominant resource of each ecotype is denoted by a capital letter. (a) The Stable Ecotype

model. The Stable Ecotype model is marked by a much higher rate of periodic selection than ecotype formation, such that each ecotype endures many

periodic selection events during its lifetime. The Stable Ecotype model generally yields a one-to-one correspondence between ecotypes and sequence

clusters. The ecotypes are able to coexist indefinitely because each has a resource not shared with the other. (b) The Nano-Niche model of bacterial

speciation. In the figure, there are three Nano-Niche ecotypes (denoted by Abc, aBc, and abC) that use the same set of resources, but in different

proportions. Each Nano-Niche ecotype can coexist with the other two because they have partitioned their resources, at least quantitatively. However,

because the ecotypes share all their resources, each is vulnerable to a possible speciation-quashing mutation that may occur in the other ecotypes. This

could be a mutation that increases efficiency in the utilization of all resources. These speciation-quashing mutations are indicated by a large asterisk;

each of these extinguishes the other Nano-Niche ecotypes. Thus, in the Nano-Niche model, cohesion can cut across ecologically distinct populations,

provided that they are only quantitatively different in their resource utilization. (c) The Recurrent Niche Invasion model. Here, a lineage may move,

frequently and recurrently, from one ecotype to another, usually by acquisition and loss of niche-determining plasmids. In the figure, the red lines

indicate the times in which a lineage is in the plasmid-containing ecotype; the blue lines indicate the times when the lineage is in the plasmid-absent

ecotype. Periodic selection events within one ecotype extinguish only the lineages of the same ecotype. For example, in the most ancient periodic

selection event shown, which is in the plasmid-absent (blue) ecotype, only the lineages missing the plasmid at the time of periodic selection are

extinguished, while the plasmid-containing lineages (red) persist. Ecotypes determined by a plasmid are not likely to be discoverable as sequence clusters

(Cohan, 2011).

FEMS Microbiol Rev 35 (2011) 957–976c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

968 J. Wiedenbeck & F.M. Cohan

associations with solar exposure (Koeppel et al., 2008), soil

texture (Connor et al., 2010), rhizospheres, elevation, and

salinity (S. Kopac, unpublished data), as well as in hot spring

Synechococcus that are ecologically distinct in their associa-

tions with horizontal (temperature and nutrient availability)

and vertical (light level and quality) dimensions of the mat

(Ward et al., 2006; Melendrez et al., 2011), and in the

pathogen L. pneumophila, where ecotypes differ in their

amoebic host ranges and in their gene expression patterns

(Cohan et al., 2006); ADAPTML has identified ecotypes of

marine Vibrio that are associated with different particle sizes

and with seasons (Hunt et al., 2008).

These most closely related ecotypes, whether discovered

by an intuitive or an algorithmic demarcation, are the key to

discovering the ecological dimensions by which lineages

split and the genetic basis of ecotype formation, as well as

the role of HGT in the origins of ecological diversity.

Genome-based comparisons of the most closely related

ecotypes could confirm that the ecotypes are ecologically

different, and could reveal the ecological dimensions of

ecotype formation; moreover, they could identify the role

of HGT in the origins of species. If genome investigations

could be directed in the future toward comparing popula-

tions identified by algorithms such as ECOTYPE SIMULATION and

ADAPTML to be the most closely related, ecologically distinct

populations, we would make better progress toward study-

ing the dynamics of bacterial speciation and identifying all

the ecological diversity within bacterial taxa of interest

(Cohan & Perry, 2007). To our knowledge, clades that have

been identified as most closely related ecotypes have yet to

be compared by genome-based analyses.

Genome comparisons among close relatives can also test

the validity of alternative models of speciation. Of foremost

importance is the need to test whether bacterial ecotypes are

cohesive, and genome comparisons provide an opportunity

to do this. Doolittle & Zhaxybayeva (2009) have argued that

there is no fundamental reason why bacterial species should

be cohesive, and they suggest that frequent HGT events may

continuously cause a lineage to switch from one ecological

niche to another. In the most extreme form of this argu-

ment, ecological interchangeability may extend only as far as

a bacterium and its immediate offspring.

Doolittle and Zhaxybayeva (2009) have likely exaggerated

the importance of HGT in fostering ecological diversity

among closest relatives (Kopac & Cohan, 2011). While

closest relatives are inevitably different in their rosters of

genes, careful analyses of genome content differences have

shown that nearly all HGT events among closest relatives

involve transfers of genes not known to be involved in the

ecological divergence of bacteria, such as phage-related

genes, genes for transposition activity, and genes whose

function is not known (the genes of so-called ‘hypothetical’

function) (Touchon et al., 2009; Wiedenbeck, 2011).

Nevertheless, the phylogenetic groups identified as eco-

types can be cohesive only if their members are ecologically

interchangeable, as genetic drift and periodic selection can

purge diversity only within populations of the same ecolo-

gical niche; it is therefore essential to test whether the

members of an identified ecotype are ecologically inter-

changeable (Kopac & Cohan, 2011). Genome-based com-

parisons, whether focusing on genome content differences,

positive selection on shared genes, or changes in gene

expression, can test whether the members of a putative

ecotype (identified by sequence-based algorithms) are eco-

logically interchangeable and can identify the phylogenetic

breadth of ecological interchangeability.

We have recently tested for ecological interchangeability

within one putative ecotype hypothesized by ECOTYPE SIMULA-

TION and ADAPTML, to our knowledge the only test of

ecological interchangeability at this small phylogenetic scale

(Wiedenbeck, 2011). This was an analysis of five strains

from a putative ecotype (PE15) within B. subtilis ssp.

spizizenii, including four strains isolated from a Death Valley

canyon (Connor et al., 2010) and one well-characterized

reference strain (Zeigler et al., 2008). The five strains were

shown to be heterogeneous in their genome content, largely

in genes related to phage and transposition as well as genes

classified as hypothetical (Wiedenbeck, 2011). Remarkably,

none of the remaining unshared genes provided any novel

ecological functions; all these genes were either duplicates of

genes already shared by every strain or they were novel genes

contributing to functions already possessed by all strains.

For example, all five strains had multiple genes involved in

the transport and utilization of maltose, and additional

genes unique to some strains were either copies of existing

maltose genes or they provided additional capability for

metabolizing maltose. Every nonphage, nontransposing,

nonhypothetical gene that was unique to a subclade of

strains within our sample shared this pattern seen for

maltose.

Thus, while the putative ecotype’s members appear not to

be ecologically interchangeable, the heterogeneity appears to

be limited. We hypothesized that the divergence among

genome types represents a kind of quantitative divergence,

where each strain sampled has its own ecological niche, but

no strain has a unique resource that protects it completely

from competition (Wiedenbeck, 2011). In this case, the

various members of PE15 may utilize maltose under differ-

ent conditions, and each strain’s unique set of maltose genes

may allow that strain to utilize maltose best under its own

optimal conditions.

This kind of quantitative divergence among ecotypes is

different from the Stable Ecotype model, where each ecotype

is more completely protected from competition from other

ecotypes because each ecotype has some unique resources

(Fig. 5a). In the case of the variants within PE15, each

FEMS Microbiol Rev 35 (2011) 957–976 c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

969Origins of diversity through horizontal transfer

quantitatively different ecotype may have its own periodic

selection events, but it is still possible that a periodic

selection event in one ecotype could drive the other ecotypes

to extinction, as in the Nano-Niche model of bacterial

speciation (Cohan & Perry, 2007) (Fig. 5b). Thus, while the

putative ecotype analyzed here turned out to be ecologically

heterogeneous, it still may be cohesive owing to the incom-

plete and quantitative nature of the ecological divergence

among populations. This study shows the potential for gene

content comparisons to study both the phylogenetic breadth

of ecological interchangeability and the extent to which a

demarcated, putative ecotype forms a cohesive unit.

An alternative model of bacterial speciation, the Recur-

rent Niche Invasion model, takes into account the role of

mobile genetic elements, such as plasmids or phage, in

determining bacterial niches (Cohan & Perry, 2007)

(Fig. 5c). For example, in the case of Rhizobium, a bacterial

lineage may acquire a symbiosis plasmid that adapts it as an

endosymbiotic mutualist to one set of legume hosts; then

the lineage may lose that plasmid and acquire another

symbiosis plasmid, thereby adapting it to another set of

legumes (Segovia et al., 1991). Also, a pathogenic lineage

may acquire a multiple-resistance plasmid, which will adapt

it to habitats where antibiotics are free flowing, and then lose

it when living in habitats where antibiotics are rarely used.

In either case, a cell can be converted from one ecotype to

another by acquiring and/or losing a niche-specifying plas-

mid or phage, and so a lineage moves back and forth

between memberships in different, previously existing eco-

types. In contrast to chromosomally based ecotypes, eco-

types that are distinguished only by the presence of different

plasmids cannot be discovered as sequence clusters in genes

unrelated to ecological divergence (Cohan, 2011) (Fig. 5c).

We therefore cannot predict and demarcate plasmid-based

ecotypes on the basis of sequence divergence in randomly

chosen chromosomal genes. The ecological diversity created

by plasmids can be discovered only by screening for the

functions known or expected to be coded by the plasmids

ahead of time. As we next discuss, such a functional

approach has been successful in identifying ecological diver-

sity in antibiotic resistance factors, which are commonly

carried and transferred on plasmids (Fondi & Fani, 2010).

The antibiotic resistance of our future

What are the resistance genes of our future? To find the

resistance factors most likely to enter human pathogens

through HGT in the near future, we should take into

account that environmental and phylogenetic proximity of

two organisms contribute to their sharing of genes, as well as

the complexity and compatibility of resistance factors.

One approach screens for resistance factors with environ-

mental proximity to potential human pathogens; here,

bacteria associated with humans have been surveyed for

their antibiotic resistance genes. Sommer et al. (2009)

recently expressed DNA cloned from the human micro-

biome and selected clones for resistance to various antibio-

tics. This approach does not require homology or sequence

similarity of resistance genes to any known resistance factor.

Indeed, Sommer and colleagues found a broad diversity of

resistance factors that were extremely divergent from, and

some not even homologous with, previously known factors

(Box 3). It is not understood why these abundant resistance

factors have not yet made their way into E. coli and other

familiar bacteria in our guts. While this approach identifies

possible resistance factors in our future, it is important to

note that the proximity of these factors to possible human

pathogens might not be enough to foster their transfer.

Beyond searching for antibiotic resistance genes in the

human microbiome, it will be informative to screen the

metagenomes of other organisms from which we frequently

acquire pathogens, including agricultural animals, mice,

ticks, and mosquitoes. We will increasingly be able to

identify the organism sources of resistance genes discovered

in this way as more bacterial genome sequences become

Box 3. Identification of antibiotic resistance factors in human gut

bacteria

The widespread occurrence of antibiotic resistance factors among

bacteria, as well as their propagation by HGT, has led to an increased

interest in characterizing the possible reservoirs for resistance factors

(Riesenfeld et al., 2004; D’Costa et al., 2006). As the majority of

microorganisms are not presently cultivable, metagenomic analyses

of these possible reservoirs may supplement previous analyses of

resistance factors present in bacterial isolates. Sommer et al. (2009)

analyzed a possible resistance reservoir using a metagenomic analysis

of human microbial communities. The authors isolated DNA from

human saliva and fecal samples, and created clone libraries from

these DNA isolates. The clones were then screened for resistance to a

number of antibiotics, and the metagenomic inserts conferring

antibiotic resistance were sequenced and annotated. Interestingly,

many of the resistance inserts identified in this way did not match

previously characterized antibiotic resistance genes (Sommer et al.,

2009). Some of these genes encoded proteins that were identical to

proteins annotated as hypothetical. This finding is especially

intriguing, as it indicates that many hypothetical or poorly

characterized proteins from sequenced bacterial genomes may

indeed be resistance factors. Identifying these proteins will not only

help to further characterize modes of antibiotic resistance in bacteria,

but will also help to improve sequence annotation by identifying the

functions of hypothetical proteins. The findings of the Sommer and

colleagues study indicated that metagenomic analyses are a

successful way to identify new resistance factors. The human

microbiome, however, is only one of many possible reservoirs for

antibiotic resistance. Other important sources for resistance factors

include agricultural sources, water environments (Baquero et al.,

2008), and soil (Riesenfeld et al., 2004). Further investigations into

these reservoirs, including metagenomic analyses, will help to better

characterize resistance factors likely to be medically significant.

FEMS Microbiol Rev 35 (2011) 957–976c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

970 J. Wiedenbeck & F.M. Cohan

available. Those resistance factors that are in phylogenetic

and environmental proximity to human pathogens will be of

greatest concern (Matte-Tailliez et al., 2002).

As we discover increasing numbers of resistance factors,

we may test them for their efficacy in human pathogens. (Or

for safety purposes, we may test them in closely related,

nonpathogenic model systems.) This would include quanti-

tative measures of the resistance benefit as well as the fitness

cost of resistance factors, perhaps by following the high-

throughput approach of Sorek et al. (2007) for introducing

thousands of genes into a strain and measuring the fitness

effects of gene acquisition. It will be interesting to observe

the extent to which antibiotic resistance genes tend to be

specialized to different phylogenetic groups of pathogens

and how easily a resistance factor can co-evolve with a new

pathogen to overcome the initial incompatibility between

resistance and organism.

In summary, predicting our future of antibiotic resistance

will involve identifying the resistance factors that are within

reach by HGT of our species’ pathogens. It will also be

important to use the approaches outlined here to determine

which of these resistance factors are compatible with our

pathogens and which can become compatible through

ameliorative evolution.

The population biology approaches we have outlined may

also help to identify all the bacteria whose acquisition of

antibiotic resistance may harm us and our agricultural

dependents. We have illustrated how the broadly defined

species taxa of bacterial systematics can obscure within them

multiple clades differing in their pathogenic properties;

predicting the next epidemic may require going beyond the

existing taxonomy, through discovering new pathogenic

ecotypes within the established taxa (Cohan & Perry, 2007)

and by not being distracted by ecotypes that are not

pathogenic (Cohan et al., 2006; Smith et al., 2006; Luo

et al., 2011). We suggest using the universal approaches

described here to discover all the ecotypes within known

pathogenic taxa and to characterize their ecological capabil-

ities through genome comparisons. The discovery of poten-

tial pathogens by these universal approaches, followed by

surveillance of their acquisitions of antibiotic resistance, will

be an important preemptive public health strategy.

Acknowledgements

This work was funded by a NASA-funded Connecticut

Space Grant to J.W. and by Wesleyan University research

funds to F.M.C. We thank Jessica Sherry for alerting us to

several examples of niche-specifying HGT events. We thank

Fernando Baquero and Dieter Haas, as well as two anon-

ymous reviewers, for their valuable suggestions for improv-

ing the manuscript.

References

Allewalt JP, Bateson MM, Revsbech NP, Slack K & Ward DM

(2006) Effect of temperature and light on growth of and

photosynthesis by Synechococcus isolates typical of those

predominating in the octopus spring microbial mat

community of Yellowstone National Park. Appl Environ Microb

72: 544–550.

Anantha RP, McVeigh AL, Lee LH, Agnew MK, Cassels FJ, Scott

DA, Whittam TS & Savarino SJ (2004) Evolutionary and

functional relationships of colonization factor antigen i and

other class 5 adhesive fimbriae of enterotoxigenic Escherichia

coli. Infect Immun 72: 7190–7201.

Andersson DI & Levin BR (1999) The biological cost of antibiotic

resistance. Curr Opin Microbiol 2: 489–493.

Aris-Brosou S (2005) Determinants of adaptive evolution at the

molecular level: the extended complexity hypothesis. Mol Biol

Evol 22: 200–209.

Banos RC, Vivero A, Aznar S, Garcıa J, Pons M, Madrid C &

Juarez A (2009) Differential regulation of horizontally

acquired and core genome genes by the bacterial modulator

H-NS. PLoS Genet 5: e1000513.

Baquero F, Martinez JL & Canton R (2008) Antibiotics and

antibiotic resistance in water environments. Curr Opin Biotech

19: 260–265.

Barlow M (2009) What antimicrobial resistance has taught us

about horizontal gene transfer. Methods Mol Biol 532:

397–411.

Barrangou R, Fremaux C, Deveau H, Richards M, Boyaval P,

Moineau S, Romero DA & Horvath P (2007) CRISPR provides

acquired resistance against viruses in prokaryotes. Science 315:

1709–1712.

Beloin C, Deighan P, Doyle M & Dorman CJ (2003) Shigella

flexneri 2a strain 2457T expresses three members of the

H-NS-like protein family: characterization of the Sfh protein.

Mol Genet Genomics 270: 66–77.

Bhaya D, Grossman AR, Steunou AS et al. (2007) Population level

functional diversity in a microbial community revealed by

comparative genomic and metagenomic analyses. ISME J 1:

703–713.

Bolotin A, Quinquis B, Sorokin A & Ehrlich SD (2005) Clustered

regularly interspaced short palindrome repeats (CRISPRs)

have spacers of extrachromosomal origin. Microbiology 151:

2551–2561.

Brouns SJ, Jore MM, Lundgren M et al. (2008) Small CRISPR

RNAs guide antiviral defense in prokaryotes. Science 321:

960–964.

Budroni S, Siena E, Hotopp JC et al. (2011) Neisseria meningitidis

is structured in clades associated with restriction modification

systems that modulate homologous recombination. P Natl

Acad Sci USA 108: 4494–4499.

Chistoserdova L, Vorholt JA, Thauer RK & Lidstrom ME (1998)

C1 transfer enzymes and coenzymes linking methylotrophic

bacteria and methanogenic Archaea. Science 281: 99–102.

FEMS Microbiol Rev 35 (2011) 957–976 c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

971Origins of diversity through horizontal transfer

Cohan FM (1994) The effects of rare but promiscuous genetic

exchange on evolutionary divergence in prokaryotes. Am

Naturalist 143: 965–986.

Cohan FM (2001) Bacterial species and speciation. Syst Biol 50:

513–524.

Cohan FM (2002) Sexual isolation and speciation in bacteria.

Genetica 116: 359–370.

Cohan FM (2005) Periodic selection and ecological diversity in

bacteria. Selective Sweep (Nurminsky D, ed), pp. 78–93. Landes

Bioscience, Georgetown, TX.

Cohan FM (2010) Synthetic biology: now that we’re creators,

what should we create? Curr Biol 20: R675–R677.

Cohan FM (2011) Are species cohesive? – A view from

bacteriology. Bacterial Population Genetics: A Tribute to

Thomas S Whittam (Walk S & Feng P, eds), pp. 43–65.

American Society for Microbiology Press, Washington, DC.

Cohan FM & Koeppel AF (2008) The origins of ecological

diversity in prokaryotes. Curr Biol 18: R1024–R1034.

Cohan FM & Perry EB (2007) A systematics for discovering the

fundamental units of bacterial diversity. Curr Biol 17:

R373–R386.

Cohan FM, King EC & Zawadzki P (1994) Amelioration of the

deleterious pleiotropic effects of an adaptive mutation in

Bacillus subtilis. Evolution 48: 81–95.

Cohan FM, Koeppel A & Krizanc D (2006) Sequence-based

discovery of ecological diversity within Legionella. Legionella:

State of the Art 30 Years after Its Recognition (Cianciotto NP,

Abu Kwaik Y & Edelstein PH, et al, eds), pp. 367–376. ASM

Press, Washington, DC.

Cohen O, Gophna U & Pupko T (2011) The complexity

hypothesis revisited: connectivity rather than function

constitutes a barrier to horizontal gene transfer. Mol Biol Evol

28: 1481–1489.

Connor N, Sikorski J, Rooney AP et al. (2010) The ecology of

speciation in Bacillus. Appl Environ Microb 76: 1349–1358.

Coscolla M, Comas I & Gonzalez-Candelas F (2011) Quantifying

nonvertical inheritance in the evolution of Legionella

pneumophila. Mol Biol Evol 28: 985–1001.

Coyne JA & Orr HA (2004) Speciation. Sinauer Associates,

Sunderland, MA.

Datta N & Kontomichalou P (1965) Penicillinase synthesis

controlled by infectious R factors in Enterobacteriaceae.

Nature 208: 239–241.

Daubin V & Ochman H (2004) Bacterial genomes as new gene

homes: the genealogy of ORFans in E. coli. Genome Res 14:

1036–1042.

Davids W & Zhang Z (2008) The impact of horizontal gene

transfer in shaping operons and protein interaction

networks–direct evidence of preferential attachment. BMC

Evol Biol 8: 23.

D’Costa VM, McGrann KM, Hughes DW & Wright GD (2006)

Sampling the antibiotic resistome. Science 311: 374–377.

Denef VJ, Kalnejais LH, Mueller RS, Wilmes P, Baker BJ, Thomas

BC, VerBerkmoes NC, Hettich RL & Banfield JF (2010)

Proteogenomic basis for ecological divergence of closely

related bacteria in natural acidophilic microbial communities.

P Natl Acad Sci USA 107: 2383–2390.

de Queiroz K (2005) Ernst Mayr and the modern concept of

species. P Natl Acad Sci USA 102 (suppl 1): 6600–6607.

de Vries J & Wackernagel W (2002) Integration of foreign DNA

during natural transformation of Acinetobacter sp. by

homology-facilitated illegitimate recombination. P Natl Acad

Sci USA 99: 2094–2099.

Dieckmann U, Metz JAJ, Doebeli M & Tautz D (2004)

Introduction. Adaptive Speciation (Dieckmann U, Doebeli M,

Metz JAJ & Tautz D, eds), pp. 1–16. Cambridge University

Press, Cambridge.

Doolittle WF & Zhaxybayeva O (2009) On the origin of

prokaryotic species. Genome Res 19: 744–756.

Dorman CJ (2004) H-NS: a universal regulator for a dynamic

genome. Nat Rev Microbiol 2: 391–400.

Dorman CJ (2007) H-NS, the genome sentinel. Nat Rev Microbiol

5: 157–161.

Dorman CJ (2009) Regulatory integration of horizontally-

transferred genes in bacteria. Front Biosci 14: 4103–4112.

Doyle M, Fookes M, Ivens A, Mangan MW, Wain J & Dorman CJ

(2007) An H-NS-like stealth protein aids horizontal DNA

transmission in bacteria. Science 315: 251–252.

Drake JW (2009) Avoiding dangerous missense: thermophiles

display especially low mutation rates. PLoS Genet 5: e1000520.

Fisher RA (1958) The Genetical Theory of Natural Selection.

Dover, New York.

Fondi M & Fani R (2010) The horizontal flow of the plasmid

resistome: clues from inter-generic similarity networks.

Environ Microbiol 12: 3228–3242.

Fraser C, Hanage WP & Spratt BG (2007) Recombination and the

nature of bacterial speciation. Science 315: 476–480.

Fraser C, Alm EJ, Polz MF, Spratt BG & Hanage WP (2009) The

bacterial species challenge: making sense of genetic and

ecological diversity. Science 323: 741–746.

Garcia-Vallve S, Romeu A & Palau J (2000) Horizontal gene

transfer of glycosyl hydrolases of the rumen fungi. Mol Biol

Evol 17: 352–361.

Godde JS & Bickerton A (2006) The repetitive DNA elements

called CRISPRs and their associated genes: evidence of

horizontal transfer among prokaryotes. J Mol Evol 62:

718–729.

Gogarten JP, Doolittle WF & Lawrence JG (2002) Prokaryotic

evolution in light of gene transfer. Mol Biol Evol 19:

2226–2238.

Goldman BS & Kranz RG (1998) Evolution and horizontal

transfer of an entire biosynthetic pathway for cytochrome c

biogenesis: Helicobacter, Deinococcus, Archae and more. Mol

Microbiol 27: 871–873.

Gomez-Lus R (1998) Evolution of bacterial resistance to

antibiotics during the last three decades. Int Microbiol 1:

279–284.

Groisman EA & Ochman H (1996) Pathogenicity islands:

bacterial evolution in quantum leaps. Cell 87: 791–794.

FEMS Microbiol Rev 35 (2011) 957–976c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

972 J. Wiedenbeck & F.M. Cohan

Haft DH, Selengut J, Mongodin EF & Nelson KE (2005) A guild

of 45 CRISPR-associated (Cas) protein families and multiple

CRISPR/Cas subtypes exist in prokaryotic genomes. PLoS

Comput Biol 1: e60.

Haldane JBS (1932) The Causes of Evolution. Longmans, Green,

and Co., London.

Hale CR, Zhao P, Olson S, Duff MO, Graveley BR, Wells L, Terns

RM & Terns MP (2009) RNA-guided RNA cleavage by a

CRISPR RNA-Cas protein complex. Cell 139: 945–956.

Hamilton HL & Dillard JP (2006) Natural transformation of

Neisseria gonorrhoeae: from DNA donation to homologous

recombination. Mol Microbiol 59: 376–385.

Hao W & Golding GB (2006) The fate of laterally transferred

genes: life in the fast lane to adaptation or death. Genome Res

16: 636–643.

Hoffman-Roberts H, Babcock E & Mitropoulos I (2005)

Investigational new drugs for the treatment of resistant

pneumococcal infections. Expert Opin Inv Drug 14: 973–995.

Homma K, Fukuchi S, Nakamura Y, Gojobori T & Nishikawa K

(2007) Gene cluster analysis method identifies horizontally

transferred genes with high reliability and indicates that they

provide the main mechanism of operon gain in 8 species of

gamma-Proteobacteria. Mol Biol Evol 24: 805–813.

Hooper SD & Berg OG (2003a) Duplication is more common

among laterally transferred genes than among indigenous

genes. Genome Biol 4: R48.

Hooper SD & Berg OG (2003b) On the nature of gene

innovation: duplication patterns in microbial genomes. Mol

Biol Evol 20: 945–954.

Horvath P & Barrangou R (2010) CRISPR/Cas, the immune

system of bacteria and archaea. Science 327: 167–170.

Hunt DE, David LA, Gevers D, Preheim SP, Alm EJ & Polz MF

(2008) Resource partitioning and sympatric differentiation

among closely related bacterioplankton. Science 320:

1081–1085.

Jacob F & Monod J (1961) Genetic regulatory mechanisms in the

synthesis of proteins. J Mol Biol 3: 318–356.

Jain R, Rivera MC & Lake JA (1999) Horizontal gene transfer

among genomes: the complexity hypothesis. P Natl Acad Sci

USA 96: 3801–3806.

Jeltsch A (2003) Maintenance of species identity and controlling

speciation of bacteria: a new function for restriction/

modification systems? Gene 317: 13–16.

Jensen EC, Schrader HS, Rieland B, Thompson TL, Lee KW,

Nickerson KW & Kokjohn TA (1998) Prevalence of broad-

host-range lytic bacteriophages of Sphaerotilus natans,

Escherichia coli, and Pseudomonas aeruginosa. Appl Environ

Microb 64: 575–580.

Kanhere A & Vingron M (2009) Horizontal gene transfers in

prokaryotes show differential preferences for metabolic and

translational genes. BMC Evol Biol 9: 9.

Koch AL (1974) The pertinence of the periodic selection

phenomenon to prokaryote evolution. Genetics 77: 127–142.

Koeppel A, Perry EB, Sikorski J et al. (2008) Identifying the

fundamental units of bacterial diversity: a paradigm shift to

incorporate ecology into bacterial systematics. P Natl Acad Sci

USA 105: 2504–2509.

Kopac S & Cohan FM (2011) A theory-based pragmatism for

discovering and classifying newly divergent bacterial species.

Genetics and Evolution of Infectious Diseases (Tibayrenc M, ed),

pp. 21–41. Elsevier, London.

Kruger DH & Bickle TA (1983) Bacteriophage survival: multiple

mechanisms for avoiding the deoxyribonucleic acid restriction

systems of their hosts. Microbiol Rev 47: 345–360.

Kuo CH, Moran NA & Ochman H (2009) The consequences of

genetic drift for bacterial genome complexity. Genome Res 19:

1450–1454.

Lacks SA & Springhorn SS (1984) Transfer of recombinant

plasmids containing the gene for DpnII DNA methylase into

strains of Streptococcus pneumoniae that produce DpnI or

DpnII restriction endonucleases. J Bacteriol 158: 905–909.

Lawler ML & Brun YV (2007) Advantages and mechanisms of

polarity and cell shape determination in Caulobacter

crescentus. Curr Opin Microbiol 10: 630–637.

Lawrence J (1999) Selfish operons: the evolutionary impact of

gene clustering in prokaryotes and eukaryotes. Curr Opin

Genet Dev 9: 642–648.

Lawrence J (2000) Clustering of antibiotic resistance genes:

Beyond the selfish operon. ASM News 66: 281–286.

Lawrence JG (1997) Selfish operons and speciation by gene

transfer. Trends Microbiol 5: 355–359.

Lawrence JG & Ochman H (1998) Molecular archaeology of the

Escherichia coli genome. P Natl Acad Sci USA 95: 9413–9417.

Lawrence JG & Roth JR (1996) Selfish operons: horizontal

transfer may drive the evolution of gene clusters. Genetics 143:

1843–1860.

Lefebure T & Stanhope MJ (2007) Evolution of the core and pan-

genome of Streptococcus: positive selection, recombination,

and genome composition. Genome Biol 8: R71.

Lercher MJ & Pal C (2008) Integration of horizontally transferred

genes into regulatory interaction networks takes many million

years. Mol Biol Evol 25: 559–567.

Levin BR (1981) Periodic selection, infectious gene exchange and

the genetic structure of E. coli populations. Genetics 99: 1–23.

Levin BR & Bergstrom CT (2000) Bacteria are different:

observations, interpretations, speculations, and opinions

about the mechanisms of adaptive evolution in prokaryotes.

P Natl Acad Sci USA 97: 6981–6985.

Levin BR, Perrot V & Walker N (2000) Compensatory mutations,

antibiotic resistance and the population genetics of adaptive

evolution in bacteria. Genetics 154: 985–997.

Lind PA, Tobin C, Berg OG, Kurland CG & Andersson DI (2010)

Compensatory gene amplification restores fitness after inter-

species gene replacements. Mol Microbiol 75: 1078–1089.

Lorenz MG & Wackernagel W (1994) Bacterial gene transfer by

natural genetic transformation in the environment. Microbiol

Rev 58: 563–602.

Lucchini S, Rowley G, Goldberg MD, Hurd D, Harrison M &

Hinton JC (2006) H-NS mediates the silencing of laterally

acquired genes in bacteria. PLoS Pathog 2: e81.

FEMS Microbiol Rev 35 (2011) 957–976 c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

973Origins of diversity through horizontal transfer

Luo C, Walk ST, Gordon DM, Feldgarden M, Tiedje JM &

Konstantinidis KT (2011) Genome sequencing of

environmental Escherichia coli expands understanding of the

ecology and speciation of the model bacterial species. P Natl

Acad Sci USA 108: 7200–7205.

Madigan MT, Martinko JM, Dunlap PV & Clark DP (2009) Brock

Biology of Microorganisms. 12th edn. Pearson Benjamin

Cummings, San Francisco.

Maisnier-Patin S, Paulander W, Pennhag A & Andersson DI

(2007) Compensatory evolution reveals functional

interactions between ribosomal proteins S12, L14 and L19.

J Mol Biol 366: 207–215.

Majewski J (2001) Sexual isolation in bacteria. FEMS Microbiol

Lett 199: 161–169.

Majewski J & Cohan FM (1998) The effect of mismatch repair

and heteroduplex formation on sexual isolation in Bacillus.

Genetics 148: 13–18.

Majewski J & Cohan FM (1999) DNA sequence similarity

requirements for interspecific recombination in Bacillus.

Genetics 153: 1525–1533.

Majewski J, Zawadzki P, Pickerill P, Cohan FM & Dowson CG

(2000a) Barriers to genetic exchange between bacterial species:

Streptococcus pneumoniae transformation. J Bacteriol 182:

1016–1023.

Majewski J, Zawadzki P, Pickerill P, Cohan FM & Dowson CG

(2000b) Barriers to genetic exchange between bacterial species:

Streptococcus pneumoniae transformation. J Bacteriol 182:

1016–1023.

Makarova KS, Aravind L, Grishin NV, Rogozin IB & Koonin EV

(2002) A DNA repair system specific for thermophilic Archaea

and bacteria predicted by genomic context analysis. Nucleic

Acids Res 30: 482–496.

Makarova KS, Haft DH, Barrangou R et al. (2011) Evolution and

classification of the CRISPR-Cas systems. Nat Rev Microbiol 9:

467–477.

Mallet J (2008) Hybridization, ecological races and the nature of

species: empirical evidence for the ease of speciation. Philos T

Roy Soc B 363: 2971–2986.

Mallet J, Beltran M, Neukirchen W & Linares M (2007) Natural

hybridization in heliconiine butterflies: the species boundary

as a continuum. BMC Evol Biol 7: 28.

Marraffini LA & Sontheimer EJ (2008) CRISPR interference

limits horizontal gene transfer in staphylococci by targeting

DNA. Science 322: 1843–1845.

Marri PR, Hao W & Golding GB (2007) The role of laterally

transferred genes in adaptive evolution. BMC Evol Biol 7

(suppl 1): S8.

Martiny AC, Huang Y & Li W (2009a) Occurrence of phosphate

acquisition genes in Prochlorococcus cells from different ocean

regions. Environ Microbiol 11: 1340–1347.

Martiny AC, Tai AP, Veneziano D, Primeau F & Chisholm SW

(2009b) Taxonomic resolution, ecotypes and the biogeography

of Prochlorococcus. Environ Microbiol 11: 823–832.

Matic I, Taddei F & Radman M (2000) No genetic barriers

between Salmonella enterica serovar Typhimurium and

Escherichia coli in SOS-induced mismatch repair-deficient

cells. J Bacteriol 182: 5922–5924.

Matte-Tailliez O, Brochier C, Forterre P & Philippe H (2002)

Archaeal phylogeny based on ribosomal proteins. Mol Biol Evol

19: 631–639.

Maynard Smith JM, Dowson CG & Spratt BG (1991) Localized

sex in bacteria. Nature 349: 29–31.

Mayr E (1963) Animal Species and Evolution. Belknap Press of

Harvard University Press, Cambridge.

McDaniel LD, Young E, Delaney J, Ruhnau F, Ritchie KB & Paul

JH (2010) High frequency of horizontal gene transfer in the

oceans. Science 330: 50.

Melendrez MC, Lange RK, Cohan FM & Ward DM (2011)

Influence of molecular resolution on sequence-based

discovery of ecological diversity among Synechococcus

populations in an alkaline siliceous hot spring microbial mat.

Appl Environ Microb 77: 1359–1367.

Mojica FJ, Dıez Villasenor C, Garcıa-Martınez J & Soria E (2005)

Intervening sequences of regularly spaced prokaryotic repeats

derive from foreign genetic elements. J Mol Evol 60: 174–182.

Nakamura Y, Itoh T, Matsuda H & Gojobori T (2004) Biased

biological functions of horizontally transferred genes in

prokaryotic genomes. Nat Genet 36: 760–766.

Nogueira T, Rankin DJ, Touchon M, Taddei F, Brown SP & Rocha

EP (2009) Horizontal gene transfer of the secretome drives the

evolution of bacterial cooperation and virulence. Curr Biol 19:

1683–1691.

Norman A, Hansen LH & Sorensen SJ (2009) Conjugative

plasmids: vessels of the communal gene pool. Philos T Roy Soc

B 364: 2275–2289.

O’Brien TF (2002) Emergence, spread, and environmental effect

of antimicrobial resistance: how use of an antimicrobial

anywhere can increase resistance to any antimicrobial

anywhere else. Clin Infect Dis 34 (suppl 3): S78–S84.

Ochman H & Davalos LM (2006) The nature and dynamics of

bacterial genomes. Science 311: 1730–1733.

Ochman H, Lawrence JG & Groisman EA (2000) Lateral gene

transfer and the nature of bacterial innovation. Nature 405:

299–304.

Oshima T, Ishikawa S, Kurokawa K, Aiba H & Ogasawara N

(2006) Escherichia coli histone-like protein H-NS preferentially

binds to horizontally acquired DNA in association with RNA

polymerase. DNA Res 13: 141–153.

O’Sullivan O, O’Callaghan J, Sangrador-Vegas A et al. (2009)

Comparative genomics of lactic acid bacteria reveals a niche-

specific gene set. BMC Microbiol 9: 50.

Pal C, Papp B & Lercher MJ (2005) Adaptive evolution of

bacterial metabolic networks by horizontal gene transfer. Nat

Genet 37: 1372–1375.

Papke RT, Zhaxybayeva O, Feil EJ, Sommerfeld K, Muise D &

Doolittle WF (2007) Searching for species in haloarchaea.

P Natl Acad Sci USA 104: 14092–14097.

Paul S, Dutta A, Bag SK, Das S & Dutta C (2010) Distinct,

ecotype-specific genome and proteome signatures in the

marine cyanobacteria Prochlorococcus. BMC Genomics 11: 103.

FEMS Microbiol Rev 35 (2011) 957–976c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

974 J. Wiedenbeck & F.M. Cohan

Petersen L, Bollback JP, Dimmic M, Hubisz M & Nielsen R (2007)

Genes under positive selection in Escherichia coli. Genome Res

17: 1336–1343.

Philippot L, Andersson SG, Battin TJ, Prosser JI, Schimel JP,

Whitman WB & Hallin S (2010) The ecological coherence of

high bacterial taxonomic ranks. Nat Rev Microbiol 8: 523–529.

Popa O, Hazkani-Covo E, Landan G, Martin W & Dagan T

(2011) Directed networks reveal genomic barriers and DNA

repair bypasses to lateral gene transfer among prokaryotes.

Genome Res 21: 599–609.

Pourcel C, Salvignol G & Vergnaud G (2005) CRISPR elements in

Yersinia pestis acquire new repeats by preferential uptake of

bacteriophage DNA, and provide additional tools for

evolutionary studies. Microbiology 151: 653–663.

Price MN, Dehal PS & Arkin AP (2008) Horizontal gene transfer

and the evolution of transcriptional regulation in Escherichia

coli. Genome Biol 9: R4.

Rasko DA, Rosovitz MJ, Myers GS et al. (2008) The pangenome

structure of Escherichia coli: comparative genomic analysis of

E. coli commensal and pathogenic isolates. J Bacteriol 190:

6881–6893.

Rest JS & Mindell DP (2003) Retroids in archaea: phylogeny and

lateral origins. Mol Biol Evol 20: 1134–1142.

Retchless AC & Lawrence JG (2010) Phylogenetic incongruence

arising from fragmented speciation in enteric bacteria. P Natl

Acad Sci USA 107: 11453–11458.

Riedl R (1978) Order in Living Systems. Wiley, Chichester.

Rieseberg LH, Raymond O, Rosenthal DM et al. (2003) Major

ecological transitions in wild sunflowers facilitated by

hybridization. Science 301: 1211–1216.

Riesenfeld CS, Goodman RM & Handelsman J (2004)

Uncultured soil bacteria are a reservoir of new antibiotic

resistance genes. Environ Microbiol 6: 981–989.

Roberts MS & Cohan FM (1993) The effect of DNA sequence

divergence on sexual isolation in Bacillus. Genetics 134: 401–408.

Schechter LM, Jain S, Akbar S & Lee CA (2003) The small

nucleoid-binding proteins H-NS, HU, and Fis affect hilA

expression in Salmonella enterica serovar Typhimurium. Infect

Immun 71: 5432–5435.

Schimel J, Balser TC & Wallenstein M (2007) Microbial stress-

response physiology and its implications for ecosystem

function. Ecology 88: 1386–1394.

Segovia L, Pinero D, Palacios R & Martınez-Romero E (1991)

Genetic structure of a soil population of nonsymbiotic

Rhizobium leguminosarum. Appl Environ Microb 57: 426–433.

Shen P & Huang HV (1986) Homologous recombination in

Escherichia coli: dependence on substrate length and

homology. Genetics 112: 441–457.

Sheppard SK, McCarthy ND, Falush D & Maiden MC (2008)

Convergence of Campylobacter species: implications for

bacterial evolution. Science 320: 237–239.

Smith NH, Kremer K, Inwald J, Dale J, Driscoll JR, Gordon SV,

van Soolingen D, Hewinson RG & Smith JM (2006) Ecotypes

of the Mycobacterium tuberculosis complex. J Theor Biol 239:

220–225.

Sommer MO, Dantas G & Church GM (2009) Functional

characterization of the antibiotic resistance reservoir in the

human microflora. Science 325: 1128–1131.

Sorek R, Zhu Y, Creevey CJ, Francino MP, Bork P & Rubin EM

(2007) Genome-wide experimental determination of barriers

to horizontal gene transfer. Science 318: 1449–1452.

Sorek R, Kunin V & Hugenholtz P (2008) CRISPR – a widespread

system that provides acquired resistance against phages in

bacteria and archaea. Nat Rev Microbiol 6: 181–186.

Stein DC, Patrone JB & Bish S (2010) Innate immune recognition

of Neisseria meningitidis and Neisseria gonorrhoeae. Neisseria:

Molecular Mechanisms of Pathogenesis (Genco C & Wetzler L,

eds), pp. 95–122. Caister Academic, Norwich.

Steunou AS, Jensen SI, Brecht E et al. (2008) Regulation of nif

gene expression and the energetics of N2 fixation over the diel

cycle in a hot spring microbial mat. ISME J 2: 364–378.

Summers AO (2006) Genetic linkage and horizontal gene

transfer, the roots of the antibiotic multi-resistance problem.

Anim Biotechnol 17: 125–135.

Templeton A (1989) The meaning of species and speciation: a

genetic perspective. Speciation and its Consequences (Otte D &

Endler J, eds), pp. 3–27. Sinauer Associates, Sunderland, MA.

Tettelin H, Masignani V, Cieslewicz MJ et al. (2005) Genome

analysis of multiple pathogenic isolates of Streptococcus

agalactiae: implications for the microbial ‘pan-genome’. P Natl

Acad Sci USA 102: 13950–13955.

Touchon M, Hoede C, Tenaillon O et al. (2009) Organised

genome dynamics in the Escherichia coli species results in

highly diverse adaptive paths. PLoS Genet 5: e1000344.

Treves DS, Manning S & Adams J (1998) Repeated evolution of an

acetate-crossfeeding polymorphism in long-term populations

of Escherichia coli. Mol Biol Evol 15: 789–797.

van der Oost J, Jore MM, Westra ER, Lundgren M & Brouns SJ

(2009) CRISPR-based adaptive and heritable immunity in

prokaryotes. Trends Biochem Sci 34: 401–407.

Vo AT, van Duijkeren E, Gaastra W & Fluit AC (2010)

Antimicrobial resistance, class 1 integrons, and genomic island

1 in Salmonella isolates from Vietnam. PLoS One 5: e9440.

Vos M & Didelot X (2009) A comparison of homologous

recombination rates in bacteria and archaea. ISME J 3:

199–208.

Vulic M, Dionisio F, Taddei F & Radman M (1997) Molecular

keys to speciation: DNA polymorphism and the control of

genetic exchange in enterobacteria. P Natl Acad Sci USA 94:

9763–9767.

Vulic M, Lenski RE & Radman M (1999) Mutation,

recombination, and incipient speciation of bacteria in the

laboratory. P Natl Acad Sci USA 96: 7348–7351.

Walk ST, Alm EW, Calhoun LM, Mladonicky JM & Whittam TS

(2007) Genetic diversity and population structure of

Escherichia coli isolated from freshwater beaches. Environ

Microbiol 9: 2274–2288.

Walk ST, Alm EW, Gordon DM, Ram JL, Toranzos GA, Tiedje JM

& Whittam TS (2009) Cryptic lineages of the genus

Escherichia. Appl Environ Microb 75: 6534–6544.

FEMS Microbiol Rev 35 (2011) 957–976 c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

975Origins of diversity through horizontal transfer

Ward DM, Bateson MM, Ferris MJ, Kuhl M, Wieland A, Koeppel

A & Cohan FM (2006) Cyanobacterial ecotypes in the

microbial mat community of Mushroom Spring (Yellowstone

National Park, Wyoming) as species-like units linking

microbial community composition, structure and function.

Philos T Roy Soc B 361: 1997–2008.

Weiserova M & Ryu J (2008) Characterization of a restriction

modification system from the commensal Escherichia coli

strain A0 34/86 (O83:K24:H31). BMC Microbiol 8: 106.

Welch RA, Burland V, Plunkett G III et al. (2002) Extensive

mosaic structure revealed by the complete genome sequence of

uropathogenic Escherichia coli. P Natl Acad Sci USA 99:

17020–17024.

Wellner A, Lurie MN & Gophna U (2007) Complexity,

connectivity, and duplicability as barriers to lateral gene

transfer. Genome Biol 8: R156.

Wernegreen JJ & Moran NA (1999) Evidence for genetic drift in

endosymbionts (Buchnera): analyses of protein-coding genes.

Mol Biol Evol 16: 83–97.

Whittam TS & Bumbaugh AC (2002) Inferences from whole-

genome sequences of bacterial pathogens. Curr Opin Genet

Dev 12: 719–725.

Wiedenbeck JK (2011) Genomic and ecolological heterogeneity

among extremely close relatives in Bacillus. MSc Thesis,

Wesleyan University, Middletown, CT.

Wilkins BM, Chilley PM, Thomas AT & Pocklington MJ (1996)

Distribution of restriction enzyme recognition sequences on

broad host range plasmid RP4: molecular and evolutionary

implications. J Mol Biol 258: 447–456.

Wu X, Monchy S, Taghavi S, Zhu W, Ramos J & van der Lelie D

(2010) Comparative genomics and functional analysis of

niche-specific adaptation in Pseudomonas putida. FEMS

Microbiol Rev 35: 299–323.

Zawadzki P & Cohan FM (1995) The size and continuity of DNA

segments integrated in Bacillus transformation. Genetics 141:

1231–1243.

Zawadzki P, Roberts MS & Cohan FM (1995) The log-linear

relationship between sexual isolation and sequence

divergence in Bacillus transformation is robust. Genetics 140:

917–932.

Zeigler DR, Pragai Z, Rodriguez S, Chevreux B, Muffler A, Albert

T, Bai R, Wyss M & Perkins JB (2008) The origins of 168, W23,

and other Bacillus subtilis legacy strains. J Bacteriol 190:

6983–6995.

FEMS Microbiol Rev 35 (2011) 957–976c� 2011 Federation of European Microbiological SocietiesPublished by Blackwell Publishing Ltd. All rights reserved

976 J. Wiedenbeck & F.M. Cohan