8
COMMENTARY The molecular cytogenetics of plants J. S. HESLOP-HARRISON Karyobiology Group, Cell Biology Department, John Innes Centre for Plant Science Research, Colney Lane, Norwich NR4 7UH, UK Introduction Over the last ten years, major advances have been made in the molecular understanding of plant genomes -both with respect to cloning of genes and with better insight into the structural organization of the DNA. The results achieved now enable the molecular genetics of defined systems to be tackled directly, and the directed manipulation of gene expression during development - whether of flower development (see Coen, 1991), wound responses (e.g. see Stanford et al. 1990) or phytohormome action (e.g. see Huttly and Baulcombe, 1989). The increasing size of the sequence database permits comparison of the DNA from different species through coding regions, introns and repetitive sequences. It ia not always easy to recognize fully many of the answers to major biological questions that have been published following the plethora of technical developments that have enabled the research - in cloning and transformation, PCR (polymerase chain reaction), transposon tagging, and in sequence analysis. In the rapid acquisition of information about DNA sequences, a gap has appeared between our knowledge of the genome at the molecular level and that at the chromosome level. Molecular cytogenetics is now permit- ting the linking of molecular genetics with cytogenetics, using a combination of tools from both disciplines, and, in particular, direct in situ hybridization of labelled probes to interphase nuclei and chromosomes prepared for light (Trask, 1991) or electron microscopy (Leitch et al. 1990). The work is providing novel answers about genome and gene organization, gene activities and even meiotic recombination. Here, I aim to discuss some of the recent progress in this area from the viewpoint of a plant cytogeneticist. Many of my examples are taken from two contrasting groups of species: the small-grained cereals are grasses, and include barley, rye and wheat, while Arabidopsis thaliana, usually referred to as Arabidopsis, is a small rapidly growing species in the Brassica family. The Nucleotype Cells of different eukaryotic plants are involved in the same set of basic processes, and probably contain DNA that encodes about 50000 genes. Higher animals have a similar number of genes, if those related to nervous and immunological systems are excluded. Assuming that each gene encodes a protein of average length 300 amino acids, and that introns, regulatory or other sequences flanking the DNA encoding these genes are about 500 base pairs Journal of Cell Science 100, 15-21 (1991) Printed in Great Britain © The Company of Biologists Limited 1991 (bp) for each gene, the minimum number of base pairs of DNA required is some 80 000 000 bp, or 80 Mbp (megabase pairs). Various trees (including horse chestnut, Aesculus hippocastanum, and oaks, Quercus) and Arabidopsis species have amounts of DNA in their nucleus that are little greater than this value (Bennett and Smith, 1976; Bennett et al. 1982); the nematode Caenorhabditis elegans has a similar genome size. At the opposite extreme of DNA contents, the diploid lily Fritillaria davisii has some 86 000 Mbp. Between the extremes lie diploid species such as mung bean {Vigna sinensis; 530 Mbp), rice (Oryza sativa; 960 Mbp), human (Homo sapiens; 3000 Mbp), barley (Hordeum vulgare; 5300 Mbp), rye (Secale cereale; 7000 Mbp) and wheat (Triticum aestivum; 17 000 Mbp). The vast range of DNA contents, and the lack of correlation with the perceived complexity of an organism, make compari- sons at the DNA level between species difficult. There must be significant differences in the organization and positioning of the genes, and in the non-coding DNA that is responsible for most of the variation between the species. The huge variation in DNA amounts also necessitates differences in technical approaches to genome studies. For example, in Arabidopsis, the complete genome is cloned in yeast artificial chromosomes (YACs) and available as a library in a few dozen 96-well microtitre plates (Ward and Jen, 1990; Grill and Somerville, 1991). In the small-grained cereals, the molecular genetics is restricted to a few dozen cloned genes (e.g. see Futers et al. 1990), augmented by a few hundred repetitive DNA clones (see Flavell, 1985) and short cloned DNA sequences that detect restriction fragment length polymorphisms (RFLPs; e.g. see Liu et al. 1991). The molecular genetics of rice and maize, with intermediate genome sizes, is slightly more extensive (see e.g. Coe, E. (ed.), 1991; Maize Genetics Cooperation Newsletter, 65; unpublished), but still limited compared with that in Arabidopsis. Structural sequences of chromosomes The number of chromosomes, which in the diploid species mentioned above ranges from 10 to 46, has little correlation with the species DNA content. Chromosome structure, in both plants and animals, is similar, with a few exceptions. The chromatin, visualized in whole nuclear preparations using the light or electron micro- scope, represents the DNA double helix wrapped around Key words: cytogenetics, plant genome, genome organization. 15

COMMENTARY The molecular cytogenetics of plantsCOMMENTARY The molecular cytogenetics of plants J. S. HESLOP-HARRISON Karyobiology Group, Cell Biology Department, John Innes Centre

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

  • COMMENTARY

    The molecular cytogenetics of plants

    J. S. HESLOP-HARRISON

    Karyobiology Group, Cell Biology Department, John Innes Centre for Plant Science Research, Colney Lane, Norwich NR4 7UH, UK

    Introduction

    Over the last ten years, major advances have been made inthe molecular understanding of plant genomes -both withrespect to cloning of genes and with better insight into thestructural organization of the DNA. The results achievednow enable the molecular genetics of defined systems to betackled directly, and the directed manipulation of geneexpression during development - whether of flowerdevelopment (see Coen, 1991), wound responses (e.g. seeStanford et al. 1990) or phytohormome action (e.g. seeHuttly and Baulcombe, 1989). The increasing size of thesequence database permits comparison of the DNA fromdifferent species through coding regions, introns andrepetitive sequences. It ia not always easy to recognizefully many of the answers to major biological questionsthat have been published following the plethora oftechnical developments that have enabled the research -in cloning and transformation, PCR (polymerase chainreaction), transposon tagging, and in sequence analysis.

    In the rapid acquisition of information about DNAsequences, a gap has appeared between our knowledge ofthe genome at the molecular level and that at thechromosome level. Molecular cytogenetics is now permit-ting the linking of molecular genetics with cytogenetics,using a combination of tools from both disciplines, and, inparticular, direct in situ hybridization of labelled probes tointerphase nuclei and chromosomes prepared for light(Trask, 1991) or electron microscopy (Leitch et al. 1990).The work is providing novel answers about genome andgene organization, gene activities and even meioticrecombination. Here, I aim to discuss some of the recentprogress in this area from the viewpoint of a plantcytogeneticist. Many of my examples are taken from twocontrasting groups of species: the small-grained cerealsare grasses, and include barley, rye and wheat, whileArabidopsis thaliana, usually referred to as Arabidopsis,is a small rapidly growing species in the Brassica family.

    The Nucleotype

    Cells of different eukaryotic plants are involved in thesame set of basic processes, and probably contain DNAthat encodes about 50000 genes. Higher animals have asimilar number of genes, if those related to nervous andimmunological systems are excluded. Assuming that eachgene encodes a protein of average length 300 amino acids,and that introns, regulatory or other sequences flankingthe DNA encoding these genes are about 500 base pairsJournal of Cell Science 100, 15-21 (1991)Printed in Great Britain © The Company of Biologists Limited 1991

    (bp) for each gene, the minimum number of base pairs ofDNA required is some 80 000 000 bp, or 80 Mbp (megabasepairs). Various trees (including horse chestnut, Aesculushippocastanum, and oaks, Quercus) and Arabidopsisspecies have amounts of DNA in their nucleus that arelittle greater than this value (Bennett and Smith, 1976;Bennett et al. 1982); the nematode Caenorhabditis eleganshas a similar genome size. At the opposite extreme of DNAcontents, the diploid lily Fritillaria davisii has some 86 000Mbp. Between the extremes lie diploid species such asmung bean {Vigna sinensis; 530 Mbp), rice (Oryza sativa;960 Mbp), human (Homo sapiens; 3000 Mbp), barley(Hordeum vulgare; 5300 Mbp), rye (Secale cereale; 7000Mbp) and wheat (Triticum aestivum; 17 000 Mbp). The vastrange of DNA contents, and the lack of correlation withthe perceived complexity of an organism, make compari-sons at the DNA level between species difficult. Theremust be significant differences in the organization andpositioning of the genes, and in the non-coding DNA thatis responsible for most of the variation between thespecies. The huge variation in DNA amounts alsonecessitates differences in technical approaches to genomestudies. For example, in Arabidopsis, the complete genomeis cloned in yeast artificial chromosomes (YACs) andavailable as a library in a few dozen 96-well microtitreplates (Ward and Jen, 1990; Grill and Somerville, 1991). Inthe small-grained cereals, the molecular genetics isrestricted to a few dozen cloned genes (e.g. see Futers et al.1990), augmented by a few hundred repetitive DNA clones(see Flavell, 1985) and short cloned DNA sequences thatdetect restriction fragment length polymorphisms(RFLPs; e.g. see Liu et al. 1991). The molecular genetics ofrice and maize, with intermediate genome sizes, is slightlymore extensive (see e.g. Coe, E. (ed.), 1991; Maize GeneticsCooperation Newsletter, 65; unpublished), but still limitedcompared with that in Arabidopsis.

    Structural sequences of chromosomes

    The number of chromosomes, which in the diploid speciesmentioned above ranges from 10 to 46, has littlecorrelation with the species DNA content. Chromosomestructure, in both plants and animals, is similar, with afew exceptions. The chromatin, visualized in wholenuclear preparations using the light or electron micro-scope, represents the DNA double helix wrapped around

    Key words: cytogenetics, plant genome, genome organization.

    15

  • the nucleosome core and then coiled several times;metaphase and interphase chromosomes differ in one levelof coiling (Manuelidis and Chen, 1990). Hence examin-ation of chromosomes at the structural level in one speciescan give more general conclusions. Nevertheless, there aresignificant, and not fully explained, differences betweenplant and animal chromosomes; for example, only thelatter show banding patterns after particular treatments(Greilhuber, 1977), which presumably reflects differencesin DNA organization, protein associations or replication.

    At metaphase, cytological preparations of chromosomesshow the telomeres ('ends') and primary constrictions atthe centromeres. The telomere is now among the bestcharacterized DNA sequences at the molecular level andconsists of a short tandemly repeated DNA sequence(where the same 6 or 7 base pair unit is repeated manyhundreds of times; Zakian, 1989). In Arabidopsis, theconsensus sequence is TTTAGGG (Richards and Ausubel,1988), and in situ hybridization of the labelled telomeresequence localizes the sequence to the physical ends ofcereal chromosomes (Schwarzacher and Heslop-Harrison,1991). The telomeric DNA does not have a Watson-Crickstructure, and it is not replicated by a template-dependentDNA polymerase, but rather by a reverse transcriptasewhere the RNA template is an integral part of theribonucleoprotein (a telomerase; e.g. see Blackburn, 1991).Some of the information revealed by in situ localization ofthe telomere sequence would be very difficult to obtain byother methods. For example, the location at everychromosome end, and a few intercalary sites, would bedifficult to conclude from Southern hybridization data, andthe variation in copy number of the telomere repeat unitfrom cell to cell and chromosome to chromosome could notbe measured using techniques that only analyse thesequence from large numbers of pooled nuclei.

    The molecular cytogenetics of other structures on thechromosome is being investigated. The centromere, wheremicrotubules bind at metaphase before separation of thetwo chromatids at anaphase, has a key structural role (seeRattner, 1991) where DNA sequence organization relatesto associated proteins and to centromere function. Many ofthe sequences near or at the centromere are highlyrepeated, and hence can be localized at interphase andused to elucidate interphase nuclear architecture (Heslop-Harrison and Bennett, 1990; Maluszynska and Heslop-Harrison, 1991).

    Origins of replication have been a major area of researchin yeast (see Murray and Szostak, 1983), and there aremany replication origins that are under tight geneticcontrol in plants (Kidd et al. 1989). The molecularcharacterization of sequences that associate with DNAbinding proteins is far ahead of their characterization atthe cytogenetical level. Where are these sequences alongthe chromosome? Where are they spatially within thenucleus?

    A final class of structural features of chromosomes isthat relating to meiotic pairing and recombination. Moensand Pearlman (1990) have shown the association oftelomere and centromere DNA with the synaptonemalcomplex core of meiotic prophase chromosomes, but whatare the sequences associated with the synaptonemalcomplex along the whole chromosome length? Are se-quences related to chromosome structure also associatedwith recombination sites? New evidence suggests that therecombination events may occur prior to synapsis me-diated by the synaptonemal complex, at least in yeast(Haber et al. 1991); if these results prove general, then

    different classes of structural sequences may be associatedwith homologue recognition, recombination and thenpairing.

    Physical organization of repetitive sequences

    At interphase, light and low-power electron-microscopestudies allow the nucleolus (reviewed by Jordan, 1991) andchromatin to be resolved. Pieces of condensed chromatin,usually referred to as heterochromatin, are visible, and (atleast in species with more than some 1000 Mbp of DNA)chromatin axes can be followed. The physical organizationof repetitive DNA is important, since such sequencesmake up most of the DNA in most higher organisms — eventhe Arabidopsis genome is 25 % repetitive DNA (Leutwileret al. 1986), while the wheat genome (17 000 Mbp) consistsof more than 80% middle and highly repetitive DNA(Flavell, 1985).

    Studies of gross nuclear morphology in many, althoughnot all, tissues, show a configuration recognized as early as1885 by Rabl, where the nucleus has a pole at which thecentromeres are preferentially located opposite the telo-mere pole. In species with higher DNA contents, there isoften a strong gradient in the proportion of the nucleusfilled by chromatin (see Anamthawat-J6nsson and Heslop-Harrison, 1990; and Fig. 1), with about half the volume ofthe nucleus near the centromere pole filled by chromatin,while the telomere zone is only 15 % filled.

    Many of the heterochromatic segments are supercoiled,tandemly repeated DNA sequences. Staining propertiesindicate that they contain repetitive DNA (Schweizer,1981), and in situ hybridization confirms the co-location ofparticular repetitive sequences. For example, light micro-graphs of stained DNA and electron microscope picturesshow large heterochromatic segments near the centro-meres at metaphase. Using in situ hybridization toArabidopsis, Maluszynska and Heslop-Harrison (1991)have shown by in situ hybridization that a repetitivesequence (cloned by Martinez-Zapater et al. 1986) co-localizes with many of the heterochromatic knobs, andthat it appears as similarly condensed units both onmetaphase chromosomes and at interphase. The probeproduces such discrete hybridization signals at interphasethat it can be used to count centromeres, and hencechromosome, numbers in non-dividing tissues.

    The type of DNA sequence found in knobs consists oflarge numbers of copies of the same (or closely similar)sequence in tandem arrays, but another class of highlyrepetitive DNA is found dispersed throughout the genome.An example of such a sequence in plants was described byMoore et al. (1991), who found an element of the genomethat comprised some 5 % of the total DNA of barley. Usingin situ hybridization, they were able to show that thesequence was interspersed and located over most chromo-some arms, although excluded from the telomeric, nu-cleolar organizing and paracentromeric regions (Fig. 2).Again, without direct examination of chromosomal lo-cation, the organization of the element would be difficult todetermine, and many questions arise from the discovery.Why is the sequence excluded from some chromosomalregions? Does it tend to flank particular types or classes ofsequence? What differences are there in locations andtypes of sequences in the excluded regions? Indications ofthe answer to another question, *How does a dispersedrepeat become distributed on all chromosomes?', come

    16 J. S. Heslop-Harrison

  • Fig. 1. A electron micrograph ofa section through an interphasenucleus from a rye root-tip. Thenucleus shows a strong gradientwith the centromeres (arrows)lying in a region where a largeproportion of the nuclear volumeis filled by chromatin. Thetelomeres (arrowhead indicatessub-telomeric heterochromatin)lie in a zone with a muchsmaller proportion of chromatin(see Anamthawat-J6nsson andHeslop-Harrison, 1990) x 12 000.

    from sequence analysis, since it has similarities toretrotransposon-like elements in animals.

    In situ hybridization of both tandemly repeated anddispersed sequences is proving to be not only of use forplant genome analysis but also of practical value in plantbreeding. Lapitan et al. (1986) have demonstrated the useof a species-specific cloned dispersed repeat from rye toidentify rye chromosomes and chromosome segments inhybrids with wheat, which are of potential agriculturalimportance. Total genomic DNA, where many repetitive

    sequences act as the probe, can also be used to identify theparental origin of chromosomes or chromosome segmentsin hybrids and breeding lines (Fig. 3; Anamthawat-J6nsson et al. 1990), and the method is now beingdeveloped further for use in screening plant breeding lines(Heslop-Harrison, Miller and colleagues, unpublisheddata). Tandem arrays are of interest in this context. In rye,the size of the sub-terminal knobs (and hence copy numberof the tandem repeat) on many chromosome arms can vary(Gustafson et al. 1983), and may be selected in rye itself

    Molecular cytogenetics of plants 17

  • Fig. 2. In situ localization of a cloned DNA sequence, BIS1, tobarley chromosomes. DAPI staining (left) shows the DNA onthe chromosomes, while a biotin-labelled fragment of BIS1(right) hybridizes in situ over most of the chromosomes exceptat centromeric and telomeric regions. The sequence may be (ormay have been) capable of movement and amplification in thegenome through reverse transcription, as is indicated by itsdispersion over the chromosomes (Moore et al. 1991).

    Fig. 3. A partial metaphase spread from a root tip of a wheatvariety, Custom, which includes a rye chromosome armtranslocated onto a wheat arm. The spread has been probed insitu with total genomic DNA from rye, with non-species-specific hybridization blocked with wheat DNA. Allchromosomes fluoresce with the counterstain (light grey), whilethe strongly labelled chromosome arm from rye fluorescesbrightly (white; Heslop-Harrison et al. 1990). Despite therelatively close relationship of the two species, there areenough different dispersed sequences to permit discriminationof the rye chromosome arm, and the point of recombinationand presence of other recombination events can be examined.

    and in the hybrid crop, triticale, with potential effects onagronomic performance.

    Chromosomal localization of genes

    For mapping genes in human, Ferguson-Smith (1991) hasrecently stated that in situ hybridization 'is now themethod of choice for assigning a cloned DNA sequence toits parent chromosomes'. In plants, this statement is nowonly true for one class of genes, those present in multiplecopies in tandem arrays, such as the 18 S, 26 S and 5.8 SrRNA gene subunits (Mukai et al. 1991) and the 5 S rRNAgenes (Mukai et al. 1990). These are present in tandemarrays of thousands of copies at one or more chromosomallocus, but some whole loci may be unexpressed, while onlya subset of the genes is expressed at other loci (Flavell,1986; Flavell and O'Dell, 1990). There is clear cytologicalevidence for the expression of rRNA genes at interphase,through the production of the nucleolus where the sizecorrelates with activity, and at metaphase chromosomesthat have recently expressed the rDNA genes normallyshow a secondary constriction at the nucleolar organizingregion (NOR). Thus the expressed sequences can belocated on chromosomes by cytology and hence mappedphysically at metaphase. The presence of non-expressingrRNA genes in rDNA-containing heterochromatin lyingadjacent to the nucleolus or elsewhere in the interphasenucleus has been shown by in situ hybridization (Appels etal. 1986; Leitch et al. unpublished data). A similarrepression of genes by condensation may be found infemale mammals, where one X chromosome is largelyinactive, and sometimes visible as a condensed Barr body.Because of the large number of tandem repeats, and thelimited variation within the repetitive element, and lackof expression of some loci, the total number of chromo-somes carrying rRNA genes cannot easily be found usingsegregation or RFLP analysis or by examination of thechromosomes. Thus, the presence and locations of rDNAon five pairs of wheat chromosomes (Mukai et al. 1991) andtwo Arabidopsis chromosomes (Maluszynska and Heslop-Harrison, 1991) have only been shown by in situhybridization.

    There are two basic maps of a genome: the physical orcytogenetic map, and the genetic map. The physical mapshows the morphology of the chromosomes and thephysical location of genes and other chromosome markersalong the length of the chromosomes. The genetic map isbased on the chromosome linkage groups on which genes,restriction fragment length polymorphisms (RFLPs), andother markers, lie. The genetic or recombination distancesbetween the markers on each chromosome give the mapdistances (see O'Brien, 1990). As well as using analysis oflarge segregating families for genetic mapping, genes andRFLPs can be mapped to chromosomes using the widerange of aneuploid stocks available in plants. Arabidopsishas a complete set of trisomic lines (with one chromosomepresent in an extra copy) that permit direct gene mapping.Wheat has many aneuploids that enable each gene ormarker to be assigned to a particular chromosome orchromosome arm. Below the level of the chromosome arm,connection of genetical map to the physical map isdifficult. Although, of course, the physical and geneticmaps show markers in the same order along thechromosome, the physical location of most markers isunknown except for specific points such as the centromeresand secondary constrictions. When physical and genetical

    18 J. S. Heslop-Harrison

  • locations are known, there is often little correlationbetween the separation distances of markers on the twotypes of map. One of the first studies of the relationships ofthe maps in cereals was by Linde-Laursen (1979), whoexamined barley populations segregating for C-bands. Hefound almost no recombination between two bands thatwere widely separated on one chromosome arm, whileneither physical band was linked with an esterase locus onthe same arm, implying a high level of recombination.Now, physical technologies are beginning to link the mapsmore closely, and to answer questions about genelocalization.

    Using the estimated total length of the Arabidopsisgenetic map, in cM (centiMorgans), derived from recombi-nation studies, and dividing the length by the genome sizeindicates that each cM occupies an average of 150 kb. Grilland Somerville (1991) have cloned the genome of Arabi-dopsis in YACs, and are probing the genome with manyRFLP markers (Chang et al. 1988), that have been mappedin segregating populations. One clone of 170 kb containedthree markers that mapped over 2.9 cM, while a second180 kb clone included two RFLP markers that mapped1.6 cM apart, both quite similar to the expectation.Nevertheless, correlation of the genetic and physicaldistances suggests that there are probably regions,perhaps near the centromeres and telomeres, whererecombination is suppressed, and the frequency isincreased in other regions of the genome.

    In cereals, with genomes 50 or more times larger thenArabidopsis, a similar physical mapping approach isimpractical. Even measurement of the genetic length ofthe chromosomes is difficult, since there is no simplemethod to map genetically the end of a chromosome;chance discovery of a terminal RFLP can greatly increasegenetic length. Data from deletion mapping and in situhybridization of cloned genes are tending to show thatgenes are not randomly distributed along chromosomearms. Certain genes, and other markers, that are mappedgenetically midway along an arm, or towards thecentromere, seem to be physically located in the terminalchromosome region, and hence all markers distal to thesemust also be located in this region. For example, Tsujimotoand Noda (1990) examined lines of wheat that had shortdeletions, detected by C-banding, on the long arm of thechromosome designated 5A. They found that a deletion of13 % of the physical length of the arm caused loss ofmarkers representing at least 83 % of the genetic length ofthe arm. Similarly, Curtis and Lukaszewski (1991) foundthat the distal half of the physical arm of another wheatchromosome accounted for almost 90% of the recombi-nation. A clone for the secalin genes in rye can be used toprobe a major gene on the nucleolar organizing chromo-some. Gustafson et al. (1990) found that the gene wasphysically located close to the NOR, although genetic datashow that recombination is frequent between the two(Fig. 4). High-resolution mapping of a short chromosomalregion was possible with the in situ hybridizationtechnique. There is very limited recombination in the 90 %of the chromosome length that physically lies between thecentromere and the NOR (Gustafson et al. 1990) in rye, inagreement with Snape et al. (1985), who found thatrecombination was much more frequent in regions ofwheat chromosomes distal to the centromere. Anotherapproach to correlation of genetic and physical maps wasused in another cereal, maize, by Dooner (1986; Dooner etal. 1985), i.e. a transposon-tagging strategy. He estimatedthat one cM in the region of the gene Bronze represented

    •XGIi

    -Gpi

    —XNor-Xpsr161

    -Xpsr325-XPpdk XEm

    —XLec XGIu

    -Xpsr330

    -XAdh

    Fig. 4. A comparison of two physical, cytogenetic maps andthe genetic map of chromosome 1R of rye; C-band map (left, R.Schlegel and R. Kynast, personal communication), in situhybridization map (centre, from Gustafson et al. 1990) andgenetic or RFLP map (right, from Wang et al. 1991). All mapsshow the nucleolar organizing region (NOR) where rRNAgenes are tandemly arrayed, the centromere and telomeres.Telomeres have not been mapped genetically (right). The twophysical maps (left, centre) separate the centromere and NORwidely, while genetic linkage analysis places them closetogether, indicating little recombination between them.

    only 14 kb of DNA, compared with an average value ofmore than 2300 kb in the genome as a whole. Not all areasnear genes have greatly elevated recombination fre-quencies (recombination hot spots); Cheung et al. (1991)have mapped the small multi-gene family of a-amylasegenes in wheat by pulsed-field gel electrophoresis, andfound that 1 cM approximates to 1000 kb within the locus,which is close to the 3000 kb average over the genome as awhole.

    It is becoming clear that physical and genetic distancesbetween loci along chromosomes need not be correlated inlarger plant genomes, as in the human genome (e.g. seePetersen et al. 1991). The discovery and significance ofsuch discrepancies, and examination of differences be-tween male and female recombination maps, will beimportant for understanding genetic recombination andphysical sequence organization on chromosomes. Only byknowing the physical positions of genes along chromo-somes can genetic linkage, and gene and sequenceinterdependence, be understood. Genes are not randomlyspaced along the chromosome arms, so division of genomesize by number of cM may give little indication of thephysical separation of genes - although knowledge of thephysical distance is important for, say, cloning strategiesinvolving cloning (walking) between an RFLP marker anda linked gene.

    Conclusions

    Technical progress to increase both the resolution andsensitivity of gene, marker and sequence mapping is now

    Molecular cytogenetics of plants 19

  • required. Current research using in situ hybridization isleading to better labelling technology (e.g. multiple targetlabelling; Leitch et al. 1991), and the detection of shortersequences. Flow cytometry provides a complementaryapproach to molecular cytogenetics. Although the tech-nology has proved to be of major importance in mam-malian cytogenetics, it has not yet been usefully applied toplants. Gray and Cram (1990) described why the analysisand sorting of plant chromosomes would be useful: flowkaryotyping is accurate and fast, while chromosomesorting enables the construction of chromosome-specificlibraries and hence gene mapping. They also consider thatthe work is of considerable economic interest, sincechromosome transplantation via fusion techniques andmicroinjection would be possible with sorted plant chromo-somes. While some progress towards the sorting of alimited number of plant species has been made, there areformidable difficulties in generating the high metaphaseindex required for maintaining stable cell cultures, and inisolation of chromosomes from cells with walls; inmammalian cytogenetics all these are established tech-niques or are not required. Nevertheless, there is immensepotential in the method for analysing large genomes, suchas those of the cereals.

    The production of stable transgenic plants requires thephysical insertion of genes into the chromosomes. Does thesite of incorporation affect gene expression (see Stief et al.1989)? Do features, such as co-suppression of pairs ofinserted genes (Matzke and Matzke, 1990), relate to geneposition? Hager and Miller (1991) reported tight linkage ofcertain developmentally co-regulated Drosophila genes,and speculate whether such grouping is a commonoccurrence. In a plant breeding context, perhaps areas oflow recombination could be used to keep desirable geneclusters together through a crossing programme.

    Developments in gene cloning and antibody technologyover the last decade have denned structures and localizedDNA and proteins within the nucleus, enabling aspects ofthe physical organization of the nucleus to be elucidated inthree dimensions (Heslop-Harrison and Bennett, 1990).Knowledge from molecular cytogenetics can help answermany of the fundamental problems of both plant geneticsand molecular biology. Physical knowledge of locations ofgenes and DNA sequences, whether active in control,expression, recombination or evolution, is helping in theunderstanding of genome behaviour and gene action.When will we be able to look directly at a chromosome andread off the sequence of DNA pairs, while assaying theactivity and purpose of each sequence?

    I am grateful to Dr Andrew Leitch, Dr Trude Schwarzacher andKesara Anamthawat-J6nsson for their immense help in ourmolecular cytogenetics reseach programme and providing theplates. I also thank Ilia Leitch, Jola Maluszynska, Shi Min,Mingli Wang, Michael Bennett, Michael Gale, David Laurie,Graham Moore and many other colleagues for helpful discussionsabout the uses of molecular cytogenetics and for access to pre-publication data, and Gill Harrison for drawing Fig. 4. The workwas supported by BP and Venture Research International, andAFRC through grants 111/569 and 570 under the PMB initiative.

    References

    ANAMTKAWAT-J6NSSON, K AND HESLOP-HARRISON, J. S (1990).Centromeres, telomeres and chromatin in the interphase nucleus ofcereals. Caryologia 43, 205-213.

    ANAMTHAWAT-J6NSSON, K., SCHWARZACHER, T., LEITCH, A R., BENNBTT,M. D. AND HESLOP-HAREISON, J. S (1990). Discrimination between

    closely related Triticeae species using genomic DNA as a probe.Theor. appl. Genet. 79, 721-728.

    APPELS, R., MORAN, L. B. AND GUSTAFSON, J. P. (1986) The structure ofDNA from the rye (Secale cereale) NOR Rl locus and its behaviour inwheat backgrounds. Can. J. Genet. Cytol. 28, 673-685.

    BENNETT, M. D AND SMITH, J. B (1976) Nuclear DNA amounts inangiosperms. Phil. Trans. R. Soc. Land. B 274, 227-274.

    BENNETT, M. D., SMITH, J. B. AND HESLOP-HARRISON, J. S. (1982).Nuclear DNA amounts in angiosperms. Proc. R. Soc. Land. B 216,179-190

    BLACKBURN, E. H (1991) Structure and function of telomeres. Nature350, i and 569-573.

    CHANG, C, BOWMAN, J. L., DEJOHN, A. W., LANDER, E. S. ANDMEYEROWITZ, E. M. (1988). Restriction fragment length polymorphismmap for Arabidopsis thaliana. Proc. natn. Acad. Set. U.S.A. 86,6856-6860.

    CHEUNG, W. Y., CHAO, S. AND GALE, M. D. (1991). Long-range physicalmapping of the amylase (a-Amy-1) loci on homoeologous group 6chromosomes of wheat. Mol. gen Genet (in press).

    COEN, E. S. (1991). The role of homeotic genes in flower developmentand evolution. A. Rev. PI. Physiol. PI. molec. Biol. 42, 241-279.

    CURTIS, C. A. AND LUKASZBWSKI, A. J. (1991). Genetic linkage betweenC-bands and storage protein genes in chromosome IB of tetraploidwheat. Theor. appl. Genet. 81, 245-252.

    DOONER, H. K. (1986). Genetic fine structure of the Bronze locus inmaize. Genetics 113, 1021-1036.

    DOONER, H. K., WECK, E., ADAMS, S., RALSTON, E., FAVREAU, M ANDENGLISH, J. (1985). A molecular genetic analysis of insertions in thebronze locus in maize. Mol. gen. Genet. 200, 240-246.

    FERGUSON-SMITH, M. A. (1991). Putting the genetics back intocytogenetics. Am. J. hum. Genet. 48, 179-182

    FLAVELL, R. B. (1986) Repeated sequences and genome change. InGenetic Flux in Plants (ed. B. Hohn and E. S. Dennis), pp. 129-166.Springer Verlag. Wien, New York.

    FLAVELL, R. B. (1986). The structure and control of expression ofribosomal RNA genes. Oxford Surveys of PI. mol. Cell Biol 3,261-274.

    FLAVELL, R. B. AND O'DELL, M. (1990) Variation and inheritance ofcytosine methylation patterns in wheat at the high molecular weightglutenin and ribosomal RNA gene loci. Development 1990 Supplement,15-20.

    FITTERS, T. S., VAUGHAN, T. J., SHARP, P. J. AND CUMING, A. C (1990)Molecular cloning of genes encoding the Em polypeptide of Triticumaestivum L. var. Chinese Spring: Evidence for close physical linkageon the long arms of homoeologous group 1 chromosomes. Theor. appl.Genet. 80, 43^48.

    GRAY, J W AND CRAM, L. S. (1990). Flow karyotyping and chromosomesorting. In Flow Cytometry and Sorting, 2nd edn, pp 503-529. Wiley-Liss, New York

    GREILHUBER, J (1977) Why plant chromosomes do not show G-bands.Theor. appl. Genet. 50, 121-124.

    GRILL, E AND SOMERVILLE, C. (1991). Construction and characterizationof a yeast artificial chromosome library of Arabidopsis which issuitable for chromosome walking. Mol. gen Genet 228, 484-490.

    GUSTAFSON, J P , BUTLER, E. AND MCINTYRE, C. L. (1990). Physicalmapping of a low-copy DNA sequence in rye (Secale cereale L.). Procnatn. Acad. Sci. U.S.A. 87, 1899-1902.

    GUSTAFSON, J. P., LUKASZEWSKI, A. J. AND BENNETT, M. D. (1983).Somatic deletion and redistribution of telomeric heterochromatin inthe genus Secale and in triticale Chromosoma 88, 293-298

    HABER, J. E., LEUNG, W-Y., BORTS, R. H. AND LICHTEN, M. (1991). Thefrequency of meiotic recombination in yeast is independent of thenumber and position of homologous donor sequences: implications forchromosome pairing Proc. natn. Acad. Sci. U.S.A. 88, 1120-1124.

    HAGER, E. J. AND MILLER, O. L. (1991) Ultrastructural analysis ofpolytene chromatin of Drosophila melanogaster reveals clusters oftightly linked co-expressed genes. Chromosoma 100, 173-186

    HESLOP-HARRISON, J. S. AND BENNETT, M. D. (1990). Nucleararchitecture in plants. Trends Genet. 6, 401-405.

    HESLOP-HARRISON, J. S., LEITCH, A. R., SCHWARZACHER, T. ANDANAMTHAWAT-J6NSSON, K. (1990). Detection and characterization of1B/1R translocations in hexaploid wheat. Heredity 65, 385-392

    HUTTLY, A. K. AND BAULCOMBE, D. C (1989). A wheat cr-Amy2 promoteris regulated by gibberellin in transformed oat aleurone protoplastsEMBO J 8, 1907-1913

    JORDAN, E. G. (1991). Interpreting nucleolar structure: where are thetranscribing genes? J. Cell Sci. 98, 437-442.

    KIDD, A. D., FRANCIS, D. AND BENNETT, M. D. (1989). Replicon size andrate of DNA replication fork movement are correlated in grasses. ExplCell Res. 184, 262-267

    LAPITAN, N. L. V., SEARS, R. G., RAYBURN, A L AND GILL, B. S. (1986).Wheat-rye translocations. J. Hered. 77, 415-419.

    LEITCH, A. R., MOSGOLLER, W., SCHWARZACHER, T, BENNETT, M. D. AND

    20 J. S. Heslop-Harrison

  • HESLOP-HARRISON, J. S. (1990). Genomic in situ hybridization tosectioned nuclei shows chromosome domains in grass hybrids. J. CellSci. 95, 335-341.

    LEITCH, I. J., LBITCH, A. R. AND HBSLOP-HARRISON, J. S. (1991). Physicalmapping of plant DNA sequences by simultaneous in situhybridization of two differently labelled fluorescent probes. Genome34, 329-333.

    LEUTWILER, L. S., HOUGH-EVANS, B. R. AND MEYEROWITZ, E. M. (1984).The DNA of Arabidopsis thaliana. Mol. gen. Genet. 194, 15-23.

    LINDE-LAURSEN, I. (1979). Giemsa C-banding of barley chromosomesIII. Segregation and linkage of C-bands on chromosomes 3, 6, and 7.Hereditas 91, 73-77.

    Liu, C. J., ATKINSON, M. D., CHINOY, C. N., DEVOS, K. M. AND GALE, M.D. (1991). Non-homoeologous translocations between group 4, 5 and 7chromosomes within wheat. Theor. appl Genet, (in press).

    MALUSZYNSKA, J. AND HESLOP-HARRISON, J. S. (1991). Localization oftandemly repeated DNA sequences in Arabidopsis thaliana. The PlantJournal 1(2) (in press).

    MANUELJDIS, L. AND CHEN, T. L. (1990). A unified model of eukaryoticchromosomes. Cytometry 11, 8-25.

    MARTINEZ-ZAPATER, J. M., EBTELLE, M. A. AND SOMERVILLB, C. R. (1986).A highly repeated DNA sequence in Arabidopsis thaliana. Mol. gen.Genet. 204, 417-423.

    MATZKI, M. A. AND MATZKK, A. J. M. (1990). Gene interactions andepigenetic variation in transgenic plants. Devi Genet. 11, 214-223

    MOENS, P. B. AND PKARLMAN, R. E. (1990). Telomere and centromereDNA are associated with the cores of meiotic prophase chromosomes.Chromomma 100, 8-14.

    MOORE, G., CHEUNG, W., SCHWARZACHER, T. AND FLAVBLL, R. (1991).BIS1, a major component of the cereal genome and a tool for studyinggenomic organization. Genonucs 10, 469—476.

    MUKAI, Y., ENDO, T. R. AND GILL, B. S. (1990). Physical mapping of the5S rRNA multigene family in common wheat. J. Hered. 81, 290-295.

    MUKAI, Y., ENDO, T. R. AND GILL, B. S. (1991). Physical mapping of the18S.26S rRNA multigene family in common wheat: Identification of anew locus. Chromosoma 100, 71-78.

    MURRAY, A. W. AND SZOSTAK, J. W. (1983). Construction of artificialchromosomes in yeast. Nature 305, 189-193.

    O'BRIEN, S. J. (ed.) (1990) Genetic Maps: Locus Maps of ComplexGenomes, 5th edn. Cold Spring Harbor Press, New York.

    PETERSEN, M. B., SLAUGENHAUPT, S. A , LEWIS, J. G., WARREN, A. C,CHAKRAVARTI, A. AND ANTONARAKIS, S. E. (1991). A genetic linkagemap of 27 markers on human chromosome 21 Genomics 9, 407-419.

    RABL, C. (1885). Ober Zelltheilung. Morph Jb 10, 214-330.RATTNER, J. B. (1991). The structure of the mammalian centromere.

    BioEssays 13, 51-56.RICHARDS, E. J. AND AUSUBEL, F. M. (1988) Isolation of a higher

    eukaryotic telomere from Arabidopsis thaliana. Cell 53, 127-136.SCHWARZACHER, T. AND HESLOP-HARRISON, J. S. (1991). In situ

    hybridization to plant telomeres using synthetic oligomers. Genome34, 317-323.

    SCHWEIZER, D. (1981) Counterstain-enhanced chromosome banding.Hum. Genet 57, 1-14.

    SNAPS, J. W., FLAVELL, R. B., O'DELL, M., HUGHES, W. G. AND PAYNE,P. I. (1985). Intrachromosomal mapping of the nucleolar organizerregion relative to three marker loci on chromosome IB of wheat(Triticum aestiuum). Theor. appl. Genet. 69, 263-270.

    STANFORD, A C, NORTHCOTE, D. H AND BEVAN, M. W. (1990). Spatialand temporal patterns of transcription of a wound-induced gene inpotato. EMBO J. 9, 593-603.

    STIBF, A , WINTER, D. M., STRALJNG, W. H. AND SIPPEL, A. E. (1989). Anuclear DNA attachment element mediates elevated and position-independent gene activity. Nature 341, 343—345.

    TRASK, B. J. (1991). Fluorescence in situ hybridization- applications incytogenetics and gene mapping Trends Genet. 7, 149-154.

    TSUJIMOTO, H AND NODA, K. (1990). Deletion mapping by gametocidalgenes in common wheat. Position of speltoid suppression (Q) and ft-amylase (Ji-Amy-A2) genes on chromosome 5A. Genome 33, 850-853.

    WANG, M L., ATKINSON, M. D , CHINOY, C. N., DEVOS, K M.,HARCOURT, R. L., LIU, C. J., ROGERS, W. J. AND GALE, M D. (1991).RFLP-based genetic map of rye (Secale cereale L.) chromosome 1R.Theor. appl Genet, (in press).

    WARD, E. R. AND JEN, G. C. (1990). Isolation of single-copy-sequenceclones from a yeast artificial chromosome library of randomly-shearedArabidopsis thaliana DNA. PI. molec Biol. 14, 561-568.

    ZAKIAN, V A. (1989). Structure and function of telomeres. A Rev.Genet. 23, 579-604.

    Molecular cytogenetics of plants 21