8
GENOMICS 25,674-681 (1995) Thermal Asymmetric Interlaced PCR: Automatable Amplification and Sequencing of Insert End Fragments from PI and YAC Clones for Chromosome Walking YAO-GUANG LIU AND ROBERTF. WHITTIER’ Mitsui Plant Biotechnology Research Institute, RlTf Tsukuba Laboratory 1, TCI-D21, Sengen 2-l-6, Tsukuba305, Japan ReceivedFebruary 22, 1994; revised September 21, 1994 Isolation of DNA segments adjacent to known se- quences is a tedious task in genome-related research. We have developed an efficient PCR strategy that over- comes the shortcomings of existing methods and can be automated. This strategy, thermal asymmetric in- terlaced (TAIL)-PCR, utilizes nested sequence-specific primers together with a shorter arbitrary degenerate primer so that the relative amplification efficiencies of specific and nonspecific products can be thermally controlled. One low-stringency PCR cycle is carried out to create annealing site(s) adapted for the arbi- trary primer within the unknown target sequence bor- dering the known segment. This sequence is then pref- erentially and geometrically amplified over nontarget ones by interspersion of high-stringency PCR cycles with reduced-stringency PCR cycles. We have ex- ploited the efficiency of this method to expedite ampli- fication and sequencing of insert end segments from Pl and YAC clones for chromosome walking. In this study we present protocols that are amenable to auto- mation of amplification and sequencing of insert end sequences directly from cells of Pl and YAC clones. Q 1996 Academic Press, Inc. INTRODUCTION Genome-related research frequently requires isola- tion of DNA from an unsequenced segment bordering a known sequence. This need arises when isolating insert end fragments of large clones such as Pl and yeast artificial chromosomes, cloning insertion tagged genes, obtaining regulatory sequences corresponding to cloned cDNAs, or studying oncogenic retroviral inser- tions. A number of PCR methods have been described for this purpose, including inverse PCR (Ochman et al., 1988; Triglia et al., 1988; Silver and Keerikatte, 1989) and hemispecific or one-sided PCR methods (Frohman 1 To whom correspondence should be addressed at the Mitsui Plant Biotechnology Research Institute, TCI D21, Sengen 2-l-6, Tsukuba 305, Japan. Telephone: 81-298-58-6235. Fax: 81-298-58-6234. E- mail: [email protected]. et al., 1988; Loh et al., 1989; Ohara et al., 1989; Riley et al., 1990; Mueller et al., 1989; Parker et al., 1991; Isegawa et al., 1992). Many of these methods require special steps before PCR such as Southern analysis to determine suitable restriction sites, followed by manip- ulations such as restriction cutting and ligation or tail- ing. Targeted gene walking PCR (Parker et al., 1991) and single primer PCR (Parks et al., 1991) do not re- quire such manipulations prior to PCR. They rely upon specific priming within the known sequence together with arbitrary priming within the flanking sequence (hence, hemispecific). However, arbitrary priming also creates nontarget molecules, and these constitute the bulk of the final product, even when the starting tem- plate sample is quite simple. The desired product must then be identified by hybridization or elongation from an end-labeled internal primer. Thus, methods that omit manipulations before PCR require more laborious screening afterward. We present here a hemispecific PCR method that overcomes these drawbacks. It requires no DNA ma- nipulation before PCR, yet efficiently amplifies tar- geted segments, usually without visible background. The basis for this strategy is thermal asymmetric PCR, which was described for producing single-stranded DNA templates for sequencing (Mazars et al., 1991). Using two primers differing in length and hence ther- mal annealing stability, PCR cycles carried out with high annealing temperatures favor the longer primer, while annealing at lower temperatures allows both primers to function with near equal efficiency. We have developed a strategy interspersing asymmetric and symmetric PCR cycles so as to geometrically favor am- plification of target molecules over nonspecific prod- ucts. This strategy, thermal asymmetric interlaced (TAIL)-PCR, entails consecutive reactions with nested sequence-specific primers and a shorter arbitrary de- generate primer. Generation of insert end-specific probes from YAC or Pl clones is an essential step in genome mapping and map-based cloning programs. In this paper we describe the methodology of TAIL-PCR and its application to Pl 674 0888-7543195 $6.00 Copyright 8 1995 by Academic Press, Inc. All rights of reproduction in any form reserved.

Thermal Asymmetric Interlaced PCR- Automatable Amplificatio......Chromosome Walking

Embed Size (px)

Citation preview

Page 1: Thermal Asymmetric Interlaced PCR- Automatable Amplificatio......Chromosome Walking

GENOMICS 25,674-681 (1995)

Thermal Asymmetric Interlaced PCR: Automatable Amplification and Sequencing of Insert End Fragments from PI and YAC

Clones for Chromosome Walking

YAO-GUANG LIU AND ROBERT F. WHITTIER’

Mitsui Plant Biotechnology Research Institute, RlTf Tsukuba Laboratory 1, TCI-D21, Sengen 2-l-6, Tsukuba 305, Japan

Received February 22, 1994; revised September 21, 1994

Isolation of DNA segments adjacent to known se- quences is a tedious task in genome-related research. We have developed an efficient PCR strategy that over- comes the shortcomings of existing methods and can be automated. This strategy, thermal asymmetric in- terlaced (TAIL)-PCR, utilizes nested sequence-specific primers together with a shorter arbitrary degenerate primer so that the relative amplification efficiencies of specific and nonspecific products can be thermally controlled. One low-stringency PCR cycle is carried out to create annealing site(s) adapted for the arbi- trary primer within the unknown target sequence bor- dering the known segment. This sequence is then pref- erentially and geometrically amplified over nontarget ones by interspersion of high-stringency PCR cycles with reduced-stringency PCR cycles. We have ex- ploited the efficiency of this method to expedite ampli- fication and sequencing of insert end segments from Pl and YAC clones for chromosome walking. In this study we present protocols that are amenable to auto- mation of amplification and sequencing of insert end sequences directly from cells of Pl and YAC clones. Q 1996 Academic Press, Inc.

INTRODUCTION

Genome-related research frequently requires isola- tion of DNA from an unsequenced segment bordering a known sequence. This need arises when isolating insert end fragments of large clones such as Pl and yeast artificial chromosomes, cloning insertion tagged genes, obtaining regulatory sequences corresponding to cloned cDNAs, or studying oncogenic retroviral inser- tions. A number of PCR methods have been described for this purpose, including inverse PCR (Ochman et al., 1988; Triglia et al., 1988; Silver and Keerikatte, 1989) and hemispecific or one-sided PCR methods (Frohman

1 To whom correspondence should be addressed at the Mitsui Plant Biotechnology Research Institute, TCI D21, Sengen 2-l-6, Tsukuba 305, Japan. Telephone: 81-298-58-6235. Fax: 81-298-58-6234. E- mail: [email protected].

et al., 1988; Loh et al., 1989; Ohara et al., 1989; Riley et al., 1990; Mueller et al., 1989; Parker et al., 1991; Isegawa et al., 1992). Many of these methods require special steps before PCR such as Southern analysis to determine suitable restriction sites, followed by manip- ulations such as restriction cutting and ligation or tail- ing. Targeted gene walking PCR (Parker et al., 1991) and single primer PCR (Parks et al., 1991) do not re- quire such manipulations prior to PCR. They rely upon specific priming within the known sequence together with arbitrary priming within the flanking sequence (hence, hemispecific). However, arbitrary priming also creates nontarget molecules, and these constitute the bulk of the final product, even when the starting tem- plate sample is quite simple. The desired product must then be identified by hybridization or elongation from an end-labeled internal primer. Thus, methods that omit manipulations before PCR require more laborious screening afterward.

We present here a hemispecific PCR method that overcomes these drawbacks. It requires no DNA ma- nipulation before PCR, yet efficiently amplifies tar- geted segments, usually without visible background. The basis for this strategy is thermal asymmetric PCR, which was described for producing single-stranded DNA templates for sequencing (Mazars et al., 1991). Using two primers differing in length and hence ther- mal annealing stability, PCR cycles carried out with high annealing temperatures favor the longer primer, while annealing at lower temperatures allows both primers to function with near equal efficiency. We have developed a strategy interspersing asymmetric and symmetric PCR cycles so as to geometrically favor am- plification of target molecules over nonspecific prod- ucts. This strategy, thermal asymmetric interlaced (TAIL)-PCR, entails consecutive reactions with nested sequence-specific primers and a shorter arbitrary de- generate primer.

Generation of insert end-specific probes from YAC or Pl clones is an essential step in genome mapping and map-based cloning programs. In this paper we describe the methodology of TAIL-PCR and its application to Pl

674 0888-7543195 $6.00

Copyright 8 1995 by Academic Press, Inc. All rights of reproduction in any form reserved.

Page 2: Thermal Asymmetric Interlaced PCR- Automatable Amplificatio......Chromosome Walking

INSERT END AMPLIFICATION BY TAIL-PCR 675

and YAC systems. Detailed protocols are presented for efficient amplification of insert end sequences directly from cells harboring Pl or YAC clones and subsequent direct-sequencing of unpurified TAIL-PCR products us- ing an automated DNA sequencer. Since no manipula- tion apart from PCR is involved, the process for PCR amplification and sequencing can be automated using an automated laboratory workstation or expedited us- ing multiple-channel pipets.

MATERIALS AND METHODS

Pl and YAC clones. Pl and YAC clones containing Arabidopsis DNA were selected for TAIL-PCR from a Pl library (Liu et al., 1995) and the EG YAC library of Grill and Somerville (1991), respectively.

Oligonucleotide primers. Specific primers that are complemen- tary to the Pl and YAC vector sequences, respectively, were synthe- sized (Fig. 3). In addition, four arbitrary degenerate (AD) primers were used: TG(Afl’~GNAG&“I’)ANCA(G/C)AGA-3’ (ADl). AGfAl T,GNAG&T,ANCA&T,AGG-3’ (ADS), CA(A/T)CGICNGAIA;G/ C)GAA-3’ (AD3, I indicates inosine), and TC(G/C)TICGNACIT(A/ T)GGA-3’ (AD4). These AD primers have average T,,,‘s of 47-48°C as calculated with the formula 69.3 + 0.41 (%GC) -650/L (Mazars et al., 1991), where L is primer length.

PCR procedure. This procedure was designed so that all pi- pettings after removal of culture medium from cell pellets can be automated in 96-well microtiter plates and MicroAmp reaction tubes (Perkin-Elmer) by a Biomek 1000 laboratory workstation (Beck- mann). Alternatively, pipettings can be carried out efficiently with multiple-channel manual pipets. Thermocycling was carried out us- ing a GeneAmp System 9600 (Perkin-Elmer). Cells harboring Pl or YAC clones were cultured overnight with gentle shaking in 96well microtiter plates (round bottom type) in 75 ~1 LB (containing 25 pg/ ml kanamycin and 1 m&f IPTG) or 100 ~1 YPD, respectively. After pelleting by centrifugation, Escherichia coli cells were resuspended in 2 vol (150 ~1) of 0.5~ PCR buffer (see below). Yeast cells were resuspended in 10 ,~l of spheroplasting solution (2 mM EDTA, pH 7.5, 1 mg/ml zymolyase) without sorbitol and incubated at 37°C for 1 h. Without removing the spheroplasting solution, 1 vol (100 ~1) of 0.5~ PCR buffer was added to each well. E. coli cells or yeast spheroplasts were incubated in an air oven at 92°C for 15 min.

Aliquots (1 ~1) of the cell lysate ware added to MicroAmp reaction tubes containing 15 ~1 of primary TAIL-PCR mixture. The PCR mix- ture consisted of lx PCR buffer (10 m&f Tris-HCI, pH 8.3, 50 m&f KCl, 1.5 or 2.0 m&f MgClz, 0.001% gelatin), 0.2 mM each dNTPs, 0.15 fi specific primer (PSl, PTl, YLl, or YRl) and an AD primer (5 $kf for AD1 and AD2 or 2.5 @f for AD3 and AD4, which contain inosine residues), and 0.8 units of AmpliTaq polymerase (Perkin- Elmer). The thermal cycling conditions are summarized in Table 1. For secondary reactions, l-p1 aliquots of the primary PCR products were transferred to microtiter plate(s) containing 100 ~1 Hz0 in each well and mixed. Note that the Biomek 1000 workstation can pipet from MicroAmp reaction tubes containing as little as 10 ~1 solution, but can only pipet from microtiter plates containing about 100 ~1 or more solution. Dilution aliquots (2 ~1) were added to 18 ~1 secondary PCR mixtures containing 1.1~ PCR buffer, 25 fl each dNTPs, 0.8 units of AmpliTaq polymerase, 0.2 fl internal specific primer (PS2, PT2, YL2, or YR21, and the same arbitrary primer used in the pri- mary reaction (3 @f for AD1 or AD2, 1.5 /.&f for AD3 or AD4). After amplification, l-,~l aliquots of the secondary PCR products were diluted in 100 ~1 HzO, and 10 ~1 of dilutions were added to 90 ~1 tertiary PCR mixtures containing 1.1~ PCR buffer, 25 p,V each dNTPs, 3.5 units of AmpliTaq polymerase, and 0.2 &the innermost specific primer (PS3, PT3, YL3, or YR3) and AD primer as in the preceding reaction. The PCR products (7 ~1) were transferred to mi- crotiter plates containing 3 ~1 of 3~ loading buffer in each well and run on 1.5% agarose gels.

TABLE 1

Cycling Conditions Used for TAIL-PCR on the GeneAmp System 9600

File Cycle Reaction no. no. Thermal condition

Primary 1 1 92°C (2 min), 95°C (1 min) 2 5 94°C (15 s), 63°C (1 min), 72°C (2 min) 3 1 94°C (15 s), 30 “C (3 mm), ramping to

72°C over 3 min, 72°C (2 min) 4 10 94°C (5 s), 44°C (1 min), 72°C (2 min) 5 12” 94°C (5 s), 63°C (1 min), 72°C (2 min)

94°C (5 s), 63°C (1 mini, 72°C (2 mini 94°C (5 s), 44°C (1 min), 72°C (2 min)

6 1 72°C (5 min)

Secondary 7 10” 94°C (5 s), 63°C (1 min), 72°C (2 min) 94°C (5 s), 63°C (1 mini, 72°C (2 mini 94°C (5 81, 44°C (1 min), 72°C (2 min)

6 1 72°C (5 min)

Tertiary 8 20 94°C (10 s), 44°C (1 mini, 72°C (2 min) 6 1 72°C (5 min)

Note. The program files in each reaction were linked automatically. n These are nine-segment super cycles each consisting of two high-

stringency and one reduced-stringency cycle (see Fig. 1).

Direct sequencing. For sequencing of TAIL-PCR products with the PRISM Ready Reaction DyeDeoxy Terminator Cycle sequencing kit (ABI), 5 ~1 of unpurified secondary (or tertiary) reaction products were added directly to a 17-~1 volume of sequencing mixtures con- taining 7 ~1 (1 pmol/pl) of the same specific primer used for the PCR amplification and 10 ~1 of sequencing mix from the kit. Cycle sequencing was carried out on a GeneAmp System 9600 thermocycler using 25 cycles of 96°C for 15 s, 60°C for 5 s, and 65°C for 4 min. Sequencing using an automated DNA sequencer 373A (ABI) was carried out according to the manufacturer’s protocol.

DNA hybridization. Conditions of dot and colony hybridizations for Pl and YAC library screening were as described (Liu et al., 1992). Hybridization signals were detected using a BAS-2000 image ana- lyzer (Fuji Film).

RESULTS

Principle of TAIL-PCR

PCR methods using a specific primer and an arbi- trary primer or a primer that pairs with a binding site attached by ligation or tailing are known as “hemispe- cific” or “one-sided” PCR. In a hemispecific PCR, three types of products may form: those primed by both prim- ers (type I), those primed by the specific primer alone (type II), and those primed by the nonspecific primer alone (type III). Type II products as well as any nonspe- cific type I products can be eliminated simply by car- rying out successive reactions with nested specific primer(s). The type III nonspecific products, which are the major source of background, however, cannot be eliminated with nested specific primers using normal PCR cycling (see Fig. 50. The TAIL-PCR strategy is designed to favor amplification of the desired type I specific products and suppress amplification of the type

Page 3: Thermal Asymmetric Interlaced PCR- Automatable Amplificatio......Chromosome Walking

676 LIU AND WHITTIER

short arbitrary degenerate (AD) primer

Primary PCR with SPI and AD

5 high stringency cycles

+ 1 low stringency cycle

10 reduced stringency cycles

(A->

1 reduced stringency cycle 2 high stringency cycles (thermal symmetric) (thermal asymmetric)

v

specific product nonspecific product nonspecific product (type I) (type 11) (type III)

-====q --- D--_ _-- ---

product high or middle high (detectable) low (undetectable) yield: (detectable or undetectable)

1 lo-fold dilution

Secondary PCR with SP2 and AD (10 super cycles)

specific product ’ nonspecific product ------; c)_---

--- product yield: high (detectable) very low (undetectable)

t lWO.fold dilution

Tertiary PCR with SP3 ad AD (20 normal cycles)

t Agarose gel analysis

i

Direct sequencing

FIG. 1. Schematic diagram of TAIL-PCR contrasting the ampli- fication of target with nontarget products. Boldface segments denote the specific primer (SP), and small open rectangles denote the arbi- trary degenerate primer (AD). Diluted type II nonspecific product is not shown after the secondary reaction. In this study standard annealing temperature settings were 30°C in the low-stringency cy- cle, 63°C in the high-stringency cycles, and 44°C in the reduced- stringency cycles (see Table 1). The 10 cycles of reduced-stringency normal PCR between the single low-stringency cycle and the TAIL- cycling are optional (when omitting these cycles the supercycle num- ber of the TAIL-cycling is increased to 15). Carrying out these cycles prior to TAIL-cycling is helpful in reducing the amplification competi- tion between specific and type II nonspecific products during the initial cycles. As an alternate procedure to speed processing, two secondary reactions can be carried out simultaneously using SP2 and SP3, respectively.

III nonspecific products. As shown in Fig. 1, the key points of this strategy are the use of a set of nested long specific primers and a relatively short arbitrary degenerate (AD) primer having a lower T,,, (melting temperature). One low-stringency PCR cycle is carried

the AD primer within the unknown target sequence to create annealing site(s) adapted for the AD primer. Amplification is then carried out by interlacing high- stringency with reduced-stringency PCR cycles. Since only the long specific primer can efficiently anneal to DNA template during high-stringency cycles, target se- quence (type I product) is amplified linearly, and little or no amplification occurs for nontarget sequences (type III products) that are primed at both ends by the AD primer. In the following reduced-stringency cycle both primers can anneal to the template. The single- stranded target DNA produced during high-stringency cycles is replicated to double-stranded form, providing a severalfold increase of target template for the next round of linear amplification. By repeating this process (TAIL-cycling), it is possible to amplify target preferen- tially over nontarget sequences. Type II products primed at both ends by the long specific primer can also arise through mispriming, and these are amplified with even higher efficiency (see Fig. 2). Such undesired products are diluted out, however, in subsequent sec-

10’2

” + Type I product primed by both the long and short primers

; 10’0 +- Type II product primed by the long primer only

E + Type III product primed by the short primer on

‘s 108

!z106

i

1

104

T*IL-cycling

FIG. 2. Theoretical amplification rates of type I, type II, and type III products during the primary and secondary TAIL-PCRs. For clarity and simplicity, all priming reactions are depicted as occurring with either 100 or 0% efficiency, according to annealing stringency. Calculations indicate that for a fixed number of basic cycles during TAIL-cycling, interspersing two high-stringency cycles between each reduced-stringency cycle yields the highest relative amplification of target sequences. During TAIL-cycling the theoretical amplification rates of type I, type II, and type III products are 2”“, 2”“, and 2”,

out to facilitate the initial base-mismatch annealing of respectively, where n is the number of supercycles.

Page 4: Thermal Asymmetric Interlaced PCR- Automatable Amplificatio......Chromosome Walking

INSERT END AMPLIFICATION BY TAIL-PCR 677

A PSl(Tm=61W ~ 5'-CTGTATGTACTGTTTTTTGCGATCTGCCGTTTCGATCCT

~xxx?.K.......... GAG&&TCACTCAGCA

WTCGAGCTGGCTACGGGAACTCGGAAGTTGGGkAGT-5' mlf.Tm=63W

B YLl(Tm=63W 5'-cACTCTGAACCATCTTGGAGGA?CGGTAATTATTTC~

ATCTCTTTTTCAATTGTATATGTGTTATGTTATGTAGTATAC YL2(Tm=60°C)

TCTTTCTTCAACAATTAAATACTCTCGGTAGCCAAGTTGGTT

~CGCAAGATGTAATTTATCACTACGGAATTCGCGGCE 3fT 58 Cl

cAATT~CC&TA~~ _EamHIcloning site

AAATCACTCCCAATTA - YR3 (Tm=58’C)

&&$GG@CTTAAGGC~CCCGCAAGCTGA YR2(Tm=59'C) - YRI(Tm=63'C)

GCGGGGGCCCTCTAAAAAAAC AAAAAAATACAGAGGTAA-5'

FIG. 3. Specific primers used for TAIL-PCR that are complemen- tary to the Pl pAdlOsacBI1 (Pierce et al., 1992) and YAC pYAC41 (Grill and Somerville, 1991) vectors, respectively. The primer sets for the Pl vector are designated PSl, PS2, and PS3 on the SP6 promoter side and PTl, PT2, and PT3 on the T7 promoter side (A); for the YAC vector, primer set YLl, YL2, and YL3 is specific to the left side and primer set YRl, YR2, and YR3 is specific to the right side (B). The calculated T,,, of each primer is indicated.

ondary and tertiary reactions using internally nested specific primers. TAIL-cycling is performed in the sec- ondary PCR as well to lower background further. TAIL- cycling is usually unnecessary in the tertiary PCR, but may be applied if type III products still emerge.

Amplification of Insert End Sequences from PI and YAC Clones

For amplifying insert end sequences from Pl and YAC clones by TAIL-PCR, three nested specific primers were synthesized to each bordering vector arm (Fig. 3). For amplifying Pl insert ends, we used E. coli cells directly as the template source. Although YAC insert ends could also be amplified directly from yeast cells without spheroplasting treatment (data not shown), this simple treatment facilitated release of DNA from cells and improved the reliability of amplification.

Figure 4 shows some examples of amplification of insert end sequences from Pl and YAC clones by the TAIL-PCR strategy. In most cases product bands from the primary TAIL-PCR were visible. Many such bands disappeared after the secondary TAIL-PCR, indicating that these bands were nonspecific type II and/or type I products. Specific products were not always seen in the primary reactions due to their low concentration. However, these specific products became visible after the subsequent secondary reaction. In a few cases the secondary reaction produced type II products. Such products could be identified by their failure to undergo reamplification in the tertiary reaction (e.g., the upper bands in lane II of the last clone shown in Fig. 4A). We

have not observed any type II products in the tertiary PCR. By the outset of this reaction the original source DNA has been diluted 106-fold relative to the primary reaction, leaving less than 1 original cell genome equiv- alent per reaction. The 20 amplification cycles in the tertiary reaction are insufficient for any newly arising type II nontarget products to reach visually detectable levels (see control reactions shown in lanes designated “C2” in Fig. 5). The product specificity was verified sim- ply by comparing the sizes of the secondary and tertiary PCR products; target products in the tertiary reactions were slightly smaller than those in the secondary reac- tions in agreement with the nested positions of the primers. Since specific products were not always ampli- fied to detectable levels in the primary TAIL-PCR, we routinely omitted the agarose gel analysis of the pri-

B

C

D

FIG. 4.

M 1 II 111 I II III I II III I II III I II III

Agarose gel analysis of TAIL-PCR products for amplifi- cation of insert end sequences from the SP6 side (A) and T7 side (B) of Pl clones and the left side (C) and right side (D) of YAC clones. Each set of 3 lanes contains products from consecutive primary (I), secondary (II), and tertiary (III) reactions for a given clone. The arbitrary primers AD4, AD3, AD2, and AD1 were used for reactions shown in A, B, C, and D, respectively. Size marker (M) is a mixture of X-Hind111 and 4X174-Hue111 digests.

Page 5: Thermal Asymmetric Interlaced PCR- Automatable Amplificatio......Chromosome Walking

678 LIU AND WHITTIER

mary reactions and checked only the secondary and tertiary reaction products.

Since very short products have limited usefulness, reactions producing no products over 250 bp were con- sidered failures and repeated using a different AD primer. With any one AD primer, successful amplifica- tion of specific products were obtained in about 70- 80% of cases for Pl clones and 60-70% for YACs. Most of the amplified fragments ranged from ca. 300 bp to ca. 1 kb in size, and some even exceeded 2 kb. In some cases more than one specific fragment was produced (see Figs. 4 and 5). Despite multiple product bands, sequencing of the total product using the specific primer yielded clear sequencing profiles. This observa- tion confirmed our supposition that these multiple product bands constituted nested fragments derived from annealing of the AD primer at more than one site along the target sequence molecules.

Effects of TAIL-Cycling on Selection of Specific Products

To assess the effect of TAIL-cycling in suppressing amplification of type III nonspecific products, we com- pared reactions for two Pl clones with altered anneal- ing temperatures in the high-stringency cycles (Fig. 5). Reactions under standard thermal conditions produced no type III products in any reactions, including the control tertiary reactions using the AD primer alone (lanes designated “Cl” in Fig. 5A). This indicates that TAIL-cycling in the primary and secondary reactions had sufficiently suppressed type III product amplifica- tion. In the reactions with decreased annealing temper- ature (55°C) in the high-stringency cycles, type III products were amplified to detectable levels for one sample (No. 2) but not for another (No. 1) (Fig. 5B). The annealing temperature was set for all cycles to 44°C so that both specific and arbitrary primers would function with more nearly equal efficiency, and type III products were amplified to high levels in both samples (Fig. 50. These results clearly demonstrate that TAIL- cycling is very effective in suppressing nonspecific tar- get amplification.

Sequencing TAIZ-PCR Products Directly

Unpurified PCR products can be directly sequenced by the thermal asymmetric cycle sequencing strategy using unlabeled sequencing primers (Liu et al., 1993). In this study we adapted this method to sequence rap- idly the amplified clone insert end fragments on an automated DNA sequencer using the DyeDeoxy Termi- nator Cycle sequencing kit (ABI). With these kits an unlabeled sequencing primer is used, and the fluores- cent label is attached to the chain terminating dideoxy nucleotides. A small amount (5 ~1) of unpurified second- ary (or tertiary) PCR product was applied directly to sequencing reactions. Even when using the same spe- cific primer for PCR as for sequencing, unpurified tem- plate yielded clear sequencing profiles, reflecting the

#I #2

I 11 III CI C2 M I II III Cl C2

FIG. 5. Comparison of PCRs for two Pl clones with high (A), decreased (B), and no (C) thermal asymmetric priming during cy- cling. The T7 side primer set was used in combination with AD2 for clone 1 and with AD4 for clone 2. Control tertiary reactions using AD primer (Cl) or PT3 (C2) alone are shown. (A) TAIL-PCR with standard thermal conditions as shown in Table 1. (B) TAIL-PCR with annealing temperature in all high-stringency cycles decreased from 63 to 55°C. The supercycle number in file 5 (see Table 1) in the primary reaction was reduced to 10 and that in file 7 in the secondary reaction reduced to 8 to compensate for higher annealing efficiency and consequent amplification rate. (Cl PCR with an annealing tem- perature of 44°C in all cycles (with no change in file 3). The cycle number in file 4 in the primary reaction was increased to 38, and file 5 was not executed. For the secondary reaction, file 8 was used. All of the tertiary reactions in A, B, and C were carried out with normal cycling using file 8. The type III nonspecific products ampli- fied in B and C are identified as bands of identical size in lanes I, II, III, and Cl.

high specificity of the products. Figure 6 shows an ex- ample of direct sequencing. Using a high annealing temperature (SO”C>, interference by the carried-over AD primer was avoided. A very low concentration (22.5 /..&I of dNTPs used in the secondary and tertiary PCR reactions was also designed so that carried-over dNTPs would not unduly affect sequencing reactions.

Automation of the PCR Process

One of the advantages of the TAIL-PCR method is that no special manipulations apart from PCR are re- quired to obtain specific products, and even template DNA isolation can be omitted when amplifying clone insert ends. This simplicity enabled us to design assem- bly-line style protocols suitable for large-scale amplifi- cation and sequencing using the GeneAmp System

Page 6: Thermal Asymmetric Interlaced PCR- Automatable Amplificatio......Chromosome Walking

INSERT END AMPLIFICATION BY TAIL-PCR 679

FIG. 6. Direct sequencing of secondary TAIL-PCR product amplified from the SP6 side of a Pl clone insert. The vector border sequence is underlined.

9600 or other thermocyclers that are able to process 96 samples simultaneously. With these protocols the manipulations can be automated using a Biomek 1000 laboratory workstation (Beckmann) or expedited using multiple-channel pipets when doing manually.

Use of Insert End Probes for Chromosome Walking

Amplified insert end sequences from Pl and YAC clones were used to screen Pl and YAC libraries by hybridization to find overlapping clones for chromo- some walking. Figure 7 shows that the hybridization backgrounds were very low, demonstrating the high specificity of the PCR products.

DISCUSSION

We have developed a novel PCR strategy to amplify specifically segments of unknown sequence that flank known sequences. Compared to other PCR methods for the same purpose, TAIL-PCR has the following advan- tages. (1) Simplicity: TAIL-PCR entails neither special DNA manipulations before PCR nor laborious screen- ing afterward. Product specificity is effectively con- firmed by simple agarose gel analysis. This makes TAIL-PCR especially suitable for assembly-line style amplification allowing simultaneous manipulation of a

large number of samples. In contrast, all other PCR methods require numerous manipulations prior to PCR (restriction digestion, ligation, tailing, etc.) or manipu- lations afterward (Southern hybridization, primer la- beling and extension, autoradiograms and gel excision, etc.). These additional steps are cumbersome, labori- ous, and time-consuming. In addition, TAIL-PCR’s re- quirement for template DNA quantity (-ng) and purity (cell lysate or crude DNA) are extremely modest. In contrast, ligation- or tailing-dependent PCR methods require more DNA ( - pg> of high purity for various ma- nipulations. (2) High specificity: the proportion of coamplified nonspecific products is so low that TAIL- PCR products can be used directly as either hybridiza- tion probes or sequencing templates. This is its major advantage over targeted gene walking PCR (Parker et al., 1991) and single primer PCR (Parks et al., 19911, which normally produce high backgrounds of nonspe- cific products. (3) High efficiency: 60430% of reactions yielded specific products with any given AD primer. In ligation-dependent PCR methods, finding suitable restriction sites and ligating them is often a problem. For targeted gene walking PCR or single primer PCR, there is a tradeoff between success rate and low back- ground; carrying out cycling at lower annealing tem- peratures raises the success rate but results in higher

Page 7: Thermal Asymmetric Interlaced PCR- Automatable Amplificatio......Chromosome Walking

680 LIU AND WHITTIER

.

*

-_.- ---

FIG. 7. Library screening using TAIL-PCR products as probes. Before labeling, the residual dNTPs in the PCR reactions were re- moved by spin dialysis using Suprec-02 spin-filters (Takara). (A) Dot hybridization to an Arubidopsis Pl clone filter using an insert end probe amplified from the SP6 side of Pl clone 17A6. Each spot on the filter contained DNA pooled from 96 clones (Liu et al., 19951. (B) Colony hybridization to a filter prepared from one plate (EGlO) of the Arubidopsis EG YAC library using an insert end probe amplified from the right side of YAC clone EGlOF3. The colony position third from the bottom and third from the left is EGlOF3.

backgrounds. (4) Speed: the successive amplification reactions can all be completed in 1 day. (5) TAIL-PCR involves no ligation step, a process that risks chimeric artifacts. (6) Direct sequencing: aliquots of the reaction products are added directly to the sequencing reaction, streamlining the process from amplification to se- quence determination. Based on the sequencing data, specific primers can be prepared for PCR-based library screening. (7) High sensitivity: although we presented only the application of this method to Pl and YAC sys- tems in this paper, this method has been successfully used to recover genomic sequences flanking T-DNA and transposon insertions from Arabidopsis and rice ge- nomes (Liu et al., unpublished work; N. Fedoroff, Washington, pers. comm., 1994; S. Ishiguro, Okazaki, Japan, pers. comm., 1994; K. Shimamoto, Nara, Japan, pers. comm., 1994). Reconstruction experiments indi- cated that single-copy sequences can be amplified from hexaploid wheat (data not shown), whose genome size

(- 1.5 X 10” bp) is about fivefold larger than humans’. Therefore, this method should be readily applicable to complex genomes, including those of mammals. Taken together, these advantages make TAIL-PCR a powerful tool for chromosome walking, genome physical map- ping, development of sequence-tagged sites (STS), se- rial gene-walking, and analysis of genomic sequences flanking T-DNA, transposon, or retrovirus insertions.

TAIL-PCR requires a disparity in T, between the specific primers and the AD primer. To achieve ade- quate thermal asymmetric priming, the T,‘s of the spe- cific primers should be at least 10°C higher than the average T,‘s of the AD primers, and the annealing tem- perature in the high-stringency cycles should be set as high as possible (usually l-5°C higher than the calcu- lated T, of the specific primer). Other rules in selecting specific primers for TAIL-PCR are generally the same as those for normal PCR. Selection of an optimal spe- cific primer for the primary reaction is important for successful amplification. Therefore, if no satisfactory results are obtained, another specific primer for the primary reaction should be tested. For example, in a preliminary experiment we used YL2 (see Fig. 3B) as the specific primer for the primary reaction of TAIL- PCR but the success rate was low (data not shown). We then selected another primer (YLl) for the primary reaction. Mispriming by the specific primer generates unwanted type II products (Fig. 2). Although these are diluted out in subsequent reactions with nested prim- ers, these products compete for enzyme and substrates and can interfere with target product amplification if too abundant. Mispriming is more common for primers with very GC-rich 3 ’ ends (e.g., Crameri and Stemmer, 19931, and so we attempt to avoid these. As an addi- tional precaution, specific primers should be used at low concentration (about 0.15-0.2 ,uLM) to reduce mis- priming.

To increase the probability of annealing between AD primers and target sequences, we utilized degenerate oligonucleotides. Functional priming may require, in most cases, at least a 3-base perfect match at the prim- er’s 3 ’ end (Parker et al., 1991; Parks et al., 1991), and this requirement should be met on average every 64 (43) bases. After meeting this requirement, a 256-fold degenerate 16-mer population will contain a match to any given sequence with an average homology of 57.8% C(3 + 4 + 9 x 25%)/16], assuming random base distribu- tion. This value is much higher than the 39.1% 1(3 + 13 x 25%)/16] expected for a simple 16-mer, and thus less base-mismatch is required for adaptation priming. Examining adapted sites in their targeted gene walk- ing strategy, Parker et al. (1991) observed priming with an average 45% homology between primers and tem- plate sequences. Therefore, the combination of both low stringency and primer degeneracy in TAIL-PCR should further increase the probability for successful adapta- tion priming. This high level of promiscuous priming, however, does not result in high backgrounds of non- specific products in our TAIL-PCR strategy. The factors

Page 8: Thermal Asymmetric Interlaced PCR- Automatable Amplificatio......Chromosome Walking

INSERT END AMPLIFICATION BY TAIL-PCR 681

determining the suitability of an AD primer may in- clude its degeneracy level, length, and nucleotide se- quence. In this study we used 128- and 256-fold degen- erate arbitrary primers, and satisfactory results were obtained. The degeneracy of the arbitrary primers can be created either through inclusion of multiple bases at one position or through inosine incorporation. For example, AD3 and AD4 used in this study were pre- pared with inosine for a portion of the degenerate posi- tions, and they can be used at half the concentration required for AD1 and AD2. Overly high degeneracy levels in AD primers may lead to problems in control of priming efficiency, production of undesirably short products, and generation of primer-dimer artifacts. It is important to design AD and specific primers so that no primer dimers are formed even during low-strin- gency annealing.

The TAIL-PCR strategy described here will greatly facilitate isolation of DNA segments flanking known sequences and increase the efficiency of gene isolation by positional cloning. Using this method for rapid gen- eration of insert end probes, we have generated a geno- mic contig of 37 Pl clones in Arabidopsis (Liu et al., 1995). We have also isolated genomic sequences flank- ing T-DNA and transposon insertions from transgenic Arabidopsis lines (Liu et al., unpublished). This method is robust and has been successfully used with various plant genomes in several laboratories. The basic utility, simplicity, and power of this method adapts it for wide application in various fields of molecular biology re- search.

ACKNOWLEDGMENTS

We thank N. Fedoroff for helpful comments on the manuscript, H. Kishida of the RIKEN Institute for setting and use of the automated Biomek 1000 laboratory workstation, and N. Hayashida for use of YAC clones. This research is conducted as a part of the Industrial Technology Development Promotion Program of the Research Insti- tute of Innovative Technology for the Earth (RITE) for global environ- mental problems, supported by the Ministry of International Trade and Industry of Japan.

REFERENCES

Cramer-i, A., and Stemmer, W. P. C. (1993). 10”-Fold aptamer library amplification without gel purification. Nucleic Acids Res. 21: 4410.

Frohman, M. A., Dush, M. K., and Martin, G. R. (19881. Rapid produc- tion of full-length cDNAs from rare transcripts: Amplification us- ing a single gene-specific oligonucleotide primer. Proc. Nutl. Acad. Sci. USA 85: 8998-9002.

Grill, E., and Somerville, C. (1991). Construction and characteriza-

tion of a yeast artificial chromosomes library of Arabidopsis which is suitable for chromosome walking. Mol. Gen. Genet. 226: 484- 490.

Isegawa, Y., Sheng, J., Sokawa, Y., Yamanishi, K., Nakagami, O., and Ueda, S. (1992). Selective amplification of cDNA sequence from total RNA by cassette-ligation mediated polymerase chain reaction (PCR): Application to sequencing 6.5 kb genome segment of hantavirus strain B-l. Mol. Cell. Probes 6: 467-475.

Liu, Y.-G., Ikeda, T. M., and Tsunewaki, K. (1992). Moderately re- peated, dispersed, and highly variable (MRDHV) genomic se- quences of common wheat usable for cultivar identification. Theor. Appl. Genet. 84: 535-543.

Liu, Y.-G., Mitsukawa, N., and Whittier, R. F. (1993). Rapid sequenc- ing of unpurified PCR products by thermal asymmetric PCR cycle sequencing using unlabeled sequencing primers. Nucleic Acids Res. 21: 3333-3334.

Liu, Y.-G., Mitsukawa, N., Vazquez-Tello, A., and Whittier, R. F. (1995). Generation of a high quality Pl library OfArabidopsis suit- able for chromosome walking. Plant J., in press.

Loh, E. Y., Elliot, J. F., Cwirla, S., Lanier, L., and Davis, M. M. (19891. Polymerase chain reaction with single-sided specificity: Analysis of T cell receptor 6 chain. Science 243: 217-220.

Mazars, G.-R., Moyret, C., Jeanteur, P., and Theillet, C.-G. (1991). Direct sequencing by thermal asymmetric PCR. Nucleic Acids Res. 19: 4783.

Mueller, P. R., and Wold, B. (1989). In viuo footprinting of a muscle specific enhancer by ligation mediated PCR. Science 246: 780-786.

Ochman, H., Gerber, A. S., and Hart& D. L. (1988). Genetic applica- tions of an inverse polymerase chain reaction. Genetics 120: 621- 623.

Ohara, O., Dorit, R. L., and Gilbert, W. (1989). One-sided polymerase chain reaction: The amplification of cDNA. Proc. Natl. Acad. Sci. USA 86: 5673-5677.

Parker, J. D., Rabinovitch, P. S., and Burmer, G. C. (19911. Targeted gene walking polymerase chain reaction. Nucleic Acids Res. 19: 3055-3060.

Parks, C. L., Chang, L.-S., and Shenk, T. (1991). A polymerase chain reaction mediated by a single primer: Cloning of genomic se- quences adjacent to a serotonin receptor protein coding region. Nucleic Acids Res. 19: 7155-7160.

Pierce, J. C., Sauer, B., and Sternberg, N. (1992). A positive selection vector for cloning high molecular weight DNA by the bacteriophage Pl system: Improved cloning efficacy. Proc. Natl. Acad. Sci. USA 89: 2056-2060.

Riley, J., Butler, R., Ogilvie, D., Finniear, R., Jenner, D., Powell, S., Anand, R., Smith, J. C., and Markham, A. F. (1990). A novel, rapid method for the isolation of terminal sequences from yeast artificial chromosome (YAC) clones. Nucleic Acids Res. 18: 2887-2890.

Rosenthal, A., and Jones, D. S. (1990). Genomic walking and se- quencing by oligo-cassette mediated polymerase chain reaction. Nucleic Acids Res. 18: 3095-3096.

Silver, J., and Keerikatte, V. (1989). Novel use of polymerase chain reaction to amplify cellular DNA adjacent to an integrated provi- rus. J. Viral. 63: 1924-1928.

Triglia, T., Peterson, M. G., and Kemp, D. J. (1988). A procedure for in vitro amplification of DNA segments that lie outside the boundaries of known sequences. Nucleic Acids Res. 16: 8186.