10
Learning by Selection in the Trion Model of Cortical Organization Krishna V. Shenoy, Jeffrey Kaufman, John V. McGrann, and Gordon L. Shaw Center for the Neurobiology of Learning and Memory and Department of Physics, University of California, Irvine, California 92717 The basic issue of whether mammalian learning in cortex proceeds via a selection principle, as stressed by Ed- elman, versus an instructional one is of major impor- tance. We present here a realization of selection learning in the trion model, which is based on the Mountcastte columnar organizational principle of cortex. We suggest that mammalian cortex starts out with an a priori con- nectivity between minicolumns that is highly structured in time and in space, competing between excitation and inhibition. This provides a "naive" repertoire of spatial- temporal firing patterns that stimuli and internal pro- cessing map onto. These patterns can be learned with small modifications to the connectivity strengths de- termined by a Hebbian learning rule. As various patterns are learned, the repertoire changes somewhat in order to respond properly to various stimuli, but the majority of all possible stimuli still map onto spatial-temporal firing patterns of the original repertoire. In order to show that the example presented here is showing true selec- tivity and is not an artifact of more stimuli evolving into the learned pattern, we develop a selectivity measure. We suggest that some form of instructional learning (in which connectivities are finely tuned) is present for difficult tasks requiring many trials, whereas very rapid learning involves selectional learning. Both types of learning must be considered to understand behavior. It is clear that the basic conceptual issue of whether mammalian learning proceeds via a selection prin- ciple (Fig. 1) as stressed by Edelman (1978, 1987) versus an instructional one (Rumelhart et al., 1986) is of large importance. For example, we suggest that selectional learning can proceed much more rapidly than instructional learning, and with comparatively small changes in connectivity. One-trial learning would be very hard to understand in an instructional neuronal network model. Clearly there are very dif- ficult learning tasks that require hundreds or thousands of trials to perfect, whereas some tasks can be learned in just a few trials (see Table 12.1 of Weinberger et al., 1984). It seems reasonable that some form of in- structional learning (in which connectivities are fine- ly tuned) is present for these difficult tasks requiring many trials, whereas the very rapid learning involves selectional learning. In contrast,finelycontrolled mo- tor movements requiring a high precision of accuracy need many trials of learning to accomplish this, and may use an instructive learning method that involves fine tuning the connnectivity. We do not enter the debate (Crick, 1989, 1990; Michod, 1990; Reeke, 1990) concerning Edelman's proposed neuronal group realization of the selection principle of learning, but we present here a realiza- tion of selectional learning in the trion model of the cortex, which is highly structured (Shaw et al., 1985, 1988; Silverman et al., 1986; Leng and Shaw, 1991; McGrann et al., 1991; Leng et al., 1992, 1993). With a selection principle, the various responses of the network exist prior to any stimuli and constitute the naive repertoire. The stimuli "selects" out a particular response from the naive repertoire. This selection process takes place, in the trion model, with small changes in the connectivity through a Hebbian (Hebb, 1949) learning algorithm. This learning should pro- ceed very rapidly since the response is already built in and doesn't have to be created from repeated ex- posure to the stimuli as in an instructional principle. In general, different stimuli should select out differ- ent responses, although some may select out the same response, indicating some similarity between the two stimuli. Also, the response should not be overlearned to the point that all stimuli select out the same re- sponse. We will discuss the qualitative differences between a highly structured connectivity, which we use in the trion model, versus an initial set of random connections. Cerebral Cortex May/Jun 1993;3:239-248; 1047-3211/93/M.OO

Learning by Selection in the Trion Model of Cortical Organizationweb.stanford.edu/~shenoy/GroupPublicationsPreStanford/... · 2006. 4. 9. · trion (Fig. 3) represents an idealized

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Learning by Selection in the Trion Model of Cortical Organizationweb.stanford.edu/~shenoy/GroupPublicationsPreStanford/... · 2006. 4. 9. · trion (Fig. 3) represents an idealized

Learning by Selection in the TrionModel of Cortical Organization

Krishna V. Shenoy, Jeffrey Kaufman, John V.McGrann, and Gordon L. Shaw

Center for the Neurobiology of Learning andMemory and Department of Physics, University ofCalifornia, Irvine, California 92717

The basic issue of whether mammalian learning in cortexproceeds via a selection principle, as stressed by Ed-elman, versus an instructional one is of major impor-tance. We present here a realization of selection learningin the trion model, which is based on the Mountcasttecolumnar organizational principle of cortex. We suggestthat mammalian cortex starts out with an a priori con-nectivity between minicolumns that is highly structuredin time and in space, competing between excitation andinhibition. This provides a "naive" repertoire of spatial-temporal firing patterns that stimuli and internal pro-cessing map onto. These patterns can be learned withsmall modifications to the connectivity strengths de-termined by a Hebbian learning rule. As various patternsare learned, the repertoire changes somewhat in orderto respond properly to various stimuli, but the majorityof all possible stimuli still map onto spatial-temporalfiring patterns of the original repertoire. In order to showthat the example presented here is showing true selec-tivity and is not an artifact of more stimuli evolving intothe learned pattern, we develop a selectivity measure.We suggest that some form of instructional learning (inwhich connectivities are finely tuned) is present fordifficult tasks requiring many trials, whereas very rapidlearning involves selectional learning. Both types oflearning must be considered to understand behavior.

It is clear that the basic conceptual issue of whethermammalian learning proceeds via a selection prin-ciple (Fig. 1) as stressed by Edelman (1978, 1987)versus an instructional one (Rumelhart et al., 1986)is of large importance. For example, we suggest thatselectional learning can proceed much more rapidlythan instructional learning, and with comparativelysmall changes in connectivity. One-trial learningwould be very hard to understand in an instructionalneuronal network model. Clearly there are very dif-ficult learning tasks that require hundreds or thousandsof trials to perfect, whereas some tasks can be learnedin just a few trials (see Table 12.1 of Weinberger etal., 1984). It seems reasonable that some form of in-structional learning (in which connectivities are fine-ly tuned) is present for these difficult tasks requiringmany trials, whereas the very rapid learning involvesselectional learning. In contrast, finely controlled mo-tor movements requiring a high precision of accuracyneed many trials of learning to accomplish this, andmay use an instructive learning method that involvesfine tuning the connnectivity.

We do not enter the debate (Crick, 1989, 1990;Michod, 1990; Reeke, 1990) concerning Edelman'sproposed neuronal group realization of the selectionprinciple of learning, but we present here a realiza-tion of selectional learning in the trion model of thecortex, which is highly structured (Shaw et al., 1985,1988; Silverman et al., 1986; Leng and Shaw, 1991;McGrann et al., 1991; Leng et al., 1992, 1993). Witha selection principle, the various responses of thenetwork exist prior to any stimuli and constitute thenaive repertoire. The stimuli "selects" out a particularresponse from the naive repertoire. This selectionprocess takes place, in the trion model, with smallchanges in the connectivity through a Hebbian (Hebb,1949) learning algorithm. This learning should pro-ceed very rapidly since the response is already builtin and doesn't have to be created from repeated ex-posure to the stimuli as in an instructional principle.In general, different stimuli should select out differ-ent responses, although some may select out the sameresponse, indicating some similarity between the twostimuli. Also, the response should not be overlearnedto the point that all stimuli select out the same re-sponse. We will discuss the qualitative differencesbetween a highly structured connectivity, which weuse in the trion model, versus an initial set of randomconnections.

Cerebral Cortex May/Jun 1993;3:239-248; 1047-3211/93/M.OO

Page 2: Learning by Selection in the Trion Model of Cortical Organizationweb.stanford.edu/~shenoy/GroupPublicationsPreStanford/... · 2006. 4. 9. · trion (Fig. 3) represents an idealized

Hgnra 1 . A schematic description of Edelman's basic idea of learning by selectionImtialty, there is a repertoire of cortical responses { f l , , R,. /?,,. .} present at birthso that a given stimuli S, mrght excite a number ol these inherent responses. Afterlearning, or through experience. S, will lead mainly to just one of these responses,

Mountcastle (1978) proposed that the cortical col-umn (Goldman-Rakic, 1984) is the basic network inthe cortex and is composed of small irreducible pro-cessing units called minicolumns (see Fig. 2; we notethat the beautiful optical recording results by Bon-hoeffer and Grinvald, 1991, in secondary visual cortexshow a truly striking similarity to the cartoon-ideal-ized primary visual cortex in Fig. 2). Such a columnhas the capability of being excited into complex spa-tial-temporal firing patterns (for further discussion ofthe cooperativity involved in the functioning of smallgroups of neurons, see Shaw et al., 1982). The as-sumption is that higher mammalian processes involvethe creation and transformation of such complex spa-tial-temporal firing patterns (in contrast to a "code"that involves sets of neurons firing with high fre-quency). Evidence is accumulating that this spatial-temporal code describes the "internal language" ofthe cortex (Eckhorn et al., 1988; Gray and Singer,1989; Gray et al., 1989; Dinse et al., 1990; Richmondet al., 1990; Singer, 1990; Leng et al., 1993; Shaw etal., 1993).

The trion model of the cortical column (Shaw etal., 1985,1988; Silverman et al., 1986; Leng and Shaw,1991; McGrann et al., 1991) is a mathematical real-ization of Mountcastle's organizational principle. Atrion (Fig. 3) represents an idealized minicolumn orroughly 100 neurons, and has three levels of firingactivity: above average, average, and below average.The "naive" (before learning) interactions among thetrions in the column are taken to be localized, highlystructured, and competing (excitatory and inhibito-ry). The firing state of the network (cortical column)of trions is updated in a manner related to the states

Hgara 2. Highly schematic representation of the Moumcastle (1978) principle ofcortical organzaiton. Each hexagon, following von Seefen (1970) and Bratenberg andBrartenberg (1979), represents a conical column, while each mangle is a mincolumnthat encodes the relevant parameter in the stimuli such as Ime orientation in thevisual cortex (Huts) and Wiesel. 1977), shown here by the (rented bars. We notethat the beautiful optcal recording results by Bonhoeffer and Grinvald (1991) insecondary visual cortex show a truly suiting similarity to the cartoon-idealaed primaryvisual cortex shown here. In order to understand the cdleaive dynamic behavior ofa cortical area, it is necessary to study the basic processing urm: the conical column

of the previous two time steps (Fig. 4). A column witha small number of trions having structured connec-tions yields a large repertoireo{quasi-stable, periodicspatial-temporal firing patterns, defined as magic pat-terns, or MPs, which can be excited. These inherentpatterns are called "magic patterns" because of theirability to be learned or enhanced via a Hebb learningrule to a large cycling probability. The repertoire of(periodic) MPs is found by evolving all possible ini-tial states (of the first two time steps) by followingthe most probable or deterministic path. In a fullprobabilistic (or Monte Carlo) evolution, the MPsevolve in natural sequences from one to another. Ithas been shown that the probability of each MPremaining in that pattern can be enhanced by only asmall change in connection strengths using a Heb-bian learning rule (it is not our purpose here to ex-amine fully the assumptions of the trion; see Leng etal., 1992, for a suggested test of the trion model withlarge clinical significance). We demonstrate here theselectional learning properties of the trion model ina (typical) simple example. In order to show that theexample presented here is showing true selectivityand is not an artifact of more stimuli evolving intothe learned pattern, we develop a selectivity measure.It is the repertoire of MPs that will be the repertoireof responses in our realization of the Edelman selec-tion principle shown in Figure 1. This repertoire aris-es from the structured nature of the cortex.

Materials and Methods

Basic ModelIn brief, the trion model was developed starting fromLittle's (1974; Little and Shaw, 1978) neural network

140 Learning by Selection • Shenoy et al

Page 3: Learning by Selection in the Trion Model of Cortical Organizationweb.stanford.edu/~shenoy/GroupPublicationsPreStanford/... · 2006. 4. 9. · trion (Fig. 3) represents an idealized

analogy to the Ising spin system, and modified in adirection inspired by Mountcastle's (1978) organi-zational principle (Fig. 2) and by the striking physicalspin model results of Fisher and Selke (1980). Thetrion represents the minicolumn and has three levelsof firing activity. Following Fisher and Selke, the in-teractions among the trions are taken to be localized,competing (between excitation and inhibition), andhighly structured. The firing state of the network (cor-tical column) of A'trions is updated in a probabilisticway related to the states of the two previous time stepsas in Figure 4. The probability P(S<) of the »th trionhaving a firing level or state 5, = 1, 0, —1 at time nris given by

g(S,)exp[BMtS,]

(1)

- v"

where S'j and 5/ are the states of yth trion at the twoearlier times (n — 1)T and (n — 2)T, respectively. VtJ

and Wv are the interactions between trion i at timem and trion j at times (n — 1)T and (n - 2)T, re-spectively. V,is an effective firing threshold. The threepossible firing states (of each trion) denoted by +, 0,— for S = 1, 0, — 1 represent, respectively, a large"burst" of above average firing, an average burst, andbelow average firing. The term g(S) with g(0) >•#(+) = g(~) takes into account the number of equiv-alent firing configurations of the trion's internal neu-ronal constituents. [E.g., in a trion representing a groupof 90 neurons, firing levels of +, 0, — could corre-spond to 90-61, 60-31, 30-0 neurons firing, respec-tively. There are many more combinatorial ways ofgenerating the 60-31 level (Roney and Shaw, 1980;Shawet al., 1982). This feature, g(0) » « ( + ) = g ( - ) ,gives stability to the trion model firing patterns.] Thefluctuation parameter B is inversely proportional tothe noise, and results (Shaw and Vasudevan, 1974)from the statistical nature of neurotransmitter releasefrom the synapses (Katz, 1969)- Studies of the trionmodel for memory (Shaw et al., 1988) and for patternrecognition (McGrann et al., 1989; McGrann, 1992)have been reported. Basically, the success of thesestudies is due to the fact that the localized, competing(between excitation and inhibition), and highly sym-metric connections yield a huge repertoire of inher-ent quasi-stable, periodic firing patterns, MPs. Fur-ther, these MPs evolve in certain natural sequencesfrom one to another (see, e.g., Fig. 1 of Shaw et al.,1985) in Monte Carlo simulations (described below).Any of the MPs can be readily learned or enhancedwith only small changes in the interaction strengthsusing the Hebb learning algorithm:

(2)

< > / ( « - 2 ) r ) , t > 0 ,M

where n runs over the time steps in one cycle of the

A Trion

Above Average BelowAverage Average

Figure 3. We identify a mincokimr with the idealized tron. and the basic networkof wore is the conical column. As shown, the trion hat three levels of firing activity.

MP. Simply extending this learning rule to a thirdtime step [using the correlation S,{m)Sj{{n - 3)r)]significantly enhances the effects of learning with asmaller change in the total connectivity (Leng, 1990;McGrann, 1992). We use this three-time-step versionof the Hebb learning rule (Eq. 2) in the calculationspresented in this article. One possible physiologicalorigin for a dependence on the previous (three) timesteps involves the circulation of activity through theconical layers (Krone et al., 1986; Shaw et al., 1993).

Let us define the cycling probability PC(MP) thatthe firing pattern for the columnar trion network re-mains in the MP for one cycle of the repeating MP.The PC(MP) is calculated by multiplying the proba-bilities PiS,) (Eq. 1) of each trion /being in the state5 at time nr, given by that MP for its whole cyclelength:

FC(MP) (3)

where / runs over the N trions in the columnar net-work. As a result of learning an MP using the Hebbalgorithm (Eq. 2), the cycling probability PC(MP) (Eq.3) is increased. Further, after learning, many moreinitial states, as noted below, will go to the learnedMP (and some related MPs). Note that these MPsevolve in natural sequences (see below) from one toanother in a Monte Carlo probabilistic calculation de-scribed below.

Monte Carlo SimulationsAs described in the introductory remarks, the trionmodel represents a mathematical realization of theMountcastle organizational principle for the corticalcolumn as the basic neuronal network in the cortex.Here we explicitly describe the simulations givingthe (probabilistic) Monte Carlo evolutions of spatial-temporal firing patterns of a trion network.

(1) We specify the parameters of the trion network:the number of trions N, the degeneracy factors

the connectivities Vtj and Wip the firing

Cerebral Cortex May/Jun 1993, V 3 N 3 241

Page 4: Learning by Selection in the Trion Model of Cortical Organizationweb.stanford.edu/~shenoy/GroupPublicationsPreStanford/... · 2006. 4. 9. · trion (Fig. 3) represents an idealized

(n -2 )T 5, . 2

( n - l ) r

r iT S.-3 S,-2

'1 + 3

Figure 4 . One network of tnons at three time steps, showing the firing states 5 and ths comscuons V and W. For N trions in the columnar network, we have ths nng-ltoconnections trion / — tnon / + N, as in Rojre 2.

thresholds V,, and the fluctuation parameter B.For this article, we will consider N = 6, g(O)/g(+) = 500, B - 6.3, and V, = 0.

(2) A choice for the firing states for the initial twotimes steps is made. Since each of the six trionsin each time step has three possible firing levels5, there are 31J = 531,441 possible choices. Notethat each trion is distinguishable.

(3) Given the firing states for each trion at the twoearlier times (n — 1)T and (n — 2)T, the prob-ability ^(5,) for the fth trion being in state S, attime nr is calculated from Equation 1.

Figure 5. Roulette analogy for a Moraa Carts evtAruon. The prooatnlruss P\S] forthe three firing tev&b for each trion st a grvsn urns step 8rs dsiBi'iinsd by thedynsrracs of the tnon model and ore represented bi ths roulette wheel by the shs ofthe rsspearvs box. Ths shadings whim, pdf, a t bhck represent the firing tsvats SH + , 0 , —, or above averags, average, end below average, 8s in Rrjure 3. Then,the actual choice is decided by which box the rotating roulette ball faUs into. Thelarger ths size of the box, ths more Irkely the ball will fall into n

(4) The three probabilities P(0), P(+), and /»(-)calculated in item 3 add up to 1.0 and can berepresented by three segments or bins of totallength 1.0 with the individual lengths being thesespecific values P(S<) arranged in a specific order.For example, let P ( - ) = 0.2, P(0) = 0.5, andP(+) = 0.3 so that we have probability segments0.0-0.2 for S, = - 1 , 0.2-0.7 for 5, = 0, and 0.7-1.0 for 5, = 1. Then, a random number between0.0 and 1.0 is chosen by the computer and thusthe segment that the random number falls intodetermines the 5, to be chosen. This is a standardMonte Carlo simulation technique as illustratedin Figure 5. This procedure is done for each ofthe N - 6 trions to determine the pattern at thattime step. Then continue to repeat items 3 and4.

Repertoire ofMPs and Learning of an MPHaving made the choice of parameters in item 1 above,the repertoire of MPs or inherent, quasi-stable, peri-odic firing patterns is found as follows. For a giveninitial firing state (item 2), follow the procedure ofalways choosing the 5 for each trion that has the larg-est probability P(S) instead of using the Monte Carlorule (item 4). [In the above example, the S, would be0.] The time evolution rapidly goes into a repeatingspatial-temporal pattern or MP. (This type of calcu-lation allows one easily to test whether an arbitraryspatial-temporal pattern is an MP.) Going through allpossible initial states (item 2) gives all the MPs (therepertoire of MPs). An MP has the property of beingreadily learned or enhanced using the three-time-stepversion (described above) of the Hebbian learningrule (Eq. 2) with only a small change in the connec-tions; that is, only a small value of t is needed. Thesesmall changes in the connections give only a smallchange in the repertoire of MPs. After learning, when

242 Learning by Sclealon • Shenoy et al.

Page 5: Learning by Selection in the Trion Model of Cortical Organizationweb.stanford.edu/~shenoy/GroupPublicationsPreStanford/... · 2006. 4. 9. · trion (Fig. 3) represents an idealized

{1} 1 {6} 3 {6} 4

{6} 7 {6} 8 {6} 9 {6} 10 {6} 11

12 {6} 13 {6} 14 {6} 15 {6} 16 {6} 17 {6}

18 {6} 19 {6} 20 {6} 21 {6} 22 {6} 23 {6}

{6} 25 {3} 26 {3} 27 {1} 28 {1} 29

{1} 31 {1} 32 {1} 33 {1}

Figure 8 . The initial repertoire of MPs for ds connscuvny given by Equator 4 is found by deterministicaSy (see Materials and Methods) evoking all possible 3 " initial statesirrrd repealing patterns, the MPs. are obtained. Each square represems a trim with three levels of firing activity: above background \ white), background (grsy). and below background{Htck). Each horizontal row represents a ring of interconnected (as in fig. 3) tnons (so that the scan square wraps anund to the first) and time evolves downward. There are atotal of 155 MPs that can be completely classified by thei distinct spatial rotations into 34 groups of MPs shown here The group number is listed on the top left coma, whilethe number of MPs in each group is given in braces (cyclically rotate the MP so that the first column a the second, etc.; if die MP is rat a temporal rotation of any of the otherelements in the group h is crjraifered distinct). In a Monte Carlo calculation (see Materials and Methods) these MPs would flow from one to another.

Cerebral Cortex May/Jun 1993, V 3 N 3 243

Page 6: Learning by Selection in the Trion Model of Cortical Organizationweb.stanford.edu/~shenoy/GroupPublicationsPreStanford/... · 2006. 4. 9. · trion (Fig. 3) represents an idealized

1.0

0.0 40.010 0.020

Figure 7 . The probability Pc to remain in MP 1(0) versus the learning coefficientt The probability is greatly enhancsd (roughly 0.9) for a small change in the connectivity|< = 0.025).

a Monte Carlo calculation is done, the evolution willboth go to the learned MP (and some related MPs)more often, and remain in it longer. Furthermore, anarbitrary spatial-temporal pattern cannot be readilylearned; only an MP can be learned to a high Pc (Leng,1990; McGrann, 1992).

Learning for Structured versus RandomConnectionsLet us examine a specific example of the repertoireof MPs for structured connections in an N= 6 network.Consider the connectivities and other parameters inEquation 1 to be as follows:

O V «n V =1 Iff n - l /^) vi,t+l vi.i-\ 1i w ij vyy

(4)

with thresholds V, = 0, all other Vy equal to 0, g(0)/g(±) - 500, and B - 6.3 (the connections, Eq. 4, falloff spatially; temporally we know that postsynapticinhibition has a slower time course, leading to a neg-ative W). Then, following the calculations in the sec-tion above, each of the 312 = 531,441 possible choicesfor the initial states is followed to find the 155 MPsshown in Figure 6 with an average recall time rn =•3.5T (McGrann et al., 1991) (note that all the MPs inFig. 6 have cycle length 6 except for MP 33, whichhas cycle length 1; this is related to the specific choiceof connections, Eq. 4, and not the number of trions;for other examples, see Shaw et al., 1985; Silvermanet al., 1986). These 155 MPs can be placed into 34sets as shown in Figure 6, where the MPs in a set arerelated to each other by a spatial rotation among trions,R, among the (distinguishable) trions (Silverman etal., 1986). Other symmetries among these MPs canbe used to categorize groups of MPs (McGrann, 1992).

Consider now learning an MP from Figure 6 usingthe Hebb learning algorithm, Equation 2. The ex-ample of learning MP 1(0) [where the (0) denotesthe MP obtained from the MP 1 specifically shown inFig. 6, by operating 0 times with spatial rotation op-erator R] is shown in Figure 7, where we plot theprobability of cycling Pc (as defined in Eq. 3) versus

e. Note that as a result of learning this MP 1(0), theconnections in Equation 4 will be modified accordingto Equation 2. For example, here, VijS = 1 and VM =1 + It, so we say that the precise symmetry, Vu+X =Kt,_, = 1, is slightly broken by learning for small t.

To contrast these results for our highly structuredconnections (Eq. 4), we choose several sets of randomconnections. In general, three qualitative featuresstand out in comparing structured connections versusrandom connections (McGrann, 1992): (1) the rep-ertoire for the random connections has a much small-er repertoire of MPs than for the structured connec-tions, (2) the average recall time rlv is much largerfor the random connections than for the structuredconnections, and (3) the learning properties of an MP(of comparable cycle length) are largely reduced forthe random connections as compared to the struc-tured connections.

ResultsA selection theory of learning inherently contains a"naive" or preexisting repertoire of responses to stim-uli as in Figure 1. For our trion model, this repertoireof responses is the repertoire of MPs determined bythe procedure given in Materials and Methods. Whenexternal stimuli are internally processed, these eventsmap onto MPs based on the properties of the stimuli.The number and type of MPs in the naive repertoiredepend on the connections. A crucial feature asstressed in the previous section is that structured con-nectivity is necessary for the existence of a large rep-ertoire of MPs. We have suggested that there is in-creasing evidence for such structure (Leng and Shaw,1991; Leng et al., 1993). These interactions (Fig. 4)among the trions in a column are taken to be local-ized, highly structured, and competing between ex-citation and inhibition. For our example, we choosethese "naive" (nonzero) connections to be the simplevalues given in Equation 4 that fall off spatially; tem-porally we expect that postsynaptic inhibition has aslower time course. (These connections are not uniqueand the brain probably utilizes different sets depend-ing upon the processing tasks of the area of cortexwhere it is located.)

The repertoire is found from deterministicallyevolving all 312 possible initial firing states as de-scribed in Materials and Methods. There are 155 dis-tinct MPs (temporal rotations of an MP are not dis-tinct). If the 155 MPs are grouped according to theirspatial rotations, there are 34 groups that result, ascan be seen in Figure 6. These are the inherent naturalfiring patterns of the network that would comprise thetypes of responses of the network before learning.

For our simple, typical example, consider a partic-ular external stimulus that then results in the exci-tation of a trion network. In particular, we imaginethat our stimulus excites part of the first two time stepsas in Figure 8. The three "unspecified" trions in Fig-ure 8 (giving 27 possibilities) denoted by diagonalstripes might represent various backgrounds. For ex-ample, the stimulus might be a rhino while the back-ground might be a zoo, a plain in Africa, or shopping

244 Learning by Selection • Shenoy et al

Page 7: Learning by Selection in the Trion Model of Cortical Organizationweb.stanford.edu/~shenoy/GroupPublicationsPreStanford/... · 2006. 4. 9. · trion (Fig. 3) represents an idealized

• ABOVE AVERAGE

• AVERAGE

• BELOW AVERAGE

S VARIED

NO LEARNING

MP 1(0)e =.025

5(0,3) 0(0) 4(0)

6 ( 5 ) 1(0,1,5) 12(3)

7(3) 2 (0 , I)

15(5) 3(0,4,5)

Figure 8 . The snmutus plus background states chosen to i t e r a t e the selection principle in the trial model is given si the rep. where the tires dkgonsBy wiped squaws arevaried fa the 27 possible combinations. Each stimulus plus background state is dnffminmialty evolved (see Materials and Methods) unti it reaches an MP. tnrttaily the "naive"27 stimulus-backflround pairs evolve into a subrepertoire of 16 different MPs, wbch are shown here on the left as 10 distinct groups in a format similar to Figure 6. Tha finsr/water is the group number, white the different spatial rotations (0 - unrotated MP, 1 - one rotation to the right....) are listed in the psemhesss. After learmng MP 1(0)with a learning coefficient of e - 0.Q25 given in Equation 2. the stimutus-oadground pairs dstermhsticaHy evolve into a subrepertoire ol eight MPs shown on the rigtn. Beforeleaning four stimufus-bsctground pairs evolve into MP 1(0), while after learning 12 pairs evolve into it

Cerebral Cortex May/Jun 1993, V 3 N 3 246

Page 8: Learning by Selection in the Trion Model of Cortical Organizationweb.stanford.edu/~shenoy/GroupPublicationsPreStanford/... · 2006. 4. 9. · trion (Fig. 3) represents an idealized

0.000 0.010 0.020

R g o r * 9 . Percentage recall versus t for MP 1(0). A Monte Carlo calculation foreach of the 27 stimulus-background pairs was performed 500 tries per t . Thepercentage of Monte Carlo catcuiatnns (or percentage recall) that evotvBd rnto MP1 (0) was determined for dierent amounts of learning, rang'rrj from £ = 0 (no learning)to e - 0.025. Notice that even though there was only a small change m connectivity,the percentage recall changes from less than I X to about 30%.

mall. The 27 different initial stimulus-backgroundstates at the top of the figure were evolved determin-istically until an MP was found. Before learning, thislead to a subrepertoire of 16 MPs that are shown atthe left of Figure 8.

Consider that the rhino in its background sur-roundings initially excited MP 1(0). This choice isarbitrary. The assumption is that once this particularMP has been excited (by the rhino plus a particularbackground) it is enhanced through the Hebb learn-ing rule. Then, using our three-time-step version(Leng, 1990; McGrann, 1992) of the Hebbian learningrule (Eq. 2), the "naive" connections (Eq. 4) weremodified. In Figure 7, Pc versus the learning coeffi-cient is graphed showing the increase in probabilityas the MP 1(0) as a function of the learning coeffi-cient i.

The probabilistic evolution of the 27 initial stim-ulus-background states was performed 500 times each.Without learning, only 1% of these 13,500 trials evolvedinto MP 1(0). As the learning coefficient increased,the number of stimulus-background pairs that re-called or evolved into MP 1(0) increased to roughly30% (see Fig. 9). The results of the Monte Carlo cal-culations (with MPs grouped along with their spatialrotations as in Fig. 6) for no learning and for learningwith a learning coefficient t of 0.025 are shown inFigure 10. Only those MP groups recalled by greaterthan 1% of the stimulus-background pairs are shownin Figure 10. Notice that the MP groups 2, 3, and 12also had an increase in the percentage of stimulus-background pairs evolving into them, thereby creat-ing together with MP 1(0) a category of selected MPs,with MP 1(0) being the dominant one.

In order to show that the example presented hereis showing true selectivity and is not an artifact ofmore stimuli recalling or evolving into MP 1(0), wedevelop a general selectivity measure. Let

NO LEARNING€ = .025MP 1(0)

3 4 5 12 22 27 32GROUP

Figure 10 . The percentage recall (as described m Fig. 8) a shown here before andafter learning (t = 0.025] MP 1(0] for each group given in Figure 8 that had apercentage recall greater than 1K. [Note that the MPs 6(5), 7(3). and 15(5) in Rg.B do not fufSP this criterion.] Before learning, the stimulus-background pairs proba-bilistically evolve into MP 0 the most, while after learning, MP 1(0) becomes thedominant MP. Groups 2, 3, and 12 correspondingly increase their respective recallpercentages after teaming MP 1(0), thereby formtng a category of MPs that 8reIrkewoe enhanced.

sb = stimulus-backgroundn^ = number of sb pairs,n^ = number of sb pairs evolving into learned MP

after learning,«b, - number of sb pairs evolving into learned MP

before learning,A = total number ofall initial states = 212 = 531,441,N^ = all initial states that evolve into learned MP

after learning, andNbl = all initial states that evolve into learned MP

before learning.

Then, our selectivity ratio SK is defined as

SK = [(«., - Wbi)/^]/[(Ai, - N^/N,]. (5)

SR > 1 then is required for selectivity. In contrast, SR

< 1 would indicate a general increase in the numberof initial states evolving into the learned MP.

Before learning, there were 155 MPs in the rep-ertoire found from the deterministic evolution of allpossible 531,441 initial states of which A^/ty = 3.0%that recalled or evolved into MP 1(0). After learningMP 1(0) with a learning coefficient of 0.025, therewere 88 MPs and NjN, - 9.3% of the initial statesevolved into MP 1(0). The deterministic evolution ofthe n^ = 27 stimulus-background pairs into MP 1(0)changed from n^, = 4 {n^Jn^ = 14.8%) before learn-ing to «., •• 12 (n,,/n^, " 44.4%) after learning. FromEquation 5, then, Sg = 4.7. Selectivity is thus dem-onstrated. (This was also true for smaller t.)

The example of selective learning shown here isjust one of several such cases that were examined(McGrann, 1992).

DiscussionWe have demonstrated (by presenting a simple, typ-ical example) that the trion model of cortical orga-

248 Learning by Selection • Shenoy et al

Page 9: Learning by Selection in the Trion Model of Cortical Organizationweb.stanford.edu/~shenoy/GroupPublicationsPreStanford/... · 2006. 4. 9. · trion (Fig. 3) represents an idealized

nization gives a realization of the Edelman selectionprinciple of learning (Fig. 1). However, in contrastto the Edelman realization via the neuronal groupselection, the trion model has a very large naive rep-ertoire of quasi-stable spatial temporal firing patterns,MPs, any of which can be selected out through theHebbian learning rule (Eq. 2). As we saw in Figure10, only a small modification of synaptic connectionsamong the trions, e = 0.025, was necessary to selectout an MP and yield it as the dominant response.Further, we defined, above, the selectivity ratio SK (Eq.5), which gives a quantitative measure of the truenature of the selective change after learning, andshowed that our example clearly satisfied this crite-rion.

After learning an MP with a relatively small t(<O.O25), the exact symmetry in the "naive" connec-tions (Eq. 4) is broken, leading to a loss in the numberof MPs from the naive repertoire (before learning) tothe new repertoire (after learning) as found in thedeterministic (most probable) calculations. However,when the Monte Carlo (full probabilistic) calcula-tions are done, these "lost" MPs will reappear andcan be learned with the Hebb rule, bringing themback to the (deterministic) repertoire. Thus, we mightintroduce the concept of the actual or dynamic rep-ertoire (McGrann, 1992).

To appreciate the possible huge size of the trionmodel repertoires for even these small six trion net-works, we note that the (nonzero) connections Vt,_,= 1.0, Vu+l = 1.0, Wu_2 = -1.0, Wu+I = -1.0 gave1804 MPs (see Table 1 of Shaw et al., 1985). As thenumber of trions in the cortical column increases, thesize of the repertoire increases (nonlinearly). We ex-pect that a more realistic cortical column would haveperhaps as many as 20 minicolumns or trions. Fur-thermore, as we couple several columns together(Leng and Shaw, 1991; Leng et al., 1993), again therepertoire increases. The importance of the symmetryof the connections and of the competition betweenexcitation and inhibition is illustrated in McGrann etal. (1991): 20% changes from the connections inEquation 3 decrease the number of MPs by only afactor of two, illustrating the robustness of the model.However, a random set of connections can decreasethe number of MPs in a repertoire by two orders ofmagnitude! Furthermore, the recall time (the numberof time steps, on the average, for an initial state toprojeCT onto an MP) for an MP is considerably fasterwhen the connections are structured versus random.

Clearly, we have presented only a simple realiza-tion of the Edelman selection theory of learning inthe example in this article. Any memory trace of ahigher level response would involve many corticalcolumns and several cortical areas. This, in fact, shouldconsiderably enhance the quantitative nature of ourresults in the manner emphasized by von Neumann(1956) in his classic report on building reliable or-ganisms from components with errors in their per-formance. The high symmetry of the Mountcastle co-lumnar organizational principle, and the additionalassumptions in the trion model based on it are clearly

huge simplifications. However, we believe the powerand robustness of the trion model make it useful toconsider advantages of some generalized, more re-alistic versions of it. Thus, for example, we suggestthat the arguments (Swindale, 1990) over whether allminicolumns or all columns need to have the samenumber of neurons are irrelevant for understandingthe dynamics involved in the Mountcastle columnarorganizational principle. We propose that there willbe these more biological realizations having struc-tured {in space and time), competing connectionsamong groups of neurons leading to large repertoiresof inherent spatial temporal patterns that can be readi-ly learned. These repertoires then will serve as thebasis of implementing the Edelman selection prin-ciple. It is apparent that as neuroanatomical and neu-rophysiological techniques have improved in the pastdecade, more and more structure has been found inthe cortex. We expect this trend to continue (Lengand Shaw, 1991; Lengetal., 1993). This in turn makesour example presented here more relevant in under-standing the existence of a selective-type learningcapability in the cortex.

In conclusion, we stress that the conceptual dis-tinction between a selectional theory of learning incortex and an instructional one is of major impor-tance. Selectional learning can proceed much morerapidly than instructional learning, and with compar-atively small changes in connectivity. One-trial learn-ing would be very hard to understand in an instruc-tional neuronal network model. We suggest that someform of instructional learning (in which connectivi-ties are finely tuned) is present for difficult tasks re-quiring many trials, whereas very rapid learning in-volves selectional learning. Both types of learning arecrucial in understanding behavior.

NotesThis research was supported in part by the REU program ofthe National Science Foundation and by the National As-sociation of Music Merchants. K.V.S. was supported in partby a University of California Presidential Undergraduate Fel-lowship. We thank one of the anonymous referees for veryuseful comments

Correspondence should be addressed to Gordon Shaw,Center for the Neurobiology of Learning and Memory, Uni-versity of California, Irvine, CA 92717.

Krishna V. Shenoy is currently at Department of ElectricalEngineering and Computer Science, MIT, Cambridge, MA02139.

ReferencesBonhoeffer T, Grinvald A (1991) Iso-orientation domains

in cat visual cortex arranged In pinwheel-like patterns.Nature 353:429-431.

Braitenberg V, Braitenberg C (1979) Geometry of orien-tation columns in the visual cortex. Biol Cybern 33:179-186.

Crick F (1989) Neural Edelmanism. Trends Neurosci 12240-248.

Crick F (1990) Neural Edelmanism—reply. Trends Neu-rosci 13:13-14.

Dinse HR, Kruger K, BestJ (1990) A temporal structure ofcortical information processing. Concepts Neurosci 1:199-238.

Cerebral Cortex May/Jun 1993, V 3 N 3 247

Page 10: Learning by Selection in the Trion Model of Cortical Organizationweb.stanford.edu/~shenoy/GroupPublicationsPreStanford/... · 2006. 4. 9. · trion (Fig. 3) represents an idealized

Eckhorn R, Bauer R, Jordan W, Brosch M, Kruse W, MunkM, Reitboeck HJ (1988) Coherent oscillations: a mech-anism of feature linking in the visual cortex: multipleelectrode and correlation analysis in the cat. Biol Cybern60:121-130.

EdelmanGM (1978) Group selection and phasic reentrantsignaling: a theory of higher brain function. In: Themindful brain (Edelman GM, Mountcastle VB, eds), pp51-100. Cambridge, MA: MIT Press.

EdelmanGM (1987) Neural Darwinism: the theory of neu-ronal group selection. New York: Basic

Fisher ME, Selke W (1980) Infinitely many commensuratephases in a simple Ising model. Phys Rev Lett 44:1502-1505.

Goldman-Rakic PS (1984) Introduction: the frontal lobes-uncharted provinces of the brain. Trends Neurosci 7-425-429

Gray CM, Singer W (1989) Stimulus-specific neuronal os-cillations in orientation columns of cat visual cortex. ProcNatl Acad Scl USA 86:1698-1702.

Gray CM, Konig P, Engel AK, Singer W (1989) Oscillatoryresponses in cat visual cortex exhibit intercolumnar syn-chronization which reflects global stimulus properties.Nature 338:334-337.

Hebb DO (1949) Organization of behavior. New York.Wiley.

Hubel DH, Wiesel TN (1977) Functional architecture ofmacaque monkey visual cortex. Proc R Soc Lond [Biol]198:1-59.

KatzB (1969) The release of neural transmitter substances.Springfield: Thomas.

Krone G.MallotH, Palm G.Schuz A (1986) Spatiotemporalreceptive fields: a dynamical model derived from corticalarchitectonics. Proc R Soc Lond [Biol] 226:421^44.

Leng X (1990) Investigation of higher brain functions inmusic composition using models of the cortex based onphysical system analogies. PhD thesis, University of Cal-ifornia, Irvine.

Leng X, Shaw GL (1991) Toward a neural theory of higherbrain function using music as a window. Concepts Neu-rosci 2:229-258.

LengX, McGrannJV, ShawGL (1992) Reversal of epilepticstate by patterned electrical stimulation suggested by trionmodel calculations. Neurol Res 14:57-61.

Leng X, McGrann JV, Quillfeldt JA, Shaw GL, Shenoy KV(1993) Learning and memory processes and the mod-ularity of the brain. In. Neural bases of learning andmemory (Delacour J, ed), in press. Singapore. WorldScientific

Little WA (1974) Existence of persistent states in the brain.Math Bioscl 19101-120.

Little WA, Shaw GL (1978) Analytic study of the storagecapacity of a neural network. Math Biosci 39-281-290

McGrann JV (1992) Further theoretical investigations ofthe trion model of cortical organization. PhD thesis, Uni-versity of California, Irvine.

McGrannJV, Shaw GL, Shenoy KV (1989) Recognition ofrotated objects in the trion model of cortical organiza-tion. Abstract presented at the annual meeting of theSociety for Mathematical Psychology.

McGrann JV, Shaw GL, Silverman DJ, Pearson JC (1991)Higher temperature phases of a structured neuronal mod-el of cortex. Phys Rev A43:5678-5682.

MichodR (1990) Neural Edelmanism Trends Neurosci 13-12-13.

Mountcastle VB (1978) An organizing principle for cere-bral function: the unit module and the distributed sys-tem. In: The mindful brain (Edelman GM, MountcastleVB, eds), pp 1-50. Cambridge, MA: MIT Press.

ReekeGNJr (1990) Neural Edelmanism Trends Neurosci13:11-12.

Richmond BJ, Optican LM, Spitzer H (1990) Temporalencoding of two-dimensional patterns by single units inprimate primary visual cortex. I. Stimulus-response relations. J Neurophysiol 64-351-369.

Roney KJ, Shaw GL (1980) Analytic study of assemblies ofneurons in memory storage. Math Biosci 51:25-41.

Rumelhart DE, McClelland, and PDP Research Group, eds(1986) Parallel distributed processing. Cambridge, MA-MIT Press.

SeelenWv (1970) Zur informationsverarbeitung im visuel-len system der wirbetiere. Kybernetik 7:89-109.

Shaw GL, Vasudevan R (1974) Persistent states of neuralnetworks and the random nature of synaptic transmission.Math Biosci 21:207-218.

Shaw GL, Harth E, Scheibel AB (1982) Cooperativity inbrain function: assemblies of approximately 30 neurons.Exp Neurol 77:324-358.

Shaw GL, Silverman DJ, Pearson JC (1985) Model of cor-tical organization embodying a basis for a theory of in-formation processing and memory recall. Proc Natl AcadSci USA 82:2364-2368.

ShawGL, Silverman DJ, Pearson JC (1988) Trion model ofcortical organization and the search for the code of short-term memory and information processing. In: Systemswith learning and memory abilities (LevyJ, Delacour JCS,eds), pp 411-435. New York. Elsevier.

Shaw GL, Kruger J, Silverman DJ, Aertsen AMHJ, Aiple F,Liu H-C (1993) Rhythmic and patterned neuronal firingin visual cortex. Neurol Res 15-46-50

Silverman DJ, Shaw GL, Pearson JC (1986) Associative re-call properties of the trion model of cortical organization.Biol Cybern 53:259-271.

Singer W (1990) Search for coherence: a basic principleof cortical self-organization. Concepts Neurosci 1:1-26.

SwindaleNV (1990) Is the cerebral cortex modular? TrendsNeurosci 13:487-492.

von Neumann J (1956) Probabilistic logics and the syn-thesis of reliable organisms form unreliable components.In: Autonama studies (Shannon CE, McCarthy J, eds), pp43-98. Princeton: Princeton University.

Weinberger NM, Diamond DM, McKennaTM (1984) Initialevents in conditioning: plasticity in the pupillomotor andauditory systems. In: Neurobiology of learning and mem-ory (Lynch G, McGaugh JL, Weinberger NM, eds), pp197-227. New York: Guilford.

146 Learning by Selection • Shenoy et al.