Click here to load reader
Upload
lamthien
View
212
Download
0
Embed Size (px)
Citation preview
Support Vector Machine Regression for Volatile StockMarket Prediction
HaiqinYang,LaiwanChan,andIrwin King
Departmentof ComputerScienceandEngineeringTheChineseUniversityof HongKong
Shatin,N.T. HongKong{ hqyang,lwchan,king} @cse.cuhk.edu.hk
Abstract. Recently, SupportVector Regression(SVR) hasbeenintroducedtosolveregressionandpredictionproblems.In thispaper, weapplySVRtofinancialpredictiontasks.In particular, thefinancialdataareusuallynoisyandtheassociatedrisk is time-varying.Therefore,our SVR model is an extensionof the standardSVRwhichincorporatesmarginsadaptation.By varyingthemarginsof theSVR,wecouldreflectthechangein volatility of thefinancialdata.Furthermore,wehaveanalyzedtheeffectof asymmetricalmarginssoasto allow for thereductionof thedownsiderisk. Our experimentalresultsshow that theuseof standarddeviationto calculatea variablemargin gives a goodpredictive resultin thepredictionofHangSengIndex.
1 Introduction
SupportVectorMachine(SVM), basedon StatisticalLearningTheory, wasfirst devel-opedby Vapnik[4,6]. It hasbecomea hot topic of intensive studydueto its successfulapplicationin classificationtasks[7,8] andregressiontasks[5,3],speciallyontimeseriesprediction[1] andfinancialrelatedapplications[2].
WhenusingSVM in regressiontasks,theSupportVectorRegressormustuseacostfunctiontomeasuretheempiricalrisk in ordertominimizetheregressionerror.Althoughtherearemany choicesof thelossfunctionsto calculatethecost,e.g.,leastmoduluslossfunction,quadraticlossfunction,etc.,the -insensitive lossfunctionis sucha functionthat exhibits the sparsityof the solution[4]. Typically, this -insensitive lossfunctioncontainsafixed andsymmetricalmargin(FASM)term.Whenthemargin is zeroor verysmall,onerunsinto therisk of overfitting thedatawith poorgeneralizationwhile whenthemargin is large,oneobtainsabettergeneralizationattheriskof havinghighertestingerror. For financialdata,dueto theembeddednoise,onemustseta suitablemargin inorderto obtainagoodprediction.Thispaperfocuseson two waysto setthemarginsinSVR.
When applyingSVR to time seriesprediction,the practitionersusually overlookthechoicesof themargin setting.For example,in [2], they simply setthemargin to 0.This amountsto the leastmoduluslossfunction.Othershave just setthe margin to avery small value[5,9,10]. In [1], they appliedadditionalcalculations,e.g.,validationtechniques,to determineasuitablemargin empirically.
H.Yin et al. (Eds.):IDEAL 2002,LNCS2412,pp.391–396,2002.c Springer-VerlagBerlin Heidelberg 2002
392 H.Yang,L. Chan,andI. King
Oneof theshortcomingsof theabovemethodsis thatthemargin is symmetricalandfixed.Consequently, thistechniqueis insensitiveandnon-adaptiveto theinputdata.Thismayresultin less-than-optimalperformancein thetestingdatawhile it obtainsa goodresulton thetrainingdata.
In this paper, we proposeto useanadaptive margin in SVR for financialpredictionto minimize the downsiderisk, which is an essentialpart in financialpredictionwithvolatilefinancialdata.Morespecifically, wepresenttwo approaches:oneusesthefixedandasymmetricalmargins(FAAM), whereastheotherusesnon-fixedandsymmetricalmargins(NASM).
A key differencebetweenFAAM andFASM is that thereexist an up anda downmargin thatareasymmetrical.In thecaseof FAAM whentheupmargin is greaterthanthe down margin, the predictive resultstend to be smallerthan the predictive resultswhichareproducedby usingFASM.
In NASM, themargin isadaptivetotheinputdata.Therearemany possiblechoicestosetthemargin.For example,onemayusethen-thorderstatisticstocalculatethemargin.More specifically, we choosethesecondorderstatistics,thestandarddeviation,asourmethodto calculatetheadaptive margin. This is becausethat thestandarddeviation isfrequentlyusedasameasureof thevolatility of stockpricesin financialdata.Whenthestockpriceis highly volatile,it hasahighstandarddeviation.In financialtimeseriesthenoiseis oftenvery large,andwetry to tolerateourpredictionby having a largermarginwhenthestockpriceis highly volatile.Ontheotherhand,asmallermargin maybemoresuitablefor lessvolatilestockactivities.Hence,ourapproachavoidsthefixedmargin inorderto obtainabetterpredictionresult.
Thepaperis organizedasfollows.We introduceageneraltypeof -insensitive lossfunctionandgive theinferentialresultin Section2. We reportexperimentsandresultsin Section3. Lastly, we concludethepaperwith a brief discussionandfinal remarksinSection4.
2 Support Vector Regression
Given a trainingdataset,(x1, y1), . . . , (xN , yN ), wherex i X, yi R, N is thesizeof trainingdata,andX denotesthespaceof theinputsamples–forinstance,Rn . Theaimis to find a functionwhich canestimateall thesedatawell. SVR is oneof themethodsto performtheabove regressiontask[4,3].
In general,theestimationfunctionin SVRtakesthefollowing form,
f (x) = (w · (x)) + b, (1)
where(·) denotesthe innerproductin , a featurespaceof possiblydifferentdimen-sionalitysuchthat : X andb R.
Now thequestionis to determinew andb from thetrainingdataby minimizing theregressionrisk, Rreg(f ), basedon theempiricalrisk,
Rreg(f ) = CN
i = 1
�(f (x i ) − yi ) +
12
(w · w), (2)
SupportVectorMachineRegressionfor VolatileStockMarketPrediction 393
whereC is a pre-specifiedvalue,�
(·) is a cost function that measuresthe empiricalrisk. In general,the -insensitive lossfunctionis usedasthecostfunction[4]. For thisfunction,whenthedatapointsarein therangeof ± , they donotcontributeto theoutputerror. Thefunctionis definedas,
�(f (x) − y) =
0, if |y − f (x)| �|y − f (x)| − � otherwise
. (3)
In this paper, we introducea generaltype of -insensitive lossfunction,which isgiven as,
�(f (x i ) − yi ) =
0, if − downi < y i − f (x i ) � up
iyi − f (x i ) − up
i , if yi − f (x i ) � upi
f (x i ) − yi − downi , if f (x i ) − yi � down
i
, (4)
where upi and down
i correspondto the i -th up margin anddown margin respectively.When up
i and downi are both equal to a constant,for all i, i = 1, . . . , N , Eq. (4)
amountsto the -insensitive lossfunction in Eq. (3) andit is labeledasFASM (FixedandSymmetricalMargin). When up
i = up , for all i = 1, . . . , N and downj = down ,
for all j = 1, . . . , N with up = down , this caseis labeledas FAAM (Fixed andAsymmetricalMargin). In thecaseof NASM (Non-fixedandSymmetricalMargin), weusean adaptive margin for which the up margin equalsto the down margin. The lastcaseis with an adaptive andasymmetricalmargin. In this paper, we just considerthefirst threecases,i.e.,FASM, FAAM, andNASM.
UsingtheLagrangefunctionmethodtofindthesolutionwhichminimizestheregres-sionrisk of Eq.(2) with thecostfunctionin Eq.(4), weobtainthefollowing QuadraticProgramming(QP)problem:
arg min� 12
N
i = 1
N
j = 1
( i − i )( j − j )( (x i ) · (x j )) +N
i = 1
( upi − yi ) i
+N
i = 1
( downi + yi ) i (5)
subjecttoN
i = 1
( i − i ) = 0 � i � i [0, C], (6)
where and arecorrespondingLagrangemultipliers usedto pushandpull f (x i )towardstheoutcomeof yi respectively.
Solvingtheabove QPproblemof Eq. (5) with constraintsof Eq. (6), we determine
theLagrangemultipliers and andobtainw =N
i = 1( i − i ) (x i ). Thereforethe
estimationfunctionin Eq.(1) becomes
f (x) =N
i = 1
( i − i )( (x) · (x i )) + b. (7)
394 H.Yang,L. Chan,andI. King
Sofar, we have not consideredthecomputationof b. In fact,this canbesolvedbyexploiting theKarush-Kuhn-Tucker(KKT) conditions.Theseconditionsstatethatat theoptimalsolution,theproductbetweentheLagrangemultipliersandtheconstraintshasto equalto zero.In this case,it meansthat
i (upi + i − yi + (w · (x i )) + b) = 0 (8)
i ( downi + i + yi − (w · (x i )) − b) = 0
and
(C − i ) i = 0
(C − i ) i = 0.
where i and i areslackvariablesusedto measuretheerrorof upsideanddown side.
Since i · i = 0 and ( )i = 0 for ( )
i (0, C), bcanbecomputedasfollows:
b =yi − (w · (x i )) − up
i , for i (0, C)yi − (w · (x i )) + down
i , for i (0, C). (9)
Using the trick of kernel function, Eq. (7) can be written as, f (x) =N
i = 1( i −
i )K (x, x i ) + b, wherethe kernel function, K (x, x i ) = ( (x) · (x i )), which is asymmetricfunctionandsatisfiestheMercer’scondition.In thispaper, weselectacom-monkernelfunction,e.g.,RBF function,K (x, x i ) = exp(− |x − x i |2), as thekernelfunction.
In thenext section,weapplyour inferentialresultof SVRbasedonthegeneraltypeof -insensitive lossfunction to the regressionof financialdata,for example,indicesandstockprices.By applyingregressionto thedata,we canbuild a dynamicsystemtomodelthedataandhenceusethesystemfor predictingfutureprices.
3 Experiments
In thissection,weconducttwoexperimentsto illustratetheeffectof FASM, FAAM, andNASM. Thefirst experimentillustratestheSVM financialpredictionwith fixedmargin,includingFASM andFAAM. ThesecondexperimentteststheSVM financialpredictionwith NASM undershift windows.
In ourexperiment,weusethedaily closingpriceof HongKong’sHangSengIndex(HSI) from January15, 2001to June19, 2001,a total of 104days’of datapoints,outof which 100datapointsfor trainingandtesting.We setthelengthof theshift windowto 80. The dynamicsystemis modeledas I t = f (I t − 4, I t − 3, I t − 2, I t − 1), whereI t istherealstockpriceat time t, andI t is thepredictivevalueat time t. Therefore,thefirsttrainingdatasetis from January15,2001to May 22,2001,a total of 84 days’of HSI.We usethemto predictthenext day’s HSI. This window is thenshiftedandanentiretrainingis performedagainto predictthefollowing day’sHSI for theremainingtestingdata.
SupportVectorMachineRegressionfor VolatileStockMarketPrediction 395
TheSVR algorithmusedin our experimentis modifiedfrom LibSVM [10]. Beforerunning the algorithm,we needto determinesomeparameters.They areC, the costof error; , parameterof kernel function, andthe margins.After performinga cross-validationin thefirst trainingdata,wesetC = 6000, = 2− 24. Sincedifferentmarginswill affect the resultsof prediction,we usedifferentvaluesin our tests.Furthermore,we usethe following threeerror definitionsto measurethe testingerrors,error1
MMt = 1 |I t − I t |, error pos
1M
Mt = 1 , I t � I t
(I t − I t ), error neg1
MMt = 1 , I t < I t
(I t −I t ), whereM is thesizeof the testingdataanderror reflectsthe total risk, error pos
reflectstheupsiderisk anderror neg reflectsthedownsiderisk respectively.The experimentsareconductedon a Pentium4, with 1.4 GHZ, 512M RAM and
Windows2000.With theseconfigurations,thepredictiveresultsareobtainedwithin sec-onds.
In thefirst experimentweusedifferentvaluesfor upmargin anddown margin to testtheeffect of FASM andFAAM. We show thesettingof themargin in thesecondandthird columnsof Table1, andreportthecorrespondingerrorsin thelastthreecolumns.In all but thefirst andthelastmargin setting,their overall margin widthsarethesame,i.e., up + down = 150. This allows us to have a fair comparisonof the four cases.From theTable1, we canseethat the error pos graduallyincreaseswith the increaseof up . At thesametime,with the increaseof up , we allow for moreerrorsabove thepredictivevalues.Thustheerror neg decreases.In termsof theoverallerror, it increasesandthendecreasesagain.This indicatesthatneitheranarrow margin for theupsidenorthedownsidewouldbedesirablein termsof theoverall error.
In thesecondexperiment,afterconsideringthevolatility of thefinancialdata,wesettheupmargin anddown margin bothequalto thestandarddeviationof theinputvectorx to performtheprediction.Thepredictive errorof theexperimentwith theNASM isreportedin thelastrow of Table1 andtheresultshowsthatthetotalerroris significantlydecreasedcomparingwith thefixedones.
Table 1. ExperimentResults
Case up down error error pos error neg
1 0 0 134.59 56.46 78.132 50 100 131.96 49.44 82.523 75 75 129.03 60.47 68.564 100 50 129.96 73.44 56.525 150 0 135.64 101.28 34.366 116.19 53.29 62.90
4 Discussion and Conclusion
In thispaper, wepresentageneraltypeof -insensitivelossfunctionin SVRandoutlinethevariousmarginsused,i.e., FASM, FAAM andNASM. UsingHongKong’s HSI asthedatasetfor SVRwith differenttypesof margins,wehavethefollowing conclusions:
396 H.Yang,L. Chan,andI. King
1. Oneinterestingobservationis thatneithertheupmarginnorthedownmarginwouldaffect theerror unilaterally. This canbeseenfrom theresultsof Case2 to Case5in Table1.
2. Anotherinterestingobservationis thatfrom thepointof view of thedownsiderisk,Case5 inTable1 isagoodresultsinceitserror neg , whichis relatedto thedownsiderisk, is minimum.In practice,wecanreducethedownsiderisk by increasingtheupmargin while decreasingthedown margin.
3. In the NASM case,we find that usingstandarddeviation to calculatethe margin,which canreflect the changein volatility of the financialdata,resultsin the bestpredictionin ourexperimentsincethis resulthasaminimal error .
Acknowledgement. Theworkdescribedin thispaperwaspartiallysupportedbyagrantfrom the ResearchGrantsCouncil of the Hong Kong SpecialAdministrationRegion,China.TheauthorsthankLin, Chih-Jenfor helpful suggestionsonusingtheLibSVM.
References
1. K. R.Muller,A. Smola,G.Ratsch,B. Scholkopf,J.KohlmorgenandV. N.Vapnik.Predictingtimeserieswith supportvectormachines.ICANN, 999-1004,1997.
2. T. B. TrafalisandH. Ince.Supportvectormachinefor regressionandapplicationsto financialforecasting.IJCNN2000,348-353.
3. A. SmolaandB. Scholkopf.A TutorialonSupportVectorRegression.1998,TechnicalReportNeuroCOLT NC-TR-98-030.
4. V. N. Vapnik.TheNatureof StatisticalLearningTheory. Springer, New York, 1995.5. V. N. Vapnik,S.Golowich andA. Smola.Supportvectormethodfor functionapproximation,
regressionestimationandsignalprocessing.6. V. N. Vapnik.StatisticalLearningTheory. Wiley, New York, 1998.7. EdgarOsunaandRobertFreundandFedericoGirosi.SupportVectorMachines:Trainingand
Applications.AIM-1602, MIT, 38,1997.8. ChristopherJ. C. Burges.A Tutorial on SupportVectorMachinesfor PatternRecognition.
DataMining andKnowledgeDiscovery, 2(2):121-167,1998.9. S. Mukherjee,E. Osunaand F. Girosi. Nonlinearpredictionof chaotic time seriesusing
supportvectormachines.IEEE Workshopon NeuralNetworks for SignalProcessingVII,IEEEPress,J.PrincipeandL. GilesandN. MorganandE.Wilson,511,1997.
10. Chih-Chung,ChangandChih-Jen,Lin. LIBSVM: a Library for SupportVectorMachines(Version2.31),2001.
11. FrankA. SortinoandStephenE. Satchell.Managingdownsiderisk in financialmarkets :theory, practice,andimplementation.Oxford,Boston:Butterworth-Heinemann,2001.