ALGORITHMIC TRADING Hidden Markov Models. Overview a)Introduction b)Methodology c)Programming codes...

ALGORITHMIC TRADINGHidden Markov Models

http://en.wikipedia.org/wiki/File:HiddenMarkovModel.png

Overview

a) Introduction

b) Methodology

c) Programming codes

d) Conclusion

a) Introduction

Markov Chain• Named after Andrey Markov, Russian mathematician.

• Mathematical system that undergoes transitions from one state to another state.

• From a finite or countable number of possible states in a chainlike manner.

• Next state depends ONLY on the current state and not the past.

a) Introduction

Simple 2 States Markov Chain Models

http://en.wikipedia.org/wiki/File:Markovkate_01.svg

a) Introduction

• System is in a certain state at each step.• States are changing randomly between steps.• Steps can be time or others measurements.• Random process is simply to map states to steps.

http://en.wikipedia.org/wiki/File:MarkovChain1.png

a) Introduction

1

2

Markov Property – The state at next step or all future steps, given its current state depends only on the current state.

Transition Probabilities – Probability of transiting to various states.

a) Introduction

The Drunkard’s Walk

• Random walk on the number line•At each step, position change by +1 or -1 or 0.•Depends only on current position.

http://sg.wrs.yahoo.com/_ylt=A0S0zu6DGKRNvmcA3nsl4gt.;_ylu=X3oDMTBpc2VvdmQ2BHBvcwM3BHNlYwNzcgR2dGlkAw--/SIG=1kfql4hc1/EXP=1302628611/**http%3A//sg.images.search.yahoo.com/images/view%3Fback=http%253A%252F%252Fsg.images.search.yahoo.com%252Fsearch%252Fimages%253Fp%253Ddrunkard%2526b%253D1%2526ni%253D18%2526ei%253Dutf-8%2526xargs%253D0%2526pstart%253D1%26w=929%26h=1200%26imgurl=archive.gamespy.com%252Fpreviews%252Fnovember00%252Fstronghold%252Fdrunkard.jpg%26rurl=http%253A%252F%252Farchive.gamespy.com%252Fasp%252Fimage.asp%253F%252Fpreviews%252Fnovember00%252Fstronghold%252Fdrunkard.jpg%26size=168k%26name=drunkard%2Bjpg%26p=drunkard%26oid=2607d74236b4b682%26fr2=%26no=7%26tt=45756%26b=1%26ni=18%26sigr=12l4a2in5%26sigi=11vc27ia6%26sigb=12u92p03v%26.crumb=RjID6XGS1Ul

a) Introduction

Transition Probability Matrix• Simple matrix holding all the state changing probabilities.

•Eg: : Imagine that the sequence of rainy and sunny days is such that each day’s weather depends only on the previous day’s, and the transition probabilities are given by the following table.

a) Introduction

Day t+1Day t Rainy SunnyRainy 0.9 0.1Sunny 0.6 0.4

if today is rainy, the probability that tomorrow will be rainy is 0.9; if today is sunny, that probability is 0.6. The weather is then a two-state Markov chain, with t.p.m. Γ given by:

Γ = 0.9 0.1 0.6 0.4

http://sg.wrs.yahoo.com/_ylt=A0S0zu7xGqRNwEIADF0l4gt.;_ylu=X3oDMTBqNzBoY2J0BHBvcwMxNARzZWMDc3IEdnRpZAM-/SIG=1guv1fphd/EXP=1302629233/**http%3A//sg.images.search.yahoo.com/images/view%3Fback=http%253A%252F%252Fsg.images.search.yahoo.com%252Fsearch%252Fimages%253Fp%253Draincloud%2526ei%253Dutf-8%26w=542%26h=524%26imgurl=agilitynet.co.uk%252Fmagazineimages%252FClip%252520Art%252Fclipart_raincloud.gif%26rurl=http%253A%252F%252Fwww.agilitynet.com%252FCanineHumour%252Fin-jokes2004.HTML%26size=10k%26name=clipart%2Brainclou...%26p=raincloud%26oid=171a15dc5baa86c6%26fr2=%26no=14%26tt=12942%26sigr=11ostvlaa%26sigi=120h9hjrv%26sigb=124aqjjj3%26.crumb=RjID6XGS1Ul

http://sg.wrs.yahoo.com/_ylt=A0S0zu4WG6RND2EABcUl4gt.;_ylu=X3oDMTBpdnJhMHUzBHBvcwMxBHNlYwNzcgR2dGlkAw--/SIG=1gbpll9ru/EXP=1302629270/**http%3A//sg.images.search.yahoo.com/images/view%3Fback=http%253A%252F%252Fsg.images.search.yahoo.com%252Fsearch%252Fimages%253Fp%253Dsun%2526js%253D1%2526ei%253Dutf-8%26w=240%26h=180%26imgurl=www.bradfitzpatrick.com%252Fstock_illustration%252Fimages%252Fcartoon_sun_01.jpg%26rurl=http%253A%252F%252Fwww.zangyoudai0.com%252Fr%252FSun%26size=10k%26name=Photo%2BGallery%2BTh...%26p=sun%26oid=286c05a5d1b5a918%26fr2=%26no=1%26tt=43599103%26sigr=1105r8f43%26sigi=124pe25n0%26sigb=123n12rhj%26.crumb=RjID6XGS1Ul

a) Introduction to HMM

Hidden Markov Model• it is a Markov chain but the states are hidden, not observable.

• In a normal Markov chain, t.p.m are the only parameters. In HMM, although state is not directly visible BUT the output which is dependent on the state is visible!

•Thus the sequence of output reveals something about the sequence of states.

Therefore, what is available to the observer is another stochastic process linked to the Markov chain.

The underlying Markov chain a.k.a the regime or state, will be directly affecting the distribution of the observed process.

The Urn Problem• A genie is in a room that is not visible to you. It is drawing balls labeled y1, y2, y3, ... from the urns X1, X2, X3, ... in that room and putting the balls on a conveyor belt, where you can observe the sequence of the balls but not the sequence of urns from which they were chosen.

http://sg.wrs.yahoo.com/_ylt=A0S0zvm3JKRNSRUAAs8l4gt.;_ylu=X3oDMTBqaHBscmZmBHBvcwMxMwRzZWMDc3IEdnRpZAM-/SIG=1jfbcdc32/EXP=1302631735/**http%3A//sg.images.search.yahoo.com/images/view%3Fback=http%253A%252F%252Fsg.images.search.yahoo.com%252Fsearch%252Fimages%253Fp%253Dgenie%2526ei%253Dutf-8%26w=368%26h=650%26imgurl=www.thewebconsole.com%252Fimgmagick%252Fpath%252FZWEB489%252Fgallery%252F2315481ff8accbb32Genie.jpg%253Fgeometry%2528650x650%2529%26rurl=http%253A%252F%252Fwww.costumesonthecoast.com.au%252Findex.php%253Fprocess%253Dactions%252FgalleryView.php%2526categoryId%253D2315%26size=45k%26name=Genie%26p=genie%26oid=b67421d036549148%26fr2=%26no=13%26tt=1066460%26sigr=12uhtfken%26sigi=131poeai4%26sigb=120iiqrsh%26.crumb=RjID6XGS1Ul

•The genie has some procedure to choose urns; the choice of the urn for the n-th ball depends upon only a random number and the choice of the urn for the (n − 1)-th ball.

•Because the choice of urn does not directly depend on the urns further previous choices, this is called a Markov process with hidden states。

http://sg.wrs.yahoo.com/_ylt=A0S0zvgrJqRN.UQA3bgl4gt.;_ylu=X3oDMTBqMjBybWU5BHBvcwM3NARzZWMDc3IEdnRpZAM-/SIG=1kc0qo9d6/EXP=1302632107/**http%3A//sg.images.search.yahoo.com/images/view%3Fback=http%253A%252F%252Fsg.images.search.yahoo.com%252Fsearch%252Fimages%253Fp%253D3%252Burns%2526b%253D73%2526ni%253D18%2526ei%253Dutf-8%2526xargs%253D0%2526pstart%253D1%26w=846%26h=650%26imgurl=home.comcast.net%252F%257Etomsturnings%252Fphotogallery%252FVases%252F3%252520midsized%252520Cremation%252520Urns.JPG%26rurl=http%253A%252F%252Fhome.comcast.net%252F%257Etomsturnings%252FVases.htm%26size=106k%26name=3%2Bmidsized%2BCrema...%26p=3%2Burns%26oid=367e115f96c6797c%26fr2=%26no=74%26tt=10392%26b=73%26ni=18%26sigr=11f21kd2t%26sigi=12lhe4i1g%26sigb=12trletra%26.crumb=RjID6XGS1Ul

Probabilistic parameters of a hidden Markov model x — statesy — possible observationsa — state transition probabilitiesb — output probabilities

http://en.wikipedia.org/wiki/File:HiddenMarkovModel.png

END OF INTRODUCTION!

METHODOLOGY

Methodology Main Idea

Total No. of data: 1400=1000+400

Markov Chain

s2: Cloudy

s1: Sunny

s3: Rainy

a22 = ?

a11 = ? a33 = ?

a12 = ?

a21 = ?

a23 = ?

a32 = ?

a13 = ?

a31 = ?

Model—λ = {A,B,Π}A = {a11, a12, ..., aNN} : a transition probability matrix

A, where each aij represents the probability of moving from state si to state sj

B = bi(ot) : a sequence of observation likelihoods, also called emission probabilities, expressing the probability of an observation ot being generated from a state si at time t

Π = {π1, π2, ..., πN} : an initial probability distribution, where πi indicates the probability of starting in state si.

Model—Other ParametersS = {s1, s2, ..., sN} a set of N hidden states,Q = {q1, q2, ..., qT } a state sequence of length T taking

values from S,O = {o1, o2, ..., oT } an observation sequence consisting

of T observations, taking values from the discrete alphabet

V = {v1, v2, ..., vM},

Model—Likelihood ： P(O|λ)Forward αt(i) = P(o1, o2, . . . , ot, qt = si|λ)

Backward βt(i) = P(ot+1, ot+2, . . . , oT |qt = si, λ)

Forward—L2

Backward—L2

Model—Decoding ： Viterbi Algo

Model—Learning: Re-estimationξt(i, j) = P(qt = si, qt+1 = sj |O, λ)

γt(i) = P(qt = si|O, λ)

Model—Learning: Re-estimation

Applying HMM(Discrete)Model: Window length=100; N=3; M=; Initial parameter estimation: Random λ = {A,B,Π} Final model

Applying HMM(Discrete)TestOut of sample

Generating trading signals

Ot+1 & Ot : Trading strategy

Content

• A Ideas

• B Code Structure

• C Results

• D Limitations and TODOs

Section A - Ideas

• Parameter Estimation

• Prediction

• Trading Strategy

Parameter estimation-HMM

• A, B, pie

• 3 steps: likelihood, decoding, learning

• Greeks

• Updating, Iteration

the Choice of B• Discrete - not smooth

• Time homogenous, why?

• a. Memory cost N * M * T (3 * 1000 * 100) in one iteration step

• b. Difficult to update

• Why continuous dist. can do this?

• a. Less parameters, less cost

• b. Update the time varying B directly

Parameter Updating

• How to compute updated B?

• Function get_index and its inverse function

• How to choose size of B? Extend the min and max, prepare to for extreme value.

• set a max, min, then interval length, set a function

Convergence Condition

• P converges to Po, set the eps

• Local maximum, not global

Prediction

• Find the q from last window -> the most likely state(t + 1)

• Find the B from last window -> the most likely obs given state

• Combine them to get most likely o(t + 1)

• Drawback: previous B to get new obs

Trading Strategy

• o(t + 1) > o(t) buy, otherwise sell

• set an indicator function I(buy or sell), which value is +1 when buy, -1 when sell

• Valuation: daily P&L = I * (true value(t + 1) - true value(t))

• Sum daily P&L to get accumulate P&L, plot

section c - results

P&L, window length = 30, state number = 3

Section b - Code Structure

double update data {input = window length data

output = a, b, pie in one step iteration, by using HMM}

int main {generate a b pie( window length, state numbers) in uniformly distributionupdate data( 1~ window length) compute p, qupdate until p convergesget best a, b, pie @ time 1 ~ window lengthuse a, b, pie @ time period 1 as initialization, run update data iteration until p convergesiteration until end of data

prediction:q at last windowb at last windowto get most likely price @ time t + 1, say o(t + 1)our strategybacktesting and draw p & l (in Matlab for convenience)}

section c - results

• P&L, window length = 50, state number = 3

section c - results

• P&L, window length = 70, state number = 3

section c - results

P&L, window length = 100, state number = 3

section d - drawbacks and todos

• Drawbacks:

• a. Iteration: Why we can use paras from last window to initialize paras of this window?

• b. Prediction: Why can use previous B to get next observation?

• c. Matrix B: time homogenous, does not make sense

• d. Discrete Distribution: Not smooth, not continuous...

• e. How many shares we will buy/sell?

todos

• a. Continuous Distribution and GMM

• b. More Trading Strategy: Computing expectation price rather than the most likely one

• because their are too many probabilities in a vector; their differences are slight -> more errors

• c. Transaction Cost: when o(t + 1) - o(t) >(<) c, buy/sell

ALGORITHMIC TRADING Hidden Markov Models. Overview a)Introduction b)Methodology c)Programming codes...

Documents

9 Markov chains and Hidden Markov Models - Freie … · 9 Markov chains and Hidden Markov Models We will discuss: Markov chains Hidden Markov Models (HMMs) Algorithms: Viterbi, forward,

Markov hidden

Hidden Markov Models in Bioinformaticscsatol/mach_learn/bemutato/Mate_Korosi_HMMpr… · Outline ˜ Markov Chain ˜ HMM (Hidden Markov Model) ˜ Hidden Markov Models in Bioinformatics

Hidden Markov Models - Home | Princeton Universityrvan/orf557/hmm080728.pdf · 1 Hidden Markov Models..... 1 1.1 Markov Processes ..... 1 1.2 Hidden Markov Models..... 4 ... able

Algorithmic evaluation of Parameter Estimation for Hidden Markov … · 2014-02-14 · one of the most simplistic one, would be to use a Hidden Markov Model (HMM). The statement that

Hidden Markov Models and Gaussian Mixture Models · Hidden Markov Models and Gaussian Mixture Models ... Hidden Markov Model ... ASR Lectures 4&5 Hidden Markov Models and Gaussian

Hidden Markov Models · Hidden Markov Models 1 10-601 Introduction to Machine Learning Matt Gormley Lecture 20 Nov. 7, 2018 ... Hidden Markov Model 28 A Hidden Markov Model (HMM)

H IDDEN M ARKOV M ODELS. O VERVIEW Markov models Hidden Markov models(HMM) Issues Regarding HMM Algorithmic approach to Issues of HMM

Hidden Markov Model Nov 11, 2008 Sung-Bae Cho. Hidden Markov Model Inference of Hidden Markov Model Path Tracking of HMM Learning of Hidden Markov Model

Hidden Markov Models - AUusers-cs.au.dk/cstorm/courses/PRiB_f12/slides/hidden-markov-model… · Hidden Markov Models Markov Model Hidden Markov Model If the latent variables are

Hidden Markov Models./awm/tutorials/hmm14.pdf · Hidden Markov Models ... 14)

Markov Chains and Hidden Markov Models - Rice University · are “hidden”; hence, we have a hidden Markov model, or HMM. ... In Markov chains and hidden Markov models, the probability

EE365: Hidden Markov Models - Stanford Universityee266.stanford.edu/lectures/hmm.pdf · EE365: Hidden Markov Models Hidden Markov Models The Viterbi Algorithm 1. Hidden Markov Models

Hidden Markov Models

Hidden markov model

Introduction to Algorithmic Trading Strategies Lecture 2 · 2015-08-14 · Introduction to Algorithmic Trading Strategies Lecture 2 Hidden Markov Trading Model Haksun Li haksun.li@numericalmethod.com

L13: hidden Markov modelscourses.cs.tamu.edu/rgutier/csce630_f14/l13.pdf · L13: hidden Markov models • Discrete Markov processes • Hidden Markov models • Forward and Backward

Hidden Markov

L13: hidden Markov models - Texas A&M Universityresearch.cs.tamu.edu/prism/lectures/sp/l13.pdf · L13: hidden Markov models • Discrete Markov processes • Hidden Markov models

Lecture 6a: Introduction to Hidden Markov Models · Lecture 6a: Introduction to Hidden Markov Models ... Markov Chain/Hidden Markov Model ... The states are hidden from the