Essays on Large Panel Data Models - uni-bonn.dehss.ulb.uni-bonn.de/2015/3976/3976.pdfEssays on Large...

Essays on Large Panel Data Models

Inaugural-Dissertation

zur Erlangung des Grades eines Doktors

der Wirtschafts- und Gesellschaftswissenschaften

durch die

Rechts- und Staatswissenschaftliche Fakultatder Rheinischen Friedrich-Wilhelms-Universitat Bonn

vorgelegt von

Oualid Badaaus Monastir (Tunesien)

Bonn 2015

Dekan: Prof. Dr. Rainer Huttemann

Erstreferent: Prof. Dr. Alois Kneip

Zweitreferent: Prof. Dr. Robin Sickles

Tag der mundlichen Prufung: 13.03.2015

UNIVERSITY OF BONN

Abstract

Department of Economics

Institute for Financial Economics and Statistics

Doctoral Thesis

Essays on Large Panel Data Models

by Oualid Bada

The standard panel data literature is moving from micro panels, where the cross-section

dimension is large and the intertemporal sample size is small, to large panels, where

both, the cross-section and the time dimension, are large. This thesis contributes to

this new and growing area of panel data treatments called “large panel data analysis”.

My dissertation consists of three essays: In the first essay, a large panel data model

with an omitted factor structure is considered. The role of the factors is to control for

the issue of the unobserved time-varying heterogeneity effects. A parameter cascading

strategy is proposed to enable efficient estimation of all model parameters when the

number of factors is unknown a priori. In the second essay, further models that combine

large panel data models with different versions of unobserved latent factors are discussed.

Computation-related issues are solved and new specification tests are introduced to check

whether or not these factors can be interpreted as classical additive fixed effects. In the

third essay, a novel method for estimating panel models with multiple structural changes

is proposed. The breaks are allowed to occur at unknown points in time and may affect

the multivariate slope parameters individually. Asymptotic results are derived, Monte

Carlo experiments are performed, and applications for highlighting these new methods

are discussed.

Acknowledgements

I would like to express my heartfelt gratitude to the many people who have supported me

in writing this thesis. First, I am deeply indebted and thankful to my supervisor Prof.

Dr. Alois Kneip for his continuous guidance and vital advice during the writing of this

thesis. His insightful comments helped me to enhance my understanding of advanced and

state-of-the-art statistical techniques. He has been an enormous source of inspiration

for my research and motivated me to keep improving my papers and the way they might

contribute to my research area.

I would like to sincerely thank my co-author Dr. Robin Sickles, Professor of Econometrics

at Rice University, for the extraordinary collaboration on the project that constitutes

the core of Chapter 3. I am grateful for his time, his enthusiasm, and his generosity

during my stay as a visiting researcher in the Department of Econometrics at Rice

University. I also want to thank our co-author, James Gualtieri, for investing so much

time in collecting data and contributing to the development of the application part.

I would also like to express my gratitude to my co-author and my friend JProf. Dr.

Dominik Liebl, who contributed to the joint paper presented in Chapter 2. I benefited

much from his numerous helpful remarks and creative ideas. I will miss his extraordinary

talent to identify and simplify substantially important and complex statistical problems.

In addition, I would like to give my gratitude to all my colleagues in the department

of statistics in the Institute for Financial Economics and Statistics of the University of

Bonn: Prof. Dr. Lorens Imhof, Heiko Wagner, Dominik Poß, Prof. Dr. Hans-Joachim

Werner, Dr. Klaus Utikal, Fabian Walders, Martin Arndt, and Hildegard Grober. They

not only provided me technical help, but also helped me to improve my German language

skills and taught me a lot of unconventional idioms.

I would like to express my deepest gratitude to my parents Emna and Romdhan, my

uncle Khaled, my aunt Metira, my sister Wihed, and my brother Wissem. I could

not have had the opportunity to study in Bonn and to write this dissertation without

them. I also would like to thank all my family members, in particular, Naima Moussa,

Ali Abbes, Fatma Boughtass, Rami Abbes, Ameur Abbes, Safouane Jguirim, Safa Bac-

couche Abbes, and Ghada Abada as well as my friends Kacem Kassraoui, Walid Hamdi,

and Nadhmi Nefzi. They have played an important moral-support role.

Last but not least, to my wonderful wife Abir, who has supported me in good and bad

times during all phases of this thesis: I love you.

Contents

Abstract v

Acknowledgements vii

Contents ix

List of Figures xi

List of Tables xiii

Introduction 1

1 Panel Models with Unknown Number of Unobserved Factors: An Ap-plication to the Credit Spread Puzzle 5

1.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

1.2 Model and Estimation Algorithm . . . . . . . . . . . . . . . . . . . . . . . 8

1.2.1 Model Identification and Estimation . . . . . . . . . . . . . . . . . 8

1.2.2 Starting Values and Iteration Scheme . . . . . . . . . . . . . . . . 12

1.3 Model Extension and Theoretical Results . . . . . . . . . . . . . . . . . . 15

1.3.1 Presence of Additional Categorical Variables . . . . . . . . . . . . 15

1.3.2 Assumptions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17

1.3.3 Asymptotic Distribution and Bias Correction . . . . . . . . . . . . 19

1.4 Monte Carlo Simulations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21

1.5 Application: The Unobserved Risk Premia of Corporate Bonds . . . . . . 28

1.5.1 The Empirical Model . . . . . . . . . . . . . . . . . . . . . . . . . 28

1.5.2 Data Description . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30

1.5.3 Empirical Results and Interpretations . . . . . . . . . . . . . . . . 32

1.6 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35

2 The R-package phtt: Panel Data Analysis with Heterogeneous TimeTrends 37

2.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37

2.2 Panel Models for Heterogeneity in Time Trends . . . . . . . . . . . . . . . 41

2.2.1 Computational Details . . . . . . . . . . . . . . . . . . . . . . . . . 44

2.2.2 Application . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45

2.3 Panel Criteria for Selecting the Number of Factors . . . . . . . . . . . . . 49

Contents

2.3.1 Application . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

2.4 Panel Models with Stochastically Bounded Factors . . . . . . . . . . . . . 56

2.4.1 Model with Known Number of Factors . . . . . . . . . . . . . . . . 56

2.4.2 Model with Unknown Number of Factors . . . . . . . . . . . . . . 57

2.4.3 Application . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60

2.5 Models with Additive and Interactive Unobserved Effects . . . . . . . . . 63

2.5.1 Specification Tests . . . . . . . . . . . . . . . . . . . . . . . . . . . 67

2.5.1.1 Testing the Sufficiency of Classical Additive Effects . . . 67

2.5.1.2 Testing the Existence of Common Factors . . . . . . . . . 69

2.6 Interpretation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71

2.7 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73

3 Panel Models with Multiple Jumps in the Parameters 75

3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75

3.2 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78

3.3 Two-way Panel Models with Multiple Jumps . . . . . . . . . . . . . . . . 83

3.3.1 Estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83

3.3.2 Assumptions and Main Asymptotic Results . . . . . . . . . . . . . 86

3.4 Post-SAW Procedures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89

3.4.1 Tree-Structured Representation . . . . . . . . . . . . . . . . . . . . 89

3.4.2 Detecting the Jump Locations . . . . . . . . . . . . . . . . . . . . 92

3.4.3 Post-SAW Estimation . . . . . . . . . . . . . . . . . . . . . . . . . 93

3.5 SAW with Unobserved Multifactor Effects . . . . . . . . . . . . . . . . . . 95

3.6 Monte Carlo Simulations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99

3.7 Application: Algorithmic Trading and Market Quality . . . . . . . . . . . 103

3.7.1 Liquidity and Asset Pricing . . . . . . . . . . . . . . . . . . . . . . 104

3.7.2 Data . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106

3.7.2.1 The Algorithmic Trading Proxy . . . . . . . . . . . . . . 106

3.7.2.2 Market Quality Measures . . . . . . . . . . . . . . . . . . 107

3.7.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109

3.8 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115

A Appendix of Chapter 1 117

A.1 Theoretical Results and Proofs . . . . . . . . . . . . . . . . . . . . . . . . 117

B Appendix of Chapter 3 123

B.1 Proofs of Section 3.2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123

Bibliography 143

List of Figures

1.1 Credit spread curves before and after transformation . . . . . . . . . . . . 31

1.2 Screeplots for static and dynamic factors . . . . . . . . . . . . . . . . . . . 33

1.3 Estimates of the time varying rating effects and the systematic factors . . 33

1.4 Estimated first and second risk components . . . . . . . . . . . . . . . . . 35

2.1 Plots of the dependent and independent variables . . . . . . . . . . . . . . 41

2.2 Estimated factors and estimated time-varying individual effects . . . . . . 48

2.3 Scree plot produced by the plot()-method for OptDim-objects . . . . . . 54

2.4 Estimated factors and estimated time-varying individual effects . . . . . . 64

2.5 Estimated additive and interactive heterogeneity effects . . . . . . . . . . 67

2.6 Visualization of the differences of the time-varying individual effects . . . 73

3.1 Tree-structured representation of the wavelet coefficients . . . . . . . . . . 90

3.2 Tree-structured representation of the shifted and non-shifted coefficients . 90

3.3 Time varying effect of algorithmic trading on the proportional quotedspread . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112

3.4 Time varying effect of algorithmic trading on the proportional effectivespread . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113

3.5 Effect of algorithmic trading on the daily high-low price range . . . . . . . 114

3.6 Time varying effect of algorithmic trading on the realized variance . . . . 115

List of Tables

1.1 Simulation results of the Monte Carlo experiments for DGP1 - DGP3 . . 26

1.2 Simulation results of the Monte Carlo experiments for DGP4 - DGP6 . . 27

1.3 Number of corporate bonds by rating class . . . . . . . . . . . . . . . . . . 31

1.4 Estimation results for the empirical Models (M.1)-(M.4) . . . . . . . . . . 34

2.1 List of the variance shares of the estimated common factors . . . . . . . . 72

3.1 Simulation results of the Monte Carlo experiments for DGP1-DGP2 . . . 101

3.4 Simulation results of the Monte Carlo experiments for DGP7 . . . . . . . 103

3.5 Instrumental variable panel data model with constant parameters . . . . . 110

3.6 Post-wavelet estimates for the proportional quoted spread . . . . . . . . . 112

3.7 Post-wavelet estimates for the proportional effective spread . . . . . . . . 113

3.8 Post-wavelet estimates for the daily high-low price range . . . . . . . . . . 114

3.9 Post-wavelet estimates for the realized variance . . . . . . . . . . . . . . . 115

I would like to dedicate this thesis to my beloved parents, mywonderful wife, and my admirable daughter.

Introduction