31
Genetics of recombination rate variation in the pig 1 2 Martin Johnsson 1,2 *, Andrew Whalen 1 *, Roger Ros-Freixedes 1,3 , Gregor Gorjanc 1 , Ching-Yi 3 Chen 4 , William O. Herring 4 , Dirk-Jan de Koning 2 , John M. Hickey 1 4 5 1 The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of 6 Edinburgh, Midlothian, EH25 9RG, Scotland, United Kingdom. 7 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 8 Sciences, Box 7023, 750 07 Uppsala, Sweden. 9 3 Departament de Ciència Animal, Universitat de Lleida-Agrotecnio Center, Lleida,Spain. 10 4 Genus plc, 100 Bluegrass Commons Blvd., Ste122200, Hendersonville, TN 37075, USA. 11 12 * Equal contributions 13 † Corresponding author 14 [email protected] 15 16 Abstract 17 Background 18 In this paper, we estimated recombination rate variation within the genome and between 19 individuals in the pig using multiocus iterative peeling for 150,000 pigs across nine 20 genotyped pedigrees. We used this to estimate the heritability of recombination and perform 21 a genome-wide association study of recombination in the pig. 22 23 Results 24 Our results confirmed known features of the pig recombination landscape, including 25 differences in chromosome length, and marked sex differences. The recombination landscape 26 was repeatable between lines, but at the same time, the lines also showed differences in 27 average genome-wide recombination rate. The heritability of genome-wide recombination 28 was low but non-zero (on average 0.07 for females and 0.05 for males). We found three 29 genomic regions associated with recombination rate, one of them harbouring the RNF212 30 gene, previously associated with recombination rate in several other species. 31 32 Conclusion 33 . CC-BY-NC-ND 4.0 International license author/funder. It is made available under a The copyright holder for this preprint (which was not peer-reviewed) is the . https://doi.org/10.1101/2020.03.17.995969 doi: bioRxiv preprint

Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

Genetics of recombination rate variation in the pig 1 2

Martin Johnsson1,2*, Andrew Whalen1*, Roger Ros-Freixedes1,3, Gregor Gorjanc1, Ching-Yi 3

Chen4, William O. Herring4, Dirk-Jan de Koning2, John M. Hickey1† 4

5

1 The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of 6

Edinburgh, Midlothian, EH25 9RG, Scotland, United Kingdom. 7

2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 8

Sciences, Box 7023, 750 07 Uppsala, Sweden. 9

3 Departament de Ciència Animal, Universitat de Lleida-Agrotecnio Center, Lleida,Spain. 10

4 Genus plc, 100 Bluegrass Commons Blvd., Ste122200, Hendersonville, TN 37075, USA. 11

12

* Equal contributions 13

† Corresponding author 14

[email protected] 15

16

Abstract 17

Background 18

In this paper, we estimated recombination rate variation within the genome and between 19

individuals in the pig using multiocus iterative peeling for 150,000 pigs across nine 20

genotyped pedigrees. We used this to estimate the heritability of recombination and perform 21

a genome-wide association study of recombination in the pig. 22

23

Results 24

Our results confirmed known features of the pig recombination landscape, including 25

differences in chromosome length, and marked sex differences. The recombination landscape 26

was repeatable between lines, but at the same time, the lines also showed differences in 27

average genome-wide recombination rate. The heritability of genome-wide recombination 28

was low but non-zero (on average 0.07 for females and 0.05 for males). We found three 29

genomic regions associated with recombination rate, one of them harbouring the RNF212 30

gene, previously associated with recombination rate in several other species. 31

32

Conclusion 33

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 2: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

Our results from the pig agree with the picture of recombination rate variation in vertebrates, 34

with low but nonzero heritability, and a major locus that is homologous to one detected in 35

several other species. This work also highlights the utility of using large-scale livestock data 36

to understand biological processes. 37

38

Background 39

40

This paper shows that recombination rate in the pig (Sus scrofa) is genetically variable and 41

associated with alleles at the RNF212 gene. 42

43

Recombination causes exchange of genetic material between homologous chromosomes. At 44

meiosis, after chromosomes have been paired up and duplicated, they break and exchange 45

pieces of chromosome arms. These recombinations are not evenly distributed along 46

chromosomes. This gives rise to a variable recombination rate landscape with peaks and 47

troughs. 48

49

The recombination rate landscape of the pig has been estimated previously [1]. It shows 50

broadly the same features as in other mammals: low recombination rate in the centre of 51

chromosomes, local hotspots of high recombination rate, a correlation between 52

recombination rate and the fraction of guanine and cytosine bases (GC content), and sex 53

difference in recombination rate [2–4]. In this paper, we investigated how recombination rate 54

varied between individuals and populations in the pig. 55

56

Recombination rate is genetically variable in several other species. Studies in humans [5, 6], 57

cattle [7–10], deer [11], sheep [12, 13] and chickens [14] have observed genetic influence on 58

recombination rate, and genetic associations with alleles at a handful of genes involved in 59

meiosis, including RNF212, REC8 and PRDM9 (reviewed by [15, 16]). 60

61

To be able to analyse the genetic basis of recombination, we need recombination estimates 62

from a large number related pigs. Recombination rate can be estimated by phasing genotypes 63

in pedigrees [17–20], by direct counting in gametes [21, 22], or by measuring linkage 64

disequilibrium in population samples [2]. Counting methods require specific experiments to 65

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 3: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

gather data. Linkage disequilibrium methods only provide averages for a population. In this 66

paper, we used a new pedigree method based on multilocus iterative peeling [23, 24] to 67

estimate recombination simultaneously with genotype imputation. This allowed us to use data 68

from a pig breeding programme, where variable density genotype data has been gathered for 69

genomic selection. 70

71

Our results confirmed known features of the pig recombination landscape, including 72

differences in chromosome length, and marked sex difference. The recombination landscape 73

was repeatable between lines, but at the same time, the lines showed differences in average 74

genome-wide recombination rate. The heritability of genome-wide recombination was low 75

but non-zero. We found three genomic regions associated with recombination rate, one of 76

them harbouring the RNF212 gene, previously associated with recombination rate in several 77

other species. 78

Methods 79

80

We estimated the recombination rate landscape in nine lines of pigs from a commercial 81

breeding programme. We performed six analyses: 82

(1) We estimated the average number of recombinations on each chromosome (the 83

genetic length of chromosomes), and analysed between-sex and between-line 84

differences in genetic length. We compared these estimates to previously published 85

estimates. 86

(2) We estimated the distribution of recombinations along chromosomes (recombination 87

rate landscapes), and analysed between-line and between-sex differences. 88

(3) We estimated the correlation between recombination rate and DNA sequence features 89

previously known to correlate with recombination rate. 90

(4) We estimated pedigree heritability and genomic heritability of recombination rate. 91

(5) We ran a genome-wide association study to detect markers associated with 92

recombination rate. 93

(6) We ran a simulation to test the performance of the method. 94

95

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 4: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

Data 96

97

We used SNP chip genotype data from nine lines of pigs from the Pig Improvement 98

Company (PIC) breeding programme. This programme contains a diverse collection of 99

genetics, which represent broadly used populations, including animals of Large White, 100

Landrace, Duroc, Hampshire and Pietrain heritage. The pigs were genotyped at a mix of 101

densities; either at low density (15K markers) using GGP-Porcine LD BeadChips (GeneSeek, 102

Lincoln, NE) or at high density (60K or 75K markers) using GGP-Porcine HD BeadChips 103

(GeneSeek, Lincoln, NE). In total, genotype data was available on 390,758 pigs. 104

105

Recombination rate estimation using multilocus iterative peeling 106

107

We used multilocus iterative peeling to estimate the number and location of the 108

recombination events in each individual [23, 24]. Multilocus iterative peeling uses pedigree 109

data to calculate the phased genotype of each individual as a combination of information 110

from the individual’s own genetic data, and that of their parents (anterior probabilities) and 111

offspring (posterior probabilities) [25]. Multilocus iterative peeling builds on previous 112

peeling algorithms by tracking which parental haplotype an individual inherits at each locus 113

(segregation probabilities). This information can be used to determine which allele an 114

individual inherits, particularly from parents who are heterozygous for that allele. 115

116

The segregation probabilities can be used to determine the number and location of likely 117

recombination events. When a recombination happens, the offspring will inherit from a 118

different parental haplotype. This will cause one, or both of the segregation probabilities to 119

change, i.e. the segregation probability will change from a value close to 0 (likely to inherit 120

the maternal haplotype) to 1 (likely to inherit the paternal haplotype). By analysing the joint 121

distribution of neighbouring segregation probabilities, we are able to calculate the expected 122

number of recombinations between two loci, and the expected number of recombinations 123

across an entire chromosome. 124

125

To aid recombination rate estimation, we introduced two simplifications to the multilocus 126

peeling method: 127

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 5: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

1. The segregation probabilities and the anterior probabilities were calculated separately 128

for each parent in lieu of modelling their full joint distribution. 129

2. The segregation and genotype probabilities of the offspring were called when 130

estimating the posterior term for each parent. 131

132

These simplifications were introduced to reduce runtime and memory requirements. In 133

particular, by calling the segregation and genotype values, we are able to store many of the 134

calculations in lookup tables instead of re-computing them for each locus, and each 135

individual. In addition, the calling of segregation values reduced the chance that feedback 136

loops occurred between offspring with fractional segregation values at multiple nearby loci. 137

138

A calling threshold of 0.99 was used for the segregation probabilities, and a calling threshold 139

of 0.9 was used for the genotype probabilities. Segregation probabilities that did not reach the 140

threshold were set to missing (equally likely to inherit either parental haplotype). Genotype 141

probabilities that did not meet the threshold were also set to missing (all genotype states 142

equally likely). 143

144

The joint distribution of segregation values depends on the chromosome length (in cM). To 145

estimate chromosome length, we initialized the length to 100cM (on average 1 recombination 146

per chromosome), and then refined this estimate in a series of steps. At each step we 147

calculated the expected number of recombination for each individual at each locus, and set 148

the chromosome length based on the average population recombination rate. This step was 149

repeated four times. Preliminary simulations found that chromosome length estimates 150

converged after four iterations, and that the recombination estimates for target individuals 151

were insensitive to the assumed chromosome length. 152

153

Filtering of individuals 154

155

After recombination estimation, we filtered the data by removing individuals without 156

genotyped parents and grandparents in order to focus on those with high-quality 157

recombination estimates. Filtering reduced the number of pigs to 145,763. Table 1 shows the 158

resulting number of individuals per line post-filtering, and the total number of dams and sires 159

for those individuals. 160

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 6: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

161

Table 1. Number of individuals that passed filtering in each line, and the unique number of 162

their dams and sires. By necessity, we inferred recombination rates from an equal number of 163

maternal and paternal chromosomes, but they derive from a much larger number of dams 164

than sires. 165

Line Individuals kept Dams Sires 1 23273 2651 437 2 16661 2255 368 3 14278 2169 215 4 7153 1239 163 5 33566 4349 293 6 11666 1971 162 7 263 76 20 8 4177 727 78 9 34726 5171 492

166

167

Comparison between lines and to published maps 168

169

To compare the recombination landscapes of the nine lines we calculated pairwise 170

correlations between lines of the estimated recombination rates at each marker interval, 171

within each sex. To compare the recombination landscapes of the sexes, we calculated the 172

correlation between sexes within each line. 173

174

We compared map length between lines using a linear model, fitting the number of 175

recombinations observed on a chromosome as response, and fixed effects for each line and 176

chromosome. 177

178

To compare the estimated landscapes to published landscapes, we also compared our results 179

to the results of [1] by plotting our map length of each chromosome against published map 180

lengths. 181

182

Correlation with genome features 183

184

To investigate the relationship between local recombination rate and genomic features, we 185

divided the autosomal part of the Sscrofa11.1 genome [26] into 2272 windows of 1 Mbp. We 186

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 7: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

used Biostrings version 2.52.0 in the R statistical environment to estimate three features of 187

sequence composition: 188

• fraction of guanine and cytosine bases (GC content); 189

• the PDRM9 consensus motif CCNCCNTNNCCNC [27]; 190

• the CCCCACCCC motif, which was the most strongly associated with recombination 191

in the pig in [1]. 192

193

We used repeat data from RepeatMasker (http://www.repeatmasker.org) [28] from the pig 194

genome to estimate the density of repeats in the same windows. We subdivided the total 195

content of repeats into three broad categories: 196

• Fraction of LTR elements 197

• Fraction of DNA repeats elements 198

• Fraction of low complexity repeats 199

200

We calculated the correlation between the recombination rate and the sequence features 201

within each window. 202

203

To find putative pericentromeric regions, we used the inferred centromere positions from 204

[26]. On chromosomes 8, 11 and 15, where there were more than one inferred location far 205

apart, we picked the most likely location based on karyotypes from [29]. 206

207

Heritability of genome-wide recombination rate 208

209

We estimated the narrow-sense heritability of genome-wide recombination rate using animal 210

models in MCMCglmm [30] version 2.29. We estimated the heritability of recombination 211

using genome-wide recombination rates per megabasepair. We fitted a pedigree animal 212

model with an additive genetic effect and a permanent environmental effect for each parent 213

as random effects. Because we measured recombination rate in parents of genotyped 214

offspring, who have varying numbers of offspring (see Table 1), we used a model with 215

repeated records and a permanent environmental effect for each parent. We analysed sexes 216

and lines separately. We used parameter expanded priors [31] for the individual variance 217

component and for the additive genetic variance component, using V = 1, ν = 1, αµ = 0, αV = 218

1000, which corresponds to a half-Cauchy prior with scale 100, and an inverse-Wishart prior 219

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 8: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

(V = 1, ν = 1) for the residual variance. Because of the low number of dams and sires, we 220

excluded the smallest line (line 7) from the quantitative genetic analysis. We also excluded 221

parents with an extremely high average recombination rate (> 5 cM/Mbp). 222

223

Genome-wide association 224

225

We performed genome-wide association studies of genome-wide recombination rates using 226

hierarchical linear mixed models in RepeatABEL [32] version 1.1. The linear mixed model 227

uses a genomic relationship matrix to account for relatedness while including a random 228

permanent environmental effect for each parent. We analysed sexes and lines separately. We 229

used imputed best-guess genotypes from the same run of AlphaPeel. Because of the low 230

number of dams and sires, we again excluded line 7 from the analysis, and parents with 231

average recombination rate > 5 cM/Mbp. We report significant markers below a conventional 232

threshold of p < 5 · 10-8. We used the most significant marker in each region to report 233

variance explained and the frequency of the allele associated with higher recombination. 234

When there were more than one marker with the same p-value, we selected the marker 235

closest to the middle of the interval. 236

237

Simulations 238

239

To demonstrate that the method works, we tested it on a synthetic dataset with features 240

similar to real data. We simulated genotype data with AlphaSimR 0.10.0. We simulated one 241

chromosome, using the same pedigree and same number of genotyped markers as the largest 242

line. The simulated recombination landscape had a constant recombination rate in the middle 243

of the chromosome, and two regions of high recombination rate at the ends, described by 244

second degree polynomials (the figure shows the resulting true recombination rate). We 245

assessed accuracy of the inferred recombination landscape by calculating the correlation 246

between the estimated number of recombination at each marker interval and the true number 247

of recombination. We also calculated the correlation between the estimated number of 248

recombinations and a smoothed recombination landscape, using a window of 50 markers. 249

250

251

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 9: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

Results 252

253

Our results showed that: 254

(1) There was variation in the genetic length of chromosomes between sexes and lines. 255

(2) The recombination rate landscape was similar between lines but different between 256

sexes. 257

(3) We confirmed previous findings that local recombination rate is correlated with GC 258

content, repeat content, the CCCCACCCC sequence motif, but not the previously 259

described correlation with the PRDM9 consensus motif. 260

(4) The heritability of recombination rate was on average 0.07 for females and 0.05 for 261

males. 262

(5) Three regions of the genome were associated with recombination rate, one of them 263

containing the candidate gene RNF212. 264

(6) In simulation, we found that multilocus iterative peeling could estimate the number of 265

recombinations per individual with an accuracy of 0.7 for dams and 0.5 for sires, and 266

the average recombination landscape along a chromosome, but with a tendency to 267

overestimate the genetic length. 268

269

Variation in genetic map length between lines and sexes 270

271

The genetic length of chromosomes was different between lines and sexes. Figure 1 shows 272

the estimated map length of each chromosome, along with previously published estimates 273

[1]. Table 2 gives the estimated of total map length in each sex and line, with confidence 274

intervals derived from a linear model. On average, we estimated a sex-averaged map of 21.5 275

Morgan (0.95 cM/Mbp), a female map of 23.6 Morgan (1.04 cM/Mbp), and a male map of 276

19.5 Morgan (0.86 cM/Mbp). Supplementary tables 1-3 contain male, female, and sex-277

averaged consensus maps of the pig recombination landscape. 278

279

Our estimated genetic lengths of chromosomes were comparable to previous estimates, but 280

tended to be higher. We found that females have higher recombination rate, except on 281

chromosome 1, where male recombination rate was higher, and chromosome 13, where the 282

recombination rate is similar in both sexes. This confirms previous results [1].283

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 10: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

Table 2. Estimates from linear model of total map length. Intervals are 95% confidence 284

intervals. 285

Line Sex Map length (Morgan) Lower Upper

Rate (cM/Mbp)

1 female 23.6 23.5 23.6 1.04 1 male 19.4 19.4 19.5 0.86 2 female 24.1 24.1 24.2 1.06 2 male 20.0 20.0 20.0 0.88 3 female 22.3 22.2 22.3 0.98 3 male 18.2 18.1 18.2 0.80 4 female 23.5 23.4 23.5 1.04 4 male 19.3 19.3 19.4 0.85 5 female 22.8 22.7 22.8 1.01 5 male 18.7 18.6 18.7 0.82 6 female 23.7 23.6 23.7 1.04 6 male 19.5 19.5 19.6 0.86 7 female 25.9 25.5 26.2 1.14 7 male 21.7 21.4 22.1 0.96 8 female 24.1 24.0 24.2 1.06 8 male 20.0 19.9 20.1 0.88 9 female 22.6 22.6 22.6 1.00 9 male 18.5 18.4 18.5 0.82 Average female 23.6 1.04

male 19.5 0.86

sex-average 21.5 0.95 286

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 11: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

287

288 Figure 1. Genetic length of each pig autosome, as estimated by multilocus iterative peeling. 289

The horizontal axis corresponds to chromosomes 1-18. Red dots and lines show female 290

estimates, while blue dots and lines show male estimates. Panel A compares estimates from 291

multilocus iterative peeling (filled dots) to estimates from [1] (open circles). Panel B shows 292

estimates from the same line of pigs connected by lines. 293

0.5

1.0

1.5

2.0

Gen

etic

leng

th (M

orga

n)

0.5

1.0

1.5

2.0

Gen

etic

leng

th (M

orga

n)

Estimated genetic length

●●

●●

●●●●●

●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●●●●●

●●

●●

●●

●●●

●●●

●●

●●●

●●

●●●

●●

●●

●●

●●

●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●

●●●

●●

●●

●●●

●●●

●●

●●

●●

●●●

●●

●●

●●

●●●●●●

●●

● ●

●●

● ● ●

●●

● ●

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18

Sex male female

A

B

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 12: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

Difference in recombination landscape between sexes 294

295

The shape of the recombination landscape was similar between lines but different between 296

sexes. Figure 2 presents the recombination rate landscape for each chromosome, and Figure 3 297

shows the correlation between the per-marker interval recombination rate estimates, between 298

lines and between sexes. Both sexes had higher recombination rate near chromosome ends 299

and lower recombination rate in the middle of the chromosomes. However, there were several 300

broad regions of elevated female recombination rate which was not present in the males. 301

These regions were repeatable between lines. The mean between-line correlation was 0.83 in 302

females and 0.70 in males, whereas the mean correlation between sexes was 0.40 across 303

lines. 304

305

Correlation between genomic features and recombination rate 306

307

Local recombination rate had moderate to low correlation (absolute correlation coefficients 308

less than 0.33) with GC content, repeats and particular sequence motifs. Figure 4 shows the 309

correlations between recombination rate and genomic features in 1 Mbp windows, separated 310

by sex. There were positive correlations with GC content, and negative correlation with 311

sequence repeats when all repeat classes were combined. The correlation between 312

recombination rate and different types of repeats was variable. Recombination rate was only 313

weakly correlated with counts of the PRDM9 consensus motif CCNCCNTNNCCNC, but 314

moderately correlated with counts of the CCCCACCCC motif, previously found to be 315

enriched in high recombination regions in the pig genome [1]. 316

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 13: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

317

16 17 18

13 14 15

10 11 12

7 8 9

4 5 6

1 2 3

0 20 40 60 80 0 20 40 60 0 20 40

0 50 100 150 200 0 50 100 0 50 100

0 20 40 60 0 20 40 60 80 0 20 40 60

0 25 50 75 100 125 0 50 100 0 50 100

0 50 100 0 25 50 75 100 0 50 100 150

0 100 200 0 50 100 150 0 50 1000.000

0.025

0.050

0.000

0.025

0.050

0.000

0.025

0.050

0.000

0.025

0.050

0.000

0.025

0.050

0.000

0.025

0.050

Position (Mbp)

Rec

ombi

natio

n ra

te (p

er M

bp)

Sex Female Male

Recombination landscape

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 14: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

Figure 2. Recombination landscape in the pig. The lines show recombination rate in windows 318

of 1 Mbp along the pig genome (Sscrofa11.1). Red lines show female estimates and blue lines 319

show male estimates. Each line shows one of the nine breeding lines. The black vertical lines 320

are predicted centromere locations in the reference genome, for chromosomes where they are 321

available. 322

323

324

325

326 Figure 3. Correlation heatmap of recombination landscapes between lines and sexes. 327

Heatmaps show pairwise correlations between lines of the estimated recombination rates at 328

each marker interval, within each sex, and the correlation between sexes within each line. 329

330

1

2

3

4

5

6

7

8

9

Line

1

2

3

4

5

6

7

8

9

1

2

3

4

5

6

7

8

9

Between lines, female Between lines, male Between sexes

1

0.89

0.87

0.88

0.9

0.86

0.64

0.87

0.93

0.89

1

0.84

0.87

0.89

0.85

0.63

0.85

0.91

0.87

0.84

1

0.9

0.89

0.87

0.62

0.83

0.91

0.88

0.87

0.9

1

0.91

0.88

0.65

0.85

0.93

0.9

0.89

0.89

0.91

1

0.87

0.65

0.86

0.96

0.86

0.85

0.87

0.88

0.87

1

0.64

0.85

0.9

0.64

0.63

0.62

0.65

0.65

0.64

1

0.67

0.67

0.87

0.85

0.83

0.85

0.86

0.85

0.67

1

0.89

0.93

0.91

0.91

0.93

0.96

0.9

0.67

0.89

1

1

0.71

0.74

0.76

0.82

0.76

0.47

0.72

0.84

0.71

1

0.63

0.69

0.81

0.7

0.44

0.68

0.77

0.74

0.63

1

0.78

0.8

0.75

0.4

0.6

0.8

0.76

0.69

0.78

1

0.84

0.79

0.45

0.68

0.85

0.82

0.81

0.8

0.84

1

0.84

0.5

0.76

0.95

0.76

0.7

0.75

0.79

0.84

1

0.49

0.73

0.85

0.47

0.44

0.4

0.45

0.5

0.49

1

0.51

0.51

0.72

0.68

0.6

0.68

0.76

0.73

0.51

1

0.79

0.84

0.77

0.8

0.85

0.95

0.85

0.51

0.79

1

0.42

0.47

0.4

0.41

0.47

0.39

0.19

0.42

0.45

1 2 3 4 5 6 7 8 9Line

1 2 3 4 5 6 7 8 9Line

0.00

0.25

0.50

0.75

1.00cor

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 15: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

331 Figure 4. Heatmap of correlation between genome features and recombination rate in 332

windows of 1 Mbp. The heatmap shows correlation between recombination rate sequence 333

features within 2272 windows of the autosomal part of the pig genome (Sscrofa11.1). 334

0.33

−0.16

−0.33

0.23

−0.069

−0.021

0.16

0.024

0.28

0.21

−0.17

−0.25

0.09

−0.0028

0.053

0.072

0.019

0.16

Simple repeats

DNA repeats

LTR repeats

LINE repeats

SINE repeats

All repeats

CCCCACCCC motif

PRDM9 consensus motif

GC content

Female MaleSex

−1.0

−0.5

0.0

0.5

1.0cor

Correlation with genomic features

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 16: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

Heritability of recombination rate 335

336

Genome-wide recombination rate had low but nonzero heritability (h2 on average 0.07 for 337

females and 0.05 for males). Figure 5 shows the heritability and ratio of permanent 338

environmental variance, broken down by sex and line. There was little evidence of 339

differences in heritability between lines. The open circles in Figure 5 show genomic 340

heritability estimates from genome-wide association analyses. The genomic heritabilities 341

suggest that the SNP chip captured most (on average 83%) of the additive genetic variance in 342

recombination. 343

344

Genome-wide association of recombination rate 345

346

Genome-wide association revealed three regions of the genome containing markers 347

associated with genome-wide recombination rate. Figure 6 shows the results of genome-wide 348

association scans within each line, broken down by sex. Table 3 shows the location of the 349

most significant marker for each region with variance explained and allele frequency. There 350

was a region associated with female recombination rate at the start of chromosome 8 in six of 351

the lines, as well as a region on chromosome 17 in line 1, and one on chromosome 1 in line 6. 352

The chromosome 8 region was also associated with male recombination rate in two lines. 353

Figure 7 shows a zoomed-in view of each of these regions, with the location of known 354

candidate genes involved in recombination. 355

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 17: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

356 Figure 5. Heritability of average recombination. The dots show estimates of narrow-sense 357

heritability and the permanent environmental effect for average genome-wide recombination 358

estimated with an animal model. The lines show 95% credible intervals. Red dots and lines 359

show female estimates, while blue dots and lines show male estimates. Open circles show the 360

genomic heritability estimated from genome-wide association. Because of the low number of 361

dams and sires, we excluded the smallest line (line 7) from the analysis. 362

0.00

0.05

0.10

0.15

Her

itabi

lity

0.00

0.05

0.10

0.15

Varia

nce

ratio

Narrow−sense heritability

Permanent environment

●●

●●

●●

●●

●●

●●

1 2 3 4 5 6 8 9Line

1 2 3 4 5 6 8 9Line

Sex ● ●female male

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 18: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

363 Figure 6. Genome-wide association of average recombination. The subplots are Manhattan 364

plots of the negative logarithm of the p-value of association against genomic position, broken 365

down by line and sex. Alternating colours correspond to chromosomes 1-18. Because of the 366

low number of dams and sires, we excluded the smallest line (line 7) from the analysis. The 367

dashed red line shows a conventional genome-wide significance threshold of 5 · 10-8. 368

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 19: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

369 Figure 7. Regions associated with recombination rate and location of recombination-370

associated candidate genes. The subplots are Manhattan plots of the negative logarithm of 371

the p-value of association against genomic position, zoomed in to show the region around the 372

significant markers. The red triangles show location of RNF212 on chromosome 8, SLOC1 373

on chromosome 1, and SPO11 on chromosome 17. 374

●●●

●●●●●

●●

●●●

●●

●●

●●

●●

●●

●●●●●

●●●

●●

●●●

●●

●●

●●

●●●●

●●●

●●

●●

●●●

●●

●●

●●

●●

●●

●●

●●●●

●●●●●

●●●●●●●

●●

●●

●●●

●●●●

●●●●●

●●

●●

●●●●●

●●

●●●

●●

●●

●●●●●

●●●●

●●

●●

●●●●●

●●

●●

●●●●

●●●●

●●●●

●●●●●●●●●●●●●●●●●●● ●

●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●● ●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●

●●

●●●●●●●●

●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●

●●●●●●●●

●●

●●●●

●●

●●●

●●●

●●

●●●

●●

●●

●●

●●

●●●●

●●●

●●●

●●●●

●●

●●●

●●●

●●

● ●●●●●

●●

●●●

●●

●●●●●●●●●

●●

●●

●●

●●

●●●●

●●●●●●●●●●●●

●●

●●●●●

●●●●●●●●●

●●

●●

●●●●

●●●●●●

●●●●●●

●●

●●

●●

●●

●●●

●●●

●●●●●●

●●●●●

●●●

●●●●

●●

●●●

●●

●●

●●●●●

●●

●●

●●●●

●●●●●●●

●● ●

●●

●●

●●

●●●●●●●

●●●●●●●●●●●●

●●●●●●●●●●●

●●●●●●●

●●●

●●●●

●●

●●●●

●●●●

●●

●●●

●●●●●●●●●●

●●

●●●●●●●●●

●●●●●●

●●●●

●●

●●●●

●●

●●●●

●●●

●●

●●●

●●

●●

●●●

●●●●

●●●

●●●

●●

●●

●●●● ●

●●●●●

●●

●●●●

●●

●●●●

●●●

●●

●●●●

●●●●●

●●●●●

●●

●●

●●●

●●●●●●●●

●●●●●

●●●●●●

●●

●●●●

●●●●

●●●●●

●●●

●●●●●●

●●●●● ●

●●●●●●

●●●●●●●●●●●●●●●●

●●

●●●●●●●●●●●

●●●

●●

●●●●●●●●●●●●

●●

●●●●●●●●●●●●●●●●●●●●●●●●

●●●

●●●●

●●●

●●●●●●●●●●●●● ●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●

●●●●●●●●●●●●●●

●●●●●●●

●●

●●●●●

●●●●●●●●●●●●●

●●●●●●●

●●

●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●

●●●●●●

●●●●●●●●●

●●

●●●●●

●●

●●●

●●

●●

●●

●●

●●

●●●●

●●

●●

●●

●●

●●

●●

●●

●●

●●●●●

●●●●●●

●●●

●●●

●●

●●●

●●

●●●

●●

●●

●●

●●

●●

●●●

●●●

●●

●●

●●

●●●●

●●●

●●●●●●●●●●●

●●

●●●●●●

●●●●●●

●●●●●●●●●

●●

●●●●●●●●●●●●●●● ●

●●●●●

●●●●

●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●● ●●

●●●●●●●●●●

●●●●●●●

●●●●●●●●●●

●●●●●●●●●●●●●●●

●●●●●●●●●

●●●●●●●●●●●●●●●●

●●

●●

●●

●●

●●●●●●●●●●●●●●

●●●●●

●●●●●●●●●●●●●

●●●●●

●●●●●●●●●

●●●

●●

●●●●● ●

●●●●●

●●

●●

●●

●●

●●

●●

●●

●●●

●●●

●●●●

●●●

●●

●●

●●●

●●

●●

●●●

●●●●●

●●●

●●●●●●●●●

●●

●●

●●●●

●●●●●

●●●

●●

●●

●●

●●

●●●●

●●

●●

●●●●

●●●●●

●●

●●●

●●●●●●●●

●●●●●

●●

●●●●●●●●

●●●●●●●●●●

●●●●●

●●●

●●●

●●●●●●●●

●● ●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●

●●

●●●●●●●●

●●●●

●●●

●●●●

●●●●●●●●●

●●●●

●●

●●●●●●●●●●●●

●●●●●●●●●

●●●

●●

●●●●●●●●●●●●●●●

●●●●●●●●●●

●●

●●●●●

●●●●●●●●●●●●●●●●●●●

●●

●●

●●●●●●●●●●

●●●●●

●●

●●

●●

●●●

●●●●

●●●●

●●

●●●●●●

●●●●

●●

●●

●●●●●

●●

●●●●●●●●●●●●

●●

●●●●●●●

●●●●●●●

●●● ●●

●●●

●●

●●●●

●●●●●

●●●●●●●●●●●●●

●●●●

●●●●●

●●●

●●●●

●●●●●●

●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●

●●●●●●●●●●●●●

●●●

●●●●●●●● ●●●●●●●●●●

●●

●●●●●●●●●●●●●●●●●●●●●●●●●●● ●●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●

●●●●●

●●●●●●

●●●●●●

●●●●●

●●●●●●●●●●●●●

●●

●●●●

●●

●●

●●

●●

●●●●●●

●●●

●●●

●●

●●

●●●●●●

●●●●

●●●●

●●●●●

●●●●

●●●●

●●

●●

●●●●●●

●●

●●

●●●●●

●●●●

●●

●●

●●

●●●

●●●

●●●

●●●●

●●

●●●●●

●●●●

●●

●●●●●

●●●

●●

●●●

●●●●●

●●●●●

●●●●●● ●●●●●●●●

●●●●●●●●●●●●●●●●●●●●

●●●●

●●

●●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●

●●●●●●●● ●●

●●●●●●●●●●●●●●●●●●●

●●

●●●●●●●

●●

●●

●●

●●●●●

●●●●●●●●●●●●●●●●

●●

●●●

●●●●

●●●●●●●●●●●●●●●●●●

●●●●●●

●●●

●●●●●●●●●●●●●

●●●●●●●●●●

●●

●●●●

●●●●●●

●●

●●●

●●●●

●●

●●●●

●●

●●●●

●●

●●●●

●●

●●

●●

●●●

●●●●●●●

●●

●●

●●●●●●●●

●●●●●●●●

●●●●●●●●●●

●●●

●●●

●●

●●●

●●

●●●

●●●

●●

●●

●●

●●

●●

●●●●

●●●●

●●●

●●●

●●●

●●

●●●●

●●

●●●

●●●●●●●●●●●

●●●

●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●

●●●●●●●

●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●

●●

●● ●●●

●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●

●●●●

Line 9, female Line 9, male

Line 5, female Line 6, female

Line 3, female Line 3, male

Line 1, female Line 2, female

0.0 2.5 5.0 7.5 10.0 0.0 2.5 5.0 7.5 10.0

05

1015

05

1015

05

1015

05

1015

Chromosome 8 locus

●●●●

●●●●●●●●●●●●●●●●●●●●●

●●●●●●●

●●●●●●●●●●

●●●●●●●●●●●●●●

●●●●●●● ●●●● ●●

●●●●●●●

●●

●●●●●

●●

●●●●●

●●

●●

●●●●

●●●●●●●●●●

●●●●●●●●●

●●●●

●●

●●●●●

●●●

●●

●●

●●●

●●●

●●●

●●

●●●●

●●●●

●●●●●●

●●●●●●●

●●

●●

●●

●●

●●●●●

●●

●●

●●

●●●●●

●●

●●

●●●

●●

●●●●●●

●●●

●●●●

●●

●●●●

●●●●●●

●●●●●●●

●●●●●●●●

●●●●

●●●

●●

●●●●●●●●

●●●

●●●

●●●

●●●

●●●●●●●

●●●●

●●●●●●●●●●●●●

●●●●●●

●●●●●●●

●●●

●●●●

●●●●●●

●●●●

●●●

●●

●●●●●

●●●

●●●●

●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●

●●

●●

●●●●●●●●●●●●●●●

●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●

●●●●●

●●●

●●●

Line 6, female

240 245 250 255 26005

1015

Chromosome 1 locus

●●●●●●●●●●

●●

●●●●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●●

●●●●

●●●●●●●●●●●●●●●●

●●●●●●●●●●●

●●

●●●●●●

●●

●●●●●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●

●●●●●●●

●●●●●●●

●●

●●●●●●●

●●●●●●●●●●

●●●

●●●●

●●

●●●

●●●●

●●●●●●●●●●●●

●●●●●●●●

●●

●●

●●●●●

●●●

●●●●●

●●

●●

●●

●●●

●●

●●●●

●●●

●●●

●●●

●●●

●●●●●

●●

●●

●●

●●

● ●●●

Line 1, female

56 58 60 62

Chromosome 17 locus

Position (Mbp)

Neg

ative

loga

rithm

of p−v

alue

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 20: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

Table 3. Genome-wide association study hits for average recombination, with position of the 375

lead SNP, additive genetic variance explained by the locus, and allele frequency of the allele 376

associated with higher recombination rate. 377

378

Chromosome Sex Line Lead SNP position Genetic variance explained

Allele frequency

1 female 6 252,547,401 0.10 0.57 8 female 1 2,253,270 0.08 0.90 8 female 2 75,256 0.60 0.53 8 female 3 226,298 0.41 0.70 8 male 3 226,298 0.44 0.74 8 female 5 259,617 0.07 0.27 8 female 6 259,617 0.12 0.74 8 female 9 75,256 0.14 0.12 8 male 9 1,283,621 0.22 0.41

17 female 1 59,968,884 0.16 0.78 379

Algorithm performance on synthetic data 380

381

We tested the accuracy of the estimated recombination by analysing a synthetic dataset. 382

Figure 8 shows the simulated and estimated map length, recombination landscape, and a 383

scatterplot of simulated and estimated numbers of recombinations per individual. Our method 384

slightly overestimated recombination rate when there was variable recombination along the 385

chromosome. Because of uncertainty in the location of recombinations, the estimated 386

recombination landscape did not track per-marker recombination rate variation very well (r = 387

0.59), but better captured the smoothed recombination landscape using a window of 50 388

markers (r = 0.86). The accuracy of individual-level estimates of recombination was higher 389

for dams (r = 0.72) than for sires (r = 0.55). 390

391

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 21: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

392 Figure 8. Recombination rate estimation on simulated data. Cumulative number of 393

recombinations, recombination landscape along the simulated chromosome and the 394

correlation between true and estimated numbers of recombination in sires and dams. The 395

smoothed values are rolling averages of 50 markers. The red dashed line is the regression 396

line between true and estimated values. 397

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 22: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

Discussion 398

399

In this paper, we estimated recombination rate variation within the genome and between 400

individuals in the pig using multiocus iterative peeling in nine genotyped pedigrees. 401

402

In this section, we discuss three main results: 403

(1) We confirm the known features of the pig recombination landscape, but not the 404

previously described correlation with the PRDM9 consensus motif. 405

(2) We show that recombination rate in the pig is genetically variable and associated with 406

alleles at the RNF212 gene. 407

(3) Multilocus iterative peeling is a compelling method for estimating recombination 408

landscapes from large genotyped pedigrees, but tends to overestimate genetic map 409

length. 410

411

Features of the pig recombination landscape 412

413

Our results recover known features of recombination in the pig, including the relative 414

chromosome lengths, and the marked sexual dimorphism. There are two notable exceptions, 415

where our estimates disagreed with previous results: we estimate overall longer genetic 416

lengths of chromosomes, and the correlations between recombination rate, density of the 417

PRDM9 consensus binding motif, and the density of some repeat classes are different than 418

estimated previously. 419

420

We estimated longer genetic maps than previous estimates for the pig. The total genetic map 421

lengths ranged from 18.5 to 21.7 Morgan for males and 22.3 to 25.9 Morgan for females. In 422

comparison, [1] found sex-specific map lengths of 17.8 and 17.5 Morgan for males, and 22.4 423

and 25.5 Morgan for females. This may be due to overestimation (see below), but also a 424

higher marker density and more complete use of the pedigree allowing us to detect more 425

recombinations. 426

427

The correlation between recombination and density of the PRDM9 consensus binding motif, 428

was lower than previous estimates. Because the PRDM9 protein determines the locations of a 429

subset of recombination hotspots, a positive correlation was expected. We detected only a 430

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 23: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

weak positive correlation with PRDM9 consensus motif density and recombination, which 431

suggests that we lack the genomic resolution to detect variation at this scale. The 432

recombination rate landscape is the outcome of processes operating at a much smaller scale, 433

with hotspots of a few kilobasepairs (as estimated by population sequencing [3] or by high-434

density gamete genotyping [33]). There is more subtle local variation in recombination rate 435

that we cannot detect. 436

437

The associations between recombination and transposable element density were mixed, and 438

different for different types of transposable elements. The overall correlation between 439

recombination rate and repeats was negative, in line with estimates from other species [34]. 440

The negative correlation with LINEs was stronger than previously reported and the positive 441

correlation with simple repeats was weaker. One reason for these differences might be that 442

we used the more complete Sscrofa11.1 reference genome [26], which likely better resolves 443

the repeat landscape of the pig genome than the previous version. 444

445

446

Genetic variation in genome-wide recombination rate 447

448

Our results from the pig agree with the general picture of recombination rate variation in 449

vertebrates. The chromosome 8 locus is homologous to regions identified in humans [35–37], 450

cattle [7, 8, 10] , sheep [12, 13], and chickens [14]. It contains the RNF212 gene, a paralog of 451

which is also associated with recombination in deer [11]. The RNF212 protein binds to 452

recombination complexes, and is essential for crossover formation [38]. 453

454

While RNF212 is an obvious candidate gene, it is harder to find candidates for the other two 455

regions. We searched for the locations of candidate regions from other vertebrates, and 456

rapidly evolving recombination genes in mammals [39]. The chromosome 1 locus overlaps 457

SHOC1, one of the rapidly evolving recombination genes in mammals [39]. The closest 458

candidate recombination gene from the chromosome 17 locus is SPO11, associated with 459

recombination in chickens [14]. However, it is about two megabasepairs away from the most 460

significant marker. 461

462

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 24: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

There are differences in recombination rate between lines, which may be due to fixed genetic 463

differences. Given that livestock populations have relatively small effective population sizes, 464

and assuming that recombination rate variation has a rather simple genetic architecture, line 465

differences in recombination rate might very well be due to genetic differences that have 466

fixed by chance. At the same times, all the lines showed evidence of comparable genetic 467

variation in recombination rate, and there was evidence that the major locus on chromosome 468

8 segregates in most lines. 469

470

A higher recombination rate could be beneficial for breeding, because it would reduce 471

linkage disequilibrium between causative variants and release genetic variance. Simulations 472

suggest that substantial increases in genome-wide recombination rate could increase genetic 473

gain [40]. We can approximate how much breeding could increase recombination rate based 474

on our results. 475

476

First, we can use the Breeder’s equation to predict the response to selection, treating genome-477

wide recombination as a quantitative trait. The response is the heritability multiplied by the 478

selection differential S, which is the difference between population mean 𝜇 and mean of the 479

selected individuals 𝜇#$%$&'$( . 480

481

𝑅 = ℎ,𝑆 = ℎ,(𝜇#$%$&'$( − 𝜇) 482

483

Using distribution of genome-wide recombination rates from the males of the largest line, the 484

mean were 0.904 cM/Mbp. If we were to select the 10%, 20% or 30% highest recombination 485

individuals, the mean of the selected individuals would be 1.22 cM/Mbp, 1.15 cM/Mbp, and 486

1.11 cM/Mbp respectively. Assuming a heritability of 0.05, comparable to our estimated 487

genomic heritability, this would result in responses of: 488

489

𝑅23% = 0.05 ∙ (1.22 − 0.904) = 0.016cM/Mbp 490

𝑅,3% = 0.012cM/Mbp 491

𝑅C3% = 0.010cM/Mbp 492

493

Relative to the average recombination rate, that would mean increases of 1.7%, 1.3% and 494

1.1%, respectively. 495

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 25: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

496

Second, we concentrate on the major locus on chromosome 8 that we detected in most of the 497

lines, and approximate the increase in recombination rate that could be achieved if this locus 498

was fixed for the high recombination allele. Again, using estimates from the largest line, the 499

additive effect a of the chromosome 8 locus was estimated to be 0.0271 cM/Mbp (averaging 500

the male and female estimates), and the frequency f of the high recombination allele was 501

0.332 (weighted average of males and females). The increase in the mean of the population 502

by fixing the chromosome 8 locus would be: 503

504

𝑑 = 𝑎(1 − 𝑓) = 0.0271 ∙ (1 − 0.332) = 0.018cM/Mbp 505

506

That is, it would increase genome-wide recombination rate by about 2%. 507

508

Compared to the simulation results of [40], which suggest that a doubling or more of 509

genome-wide recombination rate would lead to substantial genetic gains, these results 510

suggest that breeding for higher genome-wide recombination rate is not a practical alternative 511

for improving genetic gain. There may be other potential avenues, such as introducing 512

targeted recombinations in favourable locations [41] by biotechnological means. 513

514

515

Recombination rate inference by multilocus peeling 516

517

Throughout this paper we have used multilocus iterative peeling to estimate recombination 518

rate. In our simulation study, we found that multilocus iterative peeling could estimate the 519

number of recombinations per individual with an accuracy of 0.7 for dams and 0.5 for sires, 520

and the average recombination landscape along a chromosome. This is consistent with our 521

analysis of the pig genome, where we confirm previously known features of the pig 522

recombination landscape. However, the simulation results also show that we overestimated 523

the total genetic map length, consistent with our comparisons between the estimate 524

recombination rate and previously published estimates [1]. 525

526

Multilocus iterative peeling presents a compelling technique for estimating recombination 527

rate in large pedigree populations: it scales well to massive livestock pedigrees (more than 528

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 26: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

150,000 individuals), does not require pre-phasing of the data, and handles individuals 529

genotyped on range of platforms without requiring non-overlapping variants to be imputed 530

beforehand. 531

532

The primary downside is that multilocus iterative peeling requires multiple generations of 533

genotyped individuals to be available to accurately phase, impute, and estimate the 534

recombination rate. Although this information may be available in pig or chicken breeding 535

programmes [23, 42], and some wild populations [12] it may not be available in all 536

populations. In addition to this the overestimation of genetic map length suggests that the 537

exact genetic map lengths and counts of recombination for a specific individual may not be 538

accurate, but it is able to recover broad patterns in recombination between chromosomes and 539

between individuals. 540

541

Conclusion 542

543

In this paper we analyse 150,000 individuals from nine pig pedigrees. We find that we are 544

able to recover broad-scale patterns in the total genetic map length, recombination landscape, 545

and sex differences in recombination rates. In addition to this, we found that recombination 546

rate had low, but non-zero heritability, and a genome-wide association study detected three 547

regions associated with recombination rate. This paper highlights the ability to use large scale 548

pedigree and genomic data, as is routinely collected in many closely managed populations to 549

infer and understand recombination and recombination rate variation. 550

551

Declarations 552

553

Ethics approval and consent to participate 554

The samples used in this study were derived from the routine breeding activities of PIC. 555

Consent for publication 556

Not applicable. 557

Availability of data and materials 558

The datasets generated and analysed in this study are derived from the PIC breeding 559

programme and not publicly available. 560

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 27: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

Competing interests 561

The authors declare that they have no competing interests. 562

Funding 563

The authors acknowledge the financial support from the BBSRC ISPG to The Roslin Institute 564

BBS/E/D/30002275, from Grant Nos. BB/N015339/1, BB/L020467/1, BB/M009254/1, from 565

Genus PLC, Innovate UK, and from the Swedish Research Council Formas Dnr 2016-01386. 566

Author’s contributions 567

JMH, MJ, AW and GG conceived the study. MJ, AW, RRF and CC analysed data. WH and 568

DdK helped interpret the results. MJ, AW and JMH wrote the paper. All authors read and 569

approved the final manuscript. 570

Acknowledgements 571

This work has made use of the resources provided by the Edinburgh Compute and Data 572

Facility (ECDF) (http://www.ecdf.ed.ac.uk). 573

574

References 575

576

1. Tortereau F, Servin B, Frantz L, Megens H-J, Milan D, Rohrer G, et al. A high density 577

recombination map of the pig reveals a correlation between sex-specific recombination and 578

GC content. BMC genomics. 2012;13:586. 579

2. McVean GA, Myers SR, Hunt S, Deloukas P, Bentley DR, Donnelly P. The fine-scale 580

structure of recombination rate variation in the human genome. Science. 2004;304:581–4. 581

3. Myers S, Bottolo L, Freeman C, McVean G, Donnelly P. A fine-scale map of 582

recombination rates and hotspots across the human genome. Science. 2005;310:321–4. 583

4. Kong A, Gudbjartsson DF, Sainz J, Jonsdottir GM, Gudjonsson SA, Richardsson B, et al. 584

A high-resolution recombination map of the human genome. Nature genetics. 2002;31:241. 585

5. Fledel-Alon A, Leffler EM, Guan Y, Stephens M, Coop G, Przeworski M. Variation in 586

human recombination rates and its genetic determinants. PloS one. 2011;6:e20321. 587

6. Kong A, Barnard J, Gudbjartsson DF, Thorleifsson G, Jonsdottir G, Sigurdardottir S, et al. 588

Recombination rate and reproductive success in humans. Nature genetics. 2004;36:1203. 589

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 28: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

7. Kadri NK, Harland C, Faux P, Cambisano N, Karim L, Coppieters W, et al. Coding and 590

noncoding variants in HFM1, MLH3, MSH4, MSH5, RNF212, and RNF212B affect 591

recombination rate in cattle. Genome research. 2016;26:1323–1332. 592

8. Sandor C, Li W, Coppieters W, Druet T, Charlier C, Georges M. Genetic variants in 593

REC8, RNF212, and PRDM9 influence male recombination in cattle. PLoS genetics. 594

2012;8:e1002854. 595

9. Simianer H, Szyda J, Ramon G, Lien S. Evidence for individual and between-family 596

variability of the recombination rate in cattle. Mammalian genome. 1997;8:830–835. 597

10. Ma L, O’Connell JR, VanRaden PM, Shen B, Padhi A, Sun C, et al. Cattle sex-specific 598

recombination and genetic control from a large pedigree analysis. PLoS genetics. 599

2015;11:e1005387. 600

11. Johnston SE, Huisman J, Pemberton JM. A genomic region containing REC8 and 601

RNF212B is associated with individual recombination rate variation in a wild population of 602

red deer (Cervus elaphus). G3: Genes, Genomes, Genetics. 2018;8:2265–76. 603

12. Johnston SE, Bérénos C, Slate J, Pemberton JM. Conserved genetic architecture 604

underlying individual recombination rate variation in a wild population of Soay sheep (Ovis 605

aries). Genetics. 2016;203:583–98. 606

13. Petit M, Astruc J-M, Sarry J, Drouilhet L, Fabre S, Moreno CR, et al. Variation in 607

recombination rate and its genetic determinism in sheep populations. Genetics. 608

2017;207:767–84. 609

14. Weng Z, Wolc A, Su H, Fernando RL, Dekkers JC, Arango J, et al. Identification of 610

recombination hotspots and quantitative trait loci for recombination rate in layer chickens. 611

Journal of animal science and biotechnology. 2019;10:20. 612

15. Dapper AL, Payseur BA. Connecting theory and data to understand recombination rate 613

evolution. Philosophical Transactions of the Royal Society B: Biological Sciences. 614

2017;372:20160469. 615

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 29: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

16. Stapley J, Feulner PG, Johnston SE, Santure AW, Smadja CM. Variation in 616

recombination frequency and distribution across eukaryotes: patterns and processes. 617

Philosophical Transactions of the Royal Society B: Biological Sciences. 2017;372:20160455. 618

17. Broman KW, Murray JC, Sheffield VC, White RL, Weber JL. Comprehensive human 619

genetic maps: individual and sex-specific variation in recombination. The American Journal 620

of Human Genetics. 1998;63:861–9. 621

18. Sturtevant AH. The linear arrangement of six sex-linked factors in Drosophila, as shown 622

by their mode of association. Journal of experimental zoology. 1913;14:43–59. 623

19. Coop G, Wen X, Ober C, Pritchard JK, Przeworski M. High-resolution mapping of 624

crossovers reveals extensive variation in fine-scale recombination patterns among humans. 625

science. 2008;319:1395–8. 626

20. Weng Z-Q, Saatchi M, Schnabel RD, Taylor JF, Garrick DJ. Recombination locations 627

and rates in beef cattle assessed from parent-offspring pairs. Genetics Selection Evolution. 628

2014;46:34. 629

21. Segura J, Ferretti L, Ramos-Onsins S, Capilla L, Farré M, Reis F, et al. Evolution of 630

recombination in eutherian mammals: insights into mechanisms that affect recombination 631

rates and crossover interference. Proceedings of the Royal Society B: Biological Sciences. 632

2013;280:20131945. 633

22. Mary N, Barasc H, Ferchaud S, Billon Y, Meslier F, Robelin D, et al. Meiotic 634

recombination analyses of individual chromosomes in male domestic pigs (Sus scrofa 635

domestica). PLoS One. 2014;9:e99123. 636

23. Whalen A, Ros-Freixedes R, Wilson DL, Gorjanc G, Hickey JM. Hybrid peeling for fast 637

and accurate calling, phasing, and imputation with sequence data of any coverage in 638

pedigrees. Genetics Selection Evolution. 2018;50:67. 639

24. Meuwissen T, Goddard M. The use of family relationships and linkage disequilibrium to 640

impute phase and missing genotypes in up to whole-genome sequence density genotypic data. 641

Genetics. 2010;185:1441–9. 642

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 30: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

25. Elston RC, Stewart J. A general model for the genetic analysis of pedigree data. Human 643

heredity. 1971;21:523–42. 644

26. Warr A, Affara N, Aken B, Beiki H, Bickhart DM, Billis K, et al. An improved pig 645

reference genome sequence to enable pig genetics and genomics research. bioRxiv. 646

2019;:668921. 647

27. Myers S, Freeman C, Auton A, Donnelly P, McVean G. A common sequence motif 648

associated with recombination hot spots and genome instability in humans. Nature genetics. 649

2008;40:1124. 650

28. Bao W, Kojima KK, Kohany O. Repbase Update, a database of repetitive elements in 651

eukaryotic genomes. Mobile DNA. 2015;6:11. 652

29. Hansen-Melander E, Melander Y. The karyotype of the pig. Hereditas. 1974;77:149–158. 653

30. Hadfield JD. MCMC methods for multi-response generalized linear mixed models: the 654

MCMCglmm R package. Journal of Statistical Software. 2010;33:1–22. 655

31. Gelman A. Prior distributions for variance parameters in hierarchical models (comment 656

on article by Browne and Draper). Bayesian analysis. 2006;1:515–34. 657

32. Rönnegård L, McFarlane SE, Husby A, Kawakami T, Ellegren H, Qvarnström A. 658

Increasing the power of genome wide association studies in natural populations using 659

repeated measures–evaluation and implementation. Methods in ecology and evolution. 660

2016;7:792–9. 661

33. Jeffreys AJ, Holloway JK, Kauppi L, May CA, Neumann R, Slingsby MT, et al. Meiotic 662

recombination hot spots and human DNA diversity. Philosophical Transactions of the Royal 663

Society of London Series B: Biological Sciences. 2004;359:141–52. 664

34. Kent TV, Uzunović J, Wright SI. Coevolution between transposable elements and 665

recombination. Philosophical Transactions of the Royal Society B: Biological Sciences. 666

2017;372:20160458. 667

35. Chowdhury R, Bois PR, Feingold E, Sherman SL, Cheung VG. Genetic analysis of 668

variation in human meiotic recombination. PLoS genetics. 2009;5:e1000648. 669

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint

Page 31: Genetics of recombination rate variation in the pig · 8 2 Department of Animal Breeding and Genetics, Swedish University of Agricultural 9 Sciences, Box 7023, 750 07 Uppsala, Sweden

36. Kong A, Thorleifsson G, Stefansson H, Masson G, Helgason A, Gudbjartsson DF, et al. 670

Sequence variants in the RNF212 gene associate with genome-wide recombination rate. 671

Science. 2008;319:1398–401. 672

37. Kong A, Thorleifsson G, Frigge ML, Masson G, Gudbjartsson DF, Villemoes R, et al. 673

Common and low-frequency variants associated with genome-wide recombination rate. 674

Nature genetics. 2014;46:11. 675

38. Reynolds A, Qiao H, Yang Y, Chen JK, Jackson N, Biswas K, et al. RNF212 is a dosage-676

sensitive regulator of crossing-over during mammalian meiosis. Nature genetics. 677

2013;45:269. 678

39. Dapper AL, Payseur BA. Molecular evolution of the meiotic recombination pathway in 679

mammals. Evolution. 2019. 680

40. Battagin M, Gorjanc G, Faux A-M, Johnston SE, Hickey JM. Effect of manipulating 681

recombination rates on response to selection in livestock breeding programs. Genetics 682

Selection Evolution. 2016;48:44. 683

41. Bernardo R. Prospective targeted recombination and genetic gains for quantitative traits 684

in maize. The plant genome. 2017;10. 685

42. Hickey JM, Kranis A. Extending long-range phasing and haplotype library imputation 686

methods to impute genotypes on sex chromosomes. Genetics Selection Evolution. 687

2013;45:10. 688

689

.CC-BY-NC-ND 4.0 International licenseauthor/funder. It is made available under aThe copyright holder for this preprint (which was not peer-reviewed) is the. https://doi.org/10.1101/2020.03.17.995969doi: bioRxiv preprint