Generalized Additive Models · What is smoothing? How do GAMs work? Fitting GAMs using dsm. What is...

GeneralizedAdditiveModels

OverviewWhatisaGAM?

Whatissmoothing?

HowdoGAMswork?

FittingGAMsusingdsm

WhatisaGAM?

"gam"1. Collectivenounusedtorefertoagroupofwhales,orrarelyalsoof

porpoises;apod.

2. (byextension)Asocialgatheringofwhalers(whalingships).

(viaNatalieKelly,AAD.SeeninMobyDick.)

GeneralizedAdditiveModelsGeneralized:manyresponsedistributions

Additive:termsaddtogether

Models:well,it'samodel…

Whatdoesamodellooklike?Count distributedaccordingtosomecountdistribution

Modelassumofterms

Mathematically...Takingthepreviousexample…

where , countdistribution

= exp [ + s( ) + s( )] +nj Aj p̂j β0 yj Depthj ϵj

∼ N(0, )ϵj σ2 ∼nj

areaofsegment-offsetprobabilityofdetectioninsegmentlinkfunctionmodelterms

Response

where ,

= exp[ + s( ) + s( )] +nj Aj p̂j β0 yj Depthj ϵj

∼ N(0, )ϵj σ2 ∼ countdistributionnj

CountdistributionsResponseisacount(notnotalwaysinteger)

Often,it'smostlyzero(that'scomplicated)

Wantresponsedistributionthatdealswiththat

Flexiblemean-variancerelationship

Tweediedistribution

Commondistributionsaresub-cases:

Poisson

Normal

Weareinterestedin

(here )

Var (count) = ϕE(count)

q = 1 ⇒q = 2 ⇒q = 3 ⇒

1 < q < 2q = 1.2, 1.3,…,1.9

Negativebinomialdistribution

Estimate

Isquadraticrelationshipa“strong”assumption?

SimilartoPoisson:

Var (count) =E(count) + κE(count)2

Var (count) = E(count)

Smoothterms

= exp[ + s( ) + s( )] +nj Aj p̂j β0 yj Depthj ϵj

∼ N(0, )ϵj σ2 ∼nj

Okay,butwhataboutthese"s"things?Think =smooth

Wanttomodelthecovariatesflexibly

Covariatesandresponsenotnecessarilylinearlyrelated!

Wantsomewiggles

Whatissmoothing?

Straightlinesvs.interpolationWantalinethatis“close”toallthedata

Don'twantinterpolation–weknowthereis“error”

Balancebetweeninterpolationand“fit”

SplinesFunctionsmadeofother,simplerfunctions

Basisfunctions ,estimate

Makesthemathsmucheasier

s(x) = (x)∑Kk=1 βkbk

MeasuringwigglynessVisually:

Lotsofwiggles==NOTSMOOTH

Straightline==VERYSMOOTH

Howdowedothismathematically?

Derivatives!

(Calculuswasausefulclassafterall)

Wigglynessbyderivatives

MakingwigglynessmatterIntegrationofderivative(squared)giveswigglyness

Fitneedstobepenalised

Penaltymatrixgivesthewigglyness

Estimatethe termsbutpenaliseobjective

“closenesstodata”+penalty

PenaltymatrixForeach calculatethepenalty

Penaltyisafunctionof

calculatedonce

smoothingparameter( )dictatesinfluence

λ SββT

Smoothingparameter

Howwigglyarethings?Wecansetbasiscomplexityor“size”( )

Maximumwigglyness

Smoothshaveeffectivedegreesoffreedom(EDF)

Set “largeenough”

WhyGAMsarecool...Fancysmooths(cyclic,boundaries,…)

Fancyresponses(expfamilyandbeyond!)

Randomeffects(byequivalence)

Markovrandomfields

Correlationstructures

SeeWood(2006/2017)forahandyintro

Okay,thatwasalotoftheory...

Exampledata

SpermwhalesofftheUSeastcoastHangoutnearcanyons,eatsquid

Surveysin2004,USeastcoast

Combinationofdatafrom2NOAAcruises

ThankstoDebiPalka(NOAANEFSC),LanceGarrison(NOAASEFSC)fordata.JasonRoberts(DukeUniversity)fordataprep.

ModelformulationPurespatial,pureenvironmental,mixed?

Mayhavesomepriorknowledge

Biology/ecology

Whataredriversofdistribution?

Inferentialaim

Abundance

Ecology

FittingGAMsusingdsm

TranslatingmathsintoR

insidethelink:formula=count ~ s(y)responsedistribution:family=nb()orfamily=tw()detectability:ddf.obj=df_hroffset,data:segment.data=segs, observation.data=obs

= exp[ + s( )] +nj Aj p̂j β0 yj ϵj

∼ N(0, )ϵj σ2 ∼nj

YourfirstDSM

dsmisbasedonmgcvbySimonWood

library(dsm)dsm_x_tw <- dsm(count~s(x), ddf.obj=df, segment.data=segs, observation.data=obs, family=tw())

Whatdidthatdo?summary(dsm_x_tw)

Family: Tweedie(p=1.326) Link function: log

Formula:count ~ s(x) + offset(off.set)

Parametric coefficients: Estimate Std. Error t value Pr(>|t|) (Intercept) -19.8115 0.2277 -87.01 <2e-16 ***---Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Approximate significance of smooth terms: edf Ref.df F p-value s(x) 4.962 6.047 6.403 1.07e-06 ***---Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

R-sq.(adj) = 0.0283 Deviance explained = 17.7%-REML = 409.94 Scale est. = 6.0413 n = 949

Plottingplot(dsm_x_tw)Dashedlinesindicate+/-2standarderrors

Rugplot

Onthelinkscale

EDFon axisy

AddingatermJustuse+

dsm_xy_tw <- dsm(count ~ s(x) + s(y), ddf.obj=df, segment.data=segs, observation.data=obs, family=tw())

Summarysummary(dsm_xy_tw)

Formula:count ~ s(x) + s(y) + offset(off.set)

Approximate significance of smooth terms: edf Ref.df F p-value s(x) 4.943 6.057 3.224 0.004239 ** s(y) 5.293 6.420 4.034 0.000322 ***---Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Plotting

scale=0:eachplotondifferentscalepages=1:plottogether

plot(dsm_xy_tw, pages=1)

BivariatetermsAssumedanadditivestructure

Nointeraction

Wecanspecifys(x,y)(ands(x,y,z,...))

ThinplateregressionsplinesDefaultbasis

Onebasisfunctionperdatapoint

Reduce#basisfunctions(eigendecomposition)

Fittingonreducedproblem

Multidimensional

Thinplatesplines(2-D)

Bivariatespatialtermdsm_xyb_tw <- dsm(count ~ s(x, y), ddf.obj=df, segment.data=segs, observation.data=obs, family=tw())

Summarysummary(dsm_xyb_tw)

Formula:count ~ s(x, y) + offset(off.set)

Approximate significance of smooth terms: edf Ref.df F p-value s(x,y) 16.89 21.12 4.333 3.73e-10 ***---Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Plotting...erm...plot(dsm_xyb_tw)

Let'strysomethingdifferent

Stillonlinkscale

too.farexcludespointsfarfromdata

plot(dsm_xyb_tw, select=1, scheme=2, asp=1)

Comparingbivariateandadditivemodels

Let'shaveago...

Generalized Additive Models · What is smoothing? How do GAMs work? Fitting GAMs using dsm. What is...

Documents

Gefahrguteinsatz Einsatzmaßnahmen - GAMS -

Catalogo Grupo Gams

Hinweise zum Plotten von GAMs - statistik.boku.ac.at Hinweise zum Plotten von GAMs Im folgenden sollen ein paar Hinweise zum Plotten des GAM-Modells gegeben werden. Als Beispiel dient

Goliad County Groundwater Conservation District has taken … · Microsoft Word - GAM run 05-14 v2-rem_ckr.doc Author: GAMS Created Date: 8/15/2005 10:48:46 AM

GAMS Presentación

Agricultural Impact Analysis using GAMS Introduction to GAMS

gams tutorial

A GAMS TUTORIAL. WHAT IS GAMS ? General Algebraic Modeling System Modeling linear, nonlinear and mixed integer optimization problems Useful with large,

Zero inflated GAMs and GAMMs for the analysis of spatial ...highstat.com/Courses/Flyers/2020/Flyer2020_08_GAMZISPatTemp_O… · Module 5: GAM and GAMM applied to spatial correlated

images.tuyensinh247.com · ml = 32,8. Giá tri cua m bäng. A. 1,74 gam hoäc 6,33 gam 17,4 gam hoäc 63,3 gam B. 33,6 gam hoäc 47,1 gam D. 3,36 gam hoäc 4,71 gam Câu 37: Hôn

Tutorial Gams

Manual Gams

Lenguaje GAMS

Advanced GAMS 1. Good Modeling Practices 2. Fixing Misbehaving Models 3. Linking GAMS 4. Advanced GAMS Syntax

What is GAMS?. Download GAMS You can download the current GAMS distribution by going to Then 1. Fill out the form

What is GAMS?. While they are not NLP solvers, per se, attention should be given to modeling languages like: GAMS- , AIMMS- ,

Gams Trainingmanual

Datamining and statistical learning - lecture 9 Generalized linear models (GAMs) Some examples of linear models Proc GAM in SAS Model selection in

Gams productions

An Introduction to (Student) GAMS and GAMSIDE to GAMSIDE.pdf · An Introduction to (Student) GAMS and GAMSIDE ... GAMS output files are written ... the GAMS Manual and Tutorial