16
Real life for Big Data: what is data science about? Irina Muhina , PhD in AI with 25 years practical experience, Big Data and STEM Expert, Founder of iECARUS, President of ERUDITE school iECARUS is your concierge for educational intelligence. www.iecarus.com February, 2016

Data science York_University _2016

  • Upload
    iecarus

  • View
    474

  • Download
    2

Embed Size (px)

Citation preview

Page 1: Data science York_University _2016

Real life for Big Data: what is data science about?

Irina Muhina,

PhD in AI with 25 years practical experience,

Big Data and STEM Expert, Founder of iECARUS,

President of ERUDITE school

iECARUS is your concierge for educational intelligence.

www.iecarus.com

February, 2016

Page 2: Data science York_University _2016

Agenda

• What is the Big Data ? • What are the real life dimensions for Big Data ?

- return on investment (ROI)

- amount of real-time data

- demand for data scientists job and average compensation packages

- expectations for the data scientist

• Case studies and tools using Big Data examples from industries: – Trading strategy analysis – Parametric and distribution analysis – Two-regimes risk model – Correlation analysis with different cut-off

– Optimization models with re-sampling •How to use Big Data for STEM ?

Page 3: Data science York_University _2016

What is Big Data ? Gartner model

Page 4: Data science York_University _2016
Page 5: Data science York_University _2016
Page 6: Data science York_University _2016
Page 7: Data science York_University _2016

Who is a Data Scientist ? • Works more closely with multiple teams when compared to

statisticians • always expected to work with types of big data — operational

technology, text, streaming • Combinations of mathematics, statistics, machine learning and

algorithmic processing • Demand for communication skills much more frequently than BI

or statistics roles • Have to be able to code, write and present well

Current roles:

• Solution architect • Business analyst • Requirements analyst • Data modeler •Data integration lead •Data integration developer •Report writer •BI platform lead •Database administrator •User trainer •Data steward

Page 8: Data science York_University _2016

Real projects using Big Data

case studies and tools from industries

• Trading strategy case study • Parametric and distribution case study • Two-regimes risk model case study • Correlation analysis with different cut-off • Optimization models with re-sampling

Analytical Tools

Excel, SAS, SPSS, R , SQL, Tableau, MatLab, Watson , Hadoop

Page 9: Data science York_University _2016
Page 10: Data science York_University _2016

Daily price crossing 50D EMA of ACWI seems to be a good strategy

Price crosses EMA from below, go overweight Price crosses EMA from above, go underweight

Different trading strategies analysis

Trade benefit VS Trade length Bad trades tend to be very short, i.e. occur when the model is switching between overweight and underweight rapidly

Page 11: Data science York_University _2016
Page 12: Data science York_University _2016
Page 13: Data science York_University _2016
Page 14: Data science York_University _2016
Page 15: Data science York_University _2016

Expand your analytic capabilities Gartner

Page 16: Data science York_University _2016

• How to use Big Data for STEM ? Emerging Role of the Data Scientist the Art of Data Science for IT,

business The Birth of Infonomics, the New Economics of Information