ThinkFast: Scaling Machine Learning to Modern Demands

Hristo Paskov

The Genomic Data Deluge

• Precision Medicine Initiative: sequence 1,000,000 genomes– $215 Million in 2015 – Pilot study– Outputs 10-50 GB/person

How do we analyze all of this data to drive progress?

Massive Data Sources

NewseCommerce

Bioinformatics

100K Genomes

Social Media

The Analysis Refinement Cycle

Data12‖𝑦−𝑋𝑤‖2

2+𝜆2‖𝑤‖2

Model𝑥+¿=𝑥−𝛼𝑀𝛻 𝑓 (𝑥 ) ¿

SolverModel captures data

nuance?

Solver exists, is fast

enough?

More Than Just Training Models

• Regularization paths• Model risk assessment• Interpretability

el Coefficien

Regularization Parameter

Brief History of Statistical Learning

Interpretability & Statistical Guarantees

ScalabilityEase of Use

Simple Models

Kernel Methods

Trees & Ensembles

Structured Regularization

Losses

RegressionClassification

RankingMotif Finding

Matrix FactorizationFeature EmbeddingData Imputation

Regularizers

SparsitySpatial/ Temporal / Manifold StructureGroup Structure

Hierarchical StructureStructured & Unstructured

Multitask Learning…

min𝛽∈ℝ𝑑

𝐿 ( 𝑋 𝛽 )+𝜆𝑅 (𝛽 )

The Lasso’s Combinatorial Side

el Coe

fficien

The Database Perspective

Feature & label storage

Data access operations

ML “Query Language” min𝛽∈ℝ𝑑

𝐿 ( 𝑦− 𝑋 𝛽 )+𝜆‖𝛽‖1

min𝛽1 ,𝛽2 ,𝛽3∈ℝ

𝑑∑𝑡=1

[𝐿𝑡 (𝑦𝑡−𝑋 𝑡 𝛽𝑡 )+𝜆𝑡𝑅𝑡 (𝛽𝑡 ) ]+𝜔‖[ 𝛽1 𝛽2 𝛽3 ]‖∗

Feature, label and model storage

ML “Query Language” min𝛽∈ℝ𝑑

𝐿 ( 𝑦− 𝑋 𝛽 )+𝜆‖𝛽‖1

𝑀 1

𝑀 2

𝑀 1

𝑀 2

𝑀 3

𝑀 1

𝑀 2

min𝛽∈ℝ𝑑

𝐿 ( 𝑦− 𝑋 𝛽 )+𝜆‖𝛽‖1

𝑀 1

𝑀 2

𝑀 1

𝑀 2

𝑀 3

𝑀 1

𝑀 2

Processing Memory

Mathematical Structure

Efficient Feature Storage

“Query Language” Optimization

• Static analysis

‖𝑦−𝑋𝑤‖22+‖𝑤‖2

‖𝑦−𝑋𝑤‖22+‖𝑤‖1

‖𝑦−𝑋𝑤‖22+12 (‖𝑤‖2

2+‖𝑤‖1)

• Static analysis

‖𝑦−𝑋𝑤‖22+‖𝑤‖2

‖𝑦−𝑋𝑤‖22+‖𝑤‖1

‖𝑦−𝑋𝑤‖22+12 (‖𝑤‖2

2+‖𝑤‖1)

𝜀 ( 𝑦−𝑋𝑤 )+ 12 (‖𝑤‖22+‖𝑤‖1 )

• Static analysis• Runtime analysis

Some Bioinformatics Applications

• Personalized medicine, Memorial Sloan Kettering Cancer Center– 35% accuracy improvement over state-of-the-art

• Metagenomic binning and DNA quality assessment, Stanford School of Medicine– Previously unsolved problem

• Toxicogenomic analysis, Stanford University– Improved on state-of-the-art results

Upcoming

• Massive scale character level sentiment and text analysis on Amazon data– Billions of features, hours to solve a model– Efficient multitask learning

• Characterize the global limitations of learning word structure– Devise provably more efficient regularizers for uncovering structure

ThinkFast: Scaling Machine Learning to Modern Demands

Technology

When Modern Technologies Meet Ageing …...When Modern Technologies Meet Ageing Workforces: Older Workers are more affected by Demands from Mobile Interruptions than their Younger

Intuit Modern SaaS Platform - DeveloperMarch · Take care of AWS SG, EC2, Auto scaling, R53, Ingress, Egress . ... Scaling up or down the containers on demand ... and Proprietary

Global Fundraising Stage - wbaforum.org › upload › 07GFRS_2020_745.pdfJun 15, 2020 · current, highly competitive economic environment means that scaling up businesses demands

In-Memory Computing based Machine Learning Accelerators ... Roy.pdfKAUSHIK ROY PURDUE UNIVERSITY KAUSHIK@PURDUE.EDU 1 Challenge: Computational Demands and Technology Scaling Machine

Cuckoo: Scaling Microblogging Services with Divergent ...cseweb.ucsd.edu/~tixu/papers/cuckoo_tr.pdf · Cuckoo: Scaling Microblogging Services with Divergent Trafﬁc Demands. Univ

Scaling Your Tools for Your Modern Application

Challenges of Scaling Algebraic Multigrid across Modern ... · Challenges of Scaling Algebraic Multigrid across Modern Multicore Architectures Allison H. Baker, ... machine at a limited

VISUAL DEMANDS OF MODERN PRIMARY … · REFRACTIVE ANOMALIES ON ACADEMIC PERFORMANCE Sumithira Narayanasamy ... reading-related eye movements, simulation, visual acuity, visual demands,

JMB: Scaling Wireless Capacity with User Demands - DGISTcsi.dgist.ac.kr/uploads/Seminar/JMB.pdf · 1. INTRODUCTION Wireless spectrum is limited; wireless demands can, however, grow

SCALING UP THE FISHERIES ACT - Transforming the legal ...€¦ · SCALING UP THE FISHERIES ACT: RESTORING LOST PROTECTIONS AND INCORPORATING MODERN SAFEGUARDS 3 RECOMMENDATIONS 1

Modern Multidimensional Scaling 8. A Majorization Algorithm for Solving MDS

Existing and Future Demands on the Turbocharging of Modern

Modern Solutions for Today's Demands

Scaling the Field Organization in Modern Political …...Scaling the Field Program in Modern Political Campaigns Investigating Determinants of Capacity in Mobilizing and Organizing

Strategies for managing the evolving demands of the … for managing the evolving demands of the modern workforce. ... 7 Tips for managing a technical, ... Global Offerings & CGMA

Scaling Microblogging Services with Divergent Traffic Demands

DIGITAL ENGINEERING DIGITAL-ERA DESIGN DEMANDS MODERN ... · Digital-Era Design Demands Modern Workflows. 2. M. any things in the world of engineering remain a constant: The creative

Understanding the demands on modern qualitative research - Hurriyet

Supporting Media & Information Literacy · 6 Introduction Media and Information Literacy education: modern competencies meeting modern demands Access to news and information has never

Managing the demands of your modern organization...Managing the demands of your organization Page: 2 2013 12 4 managing the demands of modern org.pptx Welcome! Important Web Seminar