33
Scaling to Millions of Devices and Billions of Events Oleg Gusak, Salesforce.com, LMTS, Performance Engineering

Scaling to Millions of Devices and Billions of Events

Embed Size (px)

DESCRIPTION

Join us to learn how Salesforce Platform R&D is exploring new ways to scale data processing and aggregation from millions of devices. Learn about new frameworks, the challenges with large number of data sources and volumes of information, and making it all work. This session covers internal projects under incubation.

Citation preview

Page 1: Scaling to Millions of Devices and Billions of Events

Scaling to Millions of Devices and Billions of Events

Oleg Gusak, Salesforce.com, LMTS, Performance Engineering

Page 2: Scaling to Millions of Devices and Billions of Events

Safe harborSafe harbor statement under the Private Securities Litigation Reform Act of 1995: This presentation may contain forward-looking statements that involve risks, uncertainties, and assumptions. If any such uncertainties materialize or if any of the assumptions proves incorrect, the results of salesforce.com, inc. could differ materially from the results expressed or implied by the forward-looking statements we make. All statements other than statements of historical fact could be deemed forward-looking, including any projections of product or service availability, subscriber growth, earnings, revenues, or other financial items and any statements regarding strategies or plans of management for future operations, statements of belief, any statements concerning new, planned, or upgraded services or technology developments and customer contracts or use of our services. The risks and uncertainties referred to above include – but are not limited to – risks associated with developing and delivering new functionality for our service, new products and services, our new business model, our past operating losses, possible fluctuations in our operating results and rate of growth, interruptions or delays in our Web hosting, breach of our security measures, the outcome of any litigation, risks associated with completed and any possible mergers and acquisitions, the immature market in which we operate, our relatively limited operating history, our ability to expand, retain, and motivate our employees and manage our growth, new releases of our service and successful customer deployment, our limited history reselling non-salesforce.com products, and utilization and selling to larger enterprise customers. Further information on potential factors that could affect the financial results of salesforce.com, inc. is included in our annual report on Form 10-K for the most recent fiscal year and in our quarterly report on Form 10-Q for the most recent fiscal quarter. These documents and others containing important disclosures are available on the SEC Filings section of the Investor Information section of our Web site. Any unreleased services or features referenced in this or other presentations, press releases or public statements are not currently available and may not be delivered on time or at all. Customers who purchase our services should make the purchase decisions based upon features that are currently available. Salesforce.com, inc. assumes no obligation and does not intend to update these forward-looking statements.

Page 3: Scaling to Millions of Devices and Billions of Events

Oleg GusakLead Member of Technical StaffPerformance Engineering

Page 4: Scaling to Millions of Devices and Billions of Events

Outline

▪ Introduction: Internet of devices▪ Architecture of the devices framework▪ Challenges for Performance Testing▪ Architecture of the Test System▪ Results and Q&A

Page 5: Scaling to Millions of Devices and Billions of Events

Internet of Devices

Page 6: Scaling to Millions of Devices and Billions of Events

Internet of Devices

Billions devices produce massive amount of data

Page 7: Scaling to Millions of Devices and Billions of Events

Internet of Devices

Not every message carries important information

Page 8: Scaling to Millions of Devices and Billions of Events

Internet of Devices

Transform data to actionable information

Page 9: Scaling to Millions of Devices and Billions of Events

Dev

ices

AP

I

You

r org

in

Sal

esfo

rce.

com

Data Actionable information

Page 10: Scaling to Millions of Devices and Billions of Events

Architecture

Page 11: Scaling to Millions of Devices and Billions of Events
Page 12: Scaling to Millions of Devices and Billions of Events
Page 13: Scaling to Millions of Devices and Billions of Events

Challenges of Performance Testing

• 1 Million devices• 1 Million events per second• Ad-hoc setup

Page 14: Scaling to Millions of Devices and Billions of Events

Key components of the solution

• Automate performance test end to end

Page 15: Scaling to Millions of Devices and Billions of Events

Key components of the solution

• Reuse orchestration developed for internal performance testing

Page 16: Scaling to Millions of Devices and Billions of Events

Key components of the solution

• Non-blocking IO http client

Page 17: Scaling to Millions of Devices and Billions of Events

Key components of the solution

• Amazon elastic cloud and its API

Page 18: Scaling to Millions of Devices and Billions of Events

Key components of the solution

• Sumo Logic for log analysis

Page 19: Scaling to Millions of Devices and Billions of Events

Architecture of the test system

Controller

Results processor

Load VM

Load VM

Load VM

Res

t AP

IR

est A

PI

ELB

Kaf

kaK

afka

Sto

rmS

torm

Sumo LogicREST API of cluster manager

Page 20: Scaling to Millions of Devices and Billions of Events

Architecture of the test system

• Adopted internally developed automation tool Suzuki for testing in EC2

Page 21: Scaling to Millions of Devices and Billions of Events

Suzuki - load test orchestrator

Par

ts

Stages

Setup Load Collect resultsLoad generatorLog manager

Page 22: Scaling to Millions of Devices and Billions of Events

Architecture of the test system

• Common image for load VMs tuned to support large number of concurrent connections

Page 23: Scaling to Millions of Devices and Billions of Events

Architecture of the test system

• Load VMs are managed via Amazon API

Page 24: Scaling to Millions of Devices and Billions of Events

Architecture of the test system

• Metadata of the target system under test is retrieved via REST API of the cluster manager

Page 25: Scaling to Millions of Devices and Billions of Events

Workflow of a performance test

Controller

Amazon API

Load VM

Load VM

Load VM

Provision load instances

Page 26: Scaling to Millions of Devices and Billions of Events

Workflow of a performance test

Controller

Amazon API

Load VM

Load VM

Load VM

Configure load instances

REST API

Page 27: Scaling to Millions of Devices and Billions of Events

Workflow of a performance test

Controller

Load VM

Load VM

Load VM

Generate load

Res

t AP

IR

est A

PI

ELB

Kaf

kaK

afka

Sto

rmS

torm

Page 28: Scaling to Millions of Devices and Billions of Events

Workflow of a performance test

Controller

Load VM

Load VM

Load VM

Collect logs and process results

Res

t AP

IR

est A

PI

ELB

Kaf

kaK

afka

Sto

rmS

torm

Sumo Logic

Page 29: Scaling to Millions of Devices and Billions of Events

Single REST API instance

Page 30: Scaling to Millions of Devices and Billions of Events

80,000 requests per second, 10 c1.xlarge load instances

Page 31: Scaling to Millions of Devices and Billions of Events

80,000 requests per second, 10 c1.xlarge load instances

Page 32: Scaling to Millions of Devices and Billions of Events

Oleg Gusak

LMTS, Performance Engineering

Page 33: Scaling to Millions of Devices and Billions of Events