How Spotify scales Apache Storm Pipelines

○○○

○○

Log EventsApache Kafka

Real-time Personalization Pipeline

Apache Storm

User Profile Store

Apache Cassandra

Entity Metadata Store

Apache Cassandra

Build v2

Storm Cluster

Running v1

Storm Cluster

Running v1

Running v2

Deactivate v1

Submit v2

Check v2

graphs

Kill v1

Storm Cluster

Running v2

t5 t6 t7 t8

Pagerduty Inhouse Solution

○○○○

Log EventsApache Kafka

Real-time Personalization Pipeline

Apache Storm

User Profile Store

Apache Cassandra

Entity Metadata Store

Apache Cassandra

● Different tables for different TTLs and set gc_grace_period=0. Read repairs are disabled.

● Used DateTieredCompactionStrategy for short lived data.

● Control the number of open connections from Storm topology to Cassandra

● Configure Snitch to ensure proper call routing

● 1 worker per node per topology● 1 executor per core for CPU bound tasks● 1-10 executors per core for IO bound tasks● Compute total parallelism possible and distribute it

amongst slow and fast tasks. High parallelism for slow tasks, low for fast tasks.

* Parallelism tuning inspired by P Taylor Goetz’s Strata 2014 talk

● Think about constraints in external vs in-process caching○ External Caching

■ Network IO■ Latency■ Another point of failure

○ In-process■ Limited memory■ No persistence

How Spotify scales Apache Storm Pipelines

Data & Analytics

Scala Data Pipelines @ Spotify

Spotify Places

Contra briefing Spotify

Spotify – Large Scale, Low Latency, P2P Music-on …gkreitz/spotify-p2p10/spotify-p2p10.pdf · Spotify – Large Scale, Low Latency, P2P Music-on-Demand Streaming Gunnar Kreitz

UNDERSTANDING SPOTIFY

Cursus - Spotify

Spotify Guest

Supplement for Spotify - fr.yamaha.com · Dispositivo mobile o tablet Modem Spotify e il logo Spotify sono marchi registrati di Spotify Group. ... 2 Abra o aplicativo do Spotify no

Spotify presentations

Building Data Pipelines for Music Recommendations at Spotify

tfa spotify

Cloudstack at Spotify

Spotify vs. Rdio

Spotify .pptx

Эволюция службы эксплуатации «Spotify» / Лев Попов (Spotify)

Spotify - KENWOOD...• Nawiązać połączenie z Internetem przy pomocy LTE, 3G, EDGE lub WiFi. • Spotify i logo Spotify są znakami towarowymi firmy Spotify Group. Created Date

Spotify Behind the Scenesgkreitz/spotify/kreitz-spotify_kth11.pdf · 2011-09-29 · Background Streaming P2PProtocol Spotify—BehindtheScenes GunnarKreitz Spotify gkreitz@spotify.com

Spotify · 2020-03-07 · KYRIBA CASE STUDY Spotify Spotifyが奏でるトレジャリー・ミュージック CASE STUDY Spotify様イノベーション－斬新なトレジャリー・マネジメント

Spotify Behind the Scenesgkreitz/spotify/kreitz-spotify_kth11.pdf · Background Streaming P2PProtocol Spotify—BehindtheScenes GunnarKreitz Spotify gkreitz@spotify.com KTH,September292011

Agile spotify