106
Hands On: Multimedia Methods for Large Scale Video Analysis (Project Meeting) Dr. Gerald Friedland, [email protected] 1

Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Hands On: Multimedia Methods for Large Scale Video Analysis(Project Meeting)

Dr. Gerald Friedland, [email protected]

1

Page 2: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Today

Page 3: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Today

• Amazon EC2

Page 4: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Today

• Amazon EC2– What is it?

Page 5: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Today

• Amazon EC2– What is it?– Concepts to understand before using

it

Page 6: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Today

• Amazon EC2– What is it?– Concepts to understand before using

it– Some tutorials

Page 7: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Today

• Amazon EC2– What is it?– Concepts to understand before using

it– Some tutorials– Issues: Data, cost, and other

considerations

Page 8: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Today

• Amazon EC2– What is it?– Concepts to understand before using

it– Some tutorials– Issues: Data, cost, and other

considerations• More on Project Ideas

Page 9: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

3

Page 10: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2

4

Page 11: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2

• EC2 = Elastic Compute Cluster

4

Page 12: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2

• EC2 = Elastic Compute Cluster– Configurable set of Virtual Machines

(=Instance) running on real machines

4

Page 13: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2

• EC2 = Elastic Compute Cluster– Configurable set of Virtual Machines

(=Instance) running on real machines– Storage is virtualized as well

4

Page 14: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2

• EC2 = Elastic Compute Cluster– Configurable set of Virtual Machines

(=Instance) running on real machines– Storage is virtualized as well

• Originally designed for scalable web shops (like Amazon.com) - Software as a Service (SaaS)

4

Page 15: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2

• EC2 = Elastic Compute Cluster– Configurable set of Virtual Machines

(=Instance) running on real machines– Storage is virtualized as well

• Originally designed for scalable web shops (like Amazon.com) - Software as a Service (SaaS)

• Now: IaaS (Infrastructure as a Service)4

Page 16: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Infrastructure on Demand

5

Page 17: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Infrastructure on Demand

5

• Hardware On Demand

Page 18: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Infrastructure on Demand

5

• Hardware On Demand • Pay for what you use

Page 19: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Infrastructure on Demand

5

• Hardware On Demand • Pay for what you use • Full root access – you control the

OS and Software Stack

Page 20: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Infrastructure on Demand

5

• Hardware On Demand • Pay for what you use • Full root access – you control the

OS and Software Stack • Ability to scale computing

resources up and down

Page 21: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Infrastructure on Demand

5

• Hardware On Demand • Pay for what you use • Full root access – you control the

OS and Software Stack • Ability to scale computing

resources up and down • No dealing with racks, networks,

power, cooling, housing, etc.

Page 22: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2

6

Page 23: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2• Resizable Compute is controlled

via Instances either through

6

Page 24: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2• Resizable Compute is controlled

via Instances either through – Web Interface (Amazon Web Services)

6

Page 25: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2• Resizable Compute is controlled

via Instances either through – Web Interface (Amazon Web Services) – API

6

Page 26: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2• Resizable Compute is controlled

via Instances either through – Web Interface (Amazon Web Services) – API

• Variety of Instance Sizes: CPU Power, Cores, RAM, Disk.

6

Page 27: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2• Resizable Compute is controlled

via Instances either through – Web Interface (Amazon Web Services) – API

• Variety of Instance Sizes: CPU Power, Cores, RAM, Disk.

• Wide Variety of Pre-built AMIs (Amazon Machine Images)

6

Page 28: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Amazon EC2• Resizable Compute is controlled

via Instances either through – Web Interface (Amazon Web Services) – API

• Variety of Instance Sizes: CPU Power, Cores, RAM, Disk.

• Wide Variety of Pre-built AMIs (Amazon Machine Images)

• Access for each running instance using keypair required for SSH access. 6

Page 29: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

7

EC2: Instance Examples

Page 30: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Special Instances

8

Page 31: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Special Instances• “Spot” Instances

8

Page 32: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Special Instances• “Spot” Instances

– Bid for unused AWS capacity

8

Page 33: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Special Instances• “Spot” Instances

– Bid for unused AWS capacity – Prices controlled by AWS based on

supply and demand

8

Page 34: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Special Instances• “Spot” Instances

– Bid for unused AWS capacity – Prices controlled by AWS based on

supply and demand • AWS can terminate Spot Instances

without notice

8

Page 35: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Special Instances• “Spot” Instances

– Bid for unused AWS capacity – Prices controlled by AWS based on

supply and demand • AWS can terminate Spot Instances

without notice • Best approach to temporary

requests for large numbers of servers

8

Page 36: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Special Instances• “Spot” Instances

– Bid for unused AWS capacity – Prices controlled by AWS based on

supply and demand • AWS can terminate Spot Instances

without notice • Best approach to temporary

requests for large numbers of servers

• Default maximum = 100 servers (instead of 20 on-demand) 8

Page 37: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: More Concepts

9

Page 38: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: More Concepts

9

• Regions: A region is a geographical area that contains one or more Availability ZonesData transfer: Cheap(er)

Page 39: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: More Concepts

9

• Regions: A region is a geographical area that contains one or more Availability ZonesData transfer: Cheap(er)

• Availability Zone: Some services only available in the same AZ.Data transfer: Free

Page 40: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Even More Concepts

10

Page 41: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Even More Concepts

10

• Amazon Machine Image (AMI)

Page 42: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Even More Concepts

10

• Amazon Machine Image (AMI)• Contains an entire operating system and software stack that can be

loaded onto one or more virtual machines

Page 43: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Even More Concepts

10

• Amazon Machine Image (AMI)• Contains an entire operating system and software stack that can be

loaded onto one or more virtual machines

• Amazon Elastic Block Storage (EBS)

Page 44: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Even More Concepts

10

• Amazon Machine Image (AMI)• Contains an entire operating system and software stack that can be

loaded onto one or more virtual machines

• Amazon Elastic Block Storage (EBS)• Persistent storage: Volume lifetime is independent of any particular

EC2 instance.

Page 45: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Even More Concepts

10

• Amazon Machine Image (AMI)• Contains an entire operating system and software stack that can be

loaded onto one or more virtual machines

• Amazon Elastic Block Storage (EBS)• Persistent storage: Volume lifetime is independent of any particular

EC2 instance. • Raw, unformatted, block device.

Page 46: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Even More Concepts

10

• Amazon Machine Image (AMI)• Contains an entire operating system and software stack that can be

loaded onto one or more virtual machines

• Amazon Elastic Block Storage (EBS)• Persistent storage: Volume lifetime is independent of any particular

EC2 instance. • Raw, unformatted, block device.• Performance equal to or better than local EC2 drive.

Page 47: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Even More Concepts

10

• Amazon Machine Image (AMI)• Contains an entire operating system and software stack that can be

loaded onto one or more virtual machines

• Amazon Elastic Block Storage (EBS)• Persistent storage: Volume lifetime is independent of any particular

EC2 instance. • Raw, unformatted, block device.• Performance equal to or better than local EC2 drive. • Built-in redundancy within availability zone. AFR (Annual Failure

Rate) between 0.1% and 1%.

Page 48: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Even More Concepts

10

• Amazon Machine Image (AMI)• Contains an entire operating system and software stack that can be

loaded onto one or more virtual machines

• Amazon Elastic Block Storage (EBS)• Persistent storage: Volume lifetime is independent of any particular

EC2 instance. • Raw, unformatted, block device.• Performance equal to or better than local EC2 drive. • Built-in redundancy within availability zone. AFR (Annual Failure

Rate) between 0.1% and 1%. • Sizes range from 1 GB to 1 TB.

Page 49: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: Even More Concepts

10

• Amazon Machine Image (AMI)• Contains an entire operating system and software stack that can be

loaded onto one or more virtual machines

• Amazon Elastic Block Storage (EBS)• Persistent storage: Volume lifetime is independent of any particular

EC2 instance. • Raw, unformatted, block device.• Performance equal to or better than local EC2 drive. • Built-in redundancy within availability zone. AFR (Annual Failure

Rate) between 0.1% and 1%. • Sizes range from 1 GB to 1 TB.

• Easy to create, attach, back up, restore, and delete volumes.

Page 50: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: More on Storage

11

Page 51: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: More on Storage

11

• Amazon Simple Storage Solution (S3)

Page 52: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: More on Storage

11

• Amazon Simple Storage Solution (S3)• “... a simple web service interface that can be used

to store and retrieve any amount of data, at any time, from anywhere on the web”

Page 53: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: More on Storage

11

• Amazon Simple Storage Solution (S3)• “... a simple web service interface that can be used

to store and retrieve any amount of data, at any time, from anywhere on the web”

• Read, write, and delete binary obejcts containing from 1 byte to 5 TB of data each using API.

Page 54: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: More on Storage

11

• Amazon Simple Storage Solution (S3)• “... a simple web service interface that can be used

to store and retrieve any amount of data, at any time, from anywhere on the web”

• Read, write, and delete binary obejcts containing from 1 byte to 5 TB of data each using API.

• Number of objects you can store is unlimited.

Page 55: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: More on Storage

11

• Amazon Simple Storage Solution (S3)• “... a simple web service interface that can be used

to store and retrieve any amount of data, at any time, from anywhere on the web”

• Read, write, and delete binary obejcts containing from 1 byte to 5 TB of data each using API.

• Number of objects you can store is unlimited. • Each object stored in a 'bucket'and retrieved via a

unique, user-assigned key

Page 56: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: More on Storage

11

• Amazon Simple Storage Solution (S3)• “... a simple web service interface that can be used

to store and retrieve any amount of data, at any time, from anywhere on the web”

• Read, write, and delete binary obejcts containing from 1 byte to 5 TB of data each using API.

• Number of objects you can store is unlimited. • Each object stored in a 'bucket'and retrieved via a

unique, user-assigned key• Different levels of reliability.

Page 57: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2: More on Storage

11

• Amazon Simple Storage Solution (S3)• “... a simple web service interface that can be used

to store and retrieve any amount of data, at any time, from anywhere on the web”

• Read, write, and delete binary obejcts containing from 1 byte to 5 TB of data each using API.

• Number of objects you can store is unlimited. • Each object stored in a 'bucket'and retrieved via a

unique, user-assigned key• Different levels of reliability. • Generally cheaper than EBS

Page 58: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EBS vs S3

12

Page 59: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EBS vs S3

12

• EBS only mountable to one instance

Page 60: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EBS vs S3

12

• EBS only mountable to one instance

• EBS can only be used with instances in same AZ

Page 61: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EBS vs S3

12

• EBS only mountable to one instance

• EBS can only be used with instances in same AZ

• EBS <-> S3 can be converted,but $$$

Page 62: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Available Datasets

Page 63: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Available Datasets

• Amazon makes Public Datasets available as EBS

Page 64: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Available Datasets

• Amazon makes Public Datasets available as EBS

• 1M Song Dataset and 10k subset is available.

Page 65: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Available Datasets

• Amazon makes Public Datasets available as EBS

• 1M Song Dataset and 10k subset is available.

• More info on: http://aws.amazon.com/publicdatasets/

Page 66: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Using EC2 for HPC: Intro

http://www.youtube.com/embed/YfCgK1bmCjw

Page 67: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Using EC2 for HPC: Intro

http://www.youtube.com/embed/YfCgK1bmCjwWatch video at:

Page 68: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Problem

Page 69: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Problem

EC2 provides raw compute power. There’s work to be done to create a usable cluster:

Page 70: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Problem

EC2 provides raw compute power. There’s work to be done to create a usable cluster:

•Software installation

Page 71: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Problem

EC2 provides raw compute power. There’s work to be done to create a usable cluster:

•Software installation •AMI creation

Page 72: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Problem

EC2 provides raw compute power. There’s work to be done to create a usable cluster:

•Software installation •AMI creation •AWS / SSH key management and distribution

Page 73: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Problem

EC2 provides raw compute power. There’s work to be done to create a usable cluster:

•Software installation •AMI creation •AWS / SSH key management and distribution •Persistent Disk Storage and File Sharing

Page 74: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Problem

EC2 provides raw compute power. There’s work to be done to create a usable cluster:

•Software installation •AMI creation •AWS / SSH key management and distribution •Persistent Disk Storage and File Sharing •Configuration management

Page 75: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Problem

EC2 provides raw compute power. There’s work to be done to create a usable cluster:

•Software installation •AMI creation •AWS / SSH key management and distribution •Persistent Disk Storage and File Sharing •Configuration management •Higher-level management (cluster vs. instance)

Page 76: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Alternative: Elastic MapReduce

Page 77: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Alternative: Elastic MapReduce

• Launch MapReduce jobs on EC2 using Hadoop

Page 78: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Alternative: Elastic MapReduce

• Launch MapReduce jobs on EC2 using Hadoop

• Workflows defined either on console or using Webinterface

Page 79: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Alternative: Elastic MapReduce

• Launch MapReduce jobs on EC2 using Hadoop

• Workflows defined either on console or using Webinterface

• Map/Reduce code in S3

Page 80: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Alternative: Elastic MapReduce

• Launch MapReduce jobs on EC2 using Hadoop

• Workflows defined either on console or using Webinterface

• Map/Reduce code in S3• Input/Output data stored in S3

Page 81: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Alternative: Elastic MapReduce

• Launch MapReduce jobs on EC2 using Hadoop

• Workflows defined either on console or using Webinterface

• Map/Reduce code in S3• Input/Output data stored in S3• MapReduce covered in future lecture

Page 82: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Alternative: Elastic MapReduce

• Launch MapReduce jobs on EC2 using Hadoop

• Workflows defined either on console or using Webinterface

• Map/Reduce code in S3• Input/Output data stored in S3• MapReduce covered in future lecture• More info on Amazon:

http://aws.amazon.com/elasticmapreduce/

Page 83: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Alternative: MIT Starcluster

Page 84: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Alternative: MIT Starcluster

Watch video at:

Page 85: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Alternative: MIT Starcluster

Watch video at: http://www.youtube.com/watch?v=vC3lJcPq1FY

Page 86: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

EC2 Cost: Free Services

• 750 hours of EC2 running Micro instance usage• 30 GB of Amazon EBS Standard volume storage

plus 2 million IOs and 1 GB snapshot storage• 15 GB of bandwidth out aggregated across all AWS

services• 1 GB of Regional Data Transfer

Page 87: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Data Transfer Cost: Between Instances

Page 88: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Data Transfer Cost: Between Instances

• Inside AZ: Free

Page 89: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Data Transfer Cost: Between Instances

• Inside AZ: Free• Inside Region: $0.01/GB

Page 90: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Data Transfer Cost: Between Instances

• Inside AZ: Free• Inside Region: $0.01/GB• Public IP (even in AZ): $0.01/GB

Page 91: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Data Storage Cost

Page 92: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

CPU Cost

Page 93: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

The Tradeoffs

Page 94: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

The Tradeoffs

• More CPUs vs more power per CPU

Page 95: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

The Tradeoffs

• More CPUs vs more power per CPU• What data to transfer, what data to

process locally

Page 96: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

The Tradeoffs

• More CPUs vs more power per CPU• What data to transfer, what data to

process locally• Self-configured vs automatic

(Starcluster) vs predefined (MapReduce)

Page 97: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

The Tradeoffs

• More CPUs vs more power per CPU• What data to transfer, what data to

process locally• Self-configured vs automatic

(Starcluster) vs predefined (MapReduce)

• Console vs Web Interface vs API

Page 98: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

The Tradeoffs

• More CPUs vs more power per CPU• What data to transfer, what data to

process locally• Self-configured vs automatic

(Starcluster) vs predefined (MapReduce)

• Console vs Web Interface vs API• S3 vs EBS

Page 99: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

The Tradeoffs

• More CPUs vs more power per CPU• What data to transfer, what data to

process locally• Self-configured vs automatic

(Starcluster) vs predefined (MapReduce)

• Console vs Web Interface vs API• S3 vs EBS• Amazon vs. stay local (ICSI)

Page 100: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

More on Project Ideas

Page 105: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

This Week (Lecture)

24

• More on Audio

Page 106: Hands On: Multimedia Methods for Large Scale Video ...fractor/fall2012/cs294-4-2012.pdf · – Bid for unused AWS capacity – Prices controlled by AWS based on supply and demand

Next Week (Project Meeting)

25

• Project Teams