Infinite interns

Embed Size (px)

Citation preview

Infinite Interns

Use VirtualBox to build a machine inside your machine

Infinite Interns

Each machine or 'intern' has the software required to run each stage of your data driven investigation

Python 'intern'

MySQL 'intern'

R 'intern'

Infinite Interns

Create an 'intern' for each stage of your investigation, transforming the data and then destroying the 'intern'

Vagrant up

Vagrant ssh

Write scraper on the intern to transform data to a structured database

Vagrant destroy

Structured data saved onto computer

Infinite Interns

This means:

Each environment is virgin and uncontaminated

The computer is sandboxed (protected) from the development environment

The whole process is completely reproducible so the environment as well as the work can be shared