Scaling PHP in The Real World

Preview:

DESCRIPTION

PHP is used by the likes of Facebook, Yahoo, Zynga, Tumblr, Etsy, and Wikipedia. How do the largest internet companies scale PHP to meet their demand? Join this session and find out how to use the latest tools in PHP for developing high performance applications. We’ll take a look at common techniques for scaling PHP applications and best practices for profiling and optimizing performance. After this session, you’ll leave prepared to tackle your next enterprise PHP project.

Citation preview

SCALING PHP IN THE REAL WORLD!Dustin Whittle

PHP IS USED BY THE LIKES OF FACEBOOK, YAHOO, ZYNGA, TUMBLR, ETSY, AND WIKIPEDIA. HOW DO THE LARGEST INTERNET COMPANIES

SCALE PHP TO MEET THEIR DEMAND? !

JOIN THIS SESSION AND FIND OUT HOW TO USE THE LATEST TOOLS IN PHP FOR DEVELOPING HIGH PERFORMANCE APPLICATIONS. WE’LL

TAKE A LOOK AT COMMON TECHNIQUES FOR SCALING PHP APPLICATIONS AND BEST

PRACTICES FOR PROFILING AND OPTIMIZING PERFORMANCE. AFTER THIS SESSION, YOU’LL

LEAVE PREPARED TO TACKLE YOUR NEXT ENTERPRISE PHP PROJECT.

AGENDA• Why performance matters?

• The problems with PHP

• Best practice designs

• Distributed data caches with Redis and Memcached

• Doing work in the background with queues

• Http caching and a reverse proxy with Varnish

• Using the right tool for the job

• Tools of the trade

• Xdebug + WebGrind

• XHProf + XHProf GUI

• AppDynamics

• Google PageSpeed

• Architecture not applications

DUSTIN WHITTLE

• dustinwhittle.com

• @dustinwhittle

• Technologist, Pilot, Skier, Diver, Sailor, Golfer

WHAT I HAVE WORKED ON

• Developer Evangelist @

• Consultant & Trainer @

• Developer Evangelist @

DID YOU KNOW FACEBOOK, YAHOO, ZYNGA, TUMBLR, ETSY, AND

WIKIPEDIA WERE ALL BUILT ON PHP?

WHY DOES PERFORMANCE MATTER?

WHEN MOZILLA SHAVED 2.2 SECONDS OFF THEIR LANDING PAGE, FIREFOX DOWNLOADS INCREASED 15.4%

MAKING BARACK OBAMA’S WEBSITE 60% FASTER INCREASED DONATION CONVERSIONS BY 14%

AMAZON AND WALMART INCREASED REVENUE 1% FOR

EVERY 100MS OF IMPROVEMENT

PERFORMANCE DIRECTLY IMPACTS THE BOTTOM LINE

PHP IS SLOWER THAN JAVA, C++, ERLANG, SCALA, AND

GO!

HTTP://PHPSADNESS.COM/

...AND PHP HAS SOME SERIOUS DESIGN ISSUES AND INCONSISTENCIES!

...BUT THERE ARE WAYS TO SCALE TO HANDLE HIGH TRAFFIC APPLICATIONS

PHP IS NOT YOUR PROBLEM!

WHAT VERSION OF PHP DO YOU RUN?

UPGRADE YOUR PHP ENVIRONMENT TO 2014!

NGINX + PHP-FPM

USE AN OPCODE CACHE!

PHP 5.5 HAS ZEND OPCACHE

APC

USE AUTOLOADING AND PSR-0

SYMFONY2 CLASSLOADER COMPONENT WITH APC

CACHING

SCALING BEYOND A SINGLE SERVER IN PHP

OPTIMIZE YOUR SESSIONS!

THE DEFAULT IN PHP IS TO PERSIST SESSIONS TO DISK

IT IS BETTER TO STORE SESSIONS IN A DATABASE

EVEN BETTER IS TO STORE IN A DATABASE WITH A

SHARED CACHE IN FRONT

PECL INSTALL MEMCACHED

session.save_handler = memcached

session.save_path = "10.0.0.10:11211,10.0.0.11:11211,10.0.0.12:11211"

memcached.sess_prefix = “session.”

memcached.sess_consistent_hash = On

memcached.sess_remove_failed = 1

memcached.sess_number_of_replicas = 2

memcached.sess_binary = On

memcached.sess_randomize_replica_read = On

memcached.sess_locking = On

memcached.sess_connect_timeout = 200

memcached.serializer = “igbinary”

THE BEST SOLUTION IS TO LIMIT SESSION SIZE AND STORE ALL DATA

IN A SIGNED OR ENCRYPTED COOKIE

LEVERAGE AN IN-MEMORY DATA CACHE

MEMCACHED.ORG

REDIS.IO

• Any data that is expensive to generate/query and long lived should be cached

• Web Service Responses

• HTTP Responses

• Database Result Sets

• Configuration Data

GUZZLE HTTP CLIENT HAS BUILT-IN SUPPORT FOR CACHING

WEB SERVICE REQUESTS

$memcache = new Memcache();

$memcache->connect('localhost', 11211);

!$memcacheDriver = new Doctrine\Common\Cache\MemcacheCache();

$memcacheDriver->setMemcache($memcache);

!$client = new Guzzle\HttpClient(‘http://www.test.com/’);

!$cachePlugin = new Guzzle\Plugin\Cache\CachePlugin(array(

‘storage’ => new Guzzle\Plugin\Cache\DefaultCacheStorage(

new Guzzle\Plugin\Cache\DoctrineCacheAdapter($memcacheDriver)

)

));

$client->addSubscriber($cachePlugin);

!$response = $client->get(‘http://www.wikipedia.org/’)->send();

!$response = $client->get(‘http://www.wikipedia.org/’)->send();

DOCTRINE ORM FOR PHP HAS BUILT-IN CACHING SUPPORT FOR MEMCACHED AND REDIS

$memcache = new Memcache();

$memcache->connect('localhost', 11211);

!$memcacheDriver = new Doctrine\Common\Cache\MemcacheCache();

$memcacheDriver->setMemcache($memcache);

!$config = new Doctrine\ORM\Configuration();

$config->setQueryCacheImpl($memcacheDriver);

$config->setMetadataCacheImpl($memcacheDriver);

$config->setResultCacheImpl($memcacheDriver);

!$entityManager = Doctrine\ORM\EntityManager::create(array(‘driver’ => ‘pdo_sqlite’, ‘path’ => __DIR__ . ‘/db.sqlite’), $config);

!$query = $em->createQuery(‘select u from EntitiesUser u’);

$query->useResultCache(true, 60);

!$users = $query->getResult();

DO BLOCKING WORK IN BACKGROUND TASKS VIA

QUEUES

• Resque

• Gearman

• RabbitMQ

• Kafka

• Beanstalkd

• ZeroMQ

• ActiveMQ

RESQUE

• Any process that is slow and not important for the http response should be queued

• Sending notifications + posting to social accounts

• Analytics + Instrumentation

• Updating profiles and discovering friends from social accounts

• Consuming web services like Twitter Streaming API

LEVERAGE HTTP CACHING

EXPIRES OR INVALIDATION

EXPIRATION

VALIDATION

EXPIRATION AND INVALIDATION

SYMFONY2 HTTPFOUNDATION COMPONENT WITH HTTP

CACHING

use Symfony\Component\HttpFoundation\Response;

!

$response = new Response(‘Hello World!’, 200, array(‘content-type’ => ‘text/html’));

!

$response->setCache(array(

‘etag’ => ‘a_unique_id_for_this_resource’,

‘last_modified’ => new DateTime(),

‘max_age’ => 600,

‘s_maxage’ => 600,

‘private’ => false,

‘public’ => true,

));

use Symfony\Component\HttpFoundation\Request;

use Symfony\Component\HttpFoundation\Response;

!

$request = Request::createFromGlobals();

!

$response = new Response(‘Hello World!’, 200, array(‘content-type’ => ‘text/html’));

!

if ($response->isNotModified($request)) {

$response->send();

}

• Varnish

• Squid

• Nginx Proxy Cache

• Apache Proxy Cache

USE VARNISH AS A REVERSE PROXY CACHE TO ALLEVIATE LOAD ON YOUR APP SERVERS

OPTIMIZE YOUR FRAMEWORK!

• Stay up-to-date with the latest stable version of your favorite framework

• Disable features you are not using (I18N, Security, etc)

• Always use a data cache like Memcached/Redis

• Enable caching features for views and database result sets

• Always use a HTTP cache like Varnish

SHARDING

TAKING A LARGE PROBLEM AND MAKING IT INTO MANAGEABLE

SMALLER PROBLEMS

SERVICE ORIENTED ARCHITECTURE !

JAVA/SCALA/ERLANG/GO/NODE BACKEND !

PHP OR PURE JAVASCRIPT FRONTEND

PHP AS GLUE

COMPANIES OF GREAT SCALE MOVE AWAY FROM PHP OR

CREATE THEIR OWN VARIANT

YAHOO! & YPHP

FACEBOOK & HIPHOP

LEARN TO HOW TO PROFILE CODE FOR PHP

PERFORMANCE

XDEBUG + WEBGRIND

XHPROF + XHPROF GUI

• Upgrade to PHP 5.5 with Zend OpCache using PHP-PFM + Nginx

• Stay up to date with your framework + dependencies (using Composer)

• Optimize your session store to use signed cookies or database with caching

• Cache your database and web service access with Memcache or Redis

• Do blocking work in the background with queues and tasks using Resque

• Use HTTP caching and a reverse proxy cache like Varnish

• Profile code with Xdebug + Webgrind and monitor production performance

DON’T FORGET TO OPTIMIZE THE CLIENT SIDE

IN MODERN WEB APPLICATIONS MOST OF THE LATENCY COMES

FROM THE CLIENT SIDE

USE ASSETIC TO OPTIMIZE CLIENT-SIDE ASSETS

GOOGLE PAGESPEED

GOOGLE PAGESPEED INSIGHTS

GOOGLE PAGESPEED API

CURL "HTTPS://WWW.GOOGLEAPIS.COM/PAGESPEEDONLINE/V1/RUNPAGESPEED?

URL=HTTP://DUSTINWHITTLE.COM/&KEY=XXX"

SCALABILITY IS ABOUT THE ENTIRE ARCHITECTURE, NOT SOME

MINOR CODE OPTIMIZATIONS.

QUESTIONS?

FIND THESE SLIDES ON SPEAKERDECK !

HTTPS://SPEAKERDECK.COM/DUSTINWHITTLE

Recommended