README


home
@j4cob

It’s 2013, time to unshard your hostnames

17 Jan 2013

For a long time, conventional web wisdom has been to shard your resources across multiple hostnames for better page load performance. The HTTP 1.1 RFC says: “A single-user client SHOULD NOT maintain more than 2 connections with any server or proxy,” and for years all browsers followed this guideline. Steve Souders  wrote this excellent post on why two-connection limit hurts performance. In short, since each request on a given connection has to wait for the previous request to complete, you pay an extra round-trip time per resource, plus the backend server’s processing or loading time. With more connections, more resources are downloaded in parallel, saving time.

When Chrome launched  in 2008, it challenged the default. Chrome 1 created six connections per hostname in order to provide faster load times for the many web sites that didn’t implement hostname sharding. This was a big success, and now in 2013 all major desktop browser create six connections per hostname, with the exception of IE10, which creates eight. Similar is true of mobile browsers.

So if you shard your resources across four domains, as is common, you now create twenty-four connections where you originally intended to create eight. Actually, Yahoo published a report indicating two aliases (four connections) were optimal for a sample load. That’s without even considering the impact of DNS and SSL connection setup time.

The biggest disadvantage of hostname sharding is that each additional domain requires an additional DNS lookup. Each additional lookup costs a round-trip time to the user’s DNS server, and possibly longer if that server needs to recurse. Worse, since DNS uses UDP, when a packet drops the client has to wait for a timeout (typically one second) before retrying.

Now that more sites are using SSL for everything, there is the additional cost of setting up the SSL session for each of those connections. Depending on implementation and timing, those could be full handshakes (3.5 RTT + certificate, which can easily be 1.5kB+) or resumptions (2.5 RTT). You also need to get a separate certificate for each alias domain, or get a wildcard or SAN cert.

Sharding across multiple hostnames also adds code complexity. In order to maximize cacheability, you want every asset to map to exactly one hostname at all times. This is often done wrong, and doing it right can be a hassle. Best practice is typically hashing the path name and taking a modulus, but it can be annoying to keep the same logic between your frontend and backend code. It also means that all image paths need to be passed through a function to determine the correct hostname, which can complicate your templates.

In short, simplify your code and your DNS/SSL setup by using one hostname for all your assets, or two if you need a separate domain for your CDN. This will give you six or twelve concurrent connections, which is just about ideal.