Mega Bytes

1. How Much Bandwidth Do You Need?

Server bandwidth is the single most important factor in the performance of your web site. The math to determine what bandwidth you need is, in essence, very simple:

hits/second * average size of a hit in bits = bits/second

That is, you need some estimate of the number of hits per second you want to be able to serve. Then you need to know the average size of one of these hits. From this, you know what sort of network bandwidth you need.

1.1. Latencies Are More Important than Bandwidth

It has become clear that the number of packets is a more significant determinant of web performance than raw bandwidth once users are beyond ordinary dial-up modems. This is because each packet must be acknowledged, and the speed of light fixed, while bandwidth is increasing. It may take 20 milliseconds to send a 1500- byte packet to a PC on a DSL line, but only 12 milliseconds to get it from the network into the PC. It will take another 20 milliseconds for the acknowledgment to get back to the sender. So the 40 milliseconds latency is more than three times as important as bandwidth in this case, and it will only get more important later.

This is why it is so important to keep the number of individual items on a page to a minimum. Still, because most browsers are multithreaded, some latencies can happen in parallel. It turns out through experimentation that the best number of embedded images on a page is about the same as the number of threads the browser uses. For example, Netscape uses four threads, and you may get best performance by breaking a single large image into four smaller ones, in which acknowledgments can proceed in parallel rather than being strictly serial. But this holds only where the browser uses HTTP persistent connections ("keepalives") to avoid the overhead of setting up a TCP connection for each of the four smaller images.

Here are some numbers to help think about latency. While the latency from CPU to memory is on the order of 100 nanoseconds, LAN latency is usually about 1 millisecond, or 10,000 times slower. Going across a campus with its own network, latencies are about 5 milliseconds. Going across the Internet, latencies range from 10 to 500 milliseconds. Satellite links can take a whole second or more.

1.2. Thinking About Bandwidth

You can get some perspective when thinking about bandwidth from the Table Note that this chart uses the decimal million (1000000) and not "mega," which is 2²⁰ = 1048576.

Bandwidth comparison
Mode of data transfer	Million bits/second	Comments
Fast typist	0.000035	70 words/min × 5 chars/word × 6 bits/char × M/10⁶ × min/60 seconds.
4800 bps modem	0.004800	4800 bits/sec × M/10⁶. This is also about maximum human reading speed.
POTS sampling rate	0.064000	Voice over plain old telephone service is sampled at 64 kbps.
ISDN, two channels bonded	0.128000
One million 10000-byte web hits in one day, evenly distributed	0.925925	10⁶ hits/day × 80Kbits/hit × day/86400 seconds × M/10⁶.
Audio CD	1.411200	44100 samples/second × 16 bits/sample × 2 (stereo) × M/10⁶.
T-1 (DS-1 or primary ISDN)	1.544000	Carries 24 POTS channels with 8 kbps overhead.
Ethernet	10.00000
Token Ring	16.00000
IDE Hard disk	16.00000	Sustained throughput.
T-3 (DS-3)	44.60000	672 DS-0's, 28 DS-1's, or 7 DS-2's.
FDDI and Fast Ethernet	100.0000
ISA Bus	128.0000	16 bits @ 8 MHz.
Broadband ISDN	135.0000
ATM	154.0000
250 million 10000-byte web hits in one day, evenly distributed	231.4812	About one hit for every U.S. citizen.
EISA bus	264.0000
Wide Ultra SCSI disk controller	320.0000
100-nanosecond RAM	320.0000	32 bits / (100 × 10^-9) seconds × M/10⁶.
Gigabit Ethernet	1000.000
PCI Bus	2112.000	64 bits @ 33 MHz.
AT&T Sonet	2400.000	Long-distance fiber trunk.
CPU	3200.000	Hypothetical CPU processing 32-bit instructions at 100Mhz, one per clock cycle.
Human eye-to-brain rate	5600.000	Estimated.
Highest achieved fiber optics	16000.00	Bell Labs.
Theoretical fiber capacity	64000.00

This chart ignores latency, which varies even from bit to bit, and can be huge, especially upon startup of any component. If you're a bit romantic, you can imagine a blurry picture of virtual reality coming into focus over the years in this chart, from short symbol transmissions twenty years ago, to the limit of human audio perception today, to the limit of visual perception in the coming years.

1.3. Estimating Web Server Network Bandwidth

The following chart displays an estimate of the number of hits per second of a given size (y-axis) a given amount of bandwidth (x-axis) can handle with a 30 percent deduction for TCP/IP and other network overhead. Numbers are truncated to integers, so 0 means "less than one per second" rather than truly zero .

Web server bandwidth requirements
Hit Size	28.8K	56k	ISDN(2)	T1	10bT	T3	100bT Ethernet
1 KB	2	4	10	132	854	3845	8544
2 KB	1	2	5	66	427	1922	4272
4 KB	0	1	2	33	213	961	2136
8 KB	0	0	1	16	106	480	1068
16 KB	0	0	0	8	53	240	534
32 KB	0	0	0	4	26	120	267
64 KB	0	0	0	2	13	60	133
132 KB	0	0	0	1	6	30	66
264 KB	0	0	0	0	3	15	33
512 KB	0	0	0	0	1	7	16
1 MB	0	0	0	0	0	3	8
2 MB	0	0	0	0	0	1	4

You can use the table to estimate, for example, how many 4K files per second your T1 line can handle. The answer is 33. Keep in mind that the table refers just to the network capacity and does not say whether the load was generated by static HTML or CGIs. That is, network capacity says nothing about the capacity of your disk or server CPU, or of a database behind the web server.

In fact, the capacity of a server to fill a network connection is distinctly nonlinear. Smaller packets require a larger overhead in terms of interrupts and packet header processing. This means that sending two packets will require more of the server than combining them into one larger packet.

The table is also a bit deceptive in that you will rarely see a smooth distribution of hits filling your network capacity. Rather, there will be peaks of three to four times the average rate per second and long troughs of no load at all.

To scale any of these connection types, you can add more network cards or modems until you reach the maximum number of cards or modems the server has room for. Then you can move up to the next connection type or to a bigger server. This is the easy way to do things, throwing more hardware into a single server. Scaling across multiple servers is typically more complicated, requiring load balancing strategies.

Mega Bytes

Monday, 3 May 2010

1. How Much Bandwidth Do You Need?

1.1. Latencies Are More Important than Bandwidth

1.2. Thinking About Bandwidth

Bandwidth comparison

1.3. Estimating Web Server Network Bandwidth

Web server bandwidth requirements

No comments:

Post a Comment

Followers

Blog Archive

About Me