Dit artikel is ook in het Nederlands beschikbaar.
On November 14 Intel introduced its first quad-core server chips in the form of the Xeon 5300 series, also known under the code name Clovertown. Although the chip is technically pretty much identical to the desktop version which we looked at earlier, it runs on a different platform and is meant for other tasks. In this article we look at the influence of the step from dual-core to quad-core on Intel's position and how Clovertown holds up in our database test.Since the Xeon 5300 'Clovertown' is, with the exception of a faster bus, identical to the Core 2 Extreme QX6700 'Kentsfield' - whose architecture we looked at extensively in this review - this article has a somewhat more light-hearted introduction. In the table below, the new quad-core is compared to Intel's very first microprocessor, the 4004. This processor was in the spotlight again recently, because it was 35 years ago on November 13 that the chip hit the market.
| 4004 | Clovertown | Difference | |||
|---|---|---|---|---|---|
| Introduction | 1971 | 2006 | 35 years later | ||
| Price | ~$935* | $1172 | 25% more expensive | ||
| Cores | 1 | 4 | 4x as much | ||
| Instructions per clock tick | 1 | 16 | 4x as much per core | ||
| Clock speed | 108kHz | 2.66GHz | 24.691x higher | ||
| Bus frequency | 108kHz | 1333MHz | 12.346x higher | ||
| Bandwidth | 54KB/s | 10.6GB/s | 197.530x more | ||
| Production technique | 10000nm | 65nm | 154x smaller | ||
| Wafer size | 50mm | 300mm | 36x larger surface | ||
| Transistors | 2300 | 582 million | 63.261x more per core | ||
| Physical size | 12mm² | 286mm² | 6x larger per core | ||
| Contact points | 16 | 771 | 48x more | ||
| Registers | 4 bit | 64 bit | 16x wider | ||
| Address space | 640KB | 16EB | Unimaginable | ||
| Instructions | 46 | ~700 | 15x as many | ||
| Consumption | 1W | 120W | 30x higher per core | ||
| Voltage | 15V | 1.34V | >90% lower | ||
| Power | 0,07A | 90A | 1286x stronger | ||
* Corrected for inflation - original price: $200. Collectors are paying $1000 for it.
The difference is humongous: with its clock of a tenth of a megahertz, the 4004 could perform a little over 100,000 instructions per second, while Clovertown can manage almost 43 billion in the same time span. To equal the theoretical computing power of a quad-core Xeon, 395.061 4004's would be needed, and then we are not even recognising the fact that one does its math with 4 bits while the other uses 64 bits. We'll save Intel's first-born from a crushing comparison in terms of performance per Watt and per dollar, but it should be clear that the small steps that we have seen over the years, exhibit extreme differences taken across a few decades.Summer of servers, SPEC CPU
Last summer was the 'summer of servers' according to Intel, since it renewed its complete offering of server processors. It all started on May 23 with Dempsey, a 65nm dual-core Netburst Xeon that was launched along with a new chipset, with more than twice the amount of bandwidth as the previous generation. A little over a month later – on June 26 to be precise - Woodcrest was released, based on the new Core architecture and busses that were 25% quicker. July 18 saw the launch of Itanium, with the introduction of the dual-core Montecito, a chip with 24MB of L3 cache and over 1,7 billion transistors. Another month went by and on August 29 a new Xeon MP came out under the code name Tulsa, with 16MB of L3 cache. Clovertown – the fist quad-core Xeon – was delivered on November 14 as an afterbirth.

Intel has put in extra hours in the second half of the past year to overtake the competition. All that AMD managed to do to counter that was the introduction of the Socket F models (which, except for the 2.8GHz flavour did little to increase performance), and a price cut. Although that does not mean that AMD did badly in terms of contracts it won and market share – quite the contrary – it does mean that Intel has a lot more weight than it had for a long time. Although the quad-core Opteron Barcelona might tip the balance again in six months, Intel is ahead in many benchmarks at the moment.
Before we turn to our own comparative database test, we shall show the current state of affairs in database country. For completeness: although the 'non-tweakers.net'-scores that have been collected below are practical scores, they have been measured independently This means that there may be differences in a number of parameters of the various system configurations. It also means that better results are not always fully due to hardware improvements: after all, the software that comes with it also tends to get improved. But it is safe to assume that each system builder does his best to achieve the best results, and with this in mind the scores may be considered indicative.
SPEC CPU
We start with SPECint_rate and SPECfp_rate, two benchmarks designed to measure raw processor performance. The 'INT' (integer) suite consists of a compiler, chess programs, compression and text processing, while the 'FP' (floating point) suite contains, among others, face recognition, neural networks and physics as well as chemistry simulations. All subtests are based on software that is also used 'in real life', but the code has been altered in certain spots to minimize, among other things, hard disk load and improve portability to other platforms.
The addition 'rate' indicates that we are not dealing with a test of a single chip, but of all cores in a system simultaneously, which makes bandwidth a significant factor. 'Peak' indicates that the compiler may be tuned to the maximum extent. A standard run demands that everything is built with the same setting, but for a 'peak' run every individual test may have its own parameters. Since Clovertown-servers have a unique configuration of eight cores in two socket, we compare it to systems that have eight cores in four sockets, as well as configurations with four cores in two sockets. In the integer benchmark we see that the new Xeon beats the competition: the Core architecture already proved itself to be good at these sort of tasks, but improving the best scores of four socket systems is an impressive achievement.
In the FP benchmark it turns out that the quad-core doesn't do so well, which is presumably due to its limited bandwidth – which is something that SPECfp_rate can't get enough of. But we do see that Intel has made steady improvements over the course of the year, from a meagre 40.3 with the Paxville to a respectable score of 104. That is still lower than the Opteron (although it would be a narrow victory without compiler magic) but the problem is that AMD still has the step to quad-core architecture and 128 bit computation units ahead, while Clovertown has already been there. The Itanium can keep up, but this test remains one of AMD's favourites.

Intel has put in extra hours in the second half of the past year to overtake the competition. All that AMD managed to do to counter that was the introduction of the Socket F models (which, except for the 2.8GHz flavour did little to increase performance), and a price cut. Although that does not mean that AMD did badly in terms of contracts it won and market share – quite the contrary – it does mean that Intel has a lot more weight than it had for a long time. Although the quad-core Opteron Barcelona might tip the balance again in six months, Intel is ahead in many benchmarks at the moment.
Before we turn to our own comparative database test, we shall show the current state of affairs in database country. For completeness: although the 'non-tweakers.net'-scores that have been collected below are practical scores, they have been measured independently This means that there may be differences in a number of parameters of the various system configurations. It also means that better results are not always fully due to hardware improvements: after all, the software that comes with it also tends to get improved. But it is safe to assume that each system builder does his best to achieve the best results, and with this in mind the scores may be considered indicative.
We start with SPECint_rate and SPECfp_rate, two benchmarks designed to measure raw processor performance. The 'INT' (integer) suite consists of a compiler, chess programs, compression and text processing, while the 'FP' (floating point) suite contains, among others, face recognition, neural networks and physics as well as chemistry simulations. All subtests are based on software that is also used 'in real life', but the code has been altered in certain spots to minimize, among other things, hard disk load and improve portability to other platforms.
The addition 'rate' indicates that we are not dealing with a test of a single chip, but of all cores in a system simultaneously, which makes bandwidth a significant factor. 'Peak' indicates that the compiler may be tuned to the maximum extent. A standard run demands that everything is built with the same setting, but for a 'peak' run every individual test may have its own parameters. Since Clovertown-servers have a unique configuration of eight cores in two socket, we compare it to systems that have eight cores in four sockets, as well as configurations with four cores in two sockets. In the integer benchmark we see that the new Xeon beats the competition: the Core architecture already proved itself to be good at these sort of tasks, but improving the best scores of four socket systems is an impressive achievement.
| Opteron 8220 | 4 | 2,8GHz | Santa Rosa | |||||||
| Xeon MP 7140 | 4 | 3,4GHz | Tulsa | |||||||
| Power5 | 4 | 1,9GHz | ||||||||
| Itanium 2 9050 | 4 | 1,6GHz | Montecito | |||||||
| Xeon MP 7041 | 4 | 3,0GHz | Paxville | |||||||
| 2 | 2,66GHz | Clovertown | ||||||||
| Xeon 5160 | 2 | 3,0GHz | Woodcrest | |||||||
| Opteron 2280 | 2 | 2,8GHz | Santa Rosa | |||||||
| Power5+ | 2 | 2,1GHz | ||||||||
| Xeon 5080 | 2 | 3,73GHz | Dempsey | |||||||
| Xeon DC | 2 | 2,8GHz | Paxville | |||||||
In the FP benchmark it turns out that the quad-core doesn't do so well, which is presumably due to its limited bandwidth – which is something that SPECfp_rate can't get enough of. But we do see that Intel has made steady improvements over the course of the year, from a meagre 40.3 with the Paxville to a respectable score of 104. That is still lower than the Opteron (although it would be a narrow victory without compiler magic) but the problem is that AMD still has the step to quad-core architecture and 128 bit computation units ahead, while Clovertown has already been there. The Itanium can keep up, but this test remains one of AMD's favourites.
| Power5 | 4 | 1,9GHz | ||||||||
| Itanium 2 9050 | 4 | 1,6GHz | Montecito | |||||||
| Opteron 8220 | 4 | 2,8GHz | Santa Rosa | |||||||
| Xeon MP 7140 | 4 | 3,4GHz | Tulsa | |||||||
| Xeon MP 7041 | 4 | 3,0GHz | Paxville | |||||||
| Power5+ | 2 | 2,1GHz | ||||||||
| Itanium 2 9050 | 2 | 1,6GHz | Montecito | |||||||
| Opteron 2220 | 2 | 2,8GHz | Santa Rosa | |||||||
| 2 | 2,66GHz | Clovertown | ||||||||
| Xeon 5160 | 2 | 3,0GHz | Woodcrest | |||||||
| Xeon 5080 | 2 | 3,73GHz | Dempsey | |||||||
| Xeon DC | 2 | 2,8GHz | Paxville | |||||||
TPC-C, SAP-SD, and SPECjbb2005
SPEC CPU is an interesting test for all types of processor tasks, but there's more to life than pure maths. A common server task is running some type of database, and TPC-C is a benchmark that is often used as a measure for database performance. The test simulates the business processes of a distributor with multiple offices and hundreds of thousands of customers and products. Performance is measured in the number of transaction per minute. Since the test is sensitive to differences in the speed of the storage systems used as well as memory, a price/performance rating is given to make clear when a manufacturer ups his score artificially by investing a couple of million in hard disks. What isn't reflected in the score is the operating systems and database packages used; these may be chosen freely. Itanium and Power get their best scores running Oracle under HP-UX and running DB2 under AIX, while the x86 chips get tested pretty much exclusively running SQL Server under Windows.
It turns out that Clovertown has the best performance as well as the best price/performance ratio of all two socket servers, but it does not measure up against the big boys with four. The Opteron does not convince at first sight, but whoever takes the absolute prices into consideration instead of performance per dollar, will note that the AMD machines are the cheapest. Incidentally, the same version of SQL Server (64 bit) was used to obtain the Opteron and Clovertown scores. Woodcrest, Dempsey, and Tulsa lack SP1, which might put them at a slight disadvantage.
SAP-SD (Sales & Distribution) is a similar benchmark in that it also simulates business processes, but rather than transactions per minute, it is the number of users that can work on the system simultaneously, that is given as the final score. As the name indicates, this test is aimed more at the well known ERP package than on the underlying database, but that is not to say that it is not sensitive to factors outside the processor.
Unfortunately, prices may not be listed along with the test results. What we can do instead is check the CPU loads to see whether the system was pushed to its limit. In virtually all cases the load is above 95%, with the exception of the four-way Paxville. This means that the processor was a limiting factor in the rest of the results, and there would be little point in adding extra hard disks or memory. Here, too, we see Intel's come-back: the performance for two sockets has been more than doubled and for four has increased by more than half - enough to overtake the Opteron in both cases.
The last business benchmark is SPECjbb2005, that simulates a triple layer architecture which emphasizes the middle one, the so-called 'business logic', in which all XML processing and such takes place. The test was written entirely in Java, so the performance of the virtual machine is at least as important as that of the software itself. The score is expressed in BOPS - business operations per second. And again, Clovertown dominates with a score that even four-socket systems cannot get close to.
It turns out that Clovertown has the best performance as well as the best price/performance ratio of all two socket servers, but it does not measure up against the big boys with four. The Opteron does not convince at first sight, but whoever takes the absolute prices into consideration instead of performance per dollar, will note that the AMD machines are the cheapest. Incidentally, the same version of SQL Server (64 bit) was used to obtain the Opteron and Clovertown scores. Woodcrest, Dempsey, and Tulsa lack SP1, which might put them at a slight disadvantage.
| Power5 | 4 | 1,9GHz | ||||||||
| Itanium 2 9050 | 4 | 1,6GHz | Montecito | |||||||
| Xeon MP 7140 | 4 | 3,4GHz | Tulsa | |||||||
| Opteron 8220 | 4 | 2,8GHz | Santa Rosa | |||||||
| Xeon MP 7041 | 4 | 3,0GHz | Paxville | |||||||
| 2 | 2,66GHz | Clovertown | ||||||||
| Itanium 2 9050 | 2 | 1,6GHz | Montecito | |||||||
| Power5 | 2 | 1,9GHz | ||||||||
| Xeon 5160 | 2 | 3,0GHz | Woodcrest | |||||||
| Opteron 2220 | 2 | 2,8GHz | Santa Rosa | |||||||
| Xeon 5080 | 2 | 3,73GHz | Demspey | |||||||
SAP-SD (Sales & Distribution) is a similar benchmark in that it also simulates business processes, but rather than transactions per minute, it is the number of users that can work on the system simultaneously, that is given as the final score. As the name indicates, this test is aimed more at the well known ERP package than on the underlying database, but that is not to say that it is not sensitive to factors outside the processor.
Unfortunately, prices may not be listed along with the test results. What we can do instead is check the CPU loads to see whether the system was pushed to its limit. In virtually all cases the load is above 95%, with the exception of the four-way Paxville. This means that the processor was a limiting factor in the rest of the results, and there would be little point in adding extra hard disks or memory. Here, too, we see Intel's come-back: the performance for two sockets has been more than doubled and for four has increased by more than half - enough to overtake the Opteron in both cases.
| Itanium 2 9050 | 4 | 1,6GHz | Montecito | |||||||
| Xeon MP 7140 | 4 | 3,4GHz | Tulsa | |||||||
| Opteron 2280 | 4 | 2,8GHz | Santa Rosa | |||||||
| Xeon MP 7041 | 4 | 3,0GHz | Paxville | |||||||
| 2 | 2,66GHz | Clovertown | ||||||||
| Xeon 5160 | 2 | 3,0GHz | Woodcrest | |||||||
| Opteron 2218 | 2 | 2,6GHz | Santa Rosa | |||||||
| Xeon 5080 | 2 | 3,73GHz | Demspey | |||||||
| Xeon DC | 2 | 2,8GHz | Paxville | |||||||
The last business benchmark is SPECjbb2005, that simulates a triple layer architecture which emphasizes the middle one, the so-called 'business logic', in which all XML processing and such takes place. The test was written entirely in Java, so the performance of the virtual machine is at least as important as that of the software itself. The score is expressed in BOPS - business operations per second. And again, Clovertown dominates with a score that even four-socket systems cannot get close to.
| Xeon MP 7140 | 4 | 3,4GHz | Tulsa | |||||||
| Opteron 8220 | 4 | 2,8GHz | Santa Rosa | |||||||
| Itanium 2 9050 | 4 | 1,6GHz | Montecito | |||||||
| Power5+ | 4 | 1,65GHz | ||||||||
| 2 | 2,66GHz | Clovertown | ||||||||
| Xeon 5160 | 2 | 3,0GHz | Woodcrest | |||||||
| Opteron 2220 | 2 | 2,8GHz | Santa Rosa | |||||||
| Xeon 5080 | 2 | 3,73GHz | Dempsey | |||||||
| Power5+ | 2 | 1,65GHz | ||||||||
| Xeon DC | 2 | 2,8GHz | Paxville | |||||||
| UltraSparc T1 | 1 | 1,2GHz | Niagara | |||||||
SPECweb2005 and summery
The last benchmark that we look at here is SPECweb2005, a test that looks at the performance of machines when put to use as web servers, and examines dynamical page creation (in PHP or JSP), encryption (SSL connection) and web services. The result is a weighed average of the number of simultaneous sessions, in three scenarios: banking, shopping and support. The good performance of Niagara is noteworthy here: its performance is comparable to that of a dual Woodcrest or even a quad Tulsa. It cannot keep up with a double Clovertown or quadruple Opteron though.
In summary, we can conclude that Intel has made some significant gains during the past year. In the two socket segment we have seen average gains of 24% over the Opteron for the Woodcrest. The only test that Intel doesn't win is SPECfp_rate, but it has managed to bring down the difference considerably. Following Clovertown's introduction, the average performance difference between Xeon and Opteron went up to as much as 79%. Although AMD is certain to reduce that gap with the introduction of its own quad-core, it is at least half a year before that is released. For the time being, Intel can continue to improve its once-tarnished reputation and increase its market share, while the best AMD can do is reduce the damage with lower prices.
In the four-way server segment the story is less clear. Intel has gone from being hopelessly behind to a more or less equal performance, winning some tests and losing certain others. AMD is still ahead in terms of price and power consumption. Although Tulsa may be just about good enough to prevent further erosion of the market share, Intel will have to come up with something more convincing to turn the tide, especially since Opteron is also getting four cores next year. Intel has put its hope on Tigerton in order to fight off the AMD's new generation.
| Opteron 8220 | 4 | 2,8GHz | Santa Rosa | |||||||
| Xeon MP 7140 | 4 | 3,4GHz | Tulsa | |||||||
| 2 | 2,66GHz | Clovertown | ||||||||
| Xeon 5160 | 2 | 3,0GHz | Woodcrest | |||||||
| Opteron 285 | 2 | 2,6GHz | Italy | |||||||
| Power5+ | 2 | 1,9GHz | ||||||||
| Xeon 5080 | 2 | 3,73GHz | Dempsey | |||||||
| Xeon DC | 2 | 2,8GHz | Paxville | |||||||
| UltraSparc T1 | 1 | 1,2GHz | Niagara | |||||||
In summary, we can conclude that Intel has made some significant gains during the past year. In the two socket segment we have seen average gains of 24% over the Opteron for the Woodcrest. The only test that Intel doesn't win is SPECfp_rate, but it has managed to bring down the difference considerably. Following Clovertown's introduction, the average performance difference between Xeon and Opteron went up to as much as 79%. Although AMD is certain to reduce that gap with the introduction of its own quad-core, it is at least half a year before that is released. For the time being, Intel can continue to improve its once-tarnished reputation and increase its market share, while the best AMD can do is reduce the damage with lower prices.
In the four-way server segment the story is less clear. Intel has gone from being hopelessly behind to a more or less equal performance, winning some tests and losing certain others. AMD is still ahead in terms of price and power consumption. Although Tulsa may be just about good enough to prevent further erosion of the market share, Intel will have to come up with something more convincing to turn the tide, especially since Opteron is also getting four cores next year. Intel has put its hope on Tigerton in order to fight off the AMD's new generation.
| 4-way | Opteron | Tulsa | Difference |
|---|---|---|---|
| SPECint_rate_peak2000 | 175 | 164 | -6% ![]() |
| SPECfp_rate_peak2000 | 178 | 110 | -38% ![]() |
| TPC-C | 262989 | 318407 | 21% ![]() |

