motherboards
Arctic
Arctic Feezer 7x
Results 1 to 4 of 4

Thread: Interesting find: AMD’s Bulldozer CMT Scaling

  1. #1
    Banned

    Status
    Drdeath is offline

    Last Online
    02-07-2013 @ 01:04
    Join Date
    May 2010
    Location
    Chicago Burbs
    Posts
    8,549
    CPU: Intel Core i7 3930K @ 5GHz
    M/B: ASROCK X79 Extreme 6
    RAM: 16G G.Skill Trident 2400MHz
    GPU: 2 x Radeon 2G 7970 Liquid cooled
    • Drdeath's Full Spec's
      • Case:
      • Phobya CooliT TestBennch
      • PSU:
      • Nexus 1100W
      • Cooling:
      • Custom water. Apogee XT Rev 1, Thermachill TA120.3 plus 120.2, Swiftec Micro res, MCP655
      • Sound:
      • OB
      • Monitor:
      • Viewsonic 27"
      • OS:
      • Win7 64bit
    Thanks
    275
    Thanked 1,022 Times in 880 Posts
    Points: 28,339, Level: 51
    Points: 28,339, Level: 51
    Level completed: 72%,
    Points required for next Level: 311
    Level completed: 72%, Points required for next Level: 311
    Overall activity: 99.9%
    Overall activity: 99.9%

    Default Interesting find: AMD’s Bulldozer CMT Scaling


  2. #2
    Banned

    Status
    Drdeath is offline

    Last Online
    02-07-2013 @ 01:04
    Join Date
    May 2010
    Location
    Chicago Burbs
    Posts
    8,549
    CPU: Intel Core i7 3930K @ 5GHz
    M/B: ASROCK X79 Extreme 6
    RAM: 16G G.Skill Trident 2400MHz
    GPU: 2 x Radeon 2G 7970 Liquid cooled
    • Drdeath's Full Spec's
      • Case:
      • Phobya CooliT TestBennch
      • PSU:
      • Nexus 1100W
      • Cooling:
      • Custom water. Apogee XT Rev 1, Thermachill TA120.3 plus 120.2, Swiftec Micro res, MCP655
      • Sound:
      • OB
      • Monitor:
      • Viewsonic 27"
      • OS:
      • Win7 64bit
    Thanks
    275
    Thanked 1,022 Times in 880 Posts
    Points: 28,339, Level: 51
    Points: 28,339, Level: 51
    Level completed: 72%,
    Points required for next Level: 311
    Level completed: 72%, Points required for next Level: 311
    Overall activity: 99.9%
    Overall activity: 99.9%

    Default Re: Interesting find: AMD’s Bulldozer CMT Scaling

    AMD’s Bulldozer CMT Scaling

    Posted by Aten-Ra in Feb 01, 2012, under Reviews/Articles
    Introduction
    Lately it’s becoming more and more obvious that single thread performance cannot scale higher that easily than it did a few years ago. It takes enormous amount of resources and time to just raise the IPC (Instructions Per Cycle) a few percentage higher. To raise the performance even higher than that, it will require raising the size of the core substantially resulting in unwanted enormous higher power consumption for the performance gain.
    .
    Today’s IT needs are focused more on power reduction and higher performance per power usage than absolute brute force that was the norm in the past. From servers to workstations and a few years now down to desktops, the need for more parallelization is becoming a necessity. Microprocessors have changed from a single core to multi core designs as engineers are trying to find ways to raise performance with fewer diminishing returns.
    .
    CMP, SMT and CMT
    The easy way for higher thread parallelism is to install more cores in the same die, which is called CMP (Chip Multi Processor). An example of the CMP is AMDs Hexa-core Phenom II X6 processors with 6 cores in a single die, sharing the same L3 cache. The down side is a bigger die and higher power usage depending of the number of cores incorporated in to the same die.

    .
    Although Intel has quad and hexa-core CMP microprocessors they have also implemented the SMT (Simultaneous Multi-Threading) known by the name Hyper-Treading in to their microprocessor core design. In the SMT, each core can process (fetch, decode, execute and retire) two threads simultaneously by sharing all the resources of the single core by the two threads. By doing that, we have higher parallelization and at the same time keeping the die size and power levels down. Because of the sharing nature of the SMT, the second thread can only access the resources of the core that the first thread cannot use, resulting in a smaller performance scaling than the CMP but with much smaller die size and much less power usage.
    .
    AMD has recently incorporated a new method called CMT (Cluster Multi-Threading) in to its new Bulldozer microprocessors. In CMT, each module can process two threads simultaneously using shared and dedicated resources of the module. The fundamental difference between SMT and CMT is that the later has more dedicated recourses to support the processing of two threads simultaneously. Because of that, multithreading performance scaling is higher in the CMT design than in SMT but lower than CMP. AMD claims that its CMT architecture can have 80% of the CMP performance with a smaller die size and less power usage.

    .
    Testing procedure
    In order to measure the Thread scaling performance we used the following three microprocessors from AMD and Intel.
    AMD Phenom II X6 1100T, AMD FX 8150 and Intel Core i7 2600K.
    .
    Processor frequencies remained constant at base level by disabling Turbo in all three processors.
    .
    Phenom II X6 was used as a reference for the CMP processor. We measured with a single core, dual core, quad core and 6-core. Base frequency 3.3GHz
    .
    Intel Core i7 2600K was used as the SMT processor since it has Hyper-Threading. We measured with a single core, single core with HT (dual threads SMT), dual core(CMP), dual core + HT (4 threads SMT), quad core(CMP) and quad core + HT (8 Threads SMT). Base frequency 3.4GHz
    .
    AMD FX8150 was used for the CMT processor. It was measured with single core, single Module, two cores, two modules, four cores and four modules. Base frequency 3.6GHz
    .
    Mod x1 Core 1 = Single thread (Only a single core in a single Module)
    Mod x1 Threads x2 = Dual Threads CMT (both cores in a single Module)
    Mod x2 Threads x2 = Dual Threads CMP (Only one core per two Modules used)
    Mod x2 Threads x4 = Quad Threads CMT (dual Modules with four cores)
    Mod x4 Threads x4 = Quad Threads CMP (Four Modules with only one core per Module)
    Mod 4 Threads x8 = 8 Threads CMT (Four Modules with 2 cores each)
    .
    The rest of the hardware
    Motherboard for the AMD Processors: ASUS Crosshair V Formula
    Motherboard for the Intel Processor: GIGABYTE Z68XP-UD3P
    Memory: 2x 4GB Kingston DDR-3 1600MHz 9-9-9-12
    VGA: ASUS HD6950 1GB at 889MHz core and 1300MHz for the memory.
    HDD: Seagate 500GB SATA II 7200rpm
    PSU: ThermalTake SP-730P 730W 80+
    Windows 7 Ultimate 64bit
    .
    Software Used
    POV-Ray 3,7 RC (Balcony Project at 1024×768, AA 0,3)
    Cinebench 11,5 (Multithread)
    7-zip (32M
    x264 HD v4.0
    TrueCrypt (500MB AES)
    POV-Ray 3,7 RC
    We start with POV-Ray, AMD’s FX single thread performance is 21% lower than last generation Phenom II and 74% lower than Intel’s Core i7.
    Scaling from single core to dual Tread in CMT is 77,14% when SMT scaling is at 31,9%. With 8 Threads the FX8150 scales higher than 8 thread SMT.

    .
    Cinebench 11,5 (MultiThread)

    The single core performance of the FX8150 is very low in this benchmark, but CMT scaling is very high. SMT scales at 24,63% at dual thread and CMP scaling of four threads is at 399,25 for the Intel processor. FX exhibits the higher scaling both in CMP and CMT modes against the Core i7 in this test .


    .
    7-zip
    The 7zip score is the compination of both Compress and Decompress scores of a 32MB file. FX 8150 single thread performance is on par with Phenom II but still lags 32% of the Core i7 2600K. Again FX-1850 has the higher scaling both in CMP and CMT modes and it manages to catch the performance of an 8 thread SMT Core i7 2600K.


    .
    x264 HD v4.0 (Second Pass)

    This is the first benchmark that FX8150 single thread performance is faster than Phenom II but Intel Core i7 is still 26,5% faster. Because FX has a strong single thread performance and higher thread scaling it manages to catche Intels Core i7 performance at the Dual thread SMT mode and continues to be in front of it even in quad and 8 thread SMT mode.


    .
    TrueCrypt (AES)

    Because Phenom II doesn’t have AES, we only tested FX and Core I processors in this benchmark. Again FXs single thread performance is lower by 35% than Intels Core i7 but due to higher scaling it catches the Intel processor at the end.

    Conclusion
    AMDs CMT design scales much higher than Intels SMT and it is close to the 80% claims AMD have made. It seams that if FXs single thread performance is close to or better than 35% in relationship to Core i7 single thread performance, then both CPUs perform the same at a higher thread level. But if the single core performance is lower than 35% then the higher CMT scaling cannot help the AMD FX processor to catch Intel’s stronger single thread performance.
    .
    It will be very interesting to see what will happen if AMD could increase its single thread performance keeping the higher scaling at the same time. CMT is an interesting and new technology for the desktop platform and windows as well software is not yet optimized for it. When software can take advantage of FX SIMD’s instructions and CMT scaling as shown in AES TrueCrypt, the performance is in par with the Intel Core i7.
    The AMD CMT architecture in Bulldozer has a lot of potential and performance will only go up in the future with Piledriver and more optimized software.

  3. The following user thanks Drdeath For this above post:

    SamCR3 (07-02-2012)

  4. #3
    Core Member

    Status
    SamCR3 is offline

    Last Online
    21-06-2020 @ 14:04
    Join Date
    Jul 2011
    Location
    Seattle, Washington USA
    Posts
    4,222
    CPU: AMD FX-8350 @ 4.6Ghz
    M/B: Asus Crosshair V Formula
    RAM: 16GB Corsair Dominator GT 1866MHz (9-10-9-27)
    GPU: XFX HD 7970 Black Edition 3GB and Diamond HD 7970 3GB both @ 1125/1575MHz
    • SamCR3's Full Spec's
      • Case:
      • NZXT Switch 810
      • PSU:
      • OCZ Fatal1ty 1000 Watt
      • Cooling:
      • XSPC RayStorm CPU Block, XSPC X20 750 Pump/Res, CoolIt 360MM Rad. w/6 x Noctua NF-F12 PWM Fans (Push/Pull) and PrimoFlex Pro LRT UV Red Tubing; 5 x 140 MM Bitfenx Spectre Pro Red LED Fans on NZXT Sentry Mix Fan Controller
      • Sound:
      • SupremeFX X-Fi 2
      • Monitor:
      • 1 x Samsung SyncMaster SA350H 27 in.
      • OS:
      • Windows 7 Home Premium 64-bit
      • Misc:
      • CM Storm Trigger Mechanical Keyboard; CM Storm Sentinel Zero-G Gaming Mouse; Logitech z523 2.1 Speaker System; Logitech 600 Desktop Microphone
    Thanks
    867
    Thanked 430 Times in 386 Posts
    Points: 26,844, Level: 50
    Points: 26,844, Level: 50
    Level completed: 30%,
    Points required for next Level: 706
    Level completed: 30%, Points required for next Level: 706
    Overall activity: 0%
    Overall activity: 0%

    Default Re: Interesting find: AMD’s Bulldozer CMT Scaling

    Very interesting indeed. Maybe AMD's claims that software just isn't available yet to take full advantage of FX's architecture is true. Only time will tell, but will be interesting to watch. Piledriver should be very interesting if AMD can do as the reviewer suggests and optimize the single thread performance.
    2nd Rig: AMD FX-8120; Corsair H105 ; Gigabyte 990FXA-UD3; 16GB G.Skill Ripjaws X Series 1600MHz; 2 x PowerColor HD 7870 Myst. Ed. (Tahiti LE); Fractal Design Arc XL; SeaSonic X Series 850W
    3rd Rig: AMD A6-3650; Asetek 550LC; Asus F1A75-M Pro; 8GB Kingston HyperX; XFX HD 6670 1GB GDDR5; Antec 300; Thermaltake TR2 650W

  5. #4
    OC Jedi Master

    Status
    Deton is offline

    Last Online
    27-02-2014 @ 22:13
    Join Date
    Dec 2006
    Location
    Canada
    Posts
    8,493
    CPU: i7 920, 4.2Ghz HT
    M/B: P6X58D Premium
    RAM: OCZ Flex EX PC3-17000 12GB
    GPU: Asus EAH5870
    • Deton's Full Spec's
      • Case:
      • Lian Li A70F
      • PSU:
      • Enermax Galaxy Evo 1250W
      • Cooling:
      • Custom H20 setup.
      • Sound:
      • Onboard
      • Monitor:
      • ViewSonic VX2435
      • OS:
      • Win 7 - 64bit
    Thanks
    45
    Thanked 1,179 Times in 1,026 Posts
    Points: 50,246, Level: 69
    Points: 50,246, Level: 69
    Level completed: 36%,
    Points required for next Level: 904
    Level completed: 36%, Points required for next Level: 904
    Overall activity: 7.0%
    Overall activity: 7.0%

    Default Re: Interesting find: AMD’s Bulldozer CMT Scaling

    it will be very interesting to see what will happen if amd could increase its single thread performance keeping the higher scaling at the same time. Cmt is an interesting and new technology for the desktop platform and windows as well software is not yet optimized for it. When software can take advantage of fx simd’s instructions and cmt scaling as shown in aes truecrypt, the performance is in par with the intel core i7. The amd cmt architecture in bulldozer has a lot of potential and performance will only go up in the future with piledriver and more optimized software.
    The thing is software always behind the hardware for few years and whenever software catch up to the current hardware tech. Isn't that become an old tech then?

Remove Ads

Sponsored Links

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  

Similar Threads

  1. Very Interesting!
    By Bill Gates in forum Drives & Storage
    Replies: 4
    Last Post: 26-09-2011, 00:03
  2. Micro-Stuttering And GPU Scaling In CrossFire And SLI
    By Lil' ½ Dead in forum Graphics Cards
    Replies: 2
    Last Post: 22-08-2011, 20:14
  3. Interesting....
    By rickss69 in forum General Discussion
    Replies: 9
    Last Post: 02-03-2011, 13:31
  4. Starcraft 2 CPU scaling benchmarks
    By Lil' ½ Dead in forum PC Games
    Replies: 1
    Last Post: 01-03-2010, 14:25
  5. HD 4850 CrossFire scaling put to the test
    By News Team in forum Hardware News
    Replies: 7
    Last Post: 25-06-2008, 14:59

Search tags for this page

3930k x264 benchmark hd v4.0
,
amd 6 core cmt
,

amd cmt

,
amd cmt vs micro system cmt
,

amd fx memory scaling

,
amd's cmt
,

bulldozer cmt

,
chip design cmt vs smt
,
cmp vs cmt vs smt
,
cmt smt cpu
,
cmt vs ht processor
,
cmt vs smt
,
ht vs cmt
,
x264 benchmark hd v4.0 amd fx
Click on a term to search for related topics.

Tags for this Thread