EasyManuals Logo

Nvidia GeForce GTX 980 User Manual

Nvidia GeForce GTX 980
32 pages
To Next Page IconTo Next Page
To Next Page IconTo Next Page
To Previous Page IconTo Previous Page
To Previous Page IconTo Previous Page
Page #9 background imageLoading...
Page #9 background image
GeForce GTX 980 Whitepaper
GM204 HARDWARE ARCHITECTURE
IN-DEPTH
9
and power that had to be spent to manage data transfer in the more complex datapath organization
used by Kepler.
Compared to Kepler, the SMM’s memory hierarchy has also changed. Rather than implementing a
combined shared memory/L1 cache block as in Kepler SMX, Maxwell SMM units feature a 96KB
dedicated shared memory, while the L1 caching function has been moved to be shared with the texture
caching function.
As a result of these changes, each Maxwell CUDA core is able to deliver roughly 1.4x more performance
per core compared to a Kepler CUDA core, and 2x the performance per watt. At the SM level, with 33%
fewer total cores per SM, but 1.4x performance per core, each Maxwell SMM can deliver total per-SM
performance similar to Kepler’s SMX, and the area savings from this more efficient architecture enabled
us to then double up the total SM count, compared to GK104.
PolyMorph Engine 3.0
Tessellation was one of DirectX 11’s key features and will play a bigger role in the future as the next
generation of games are designed to use more tessellation. With the addition of more SMs in GM204,
GTX 980 also benefits from 2x the Polymorph Engines, compared to GTX 680. As a result, performance
on geometry heavy workloads is roughly doubled, and due to architectural improvements within the PE,
can achieve up to 3x performance improvement with high tessellation expansion factors.

Other manuals for Nvidia GeForce GTX 980

Questions and Answers:

Question and Answer IconNeed help?

Do you have a question about the Nvidia GeForce GTX 980 and is the answer not in the manual?

Nvidia GeForce GTX 980 Specifications

General IconGeneral
GPU ArchitectureMaxwell
CUDA Cores2048
Base Clock1126 MHz
Boost Clock1216 MHz
Memory Speed7 Gbps
Memory Clock1750 MHz
Memory Interface256-bit
Memory Bandwidth224 GB/s
VRAM4 GB GDDR5
TDP165 W
Recommended System Power500 W
Power Connectors2 x 6-pin
DirectX12 API
OpenGL4.5
GPUGM204
Process Size28 nm
Transistors5.2 billion
Die Size398 mm²
Texture Units128
ROP Units64
OpenCL1.2
PCI Express Version3.0
Maximum Digital Resolution5120x3200
Maximum Display Outputs4
HDCP SupportYes
SLI SupportYes

Related product manuals