EasyManuals Logo

Intel ARCHITECTURE IA-32 User Manual

Intel ARCHITECTURE IA-32
568 pages
To Next Page IconTo Next Page
To Next Page IconTo Next Page
To Previous Page IconTo Previous Page
To Previous Page IconTo Previous Page
Page #291 background imageLoading...
Page #291 background image
6-1
6
Optimizing Cache Usage
Over the past decade, processor speed has increased more than ten
times. Memory access speed has increased at a slower pace. The
resulting disparity has made it important to tune applications in one of
two ways: either (a) a majority of the data accesses are fulfilled from
processor caches, or (b) effectively masking memory latency to utilize
peak memory bandwidth as much as possible.
Hardware prefetching mechanisms are enhancements in
microarchitecture to facilitate the latter aspect, and will be most
effective when combined with software tuning. The performance of
most applications can be considerably improved if the data required can
be fetched from the processor caches or if memory traffic can take
advantage of hardware prefetching effectively.
Standard techniques to bring data into the processor before it is needed
involves additional programming which can be difficult to implement
and may require special steps to prevent performance degradation.
Streaming SIMD Extensions addressed this issue by providing the
various prefetch instructions.
Streaming SIMD Extensions also introduced the various non-temporal
store instructions. SSE2 extend this support to new data types and also
introduce non-temporal store support for the 32-bit integer registers.
This chapter focuses on three subjects:
Hardware Prefetching Mechanism, Software Prefetch and
Cacheability Instructions: discusses microarchitectural feature and
instructions that allow you to affect data caching in an application.

Table of Contents

Questions and Answers:

Question and Answer IconNeed help?

Do you have a question about the Intel ARCHITECTURE IA-32 and is the answer not in the manual?

Intel ARCHITECTURE IA-32 Specifications

General IconGeneral
Instruction Setx86
Instruction Set TypeCISC
Memory SegmentationSupported
Operating ModesReal mode, Protected mode, Virtual 8086 mode
Max Physical Address Size36 bits (with PAE)
Max Virtual Address Size32 bits
ArchitectureIA-32 (Intel Architecture 32-bit)
Addressable Memory4 GB (with Physical Address Extension up to 64 GB)
Floating Point Registers8 x 80-bit
MMX Registers8 x 64-bit
SSE Registers8 x 128-bit
RegistersGeneral-purpose registers (EAX, EBX, ECX, EDX, ESI, EDI, ESP, EBP), Segment registers (CS, DS, SS, ES, FS, GS), Instruction pointer (EIP), Flags register (EFLAGS)
Floating Point UnitYes (x87)

Related product manuals