EasyManuals Logo

Intel ARCHITECTURE IA-32 User Manual

Intel ARCHITECTURE IA-32
568 pages
To Next Page IconTo Next Page
To Next Page IconTo Next Page
To Previous Page IconTo Previous Page
To Previous Page IconTo Previous Page
Page #268 background imageLoading...
Page #268 background image
IA-32 Intel® Architecture Optimization
5-6
simultaneously referred to as an xyz data representation, see the
diagram below) are computed in parallel, and the array is updated one
vertex at a time.
When data structures are organized for the horizontal computation
model, sometimes the availability of homogeneous arithmetic
operations in SSE and SSE2 may cause inefficiency or require
additional intermediate movement between data elements.
Alternatively, the data structure can be organized in the SoA format.
The SoA data structure enables a vertical computation technique, and is
recommended over horizontal computation for many applications, for
the following reasons:
When computing on a single vector (xyz), it is common to use only
a subset of the vector components; for example, in 3D graphics the
W
component is sometimes ignored. This means that for single-vector
operations, 1 of 4 computation slots is not being utilized. This
typically results in a 25% reduction of peak efficiency.
It may become difficult to hide long latency operations. For
instance, another common function in 3D graphics is normalization,
which requires the computation of a reciprocal square root (that is,
1/sqrt). Both the division and square root are long latency
operations. With vertical computation (SoA), each of the 4
computation slots in a SIMD operation is producing a unique result,
so the net latency per slot is L/4 where L is the overall latency of the
operation. However, for horizontal computation, the 4 computation
slots each produce the same result, hence to produce 4 separate
results requires a net latency per slot of L.
XYZW

Table of Contents

Questions and Answers:

Question and Answer IconNeed help?

Do you have a question about the Intel ARCHITECTURE IA-32 and is the answer not in the manual?

Intel ARCHITECTURE IA-32 Specifications

General IconGeneral
Instruction Setx86
Instruction Set TypeCISC
Memory SegmentationSupported
Operating ModesReal mode, Protected mode, Virtual 8086 mode
Max Physical Address Size36 bits (with PAE)
Max Virtual Address Size32 bits
ArchitectureIA-32 (Intel Architecture 32-bit)
Addressable Memory4 GB (with Physical Address Extension up to 64 GB)
Floating Point Registers8 x 80-bit
MMX Registers8 x 64-bit
SSE Registers8 x 128-bit
RegistersGeneral-purpose registers (EAX, EBX, ECX, EDX, ESI, EDI, ESP, EBP), Segment registers (CS, DS, SS, ES, FS, GS), Instruction pointer (EIP), Flags register (EFLAGS)
Floating Point UnitYes (x87)

Related product manuals