EasyManuals Logo

Intel ARCHITECTURE IA-32 User Manual

Intel ARCHITECTURE IA-32
568 pages
To Next Page IconTo Next Page
To Next Page IconTo Next Page
To Previous Page IconTo Previous Page
To Previous Page IconTo Previous Page
Page #350 background imageLoading...
Page #350 background image
IA-32 Intel® Architecture Optimization
7-4
When optimizing application performance in a multithreaded
environment, control flow parallelism is likely to have the largest
impact on performance scaling with respect to the number of physical
processors and to the number of logical processors per physical
processor.
If the control flow of a multi-threaded application contains a workload
in which only 50% can be executed in parallel, the maximum
performance gain using two physical processors is only 33%, compared
to using a single processor. Using four processors can deliver no more
than a 60% speed-up over a single processor! Thus, it is critical to
maximize the portion of control flow that can take advantage of
parallelism. Improper implementation of thread synchronization can
significantly increase the proportion of serial control flow and further
reduce the application’s performance scaling.
In addition to maximizing the parallelism of control flows, interaction
between threads in the form of thread synchronization and imbalance of
task scheduling can also impact overall processor scaling significantly.
Excessive cache misses are one cause of poor performance scaling. In a
multithreaded execution environment, they can occur from:
aliased stack accesses by different threads in the same process
thread contentions resulting in cache line evictions
false-sharing of cache lines between different processors
Techniques that address each of these situations (and many other areas)
are described in sections in this chapter.
Multitasking Environment
Hardware multi-threading capabilities in IA-32 processors can exploit
task-level parallelism when a workload consists of several
single-threaded applications and these applications are scheduled to run
concurrently under an MP-aware operating system. In this environment,
hardware multi-threading capabilities can deliver higher throughput for
the workload, although the relative performance of a single task (in

Table of Contents

Questions and Answers:

Question and Answer IconNeed help?

Do you have a question about the Intel ARCHITECTURE IA-32 and is the answer not in the manual?

Intel ARCHITECTURE IA-32 Specifications

General IconGeneral
Instruction Setx86
Instruction Set TypeCISC
Memory SegmentationSupported
Operating ModesReal mode, Protected mode, Virtual 8086 mode
Max Physical Address Size36 bits (with PAE)
Max Virtual Address Size32 bits
ArchitectureIA-32 (Intel Architecture 32-bit)
Addressable Memory4 GB (with Physical Address Extension up to 64 GB)
Floating Point Registers8 x 80-bit
MMX Registers8 x 64-bit
SSE Registers8 x 128-bit
RegistersGeneral-purpose registers (EAX, EBX, ECX, EDX, ESI, EDI, ESP, EBP), Segment registers (CS, DS, SS, ES, FS, GS), Instruction pointer (EIP), Flags register (EFLAGS)
Floating Point UnitYes (x87)

Related product manuals