Movatterモバイル変換


[0]ホーム

URL:


KA
Uploaded byKamal Acharya
PPTX, PDF68,276 views

Pipelining and vector processing

The document discusses parallel processing techniques, categorizing computers based on Flynn's classification, which includes SISD, SIMD, MISD, and MIMD structures. It explains pipelining as a method to improve instruction and data processing efficiency through simultaneous operations, and delves into instruction pipelines and vector processing for complex computations. Additionally, the document highlights array processors and their function in enhancing computer performance through parallelism.

Embed presentation

Downloaded 3,092 times
 Parallel Processing, Flynn’s Classification ofComputers Pipelining Instruction Pipeline Pipeline Hazards and their solution Array and Vector ProcessingPipelining and VectorProcessing
Parallel Processing It refers to techniques that are used to providesimultaneous data processing. The system may have two or more ALUs to be able toexecute two or more instruction at the same time. The system may have two or more processorsoperating concurrently. It can be achieved by having multiple functionalunits that perform same or different operationsimultaneously.
Classification There are variety of ways in which the parallelprocessing can be classified Internal Organization of Processor Interconnection structure between processors Flow of information through system
 M.J. Flynn classify the computer on the basis ofnumber of instruction and data items processedsimultaneously. Single Instruction Stream, Single Data Stream(SISD) Single Instruction Stream, Multiple Data Stream(SIMD) Multiple Instruction Stream, Single Data Stream(MISD) Multiple Instruction Stream, Multiple Data Stream(MIMD)
 SISD represents the organization containing singlecontrol unit, a processor unit and a memory unit.Instruction are executed sequentially and systemmay or may not have internal parallel processingcapabilities. SIMD represents an organization that includes manyprocessing units under the supervision of a commoncontrol unit.
 MISD structure is of only theoretical interest sinceno practical system has been constructed using thisorganization. MIMD organization refers to a computer systemcapable of processing several programs at the sametime.
 Flynn’s classification emphasize on the behavioralcharacteristics of the computer system rather thanits operational and structural interconnections. Onetype of parallel processing that does not fit in theFlynn’s classification is Pipelining. Parallel Processing can be discussed under followingtopics: Pipeline Processing Vector Processing Array Processors
Pipelining It is a technique of decomposing a sequential processinto sub operations, with each sub process beingexecuted in a special dedicated segments thatoperates concurrently with all other segments. Each segment performs partial processing dictatedby the way task is partitioned. The result obtained from each segment is transferredto next segment. The final result is obtained when data have passedthrough all segments.
Example Suppose we have to perform the following task: Each sub operation is to be performed in a segmentwithin a pipeline. Each segment has one or tworegisters and a combinational circuit.
 The sub operations in each segment of the pipelineare as follows:
General Consideration Let us consider the case where k segments pipelinewith a clock cycle time tp is used to execute n tasks. The first task T1 require time ktp to complete sincethere are k segments. The remaining (n-1) tasks emerge from pipe at therate one task per cycle. They will complete after time(n-1)tp. So total time required is k+(n-1) clock cycles. Calculate total cycles in previous example.
 Now consider non pipeline unit that performs thesame operation and takes time equal to tn tocomplete each task. Total time required is ntn. The speedup ration is given as:
Arithmetic Pipeline Pipeline arithmetic units are usually found in veryhigh speed computers. They are used to implement floating pointoperations. We will now discuss the pipeline unit for the floatingpoint addition and subtraction.
 The inputs to floating point adder pipeline are twonormalized floating point numbers. A and B are mantissas and a and b are theexponents. The floating point addition and subtraction can beperformed in four segments.
 The sub-operation performed in each segments are: Compare the exponents Align the mantissas Add or subtract the mantissas Normalize the result
Instruction Pipeline Pipeline processing can occur not only in the datastream but in the instruction stream as well. An instruction pipeline reads consecutive instructionfrom memory while previous instruction are beingexecuted in other segments. This caused the instruction fetch and executesegments to overlap and perform simultaneousoperation.
Four Segment CPU Pipeline FI segment fetches the instruction. DA segment decodes the instruction and calculatethe effective address. FO segment fetches the operand. EX segment executes the instruction.
Handling Data Dependency This problem can be solved in the following ways: Hardware interlocks: It is the circuit that detects theconflict situation and delayed the instruction by sufficientcycles to resolve the conflict. Operand Forwarding: It uses the special hardware todetect the conflict and avoid it by routing the datathrough the special path between pipeline segments. Delayed Loads: The compiler detects the data conflict andreorder the instruction as necessary to delay the loadingof the conflicting data by inserting no operationinstruction.
Handling of Branch Instruction Pre fetch the target instruction. Branch target buffer(BTB) included in the fetchsegment of the pipeline Branch Prediction Delayed Branch
RISC Pipeline Simplicity of instruction set is utilized to implementan instruction pipeline using small number of sub-operation, with each being executed in single clockcycle. Since all operation are performed in the register,there is no need of effective address calculation.
Three Segment Instruction Pipeline I: Instruction Fetch A: ALU Operation E: Execute Instruction
Delayed Load
Delayed Branch Let us consider the program having the following 5instructions
Vector Processing There is a class of computational problems that arebeyond the capabilities of the conventionalcomputer. These are characterized by the fact that they requirevast number of computation and it take aconventional computer days or even weeks tocomplete. Computers with vector processing are able to handlesuch instruction and they have application infollowing fields:
 Long range weather forecasting Petroleum exploration Seismic data analysis Medical diagnosis Aerodynamics and space simulation Artificial Intelligence and expert system Mapping the human genome Image Processing
Vector Operation A vector V of length n is represented as row vector by The element Vi of vector V is written as V(I) and theindex I refers to a memory address or register wherethe number is stored.
 Let us consider the program in assembly languagethat two vectors A and B of length 100 and put theresult in vector C.
 A computer capable of vector processing eliminatesthe overhead associated with the time it takes tofetch and execute the instructions in the programloop. It allows operations to be specified with a singlevector instruction of the form:
Matrix Multiplication Let us consider the multiplication of two 3*3 matrixA and B.
 This requires three multiplication and(afterinitializing c11 to 0) three addition. Total number of addition or multiplication requiredis 3*9. In general inner product consists of the sum of kproduct terms of the form:
 In typical application value of k may be 100 or even1000. The inner product calculation on a pipeline vectorprocessor is shown below. Floating point adder and multiplier are assumed tohave four segments each.
 The four partial sum are added to form the final sum
Memory Interleaving
Array Processor An array processor is a processor that performs thecomputations on large arrays of data. There are two different types of array processor: Attached Array Processor SIMD Array Processor
Attached Array Processor It is designed as a peripheral for a conventional hostcomputer. Its purpose is to enhance the performance of thecomputer by providing vector processing. It achieves high performance by means of parallelprocessing with multiple functional units.
SIMD Array Processor It is processor which consists of multiple processingunit operating in parallel. The processing units are synchronized to performthe same task under control of common control unit. Each processor elements(PE) includes an ALU , afloating point arithmetic unit and working register.
Pipelining and vector processing

Recommended

PPT
Unit 3-pipelining & vector processing
PPT
Computer architecture pipelining
PPT
Parallel processing
PPTX
Instruction pipeline: Computer Architecture
PPT
Pipeline hazards in computer Architecture ppt
PPTX
Instruction pipelining
PPT
Microprogram Control
PDF
Pipelining
PPTX
Pipelining And Vector Processing
PPT
Pipeline hazard
DOCX
Control Units : Microprogrammed and Hardwired:control unit
PPT
pipelining
PPT
Arithmetic Logic Unit (ALU)
PPTX
Microprogrammed Control Unit
PDF
Computer organisation -morris mano
PPTX
Timing and control
PPTX
Register transfer language
PDF
Processor Organization and Architecture
PDF
Addressing modes in computer organization
PPTX
design of accumlator
PPTX
Accessing I/O Devices
PPTX
Signed Addition And Subtraction
PPTX
Computer network switching
PPT
Instruction cycle
PPTX
DMA and DMA controller
PPTX
Stacks & subroutines 1
PPS
Virtual memory
PPS
Ram and-rom-chips
PPTX
pipelining
PPTX
8237 dma controller

More Related Content

PPT
Unit 3-pipelining & vector processing
PPT
Computer architecture pipelining
PPT
Parallel processing
PPTX
Instruction pipeline: Computer Architecture
PPT
Pipeline hazards in computer Architecture ppt
PPTX
Instruction pipelining
PPT
Microprogram Control
PDF
Pipelining
Unit 3-pipelining & vector processing
Computer architecture pipelining
Parallel processing
Instruction pipeline: Computer Architecture
Pipeline hazards in computer Architecture ppt
Instruction pipelining
Microprogram Control
Pipelining

What's hot

PPTX
Pipelining And Vector Processing
PPT
Pipeline hazard
DOCX
Control Units : Microprogrammed and Hardwired:control unit
PPT
pipelining
PPT
Arithmetic Logic Unit (ALU)
PPTX
Microprogrammed Control Unit
PDF
Computer organisation -morris mano
PPTX
Timing and control
PPTX
Register transfer language
PDF
Processor Organization and Architecture
PDF
Addressing modes in computer organization
PPTX
design of accumlator
PPTX
Accessing I/O Devices
PPTX
Signed Addition And Subtraction
PPTX
Computer network switching
PPT
Instruction cycle
PPTX
DMA and DMA controller
PPTX
Stacks & subroutines 1
PPS
Virtual memory
PPS
Ram and-rom-chips
Pipelining And Vector Processing
Pipeline hazard
Control Units : Microprogrammed and Hardwired:control unit
pipelining
Arithmetic Logic Unit (ALU)
Microprogrammed Control Unit
Computer organisation -morris mano
Timing and control
Register transfer language
Processor Organization and Architecture
Addressing modes in computer organization
design of accumlator
Accessing I/O Devices
Signed Addition And Subtraction
Computer network switching
Instruction cycle
DMA and DMA controller
Stacks & subroutines 1
Virtual memory
Ram and-rom-chips

Viewers also liked

PPTX
pipelining
PPTX
8237 dma controller
PPTX
Booths algorithm for Multiplication
PPT
09 Arithmetic
PPTX
Pipelining, processors, risc and cisc
PDF
Array Processor
PPTX
Memory management
PPT
8237 / 8257 DMA
PPT
Booth Multiplier
PPTX
DMA controller intel 8257
PPT
Booths Multiplication Algorithm
PDF
Instruction pipelining (i)
PPTX
Lecture 46
PPT
Booths Multiplication Algorithm
pipelining
8237 dma controller
Booths algorithm for Multiplication
09 Arithmetic
Pipelining, processors, risc and cisc
Array Processor
Memory management
8237 / 8257 DMA
Booth Multiplier
DMA controller intel 8257
Booths Multiplication Algorithm
Instruction pipelining (i)
Lecture 46
Booths Multiplication Algorithm

Similar to Pipelining and vector processing

PPTX
Arithmatic pipline
PPTX
Unit - 5 Pipelining.pptx
PPTX
Unit 4 COA.pptx
PDF
CS304PC:Computer Organization and Architecture Session 33 demo 1 ppt.pdf
PPTX
pipelining
PPTX
Pipeline and Vector Processing Computer Org. Architecture.pptx
PPTX
UNIT 6 in computer organization cse.pptx
PPTX
vector processing, pipelining - computer organization.pptx
PPTX
ehhhhhhhhhhhhhhhhhhhhhhhhhjjjjjllaye.pptx
PPTX
Ch-7 COAwrdftghkjnxcvgbdxfhbgfjmgdxghn.pptx
PPTX
Ch-7.pptx about architecture and computer
 
PDF
236UNIT5-COA.pdfvfrffccvnnnjuyyhgfeeesdd
PPT
Computer Organozation
PPTX
Pipeline_and_Vector_Processing with info.pptx
PPT
Computer_Architecture_3rd_Edition_by_Moris_Mano_Ch_09.ppt
PDF
Parallel Processing Techniques Pipelining
PPT
Unit 6 of OS in computer science and engineering
PPTX
arithmaticpipline-170310085040.pptx
PPT
Pipelining (COA)okokokokokokokokokokok.ppt
PPTX
BTCS501_MM_Ch9.pptx
Arithmatic pipline
Unit - 5 Pipelining.pptx
Unit 4 COA.pptx
CS304PC:Computer Organization and Architecture Session 33 demo 1 ppt.pdf
pipelining
Pipeline and Vector Processing Computer Org. Architecture.pptx
UNIT 6 in computer organization cse.pptx
vector processing, pipelining - computer organization.pptx
ehhhhhhhhhhhhhhhhhhhhhhhhhjjjjjllaye.pptx
Ch-7 COAwrdftghkjnxcvgbdxfhbgfjmgdxghn.pptx
Ch-7.pptx about architecture and computer
 
236UNIT5-COA.pdfvfrffccvnnnjuyyhgfeeesdd
Computer Organozation
Pipeline_and_Vector_Processing with info.pptx
Computer_Architecture_3rd_Edition_by_Moris_Mano_Ch_09.ppt
Parallel Processing Techniques Pipelining
Unit 6 of OS in computer science and engineering
arithmaticpipline-170310085040.pptx
Pipelining (COA)okokokokokokokokokokok.ppt
BTCS501_MM_Ch9.pptx

More from Kamal Acharya

PPTX
Programming the basic computer
PPTX
Computer Arithmetic
PPTX
Data Warehousing
PPTX
Functions in php
PPTX
Association Analysis in Data Mining
PPTX
Introduction to Data Mining and Data Warehousing
PPTX
Search Engines
PPTX
Cluster Analysis
PPTX
Introduction to Computer Security
PPTX
Data Preprocessing
PPTX
Capacity Planning of Data Warehousing
PPTX
Making decision and repeating in PHP
PPTX
Classification techniques in data mining
PPTX
Session and Cookies
PPTX
Information Privacy and Data Mining
PPTX
Web Mining
PPTX
Working with arrays in php
PPTX
Text and Numbers (Data Types)in PHP
PPTX
Web forms in php
PPTX
Introduction to PHP
Programming the basic computer
Computer Arithmetic
Data Warehousing
Functions in php
Association Analysis in Data Mining
Introduction to Data Mining and Data Warehousing
Search Engines
Cluster Analysis
Introduction to Computer Security
Data Preprocessing
Capacity Planning of Data Warehousing
Making decision and repeating in PHP
Classification techniques in data mining
Session and Cookies
Information Privacy and Data Mining
Web Mining
Working with arrays in php
Text and Numbers (Data Types)in PHP
Web forms in php
Introduction to PHP

Recently uploaded

PDF
Digital Electronics – Registers and Their Applications
PDF
Past Memories and a New World: Photographs of Stoke Newington from the 70s, 8...
PDF
বাংলাদেশ অর্থনৈতিক সমীক্ষা - ২০২৫ with Bookmark.pdf
PDF
The invasion of Alexander of Macedonia in India
PDF
ASRB NET 2025 Paper GENETICS AND PLANT BREEDING ARS, SMS & STODiscussion | Co...
PDF
Unit 4_ small scale industries & Entrepreneurship
PDF
Integrated Circuits: Lithography Techniques - Fundamentals and Advanced Metho...
PPTX
Time Series Analysis - Least Square Method Fitting a Linear Trend Equation
PDF
AI Chatbots and Prompt Engineering - by Ms. Oceana Wong
PPTX
Photography Pillar 1 The Subject PowerPoint
PDF
Unit 2: Functions of Management (POSDC.)
PPTX
Session 5 Overview of the PPST and Its Indicators (COI and NCOI).pptx
PPTX
Chapter 3. Pharmaceutical Aids (pharmaceutics)
PPTX
Anatomy of the eyeball An overviews.pptx
PPTX
Introduction to Beauty Care and Wellness Services.pptx-day fcs 3rd quarter tl...
PPTX
Declaration of Helsinki Basic principles in medical research ppt.pptx
PDF
Rigor, ethics, wellbeing and resilience in the biomedical doctoral journey
 
PPTX
Quarter 3 lesson 2 of English Grade 8.pptx
PPTX
LYMPHATIC SYSTEM.pptx it includes lymph, lymph nodes, bone marrow, spleen
PPT
n-1-PMES-Guidelines-for-SY-2025-2026.ppt
Digital Electronics – Registers and Their Applications
Past Memories and a New World: Photographs of Stoke Newington from the 70s, 8...
বাংলাদেশ অর্থনৈতিক সমীক্ষা - ২০২৫ with Bookmark.pdf
The invasion of Alexander of Macedonia in India
ASRB NET 2025 Paper GENETICS AND PLANT BREEDING ARS, SMS & STODiscussion | Co...
Unit 4_ small scale industries & Entrepreneurship
Integrated Circuits: Lithography Techniques - Fundamentals and Advanced Metho...
Time Series Analysis - Least Square Method Fitting a Linear Trend Equation
AI Chatbots and Prompt Engineering - by Ms. Oceana Wong
Photography Pillar 1 The Subject PowerPoint
Unit 2: Functions of Management (POSDC.)
Session 5 Overview of the PPST and Its Indicators (COI and NCOI).pptx
Chapter 3. Pharmaceutical Aids (pharmaceutics)
Anatomy of the eyeball An overviews.pptx
Introduction to Beauty Care and Wellness Services.pptx-day fcs 3rd quarter tl...
Declaration of Helsinki Basic principles in medical research ppt.pptx
Rigor, ethics, wellbeing and resilience in the biomedical doctoral journey
 
Quarter 3 lesson 2 of English Grade 8.pptx
LYMPHATIC SYSTEM.pptx it includes lymph, lymph nodes, bone marrow, spleen
n-1-PMES-Guidelines-for-SY-2025-2026.ppt

Pipelining and vector processing

  • 1.
     Parallel Processing,Flynn’s Classification ofComputers Pipelining Instruction Pipeline Pipeline Hazards and their solution Array and Vector ProcessingPipelining and VectorProcessing
  • 2.
    Parallel Processing Itrefers to techniques that are used to providesimultaneous data processing. The system may have two or more ALUs to be able toexecute two or more instruction at the same time. The system may have two or more processorsoperating concurrently. It can be achieved by having multiple functionalunits that perform same or different operationsimultaneously.
  • 4.
    Classification There arevariety of ways in which the parallelprocessing can be classified Internal Organization of Processor Interconnection structure between processors Flow of information through system
  • 5.
     M.J. Flynnclassify the computer on the basis ofnumber of instruction and data items processedsimultaneously. Single Instruction Stream, Single Data Stream(SISD) Single Instruction Stream, Multiple Data Stream(SIMD) Multiple Instruction Stream, Single Data Stream(MISD) Multiple Instruction Stream, Multiple Data Stream(MIMD)
  • 6.
     SISD representsthe organization containing singlecontrol unit, a processor unit and a memory unit.Instruction are executed sequentially and systemmay or may not have internal parallel processingcapabilities. SIMD represents an organization that includes manyprocessing units under the supervision of a commoncontrol unit.
  • 7.
     MISD structureis of only theoretical interest sinceno practical system has been constructed using thisorganization. MIMD organization refers to a computer systemcapable of processing several programs at the sametime.
  • 8.
     Flynn’s classificationemphasize on the behavioralcharacteristics of the computer system rather thanits operational and structural interconnections. Onetype of parallel processing that does not fit in theFlynn’s classification is Pipelining. Parallel Processing can be discussed under followingtopics: Pipeline Processing Vector Processing Array Processors
  • 9.
    Pipelining It isa technique of decomposing a sequential processinto sub operations, with each sub process beingexecuted in a special dedicated segments thatoperates concurrently with all other segments. Each segment performs partial processing dictatedby the way task is partitioned. The result obtained from each segment is transferredto next segment. The final result is obtained when data have passedthrough all segments.
  • 10.
    Example Suppose wehave to perform the following task: Each sub operation is to be performed in a segmentwithin a pipeline. Each segment has one or tworegisters and a combinational circuit.
  • 11.
     The suboperations in each segment of the pipelineare as follows:
  • 14.
    General Consideration Letus consider the case where k segments pipelinewith a clock cycle time tp is used to execute n tasks. The first task T1 require time ktp to complete sincethere are k segments. The remaining (n-1) tasks emerge from pipe at therate one task per cycle. They will complete after time(n-1)tp. So total time required is k+(n-1) clock cycles. Calculate total cycles in previous example.
  • 15.
     Now considernon pipeline unit that performs thesame operation and takes time equal to tn tocomplete each task. Total time required is ntn. The speedup ration is given as:
  • 17.
    Arithmetic Pipeline Pipelinearithmetic units are usually found in veryhigh speed computers. They are used to implement floating pointoperations. We will now discuss the pipeline unit for the floatingpoint addition and subtraction.
  • 18.
     The inputsto floating point adder pipeline are twonormalized floating point numbers. A and B are mantissas and a and b are theexponents. The floating point addition and subtraction can beperformed in four segments.
  • 19.
     The sub-operationperformed in each segments are: Compare the exponents Align the mantissas Add or subtract the mantissas Normalize the result
  • 21.
    Instruction Pipeline Pipelineprocessing can occur not only in the datastream but in the instruction stream as well. An instruction pipeline reads consecutive instructionfrom memory while previous instruction are beingexecuted in other segments. This caused the instruction fetch and executesegments to overlap and perform simultaneousoperation.
  • 22.
    Four Segment CPUPipeline FI segment fetches the instruction. DA segment decodes the instruction and calculatethe effective address. FO segment fetches the operand. EX segment executes the instruction.
  • 26.
    Handling Data DependencyThis problem can be solved in the following ways: Hardware interlocks: It is the circuit that detects theconflict situation and delayed the instruction by sufficientcycles to resolve the conflict. Operand Forwarding: It uses the special hardware todetect the conflict and avoid it by routing the datathrough the special path between pipeline segments. Delayed Loads: The compiler detects the data conflict andreorder the instruction as necessary to delay the loadingof the conflicting data by inserting no operationinstruction.
  • 27.
    Handling of BranchInstruction Pre fetch the target instruction. Branch target buffer(BTB) included in the fetchsegment of the pipeline Branch Prediction Delayed Branch
  • 28.
    RISC Pipeline Simplicityof instruction set is utilized to implementan instruction pipeline using small number of sub-operation, with each being executed in single clockcycle. Since all operation are performed in the register,there is no need of effective address calculation.
  • 29.
    Three Segment InstructionPipeline I: Instruction Fetch A: ALU Operation E: Execute Instruction
  • 30.
  • 33.
    Delayed Branch Letus consider the program having the following 5instructions
  • 36.
    Vector Processing Thereis a class of computational problems that arebeyond the capabilities of the conventionalcomputer. These are characterized by the fact that they requirevast number of computation and it take aconventional computer days or even weeks tocomplete. Computers with vector processing are able to handlesuch instruction and they have application infollowing fields:
  • 37.
     Long rangeweather forecasting Petroleum exploration Seismic data analysis Medical diagnosis Aerodynamics and space simulation Artificial Intelligence and expert system Mapping the human genome Image Processing
  • 38.
    Vector Operation Avector V of length n is represented as row vector by The element Vi of vector V is written as V(I) and theindex I refers to a memory address or register wherethe number is stored.
  • 39.
     Let usconsider the program in assembly languagethat two vectors A and B of length 100 and put theresult in vector C.
  • 40.
     A computercapable of vector processing eliminatesthe overhead associated with the time it takes tofetch and execute the instructions in the programloop. It allows operations to be specified with a singlevector instruction of the form:
  • 42.
    Matrix Multiplication Letus consider the multiplication of two 3*3 matrixA and B.
  • 43.
     This requiresthree multiplication and(afterinitializing c11 to 0) three addition. Total number of addition or multiplication requiredis 3*9. In general inner product consists of the sum of kproduct terms of the form:
  • 44.
     In typicalapplication value of k may be 100 or even1000. The inner product calculation on a pipeline vectorprocessor is shown below. Floating point adder and multiplier are assumed tohave four segments each.
  • 46.
     The fourpartial sum are added to form the final sum
  • 47.
  • 48.
    Array Processor Anarray processor is a processor that performs thecomputations on large arrays of data. There are two different types of array processor: Attached Array Processor SIMD Array Processor
  • 49.
    Attached Array ProcessorIt is designed as a peripheral for a conventional hostcomputer. Its purpose is to enhance the performance of thecomputer by providing vector processing. It achieves high performance by means of parallelprocessing with multiple functional units.
  • 51.
    SIMD Array ProcessorIt is processor which consists of multiple processingunit operating in parallel. The processing units are synchronized to performthe same task under control of common control unit. Each processor elements(PE) includes an ALU , afloating point arithmetic unit and working register.

[8]ページ先頭

©2009-2025 Movatter.jp