Movatterモバイル変換

Tensor Processing Unit

From Wikipedia, the free encyclopedia

AI accelerator ASIC by Google

This article is about the chip developed by Google. For the smartphone system-on-chip, seeGoogle Tensor. For other devices that provide tensor processing for artificial intelligence, seeAI accelerator.

Tensor Processing Unit
Tensor Processing Unit 3.0
Designer	Google
Introduced	2015^[1]
Type	Neural network Machine learning

Tensor Processing Unit (TPU) is anAI accelerator application-specific integrated circuit (ASIC) developed byGoogle forneural network machine learning, using Google's ownTensorFlow software.^[2] Google began using TPUs internally in 2015, and in 2018 made them available forthird-party use, both as part of its cloud infrastructure and by offering a smaller version of the chip for sale.

	v1	v2	v3	v4^[20]^[22]^[23]	v5e^[24]	v5p^[25]^[26]	v6e (Trillium)^[27]^[28]	v7 (Ironwood)^[29]
Date introduced	2015	2017	2018	2021	2023	2023	2024	2025
Process node	28 nm	16 nm	16 nm	7 nm	Not listed	Not listed	Not listed	Not listed
Die size (mm²)	331	< 625	< 700	< 400	300–350	Not listed	Not listed	Not listed
On-chip memory (MiB)	28	32	32 (VMEM) + 5 (spMEM)	128 (CMEM) + 32 (VMEM) + 10 (spMEM)	Not listed	Not listed	Not listed	Not listed
Clock speed (MHz)	700	700	940	1050	Not listed	1750	Not listed	Not listed
Memory	8 GiBDDR3	16 GiBHBM	32 GiB HBM	32 GiB HBM	16 GB HBM	95 GB HBM	32 GB	192 GB HBM
Memory bandwidth	34 GB/s	600 GB/s	900 GB/s	1200 GB/s	819 GB/s	2765 GB/s	1640 GB/s	7.37 TB/s
Thermal design power (W)	75	280	220	170	Not listed	Not listed	Not listed	Not listed
Computational performance (trillion operations per second)	23	45	123	275	197 (bf16) 393 (int8)	459 (bf16) 918 (int8)	918 (bf16) 1836 (int8)	4614 (fp8)
Energy efficiency (teraOPS/W)	0.31	0.16	0.56	1.62	Not listed	Not listed	Not listed	4.7

v t e Differentiable computing
General	Differentiable programming Information geometry Statistical manifold Automatic differentiation Neuromorphic computing Pattern recognition Ricci calculus Computational learning theory Inductive bias
Hardware	IPU TPU VPU Memristor SpiNNaker
Software libraries	TensorFlow PyTorch Keras scikit-learn Theano JAX Flux.jl MindSpore
Portals Computer programming Technology

v t e Digital electronics
Components	Transistor Resistor Inductor Capacitor Printed electronics Printed circuit board Electronic circuit Flip-flop Memory cell Combinational logic Sequential logic Logic gate Boolean circuit Integrated circuit (IC) Hybrid integrated circuit (HIC) Mixed-signal integrated circuit Three-dimensional integrated circuit (3D IC) Emitter-coupled logic (ECL) Erasable programmable logic device (EPLD) Macrocell array Programmable logic array (PLA) Programmable logic device (PLD) Programmable Array Logic (PAL) Generic Array Logic (GAL) Complex programmable logic device (CPLD) Field-programmable gate array (FPGA) Field-programmable object array (FPOA) Application-specific integrated circuit (ASIC) Tensor Processing Unit (TPU)
Theory	Digital signal Boolean algebra Logic synthesis Logic in computer science Computer architecture Digital signal Digital signal processing Circuit minimization Switching circuit theory Gate equivalent
Design	Logic synthesis Place and route Placement Routing Transaction-level modeling Register-transfer level Hardware description language High-level synthesis Formal equivalence checking Synchronous logic Asynchronous logic Finite-state machine Hierarchical state machine
Applications	Computer hardware Hardware acceleration Digital audio radio Digital photography Digital telephone Digital video cinematography television Electronic literature
Design issues	Metastability Runt pulse

Movatterモバイル変換

Comparison to CPUs and GPUs

History

Products

First generation TPU

Second generation TPU

Third generation TPU

Fourth generation TPU

Fifth generation TPU

Sixth generation TPU

Seventh generation TPU

Edge TPU

Pixel Neural Core

Google Tensor

Lawsuit

See also

References

External links