Google TPU
E660962
Google TPU is a custom-designed application-specific integrated circuit (ASIC) developed by Google to accelerate machine learning workloads, particularly deep learning inference and training in its data centers.
Observed surface forms (2)
| Surface form | Occurrences |
|---|---|
| Cloud TPU | 1 |
| Google custom processors | 1 |
Statements (46)
| Predicate | Object |
|---|---|
| instanceOf |
application-specific integrated circuit
ⓘ
tensor processing unit ⓘ |
| announcedAt | Google I/O 2016 NERFINISHED ⓘ |
| architecture | matrix-multiply optimized architecture ⓘ |
| availableVia | Google Cloud NERFINISHED ⓘ |
| category |
data center hardware
ⓘ
machine learning hardware ⓘ |
| competesWith |
AMD GPU
NERFINISHED
ⓘ
NVIDIA GPU NERFINISHED ⓘ custom AI accelerators ⓘ |
| designedFor |
high throughput matrix operations
ⓘ
low-precision arithmetic ⓘ |
| developer | Google ⓘ |
| firstAnnouncement | 2016 ⓘ |
| firstPublicUse | 2015 ⓘ |
| hasVersion |
TPU v1
NERFINISHED
ⓘ
TPU v2 NERFINISHED ⓘ TPU v3 NERFINISHED ⓘ TPU v4 NERFINISHED ⓘ TPU v5e NERFINISHED ⓘ TPU v5lite NERFINISHED ⓘ TPU v5p NERFINISHED ⓘ |
| integratedWith | Google Cloud AI Platform NERFINISHED ⓘ |
| manufacturer | Google NERFINISHED ⓘ |
| notableFeature |
high performance per watt for ML workloads
ⓘ
systolic array matrix unit ⓘ |
| offeredAs | Cloud TPU NERFINISHED ⓘ |
| optimizedFor | large-scale data center deployment ⓘ |
| purpose |
accelerate deep learning inference
ⓘ
accelerate deep learning training ⓘ accelerate machine learning workloads ⓘ |
| region | global deployment in Google data centers ⓘ |
| supports |
TensorFlow
NERFINISHED
ⓘ
neural network inference ⓘ neural network training ⓘ |
| supportsDataType |
32-bit floating point
ⓘ
8-bit integer ⓘ bfloat16 ⓘ |
| technologyNode | advanced CMOS process ⓘ |
| usedBy |
external Google Cloud customers
ⓘ
internal Google services ⓘ |
| usedFor |
Google Assistant
NERFINISHED
ⓘ
Google Photos NERFINISHED ⓘ Google Search NERFINISHED ⓘ Google Translate NERFINISHED ⓘ |
| usedIn | Google data centers ⓘ |
Referenced by (3)
Full triples — surface form annotated when it differs from this entity's canonical label.
this entity surface form:
Cloud TPU
this entity surface form:
Google custom processors