Movatterモバイル変換

[0]ホーム

Jump to content

T5 (language model)

فارسی

Edit links

From Wikipedia, the free encyclopedia

Series of large language models developed by Google AI

Text-to-Text Transfer Transformer (T5)
Original author(s)	Google AI
Initial release	23 October 2019; 5 years ago (23 October 2019)

Stable release	T5Xgithub.com/google-research/t5x

Repository	https://github.com/google-research/text-to-text-transfer-transformer
Type	Large language model Transformer (deep learning architecture)
License	Apache-2.0
Website	blog.research.google/2020/02/exploring-transfer-learning-with-t5.html

T5 (Text-to-Text Transfer Transformer) is a series oflarge language models developed byGoogle AI introduced in 2019.^[1]^[2] Like theoriginal Transformer model,^[3] T5 models areencoder-decoder Transformers, where the encoder processes the input text, and the decoder generates the output text.

T5 models are usually pretrained on a massivedataset of text and code, after which they can perform the text-based tasks that are similar to their pretrained tasks. They can also be finetuned to perform other tasks.

T5 models have been employed in various applications, including chatbots, machine translation systems, text summarization tools, code generation, and robotics.^[4]

Training

[edit]

The original T5 models are pre-trained on theColossal Clean Crawled Corpus (C4), containing text and codescraped from the internet. This pre-training process enables the models to learn general language understanding and generation abilities. T5 models can then be fine-tuned on specific downstream tasks, adapting their knowledge to perform well in various applications.

The T5 models were pretrained on many tasks, all in the format of<input text> -><output text>.

How a T5 can be finetuned for a summarization task.^[5]

Some examples are:

restoring corrupted text:Thank you <X> me to your party <Y> week. -><X> for inviting <Y> last <Z>, where the<Z> means "end of output", and the<X> and<Y> denote blanks to be filled, called "sentinels" in the original report.
translation:translate English to German: That is good. ->Das ist gut..
judging the grammatical acceptability of a sentence (CoLA sentence):The course is jumping well. ->not acceptable .

Architecture

[edit]

T5 encoder-decoder structure, showing the attention structure. In the encoder self-attention (lower square), all input tokens attend to each other; In the encoder–decoder cross-attention (upper rectangle), each target token attends to all input tokens; In the decoder self-attention (upper triangle), each target token attends to present and past target tokens only (causal).^[5]

The T5 series encompasses several models with varying sizes and capabilities, allencoder-decoder Transformers, where the encoder processes the input text, and the decoder generates the output text.

These models are often distinguished by their parameter count, which indicates the complexity and potential capacity of the model. The original paper^[1] reported the following 5 models:

T5 properties^{[note 1]}
Name	Total parameters	Encoder parameters	Decoder parameters	$n_{\text{layer}}$	$d_{\text{model}}$	$d_{\text{ff}}$	$d_{\text{kv}}$	$n_{\text{head}}$
Small	76,956,160	35,330,816	41,625,344	6	512	2048	64	8
Base	247,577,856	109,628,544	137,949,312	12	768	3072	64	12
Large	770,567,168	334,939,648	435,627,520	24	1024	4096	64	16
3B	2,884,497,408	1,240,909,824	1,643,587,584	24	1024	16384	128	32
11B	11,340,220,416	4,864,791,552	6,475,428,864	24	1024	65536	128	128

^{*The encoder and the decoder have the same shape. So for example, the T5-small has 6 layers in the encoder and 6 layers in the decoder.}

In the above table,

$n_{\text{layer}}$ : Number of layers in the encoder; also, number of layers in the decoder. They always have the same number of layers.
$n_{\text{head}}$ : Number of attention heads in each attention block.
$d_{\text{model}}$ : Dimension of the embedding vectors.
$d_{\text{ff}}$ : Dimension of the feedforward network within each encoder and decoder layer.
$d_{\text{kv}}$ : Dimension of the key and value vectors used in the self-attention mechanism.

Note that unlike typical Transformers, the 3B and 11B models do not satisfy $d_{\text{model}}=d_{\text{kv}}n_{\text{head}}$ .^[6]

Compared to the original Transformer, it uses a few minor modifications: layer normalization with no additive bias; placing the layer normalization outside the residual path; relative positional embedding.^[7]

For all experiments, they used a WordPiece tokenizer, with vocabulary size 32,000. The tokenizer is shared across both the input and output of each model. It was trained on a mixture ofEnglish,German,French, andRomanian data from the C4 dataset, at a ratio of 10:1:1:1.

Variants

[edit]

Several subsequent models used the T5 architecture, with non-standardized naming conventions used to differentiate them. This section attempts to collect the main ones. An exhaustive list of the variants released by Google Brain is on the GitHub repo for T5X.^[8]

Some models are trained from scratch while others are trained by starting with a previous trained model. By default, each model is trained from scratch, except otherwise noted.

T5 small, base, large, 3B, 11B (2019): The original models.^[1]
T5 1.1 small, base, large, XL, XXL: Improved versions of the original T5 series. These have roughly equal parameters. Theactivation function is GEGLU^[9] instead of ReLU. The 3B and the 11B were changed to "XL" and "XXL", and their shapes are changed:^[8]^[10]^[11]

T5 v1.1 properties^{[note 2]}
Name	Total parameters	Encoder parameters	Decoder parameters	$n_{\text{layer}}$	$d_{\text{model}}$	$d_{\text{ff}}$	$d_{\text{kv}}$	$n_{\text{head}}$
Small	76,961,152	35,332,800	41,628,352	8	512	1024	64	6
Base	247,577,856	109,628,544	137,949,312	12	768	2048	64	12
Large	783,150,080	341,231,104	441,918,976	24	1024	2816	64	16
XL	2,849,757,184	1,223,527,424	1,626,229,760	24	2048	5120	64	32
XXL	11,135,332,352	4,762,310,656	6,373,021,696	24	4096	10240	64	64

LM-adapted T5 (2021): a series of models (from small to XXL) that started from checkpoints of theT5 series, but trained further on 100B additional tokens from C4.^[12]
Switch Transformer (2021): amixture-of-experts variant of T5, by replacing the feedforward layers in the encoder and decoder blocks with mixture of expert feedforward layers.^[13]^[14]
T0 3B, 11B (2021): a series of models that started from checkpoints ofLM-adapted T5, and further trained to perform tasks based only on task instruction (zero-shot).^[15] Different entries in the series uses different finetuning data.^[16]
ByT5 (2021): a byte-level version of T5, trained on mC4 (multilingual C4) dataset.^[17] It operates on text encoded asUTF-8 bytes, without tokenizers.
Flan-T5-XL (2022): a model that started with a checkpoint ofT5 XL, theninstruction-tuned on the FLAN dataset.^[18]^[19]^[20]^[21]
T5X (2022): aJAX-based re-implementation of the originalT5 codebase. It isnot a model.^[22] The original T5 codebase was implemented inTensorFlow with MeshTF.^[2]
UL2 20B (2022): a model with the same architecture as theT5 series, but scaled up to 20B, and trained with "mixture of denoisers" objective on the C4.^[23] It was trained on a TPU cluster by accident, when a training run was left running accidentally for a month.^[24]
Flan-UL2 20B (2022):UL2 20Binstruction-finetuned on the FLAN dataset.^[23]^[20]
Pile-T5 (2024): has the same architecture ofT5, except it used theLlama tokenizer. It was trained onThe Pile. It came in sizes of base, large, XL, XXL.^[25]

Applications

[edit]

The T5 model itself is an encoder-decoder model, allowing it to be used for instruction following. The encoder encodes the instruction, and the decoder autoregressively generates the reply.

The T5 encoder can be used as a text encoder, much like BERT. It encodes a text into a sequence of real-number vectors, which can be used for downstream applications. For example, GoogleImagen^[26] usesT5-XXL as text encoder, and the encoded text vectors are used as conditioning on adiffusion model. As another example, the AuraFlow diffusion model^[27] usesPile-T5-XL.

References

[edit]

^^a ^b ^cRaffel, Colin; Shazeer, Noam; Roberts, Adam; Lee, Katherine; Narang, Sharan; Matena, Michael; Zhou, Yanqi; Li, Wei; Liu, Peter J. (2020)."Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer".Journal of Machine Learning Research.21 (140):1–67.arXiv:1910.10683.ISSN 1533-7928.
^^a ^bgoogle-research/text-to-text-transfer-transformer, Google Research, 2024-08-21, retrieved2024-08-21
^Vaswani, Ashish; Shazeer, Noam; Parmar, Niki; Uszkoreit, Jakob; Jones, Llion; Gomez, Aidan N; Kaiser, Łukasz; Polosukhin, Illia (2017)."Attention is All you Need".Advances in Neural Information Processing Systems.30. Curran Associates, Inc.
^Jiang, Yunfan; Gupta, Agrim; Zhang, Zichen; Wang, Guanzhi; Dou, Yongqiang; Chen, Yanjun; Fei-Fei, Li; Anandkumar, Anima; Zhu, Yuke (2022-10-06). "VIMA: General Robot Manipulation with Multimodal Prompts".arXiv:2210.03094 [cs.RO].
^^a ^bZhang, Aston; Lipton, Zachary; Li, Mu; Smola, Alexander J. (2024)."11.9. Large-Scale Pretraining with Transformers".Dive into deep learning. Cambridge New York Port Melbourne New Delhi Singapore: Cambridge University Press.ISBN 978-1-009-38943-3.
^"config.json · google-t5/t5-11b at main".huggingface.co. 2020-04-24. Retrieved2024-09-17.
^Shaw, Peter; Uszkoreit, Jakob; Vaswani, Ashish (2018-04-12),Self-Attention with Relative Position Representations,arXiv:1803.02155
^^a ^b"t5x/docs/models.md at main · google-research/t5x".GitHub. Retrieved2024-08-05.
^Shazeer, Noam (2020-02-12),GLU Variants Improve Transformer,arXiv:2002.05202
^"config.json · google/t5-v1_1-xl at main".huggingface.co. 2020-11-19. Retrieved2024-09-17.
^"config.json · google/t5-v1_1-xxl at main".huggingface.co. 2020-11-19. Retrieved2024-09-17.
^Lester, Brian; Al-Rfou, Rami; Constant, Noah (2021-09-02),The Power of Scale for Parameter-Efficient Prompt Tuning,arXiv:2104.08691
^Fedus, William; Zoph, Barret; Shazeer, Noam (2022-06-16),Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity,arXiv:2101.03961
^"SwitchTransformers".huggingface.co. Retrieved2024-08-05.
^Sanh, Victor; Webson, Albert; Raffel, Colin; Bach, Stephen H.; Sutawika, Lintang; Alyafeai, Zaid; Chaffin, Antoine; Stiegler, Arnaud; Scao, Teven Le (2022-03-17),Multitask Prompted Training Enables Zero-Shot Task Generalization,arXiv:2110.08207
^"bigscience/T0 · Hugging Face".huggingface.co. 2024-03-04. Retrieved2024-08-21.
^Xue, Linting; Barua, Aditya; Constant, Noah; Al-Rfou, Rami; Narang, Sharan; Kale, Mihir; Roberts, Adam; Raffel, Colin (2022-03-25)."ByT5: Towards a Token-Free Future with Pre-trained Byte-to-Byte Models".Transactions of the Association for Computational Linguistics.10:291–306.arXiv:2105.13626.doi:10.1162/tacl_a_00461.ISSN 2307-387X.
^Chung, Hyung Won; Hou, Le; Longpre, Shayne; Zoph, Barret; Tay, Yi; Fedus, William; Li, Yunxuan; Wang, Xuezhi; Dehghani, Mostafa; Brahma, Siddhartha; Webson, Albert; Gu, Shixiang Shane; Dai, Zhuyun; Suzgun, Mirac; Chen, Xinyun (2024)."Scaling Instruction-Finetuned Language Models".Journal of Machine Learning Research.25 (70):1–53.arXiv:2210.11416.ISSN 1533-7928.
^Longpre, Shayne; Hou, Le; Vu, Tu; Webson, Albert; Chung, Hyung Won; Tay, Yi; Zhou, Denny; Le, Quoc V.; Zoph, Barret; Wei, Jason; Roberts, Adam (2023-07-03)."The Flan Collection: Designing Data and Methods for Effective Instruction Tuning".Proceedings of the 40th International Conference on Machine Learning. PMLR:22631–22648.arXiv:2301.13688.
^^a ^bgoogle-research/FLAN, Google Research, 2024-08-03, retrieved2024-08-05
^"google/flan-t5-xl · Hugging Face".huggingface.co. 2024-01-04. Retrieved2024-08-05.
^Roberts, Adam; Chung, Hyung Won; Mishra, Gaurav; Levskaya, Anselm; Bradbury, James; Andor, Daniel; Narang, Sharan; Lester, Brian; Gaffney, Colin; Mohiuddin, Afroz; Hawthorne, Curtis; Lewkowycz, Aitor; Salcianu, Alex; Zee, Marc van; Austin, Jacob (2023)."Scaling Up Models and Data with t5x and seqio".Journal of Machine Learning Research.24 (377):1–8.ISSN 1533-7928.
^^a ^bTay, Yi; Dehghani, Mostafa; Tran, Vinh Q.; Garcia, Xavier; Wei, Jason; Wang, Xuezhi; Chung, Hyung Won; Shakeri, Siamak; Bahri, Dara (2023-02-28),UL2: Unifying Language Learning Paradigms,arXiv:2205.05131
^"Training great LLMs entirely from ground up in the wilderness as a startup".Yi Tay. Retrieved2024-10-18.
^Sutawika, Lintang; Komatsuzaki, Aran; Raffel, Colin (2024-04-15)."Pile-T5".EleutherAI Blog. Retrieved2024-05-05.
^"Imagen: Text-to-Image Diffusion Models".imagen.research.google. Retrieved2024-08-23.
^"AuraFlow".huggingface.co. Retrieved2024-08-23.

External links

[edit]

"T5 release - a google Collection".huggingface.co. 2024-07-31. Retrieved2024-10-16.

Notes

[edit]

importtorchfromtransformersimportAutoConfig,AutoModelForSeq2SeqLMdefcount_parameters(model):enc=sum(p.numel()forpinmodel.encoder.parameters())dec=sum(p.numel()forpinmodel.decoder.parameters())total=enc+decreturntotal,enc,decfornamein["t5-small","t5-base","t5-large","t5-3b","t5-11b"]:print(f"Model:{name}")config=AutoConfig.from_pretrained(f"google-t5/{name}")torch_dtype=torch.float16model=AutoModelForSeq2SeqLM.from_config(config,torch_dtype=torch_dtype)total,enc,dec=count_parameters(model)print(f"Total number of parameters in{name}:{total}")print(f"Total number of parameters in encoder:{enc}")print(f"Total number of parameters in decoder:{dec}")delmodel

importtorchfromtransformersimportAutoConfig,AutoModelForSeq2SeqLMdefcount_parameters(model):enc=sum(p.numel()forpinmodel.encoder.parameters())dec=sum(p.numel()forpinmodel.decoder.parameters())total=enc+decreturntotal,enc,decfornamein["small","base","large","xl","xxl"]:print(f"Model:{name}")config=AutoConfig.from_pretrained(f"google/t5-v1_1-{name}")torch_dtype=torch.float16model=AutoModelForSeq2SeqLM.from_config(config,torch_dtype=torch_dtype)total,enc,dec=count_parameters(model)print(f"Total number of parameters in{name}:{total}")print(f"Total number of parameters in encoder:{enc}")print(f"Total number of parameters in decoder:{dec}")delmodel

Google AI

Computer programs

AlphaGo

Versions	AlphaGo (2015) Master (2016) AlphaGo Zero (2017) AlphaZero (2017) MuZero (2019)
Competitions	Fan Hui (2015) Lee Sedol (2016) Ke Jie (2017)
In popular culture	AlphaGo (2017) The MANIAC (2023)

Other

AlphaFold (2018)
AlphaStar (2019)
AlphaDev (2023)
AlphaGeometry (2024)
AlphaGenome (2025)

Machine learning

Neural networks	Inception (2014) WaveNet (2016) MobileNet (2017) Transformer (2017) EfficientNet (2019) Gato (2022)
Other	Quantum Artificial Intelligence Lab TensorFlow Tensor Processing Unit

Generative AI

Chatbots	Assistant (2016) Sparrow (2022) Gemini (2023)
Models	BERT (2018) XLNet (2019) T5 (2019) LaMDA (2021) Chinchilla (2022) PaLM (2022) Imagen (2023) Gemini (2023) VideoPoet (2024) Veo (text-to-video model) (2024)
Other	DreamBooth (2022) NotebookLM (2023) Vids (2024) Gemini Robotics (2025)

See also

Google

a subsidiary ofAlphabet

Company

Divisions

Subsidiaries

Active

Defunct

Programs

Events

Infrastructure

People

Current	Krishna Bharat Vint Cerf Jeff Dean John Doerr Sanjay Ghemawat Al Gore John L. Hennessy Urs Hölzle Salar Kamangar Ray Kurzweil Ann Mather Alan Mulally Rick Osterloh Sundar Pichai (CEO) Ruth Porat (CFO) Rajen Sheth Hal Varian Neal Mohan
Former	Andy Bechtolsheim Sergey Brin (co-founder) David Cheriton Matt Cutts David Drummond Alan Eustace Timnit Gebru Omid Kordestani Paul Otellini Larry Page (co-founder) Patrick Pichette Eric Schmidt Ram Shriram Amit Singhal Shirley M. Tilghman Rachel Whetstone Susan Wojcicki

Criticism

General	Censorship DeGoogle FairSearch "Google's Ideological Echo Chamber" No Tech for Apartheid Privacy concerns Street View YouTube Trade unions Alphabet Workers Union YouTube copyright issues
Incidents	Backdoor advertisement controversy Blocking of YouTube videos in Germany Data breach Elsagate Fantastic Adventures scandal Kohistan video case Reactions toInnocence of Muslims San Francisco tech bus protests Services outages Slovenian government incident Walkouts YouTube headquarters shooting

Other

Development

Software

A–C	Accelerated Linear Algebra AMP Actions on Google ALTS American Fuzzy Lop Android Cloud to Device Messaging Android Debug Bridge Android NDK Android Runtime Android SDK Android Studio Angular AngularJS Apache Beam APIs App Engine App Inventor App Maker App Runtime for Chrome AppJet Apps Script AppSheet ARCore Base Bazel BeyondCorp Bigtable BigQuery Bionic Blockly Borg Caja Cameyo Chart API Charts Chrome Frame Chromium Blink Closure Tools Cloud Connect Cloud Dataflow Cloud Datastore Cloud Messaging Cloud Shell Cloud Storage Code Search Compute Engine Cpplint
D–N	Dalvik Data Protocol Dialogflow Exposure Notification Fast Pair Fastboot Federated Learning of Cohorts File System Firebase Firebase Studio Firebase Cloud Messaging FlatBuffers Flutter Freebase Gadgets Ganeti Gears Gerrit GLOP gRPC Gson Guava Guetzli Guice gVisor GYP JAX Jetpack Compose Keyhole Markup Language Kubernetes Kythe LevelDB Lighthouse Looker Studio lmctfy MapReduce Mashup Editor Matter Mobile Services Namebench Native Client Neatx Neural Machine Translation Nomulus
O–Z	Open Location Code OpenRefine OpenSocial Optimize OR-Tools Pack PageSpeed Piper Plugin for Eclipse Polymer Programmable Search Engine Project Shield Public DNS reCAPTCHA RenderScript SafetyNet SageTV Schema.org Search Console Shell Sitemaps Skia Graphics Engine Spanner Sputnik Stackdriver Swiffy Tango TensorFlow Tesseract Test Translator Toolkit Urchin UTM parameters V8 VirusTotal VisBug Wave Federation Protocol Weave Web Accelerator Web Designer Web Server Web Toolkit Webdriver Torso WebRTC

Operating systems

Machine learning models

Neural networks

Computer programs

Formats and codecs

Programming languages

Search algorithms

Domain names

Typefaces

Software

A	Aardvark Account Dashboard Takeout Ad Manager AdMob Ads AdSense Affiliate Network Alerts Allo Analytics Android Auto Android Beam Answers Apture Arts & Culture Assistant Attribution Authenticator
B	BebaPay BeatThatQuote.com Beam Blog Search Blogger Body Bookmarks Books Ngram Viewer Browser Sync Building Maker Bump BumpTop Buzz
C	Calendar Cast Catalogs Chat Checkout Chrome Chrome Apps Chrome Experiments Chrome Remote Desktop Chrome Web Store Classroom Cloud Print Cloud Search Contacts Contributor Crowdsource Currents (social app) Currents (news app)
D	Data Commons Dataset Search Desktop Dictionary Digital Wellbeing Dinosaur Game Directory Docs Docs Editors Domains Drawings Drive Duo
E	Earth Etherpad Expeditions Express
F	Family Link Fast Flip FeedBurner fflick Fi Wireless Finance Files Find Hub Fit Flights Flu Trends Fonts Forms Friend Connect Fusion Tables
G	Gboard Gemini Gesture Search Gizmo5 Google+ Gmail Goggles GOOG-411 Grasshopper Groups
H	Hangouts Helpouts
I	iGoogle Images Image Labeler Image Swirl Inbox by Gmail Input Tools Japanese Input Pinyin Insights for Search
J	Jaiku Jamboard
K	Kaggle Keep Knol
L	Labs Latitude Lens Like.com Live Transcribe Lively
M	Map Maker Maps Maps Navigation Marketing Platform Meet Messages Moderator My Tracks
N	Nearby Share News News & Weather News Archive Notebook NotebookLM Now
O	Offers One One Pass Opinion Rewards Orkut Oyster
P	Panoramio PaperofRecord.com Patents Page Creator Pay (mobile app) Pay (payment method) Pay Send People Cards Person Finder Personalized Search Photomath Photos Picasa Picasa Web Albums Picnik Pixel Camera Play Play Books Play Games Play Music Play Newsstand Play Pass Play Services Podcasts Poly Postini PostRank Primer Public Alerts Public Data Explorer
Q	Question Hub Quick, Draw! Quick Search Box Quick Share Quickoffice
R	Read Along Reader Reply
S	Safe Browsing SageTV Santa Tracker Schemer Scholar Search AI Overviews Knowledge Graph SafeSearch Searchwiki Sheets Shoploop Shopping Sidewiki Sites Slides Snapseed Socratic Softcard Songza Sound Amplifier Spaces Sparrow (chatbot) Sparrow (email client) Speech Recognition & Synthesis Squared Stadia Station Store Street View Surveys Sync
T	Tables Talk TalkBack Tasks Tenor Tez Tilt Brush Toolbar Toontastic 3D Translate Travel Trendalyzer Trends TV
U	URL Shortener
V	Video Vids Voice Voice Access Voice Search
W	Wallet Wave Waze WDYL Web Light Where Is My Train Widevine Wiz Word Lens Workspace Workspace Marketplace
Y	YouTube YouTube Kids YouTube Music YouTube Premium YouTube Shorts YouTube Studio YouTube TV YouTube VR

Hardware

Pixel

Smartphones	Pixel (2016) Pixel 2 (2017) Pixel 3 (2018) Pixel 3a (2019) Pixel 4 (2019) Pixel 4a (2020) Pixel 5 (2020) Pixel 5a (2021) Pixel 6 (2021) Pixel 6a (2022) Pixel 7 (2022) Pixel 7a (2023) Pixel Fold (2023) Pixel 8 (2023) Pixel 8a (2024) Pixel 9 (2024) Pixel 9 Pro Fold (2024) Pixel 9a (2025)
Smartwatches	Pixel Watch (2022) Pixel Watch 2 (2023) Pixel Watch 3 (2024)
Tablets	Pixel C (2015) Pixel Slate (2018) Pixel Tablet (2023)
Laptops	Chromebook Pixel (2013–2015) Pixelbook (2017) Pixelbook Go (2019)
Other	Pixel Buds (2017–present)

Nexus

Smartphones	Nexus One (2010) Nexus S (2010) Galaxy Nexus (2011) Nexus 4 (2012) Nexus 5 (2013) Nexus 6 (2014) Nexus 5X (2015) Nexus 6P (2015)
Tablets	Nexus 7 (2012) Nexus 10 (2012) Nexus 7 (2013) Nexus 9 (2014)
Other	Nexus Q (2012) Nexus Player (2014)

Other

v t e Litigation
Advertising	Feldman v. Google, Inc. (2007) Rescuecom Corp. v. Google Inc. (2009) Goddard v. Google, Inc. (2009) Rosetta Stone Ltd. v. Google, Inc. (2012) Google, Inc. v. American Blind & Wallpaper Factory, Inc. (2017) Jedi Blue
Antitrust	European Union (2010–present) United States v. Adobe Systems, Inc., Apple Inc., Google Inc., Intel Corporation, Intuit, Inc., and Pixar (2011) Umar Javeed, Sukarma Thapar, Aaqib Javeed vs. Google LLC and Ors. (2019) United States v. Google LLC (2020) United States v. Google LLC (2023)
Intellectual property	Perfect 10, Inc. v. Amazon.com, Inc. (2007) Viacom International Inc. v. YouTube, Inc. (2010) Lenz v. Universal Music Corp.(2015) Authors Guild, Inc. v. Google, Inc. (2015) Field v. Google, Inc. (2016) Google LLC v. Oracle America, Inc. (2021) Smartphone patent wars
Privacy	Rocky Mountain Bank v. Google, Inc. (2009) Hibnick v. Google, Inc. (2010) United States v. Google Inc. (2012) Judgement of the German Federal Court of Justice on Google's autocomplete function (2013) Joffe v. Google, Inc. (2013) Mosley v SARL Google (2013) Google Spain v AEPD and Mario Costeja González (2014) Frank v. Gaos (2019)
Other	Garcia v. Google, Inc. (2015) Google LLC v Defteros (2020) Epic Games v. Google (2021) Gonzalez v. Google LLC (2022)

Concepts

Products

Android	Booting process Custom distributions Features Recovery mode Software development
Street View coverage	Africa Antarctica Asia Israel Europe North America Canada United States Oceania South America Argentina Chile Colombia
YouTube	Copyright strike Education Features Moderation Most-disliked videos Most-liked videos Most-subscribed channels Most-viewed channels Most-viewed videos Arabic music videos Chinese music videos French music videos Indian videos Pakistani videos Official channel Social impact YouTube Premium original programming
Other	Gmail interface Maps pin Most downloaded Google Play applications Stadia games

Documentaries

Books

Popular culture

Google Feud
Google Me (film)
"Google Me" (Kim Zolciak song)
"Google Me" (Teyana Taylor song)
Is Google Making Us Stupid?
Proceratium google
Matt Nathanson: Live at Google
The Billion Dollar Code
The Internship
Where on Google Earth is Carmen Sandiego?

Other

Italics denotediscontinued products.

Natural language processing

General terms

Text analysis

Text segmentation	Compound-term processing Lemmatisation Lexical analysis Text chunking Stemming Sentence segmentation Word segmentation

Automatic summarization

Machine translation

Distributional semantics models

Language resources,
datasets and corpora

Types and standards	Corpus linguistics Lexical resource Linguistic Linked Open Data Machine-readable dictionary Parallel text PropBank Semantic network Simple Knowledge Organization System Speech corpus Text corpus Thesaurus (information retrieval) Treebank Universal Dependencies
Data	BabelNet Bank of English DBpedia FrameNet Google Ngram Viewer UBY WordNet Wikidata

Automatic identification
and data capture

Topic model

Computer-assisted
reviewing

Natural language
user interface

Artificial intelligence (AI)

History (timeline)

Concepts

Applications

Implementations

Audio–visual	AlexNet WaveNet Human image synthesis HWR OCR Speech synthesis 15.ai ElevenLabs Speech recognition Whisper Facial recognition AlphaFold Text-to-image models Aurora DALL-E Firefly Flux Ideogram Imagen Midjourney Recraft Stable Diffusion Text-to-video models Dream Machine Runway Gen Hailuo AI Kling Sora Veo Music generation Suno AI Udio
Text	Word2vec Seq2seq GloVe BERT T5 Llama Chinchilla AI PaLM GPT 1 2 3 J ChatGPT 4 4o o1 o3 4.5 4.1 o4-mini Claude Gemini chatbot Grok LaMDA BLOOM Project Debater IBM Watson IBM Watsonx Granite PanGu-Σ DeepSeek Qwen
Decisional	AlphaGo AlphaZero OpenAI Five Self-driving car MuZero Action selection AutoGPT Robot control

People

Architectures

Portals
- Technology
Category
- Artificial neural networks
- Machine learning
List
- Companies
- Projects

Retrieved from "https://en.wikipedia.org/w/index.php?title=T5_(language_model)&oldid=1289207918"

Categories:

Hidden categories:

[8]ページ先頭