Supported Files

File Types

Document AI supports the following image types.

For information about file size and page limits, refer to theQuotas andLimits page.

Note: Document AI includes some supported file types inPreview.These will be charged when they are released to General Availability (GA).
NameFile Extension(s)MIME Type
Portable Document Format (PDF).pdfapplication/pdf
Graphics Interchange Format (GIF).gifimage/gif
Tag Image File Format (TIFF).tiff,.tifimage/tiff
Joint Photographic Experts Group (JPEG).jpg,.jpegimage/jpeg
Portable Network Graphics (PNG).pngimage/png
Bitmap (BMP).bmpimage/bmp
WebP.webpimage/webp
HyperText Markup Language (HTML).htmltext/html
Microsoft Word Office Open XML (OOXML).docxapplication/vnd.openxmlformats-officedocument.wordprocessingml.document
Microsoft PowerPoint OOXML.pptxapplication/vnd.openxmlformats-officedocument.presentationml.presentation
Microsoft Excel OOXML.xlsxapplication/vnd.openxmlformats-officedocument.spreadsheetml.sheet

Note that some of these image formats are "lossy" (for example, JPEG). Reducingfile sizes for lossy formats may result in a degradation of image quality and accuracy of results from Document AI.

Note: Prior JPEG compressions for TIFF are unsupported. Type of JPEG encapsulationdefined by the TIFFversion 6.0specification.Note: HTML and OOXML support are only available withlayoutparser.Customsplitter only supports PDF, TIFF, TIF, andGIF file types.

Document scan resolution

For most accurate OCR results from Document AI, document scans should bea minimum of 200 dpi(dots per inch).300 dpi and higher generally produce the best results. OCR accuracy is dependenton both the resolution and the minimum font size, along with other factors likedocument (and if handwritten, handwriting) quality, so testing is recommended.Theimage quality analysisfeature can help evaluate resolution concerns.

NOTE: 2k x 3k pixels are required for the US driver's license back side image inorder to read the barcode.

Except as otherwise noted, the content of this page is licensed under theCreative Commons Attribution 4.0 License, and code samples are licensed under theApache 2.0 License. For details, see theGoogle Developers Site Policies. Java is a registered trademark of Oracle and/or its affiliates.

Last updated 2026-02-19 UTC.