uv. However,uv andvenv are not required.To work with allsupported file types, run:uv add "unstructured[all-docs]"uv add unstructured.txt), HTML files (.html), XML files (.xml), and emails (.eml,.msg, and.p7s) by default.To further conserve disk space and reduce code dependencies, you can run the following command instead, replacing<extra> with the appropriate extra for the target file type:uv add "unstructured[<extra>]"all-docs (for all supported file types in this list)csv (for.csv files only)docx (for.doc and.docx files only)epub (for.epub files only)image (for all supported image file types:.bmp,.heic,.jpeg,.png, and.tiff)md (for.md files only)odt (for.odt files only)org (for.org files only)pdf (for.pdf files only)pptx (for.ppt and.pptx files only)rst (for.rst files only)rtf (for.rtf files only)tsv (for.tsv files only)xlsx (for.xls and.xlsx files only)uv add "unstructured[pdf,docx]"tesseract-lang (for additional language support).epub,.odt, and.rtf files. For.rtf files, you must have version 2.14.2 or newer. Runningthis script will install the correct version for you.)Was this page helpful?