- Notifications
You must be signed in to change notification settings - Fork3.6k
Insights: microsoft/markitdown
Overview
- 0Merged pull requests
- 21Open pull requests
- 2Closed issues
- 14New issues
There hasn’t been any commit activity on microsoft/markitdown in the last week.
Want to help out?
21 Pull requests opened by21 people
- convert sup in the word to ^ in the markdown
#1322 opened
Jul 12, 2025 - Fix #53: Preserve Excel cell formatting (currency, percentage, etc.) …
#1325 opened
Jul 14, 2025 - heyyy
#1327 opened
Jul 15, 2025 - added a new file
#1328 opened
Jul 15, 2025 - changes made
#1334 opened
Jul 16, 2025 - Update README.md
#1335 opened
Jul 16, 2025 - Incorporated options to embed or save images off of PDF
#1336 opened
Jul 16, 2025 - docs: clarify supported input formats for Markdown conversion
#1337 opened
Jul 16, 2025 - fix : typo in security.md
#1338 opened
Jul 16, 2025 - Fix: Preserve hard breaks and new lines in PPTX
#1345 opened
Jul 17, 2025 - docs(readme): fix typo 'instrutions' → 'instructions'
#1348 opened
Jul 17, 2025 - Add MseeP.ai badge
#1349 opened
Jul 17, 2025 - Update README.md
#1350 opened
Jul 17, 2025 - Fix: Add image description support for PDF converter
#1351 opened
Jul 17, 2025 - HTML| Update document intelligence file type handling
#1352 opened
Jul 17, 2025 - Ulugbek: added test file for pull request practice
#1354 opened
Jul 17, 2025 - feat(magika): make magika an optional dependency
#1355 opened
Jul 17, 2025 - feat: enhance CLI with logging, file checks, and quiet mode
#1357 opened
Jul 17, 2025 - Create Airdrop
#1358 opened
Jul 17, 2025 - Add DETAILS.md for improved documentation
#1359 opened
Jul 18, 2025 - Add files via upload
#1360 opened
Jul 18, 2025
2 Issues closed by2 people
- Could get FontBBox from font descriptor because None cannot be parsed as 4 floats
#1347 closed
Jul 17, 2025 - How to install it
#1330 closed
Jul 15, 2025
14 Issues opened by14 people
- Got this error while doing hatch test via hatch shell
#1364 opened
Jul 18, 2025 - Docker build to fail
#1363 opened
Jul 18, 2025 - markdown -> office document
#1356 opened
Jul 17, 2025 - MCP server should support converting without returning content to preserve agent context
#1353 opened
Jul 17, 2025 - Describing Images Inline in PDFs for Better RAG
#1346 opened
Jul 17, 2025 - OCR Fallback Not Working
#1344 opened
Jul 16, 2025 - MARKITUP - Revert back from markdown to original document
#1341 opened
Jul 16, 2025 - README.md orthography
#1339 opened
Jul 16, 2025 - MCP has no option to write the markdown to a file
#1332 opened
Jul 15, 2025 - .pptx conversion doesn't add two spaces at the end of a line to preserve new lines
#1331 opened
Jul 15, 2025 - Does it support the text recognization?
#1324 opened
Jul 14, 2025 - PDF conversion fault
#1323 opened
Jul 13, 2025
16 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
- Added DOC file support to MarkItDown
#1316 commented on
Jul 14, 2025 • 2 new comments - Unable to extract currency from excel formatted cells
#53 commented on
Jul 14, 2025 • 0 new comments - 使用docker部署没有日志输出
#1269 commented on
Jul 15, 2025 • 0 new comments - Word Document table conversion issue
#20 commented on
Jul 15, 2025 • 0 new comments - PptxConverter threw TypeError with message: '<' not supported between instances of 'NoneType' and 'Emu'
#1293 commented on
Jul 15, 2025 • 0 new comments - math formula ocr
#17 commented on
Jul 15, 2025 • 0 new comments - How to keep page number
#1317 commented on
Jul 15, 2025 • 0 new comments - Is there anyone practice fill Word tables with this tool and AI models?
#1297 commented on
Jul 15, 2025 • 0 new comments - PDF performance (PDFMiner)
#1276 commented on
Jul 15, 2025 • 0 new comments - Extract one markdown per page or force a separator
#1304 commented on
Jul 15, 2025 • 0 new comments - Arabic Text is Mirrored
#1242 commented on
Jul 16, 2025 • 0 new comments - DocxConverter threw KeyError with message: 'w:ilvl'
#1282 commented on
Jul 17, 2025 • 0 new comments - pdf转md后没有图片
#1298 commented on
Jul 18, 2025 • 0 new comments - File support: chm support
#14 commented on
Jul 18, 2025 • 0 new comments - arg: fill_merged_cells. Optional to fill merged cells instead of NAN
#1165 commented on
Jul 14, 2025 • 0 new comments - Add page-level text extraction for PDF/PPTX/DOCX documents
#1263 commented on
Jul 15, 2025 • 0 new comments