- Notifications
You must be signed in to change notification settings - Fork1.5k
Insights: py-pdf/pypdf
Overview
Could not load contribution data
Please try again later
2 Releases published by1 person
- 5.6.1 Version 5.6.1, 2025-06-22
published
Jun 22, 2025 - 5.7.0 Version 5.7.0, 2025-06-29
published
Jun 29, 2025
27 Pull requests merged by7 people
- ENH: Implement flattening for writer
#3312 merged
Jul 11, 2025 - MAINT: Reduce max-complexity of tool.ruff.lint.mccabe
#3365 merged
Jul 11, 2025 - ROB: Resolve some image extraction edge cases
#3371 merged
Jul 10, 2025 - DEV: Rename extension to yaml
#3366 merged
Jul 9, 2025 - MAINT: Use dispatch table for TextExtraction.process_operation
#3360 merged
Jul 9, 2025 - STY: Use math.isclose() to determine if values are close
#3364 merged
Jul 8, 2025 - ROB: ignore faulty trailing newline during RLE decoding
#3355 merged
Jul 4, 2025 - MAINT: Move text extraction state to TextExtraction class
#3350 merged
Jul 4, 2025 - TST: Improve text extraction coverage
#3353 merged
Jul 2, 2025 - BUG: Unterminated object when using PdfWriter with incremental=True
#3345 merged
Jul 1, 2025 - MAINT: Move code from _page.py to _text_extraction
#3343 merged
Jul 1, 2025 - ROB: Gracefully handle odd-length strings in parse_bfchar
#3348 merged
Jul 1, 2025 - DEV: Modernize license specifiers
#3338 merged
Jun 30, 2025 - ROB: Flate decoding for streams with faulty tail bytes
#3332 merged
Jun 27, 2025 - STY: Remove an immediate update when creating a DictionaryObject
#3334 merged
Jun 27, 2025 - ROB: dc_creator could be a Bag as well
#3333 merged
Jun 26, 2025 - PI: Performance optimization for LZW decoding
#3329 merged
Jun 26, 2025 - ROB: Handle tree being NullObject when retrieving named destinations
#3331 merged
Jun 24, 2025 - MAINT: Move inline-image mappings to constants
#3328 merged
Jun 23, 2025 - DOC: Add comments to page module
#3319 merged
Jun 22, 2025 - MAINT: Update urllib3 to 2.5.0
#3326 merged
Jun 22, 2025 - ROB: Deal with annotations not being lists on merge
#3321 merged
Jun 19, 2025 - ROB: Handle NullObject for cmap encoding Differences entry
#3317 merged
Jun 18, 2025 - DEV: Update ruff to 0.12.0
#3316 merged
Jun 18, 2025 - ENH: Add PDF/A XMP metadata support
#3314 merged
Jun 18, 2025 - MAINT: Improve benchmarking code
#3315 merged
Jun 17, 2025 - MAINT: Update comment
#3311 merged
Jun 17, 2025
6 Pull requests opened by4 people
- STY: Tweak PdfWriter
#3337 opened
Jun 28, 2025 - MAINT: Refactor _page.py
#3339 opened
Jun 29, 2025 - BUG: check whether self._info is not None before accessing its indirect_reference property
#3349 opened
Jul 1, 2025 - ENH: Add all font metrics for base 14 Type 1 PDF fonts.
#3363 opened
Jul 7, 2025 - STY: Tweak _page.py
#3370 opened
Jul 10, 2025 - MAINT: Remove intermediate list
#3373 opened
Jul 12, 2025
18 Issues closed by3 people
- ENH: Flatten PDF forms
#232 closed
Jul 11, 2025 - Cannot parse CreationDate
#3362 closed
Jul 7, 2025 - Radnom space added inside words
#3358 closed
Jul 4, 2025 - PdfWriter crashes on KeyError '/Subtype' when merging pdfs
#3356 closed
Jul 4, 2025 - Implementation of get_contents
#3351 closed
Jul 2, 2025 - Unterminated object when using PdfWriter with incremental=True
#3118 closed
Jul 1, 2025 - parse_bfchar: binascii.Error: Odd-length string
#3347 closed
Jul 1, 2025 - invalid pdf header: b'\n%PDF'
#3340 closed
Jun 30, 2025 - Missing whitespaces between words
#3341 closed
Jun 30, 2025 - Image object not recognized
#3335 closed
Jun 27, 2025 - dc_creator could be read as bag and not as sequence (it could be both!)
#3324 closed
Jun 26, 2025 - TypeError: NullObject not iterable when appending PDF with null /Dests to PdfWriter
#3330 closed
Jun 24, 2025 - pypdf makes larger output files than gs
#3322 closed
Jun 20, 2025 - Unable to merge PDF with annotations
#3320 closed
Jun 19, 2025 - Null encoding causes parse failure.
#3295 closed
Jun 18, 2025 - "extract_text" doesn't output the same transformation matrix in version 3.17 as in 3.16.
#2353 closed
Jun 18, 2025 - Assert fails when getting the mediabox property for certain PDFs
#2991 closed
Jun 18, 2025 - Extend XMP metadata support with PDF/A pdfaid namespace
#3313 closed
Jun 18, 2025
8 Issues opened by3 people
- Reduce maximum complexity to 30
#3372 opened
Jul 11, 2025 - Review image extraction issues
#3369 opened
Jul 9, 2025 - Allow writing/updating all properties of an embedded file
#3368 opened
Jul 9, 2025 - Image handling: Mode from RGB path will always be overwritten
#3367 opened
Jul 9, 2025 - Corrupted unicode characters in form field
#3361 opened
Jul 5, 2025 - Cleanup ruff ignores
#3327 opened
Jun 22, 2025 - Modify/add XMP metadata
#3325 opened
Jun 21, 2025 - Form rendering issues with special characters in input text
#3318 opened
Jun 18, 2025
9 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
- ENH: Automatically preserve links in added pages
#3298 commented on
Jul 1, 2025 • 11 new comments - ENH: Add support for BrotliDecode filter (PDF 2.0) #3223
#3254 commented on
Jun 24, 2025 • 2 new comments - Refactor regular text extraction into dedicated module
#3010 commented on
Jun 19, 2025 • 0 new comments - Only last %%EOF is considered, possibly not detecting valid startxref
#3238 commented on
Jun 29, 2025 • 0 new comments - Memory leaks
#3296 commented on
Jul 6, 2025 • 0 new comments - MAINT: Simplify file identifiers generation
#2003 commented on
Jun 30, 2025 • 0 new comments - ENH: consider images inside PDF made with onlyoffice
#2637 commented on
Jun 29, 2025 • 0 new comments - STY: Add two abbreviations for an inline image object
#3048 commented on
Jul 9, 2025 • 0 new comments - MAINT: Increase readability of _merge_page
#3291 commented on
Jul 8, 2025 • 0 new comments