Parse font properties also from the encrypted part of the file, and reimplement the parsing so it understands more of PostScript's syntax. This fixes a bug whereType1Font.transform would not remove the UniqueID key but break some PostScript code referring to UniqueID instead.

Incidentally, fix the bug where every font had aweight property with value'Normal' - the correct property is spelledWeight with a capital letter.

This is a prerequisite for subsetting Type-1 fonts (#127).

PR Checklist

Has pytest style unit tests (andpytest passes).
IsFlake 8 compliant (runflake8 on changed files to check).
New features are documented, with examples if plot related.
Documentation is sphinx and numpydoc compliant (the docs shouldbuild without error).
Conforms to Matplotlib style conventions (installflake8-docstrings and runflake8 --docstring-convention=all).
New features have an entry indoc/users/next_whats_new/ (follow instructions in README.rst there).
API changes documented indoc/api/next_api_changes/ (follow instructions in README.rst there).

jkseppan force-pushed thetype1-improved-parsing branch 4 times, most recently from981ef67 to90b5889Compare

July 22, 2021 12:46

jkseppan added a commit to jkseppan/matplotlib that referenced this pull request

Jul 22, 2021

Type-1 subsetting

b66579d

With this I can produce smaller pdf files with usetex in some smalltests, but this obviously needs more extensive testing, thus markingas draft.On top ofmatplotlib#20634 andmatplotlib#20715.Closesmatplotlib#127.

jkseppan mentioned this pull request

Jul 22, 2021

Type-1 font subsetting#20716

Open

7 tasks

jklymak added topic: text/fonts status: waiting for other PR labels

Jul 22, 2021

jkseppan added a commit to jkseppan/matplotlib that referenced this pull request

Jul 22, 2021

Type-1 subsetting

6546417

With this I can produce smaller pdf files with usetex in some smalltests, but this obviously needs more extensive testing, thus markingas draft.On top ofmatplotlib#20634 andmatplotlib#20715.Closesmatplotlib#127.

jkseppan force-pushed thetype1-improved-parsing branch 2 times, most recently frome35728b to9418b35Compare

July 22, 2021 15:59

jklymak removed the status: waiting for other PR label

Jul 22, 2021

jkseppan added a commit to jkseppan/matplotlib that referenced this pull request

Jul 22, 2021

Type-1 subsetting

3aaf4c9

With this I can produce smaller pdf files with usetex in some smalltests, but this obviously needs more extensive testing, thus markingas draft.On top ofmatplotlib#20715.Closesmatplotlib#127.

anntzer reviewed

Aug 22, 2021

View reviewed changes

lib/matplotlib/type1font.py OutdatedShow resolvedHide resolved

anntzer reviewed

Aug 22, 2021

View reviewed changes

lib/matplotlib/type1font.py OutdatedShow resolvedHide resolved

anntzer reviewed

Aug 22, 2021

View reviewed changes

lib/matplotlib/type1font.pyShow resolvedHide resolved

anntzer reviewed

Aug 22, 2021

View reviewed changes

lib/matplotlib/type1font.py OutdatedShow resolvedHide resolved

anntzer reviewed

Aug 22, 2021

View reviewed changes

lib/matplotlib/type1font.py OutdatedShow resolvedHide resolved

anntzer reviewed

Aug 22, 2021

View reviewed changes

lib/matplotlib/type1font.py OutdatedShow resolvedHide resolved

anntzer reviewed

Aug 22, 2021

View reviewed changes

lib/matplotlib/type1font.py OutdatedShow resolvedHide resolved

jkseppan force-pushed thetype1-improved-parsing branch 2 times, most recently from19119d8 to25613b9Compare

August 28, 2021 18:15

Copy link

MemberAuthor

jkseppan commentedAug 28, 2021

Thanks for the review@anntzer!

anntzer reviewed

Aug 28, 2021

View reviewed changes

lib/matplotlib/type1font.py OutdatedShow resolvedHide resolved

anntzer reviewed

Aug 28, 2021

View reviewed changes

lib/matplotlib/type1font.py OutdatedShow resolvedHide resolved

anntzer reviewed

Aug 28, 2021

View reviewed changes

lib/matplotlib/type1font.py OutdatedShow resolvedHide resolved

anntzer reviewed

Aug 28, 2021

View reviewed changes

lib/matplotlib/type1font.py OutdatedShow resolvedHide resolved

jkseppan added2 commits

August 29, 2021 09:51

Improve the Type-1 font parsing

3d443e5

Move Type1Font._tokens into a top-level function _tokenize that is acoroutine. The parsing stage consuming the tokens can instruct thetokenizer to return a binary token - this is necessary when decryptingthe CharStrings and Subrs arrays, since the preceding context determineswhich parts of the data need to be decrypted.The function now also parses the encrypted portion of the font file.To support usage as a coroutine, move the whitespace filtering into thefunction, since passing the information about binary tokens would noteasily work through a filter.The function now returns tokens as subclasses of a new _Token class,which carry the position and value of the token and can havetoken-specific helper methods. The position data will be needed whenmodifying the file, as the font is transformed or subsetted.A new helper function _expression can be used to consume tokens thatform a balanced subexpression delimited by [] or {}. This helps fix abug in UniqueID removal: if the font includes PostScript code thatchecks if the UniqueID is set in the current dictionary, the previouscode broke that code instead of removing the UniqueID definition. Fontscan include UniqueID in the encrypted portion as well as the cleartextone, and removal is now done in both portions.Fix a bug related to font weight: the key is title-cased and notlower-cased, so font.prop['weight'] should not exist.

Recognize abbreviations of PostScript code

e98bb83

Type-1 fonts are required to have subroutines with specific contentsbut their names may vary. They are usually ND, NP and RD but nameslike | and |- appear too.

jkseppan force-pushed thetype1-improved-parsing branch from25613b9 toe98bb83Compare

August 29, 2021 06:55

Copy link

MemberAuthor

jkseppan commentedAug 29, 2021

I added some tests and realized that the string escaping was slightly wrong. Now the code also parses string values, although it is unlikely that font properties would include escaped whitespace characters.

anntzer approved these changes

Aug 29, 2021

View reviewed changes

Copy link

Contributor

anntzer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I admit I still haven't followedall the logic, but we can always check again later :)

jkseppan added a commit to jkseppan/matplotlib that referenced this pull request

Aug 29, 2021

Type-1 subsetting

f6861ad

With this I can produce smaller pdf files with usetex in some smalltests, but this obviously needs more extensive testing, thus markingas draft.Give dviread.DviFont a fake filename attribute for character tracking.On top ofmatplotlib#20715.Closesmatplotlib#127.

jkseppan added a commit to jkseppan/matplotlib that referenced this pull request

Aug 30, 2021

Type-1 subsetting

d8ae364

With this I can produce smaller pdf files with usetex in some smalltests, but this obviously needs more extensive testing, thus markingas draft.Give dviread.DviFont a fake filename attribute for character tracking.On top ofmatplotlib#20715.Closesmatplotlib#127.

anntzer reviewed

Aug 30, 2021

View reviewed changes

lib/matplotlib/type1font.py

		depth += 1
		elif match.group() == ')':
		depth -= 1
		else: # a backslash

Copy link

Contributor

anntzerAug 30, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I guess that here you don't really care about handling the backslash escapes, and all you want to do is simply to match the (unescaped) parentheses, so you could perhaps just replace instring_re by something like(?<!\\)[()] (parentheses not preceded by a backslash)?

Copy link

Contributor

anntzerOct 4, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Ah, I gave this a try but it wouldn't work because the parenthesiscan be preceded by backslashes if the backslash is itself escaped (i.e. one would really need to search for "parenthesis not preceded by an even number of backslashes"). So let's forget about this for now.

jklymak approved these changes

Oct 4, 2021

View reviewed changes

Copy link

Member

jklymak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

I'll merge based on@anntzer review. This shouldnot go into 3.5 so that it has time to be used on master...

jklymak added this to thev3.6.0 milestone

Oct 4, 2021

jklymak merged commite9bd017 intomatplotlib:master

Oct 4, 2021

tacaswell pushed a commit to tacaswell/matplotlib that referenced this pull request

Oct 12, 2021

Merge pull requestmatplotlib#20715from jkseppan/type1-improved-parsing

42e8e12

Improve Type-1 font parsing

tacaswell pushed a commit that referenced this pull request

Oct 20, 2021

Merge pull request#20715from jkseppan/type1-improved-parsing

0bb36e8

Improve Type-1 font parsing

oscargus mentioned this pull request

Jan 6, 2022

Deprecatedafm,fontconfig_pattern, andtype1font#22133

Merged

3 tasks

jkseppan added a commit to jkseppan/matplotlib that referenced this pull request

May 4, 2025

Type-1 subsetting

4dcd073

With this I can produce smaller pdf files with usetex in some smalltests, but this obviously needs more extensive testing, thus markingas draft.Give dviread.DviFont a fake filename attribute for character tracking.On top ofmatplotlib#20715.Closesmatplotlib#127.

Labels

topic: text/fonts

3 participants

Movatterモバイル変換

Uh oh!

Improve Type-1 font parsing#20715

Improve Type-1 font parsing#20715

Uh oh!

Conversation

jkseppan commentedJul 22, 2021• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

PR Summary

PR Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jkseppan commentedAug 28, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jkseppan commentedAug 29, 2021

Uh oh!

anntzer left a comment

Choose a reason for hiding this comment

Uh oh!

anntzerAug 30, 2021

Choose a reason for hiding this comment

Uh oh!

anntzerOct 4, 2021

Choose a reason for hiding this comment

Uh oh!

jklymak left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jkseppan commentedJul 22, 2021•
edited
Loading