NotificationsYou must be signed in to change notification settings
Fork89
Star131

feat: add caching to GapicCallable#527

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Merged

daniel-sanche merged 28 commits intomainfromoptimize_gapic_callable

May 3, 2024

Merged

feat: add caching to GapicCallable#527

daniel-sanche merged 28 commits intomainfromoptimize_gapic_callable

May 3, 2024

Conversation

Copy link

Contributor

daniel-sanche commentedSep 6, 2023•
edited by parthea
Loading

_GapicCallable currently does a lot of work on eachcall, re-building each wrapped function, using a lot of calls to helper functions. This cost is added to every single rpc, so it can really add up

This PR does the following optimizations

removes the_apply_decorators and_is_not_none_or_false to build the wrapped call more directly.
- This seems to make it ~10% faster
add a new helper that builds a wrapped call using a timeout and retry object, and then cache the result with @lru_cache
- This seems to make it 50% faster
- In practice, I think it's safe to assume most calls will be re-using timeout and retry values
- I currently have the cache size of 4, but this can be changed

Benchmark:

from google.api_core.gapic_v1.method import _GapicCallablefrom google.api_core.retry import Retrycallable =  _GapicCallable(lambda *a, **k: 1, retry=Retry(), timeout=1010, compression=False)from timeit import timeittimeit(lambda: callable())

Before: 20.43s
After: 9.48s

BEGIN_COMMIT_OVERRIDE
chore: add caching to GapicCallable
END_COMMIT_OVERRIDE

feat: optimize _GapicCallable

5518645

product-auto-labelbot added the size: sPull request size is small. label

Sep 6, 2023

daniel-sanche mentioned this pull request

Sep 7, 2023

chore: optimize gapic callsgoogleapis/python-bigtable#863

Merged

daniel-sancheand others added9 commits

September 7, 2023 23:49

cleaned up metadata lines

0cc03bd

chore: avoid type checks in error wrapper

c97a636

Revert "chore: avoid type checks in error wrapper"

b453df4

This reverts commitc97a636.

add default wrapped function

2f7acff

fixed decorator order

31f0b4e

fixed spacing

b92328c

fixed comment typo

0831dbf

fixed spacing

a1563d2

Merge branch 'main' into optimize_gapic_callable

fb1a372

daniel-sanche mentioned this pull request

Oct 25, 2023

v3: fix identified gapic performance issuesgoogleapis/python-bigtable#883

Closed

daniel-sanche added6 commits

February 9, 2024 16:36

Merge branch 'main' into optimize_gapic_callable

52ed5be

fixed spacing

85e2102

removed unneeded helpers

c76f51c

use caching

f4a9021

improved metadata parsing

cacc73c

improved docstring

a30101d

product-auto-labelbot added size: mPull request size is medium. and removed size: sPull request size is small. labels

Feb 10, 2024

fixed logic

db9a9c4

daniel-sanche marked this pull request as ready for review

February 10, 2024 01:35

daniel-sanche requested review froma team ascode owners

February 10, 2024 01:35

daniel-sanche changed the title~~[DRAFT] feat: optimize GapicCallable~~feat: add caching to GapicCallable

Feb 10, 2024

vchudnov-g self-assigned this

Feb 12, 2024

parthea requested changes

Feb 13, 2024

View reviewed changes

Copy link

Collaborator

parthea left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

This seems to make it 50% faster

Thanks for fixing this!

Please could you add a simple benchmarking presubmit, similar to the test that you ran manually, to avoid future regressions in performance?

parthea assigneddaniel-sanche

Feb 13, 2024

added benchmark test

a555629

parthea unassignedvchudnov-g

Feb 14, 2024

Copy link

Collaborator

parthea commentedFeb 14, 2024

Assigning back to@daniel-sanche to resolve the presubmit failure

Copy link

ContributorAuthor

daniel-sanche commentedFeb 14, 2024

Hmm good point, the benchmark result will be machine-specific, and I was doing my tests locally instead of with the CI workers.

I guess I'll have to find an assertion value that works well for the CI nodes, and I'll add a comment explaining that it may be flake on slower hardware. Or let me know if you have other suggestions for how to approach this

Copy link

Collaborator

parthea commentedFeb 14, 2024•
edited
Loading

Can you set it high enough that we don't get flaky results, but low enough that we can detect performance regressions.

Perhaps set the threshold to 0.4 for now and create an issue inhttps://github.com/googleapis/python-api-core/issues to add a proper benchmarking test ? I believe@ohmayr started looking into a benchmarking presubmit so please tag him on the issue.

update threshold

fbbaaca

daniel-sanche mentioned this pull request

Feb 15, 2024

improve __call__ performance benchmarking#616

Open

Copy link

ContributorAuthor

daniel-sanche commentedFeb 15, 2024

Sure, I opened an issue to track this here:#616

I adjusted the value to 0.4. Feel free to merge it with that number, but I suspect we can find a lower value that still avoids flakiness. Let me know if you want me to do some investigation

daniel-sancheand others added6 commits

February 16, 2024 12:27

run benchmark in loop for testing

576bb0f

use verbose logs

7c32a5d

Revert testing

26bec79

used smaller value

c25e0eb

changed threshold

49201ca

Merge branch 'main' into optimize_gapic_callable

3d2c964

parthea approved these changes

Feb 27, 2024

View reviewed changes

parthea assignedvchudnov-g and unassigneddaniel-sanche

Feb 27, 2024

Copy link

Collaborator

parthea commentedFeb 27, 2024

@vchudnov-g Please could you review?

vchudnov-g approved these changes

Mar 29, 2024

View reviewed changes

Copy link

Contributor

vchudnov-g left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Minor code comment, and an idea about tightening benchmarks.

tests/unit/gapic/test_method.py Outdated

		Note: The threshold has been tuned for the CI workers. Test may flake on
		slower hardware

		https://github.com/googleapis/python-api-core/pull/527

Copy link

Contributor

vchudnov-gMar 29, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Do you mean to self-reference this PR?

Copy link

ContributorAuthor

daniel-sancheMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

It was intentional, to give the context on this test. But on second thought,git blame should be enough. Removed

tests/unit/gapic/test_method.py

		lambda a, *k: 1, retry=Retry(), timeout=1010, compression=False
		)
		avg_time = timeit(lambda: gapic_callable(), number=10_000)
		assert avg_time < 0.4

Copy link

Contributor

vchudnov-gMar 29, 2024•
edited
Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Idea: If the assertion fails, print both the actual time it took and enough platform information so that in the future we can add the right threshold for the platform. The latter would be something like this

platform_threshold = { "foo": 0.2, "bar": 0.6 }current_platform = ......assert avg_time < platform_threshold.get(current_platform, 0.4)

In fact, you could implementplatform_threshold now, and start with whatever your current machine is.

Copy link

ContributorAuthor

daniel-sancheMay 3, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

That's an interesting idea, but it's not completely clear to me what we'd need to capture for the platform. Number of CPUs? Architecture? OS? Let me know if you have thoughts

We already have#616 to track improving this though, so if it's alright with you, I'll merge this as-is and we can discuss follow-up there