NotificationsYou must be signed in to change notification settings
Fork1.2k
Star23k

[WIP] binary bloat analysis#1223

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Draft

pauldreik wants to merge2 commits intosimdjson:master

base:master

Choose a base branch

frompauldreik:pauldreik/bloaty

Draft

[WIP] binary bloat analysis#1223

pauldreik wants to merge2 commits intosimdjson:masterfrompauldreik:pauldreik/bloaty

Conversation

Copy link

Member

pauldreik commentedOct 10, 2020•
edited
Loading

This is for measuring binary size. Not that anyone asked about it that I know of, but it may be interesting to

see which functions take up space
notice if the binary size suddenly increases

It is really quick to run, less than a minute.

However, I do not really know how to present the result and/or possibly act on it, seedjarek/bloaty-analyze#1

Copy link

Member

jkeiser commentedOct 12, 2020

It's definitely a good idea to keep track of size over time ... that affects the ability to distribute.

Not part of this at ALL, but someday it'll be nice to do a sizecomparison--instead of just comparing simdjson against other libraries for speed on certain tasks, make a separate executable for each library+task combo and comparesize as well.

Copy link

MemberAuthor

pauldreik commentedOct 13, 2020

Would a reasonable small example be one that takes the twitter json and outputs the first tweet found that contains "cats" ?

I think there are several metrics interesting to keep track of over time

lines of code in total
fuzz coverage (I have amanually updated table here)
unit test coverage
performance (N platforms x M benchmarks)
binary size of the library
binary size of "hello world" similar to what@jkeiser proposes above

There was discussion of tracking performance earlier, perhaps it would be possible to store all these metrics somewhere?

Thecurl project tracks all sorts of things over time, we might get inspiration there.

Copy link

Member

jkeiser commentedDec 22, 2020•
edited
Loading

For some reason I didn't see your response: I think it's reasonable to run this on simdjson.so at a minimum, and perhaps the parse executable.

For ondemand, perhaps the partial_tweets benchmarks? Perhaps benchmark_ondemand as a whole, which would potentially give us interesting comparisons.

Copy link

Member

jkeiser commentedMar 20, 2021

@pauldreik I'm fine leaving this pr around if you plan to get back to it; otherwise let's file an issue and get back to it when we have time :)

Copy link

MemberAuthor

pauldreik commentedMar 23, 2021

@pauldreik I'm fine leaving this pr around if you plan to get back to it; otherwise let's file an issue and get back to it when we have time :)

I pinged the author of the bloaty action job, let's see if I get a response and if not, let's do as you suggest!

pauldreik force-pushed thepauldreik/bloaty branch from0b94c19 toe03e602Compare

April 4, 2021 17:24

Copy link

MemberAuthor

pauldreik commentedApr 4, 2021

@jkeiser I fixed the bloaty job and it seems to work, but what should we do with the results? should we reject the pull request if the binary size increases more than X%?

pauldreik added2 commits

April 4, 2021 20:09

drop everything byt bloaty to avoid wasting CI

37de5b0

add bloaty CI job

00a59a7

pauldreik force-pushed thepauldreik/bloaty branch from2d0eb3e to00a59a7Compare

April 4, 2021 18:09

Copy link

Member

jkeiser commentedJun 24, 2021

@pauldreik do you have any idea whether the results of this are generally stable? If so, I think it'd be reasonable to reject changes with a 20% size change (or at least flag the crap out of them).@lemire thoughts?

At the very, very least, we should run this in CI so we can golook at the results when we're worried. If it's expensive we can restrict to just master pushes.

Copy link

Member

lemire commentedJun 24, 2021

@jkeiser Sure, sure.

Copy link

MemberAuthor

pauldreik commentedJun 27, 2021 via email

I do not know if the results are stable, I do not understand why theywouldn't be?20% seems like a pretty big margin, let's start with that!

jkeiser changed the title~~DON'T MERGE binary bloat analysis~~[WIP] binary bloat analysis

Jun 28, 2021

Labels

None yet

Movatterモバイル変換

[WIP] binary bloat analysis#1223

Are you sure you want to change the base?

[WIP] binary bloat analysis#1223

Uh oh!

Conversation

pauldreik commentedOct 10, 2020• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

jkeiser commentedOct 12, 2020

Uh oh!

pauldreik commentedOct 13, 2020

Uh oh!

jkeiser commentedDec 22, 2020• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

Uh oh!

jkeiser commentedMar 20, 2021

Uh oh!

pauldreik commentedMar 23, 2021

Uh oh!

pauldreik commentedApr 4, 2021

Uh oh!

jkeiser commentedJun 24, 2021

Uh oh!

lemire commentedJun 24, 2021

Uh oh!

pauldreik commentedJun 27, 2021 via email

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pauldreik commentedOct 10, 2020•
edited
Loading

jkeiser commentedDec 22, 2020•
edited
Loading