IntelPython/scikit-learn_benchPublic

NotificationsYou must be signed in to change notification settings
Fork74
Star118

Remove epsilon dataset usage for ml_benchmarks#197

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Draft

ethanglaser wants to merge2 commits intomain

base:main

Choose a base branch

fromdev/eglaser-rm-epsilon

Draft

Remove epsilon dataset usage for ml_benchmarks#197

ethanglaser wants to merge2 commits intomainfromdev/eglaser-rm-epsilon

Conversation

Copy link

Contributor

ethanglaser commentedNov 12, 2025•
edited
Loading

Description

Avoid ChunkedEncodingError / IncompleteRead issues in CI by disabling usage of epsilon dataset

Checklist:

Completeness and readability

I have commented my code, particularly in hard-to-understand areas.
I have updated the documentation to reflect the changes or created a separate PR with updates and provided its number in the description, if necessary.
Git commit message contains an appropriate signed-off-by string(seeCONTRIBUTING.md for details).
I have resolved any merge conflicts that might occur with the base branch.

Testing

I have run it locally and tested the changes extensively.
All CI jobs are green or I have provided justification why they aren't.
I have extended testing suite if new functionality was introduced in this PR.

Remove epsilon dataset usage for ml_benchmarks

becce41

Copy link

ContributorAuthor

ethanglaser commentedNov 12, 2025

http://intel-ci.intel.com/f0bf6601-ce1e-f178-89fb-a4bf010d0e2d

remove sensit from dbscan

098b21b

Copy link

Contributor

david-cortes-intel commentedNov 20, 2025

@razdoburdin Would it be a problem to remove this dataset?

Copy link

Collaborator

razdoburdin commentedNov 20, 2025

@razdoburdin Would it be a problem to remove this dataset?

it represents the xgboost cases with large histogram size not fitting in cache, but we can replace it by synthetic data or use preloaded data.

david-cortes-intel approved these changes

Nov 20, 2025

View reviewed changes

Copy link

Contributor

david-cortes-intel commentedNov 26, 2025

@ethanglaser Any blockers for merging this PR?

Copy link

ContributorAuthor

ethanglaser commentedNov 26, 2025•
edited
Loading

@ethanglaser Any blockers for merging this PR?

Last I checked there were still some issues with the job. Let's see howhttp://intel-ci.intel.com/f0caf58d-06da-f13b-8936-a4bf010d0e2d goes. Also I think we may need some help from Aleksei, the filters parameter does not work as intended (it always resorts to default from what I can tell)

Copy link

Contributor

david-cortes-intel commentedNov 27, 2025

@ethanglaser Any blockers for merging this PR?
Last I checked there were still some issues with the job. Let's see howhttp://intel-ci.intel.com/f0caf58d-06da-f13b-8936-a4bf010d0e2d goes. Also I think we may need some help from Aleksei, the filters parameter does not work as intended (it always resorts to default from what I can tell)

Why would the filters matter if this is removing it from the configs?

Labels

None yet

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove epsilon dataset usage for ml_benchmarks#197

Are you sure you want to change the base?

Remove epsilon dataset usage for ml_benchmarks#197

Uh oh!

Conversation

ethanglaser commentedNov 12, 2025•
edited
Loading

Uh oh!

Description