- Notifications
You must be signed in to change notification settings - Fork74
Remove epsilon dataset usage for ml_benchmarks#197
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.
Already on GitHub?Sign in to your account
base:main
Are you sure you want to change the base?
Uh oh!
There was an error while loading.Please reload this page.
Conversation
ethanglaser commentedNov 12, 2025
david-cortes-intel commentedNov 20, 2025
@razdoburdin Would it be a problem to remove this dataset? |
razdoburdin commentedNov 20, 2025
it represents the xgboost cases with large histogram size not fitting in cache, but we can replace it by synthetic data or use preloaded data. |
david-cortes-intel commentedNov 26, 2025
@ethanglaser Any blockers for merging this PR? |
ethanglaser commentedNov 26, 2025 • edited
Loading Uh oh!
There was an error while loading.Please reload this page.
edited
Uh oh!
There was an error while loading.Please reload this page.
Last I checked there were still some issues with the job. Let's see howhttp://intel-ci.intel.com/f0caf58d-06da-f13b-8936-a4bf010d0e2d goes. Also I think we may need some help from Aleksei, the filters parameter does not work as intended (it always resorts to default from what I can tell) |
david-cortes-intel commentedNov 27, 2025
Why would the filters matter if this is removing it from the configs? |
Uh oh!
There was an error while loading.Please reload this page.
Description
Avoid ChunkedEncodingError / IncompleteRead issues in CI by disabling usage of epsilon dataset
Checklist:
Completeness and readability
Testing