Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Expand PCA sizes#191

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Open
david-cortes-intel wants to merge2 commits intoIntelPython:main
base:main
Choose a base branch
Loading
fromdavid-cortes-intel:larger_pca

Conversation

@david-cortes-intel
Copy link
Contributor

Description

This PR expands the sizes of the synthetic datasets used to benchmark PCA.

Currently, these cases involve 3 components, in many cases out of thousands of features, which is not a representative application and thus not a good candidate for benchmarking. The PR expands those to 20 which is more reasonable.

It also makes the synthetic datasets wider (=more columns) and shorter (=fewer rows) as large-scale PCA is for the most part meant to be applied to wide datasets, and substantially increases the sizes of the inputs for.transform() as the benchmarks for those cases are very short.

Note that this PR might increase the time it takes to execute a benchmark run, especially from the data generation step. I do not know how much the timings will change if this is merged.


Checklist:

Completeness and readability

  • Git commit message contains an appropriate signed-off-by string(seeCONTRIBUTING.md for details).
  • I have resolved any merge conflicts that might occur with the base branch.

Testing

  • I have run it locally and tested the changes extensively.
  • All CI jobs are green or I have provided justification why they aren't.

@david-cortes-intel
Copy link
ContributorAuthor

CI errors should be fixed once this PR is merged in sklearnex:uxlfoundation/scikit-learn-intelex#2741

Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment

Reviewers

@Vika-FVika-FAwaiting requested review from Vika-F

@avolkov-intelavolkov-intelAwaiting requested review from avolkov-intel

@AlexsandrussAlexsandrussAwaiting requested review from AlexsandrussAlexsandruss is a code owner

At least 1 approving review is required to merge this pull request.

Assignees

No one assigned

Labels

datasetsExtension or fix load dataset

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

1 participant

@david-cortes-intel

[8]ページ先頭

©2009-2025 Movatter.jp