Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

New spectral clustering algorithm#31648

Malik-Hacini started this conversation inIdeas
Discussion options

Hey everyone,

As part of a paper in Machine Learning Research Journal, our research team is trying to submit a new clustering algorithm to scikit learn.

It is a "generalization" of Spectral Clustering to directed graphs. Spectral clustering is already present in sklearn, along with two variations.
This implementation is random walk based, and in practice, the main difference with standard SC is the computation of new (so called generalized for now) laplacian matrices.
The rest of the pipeline is pretty much the same as standard SC (kneighbors graph of the data, adjacency matrix, eigenvectors of the laplacian (spectral embedding), assigning labels).

The research is still ongoing, especially the task of finding optimal base parameters, but the results are promising. They are at least as good as classical spectral clustering (being a subcase) and often outperform it.

We are unsure on how to submit this. Based on the other spectral algorithms implemented, we think the best would be building a new estimator (probably called DiSpectral for directed spectral clustering).
The thing is this estimator would be very similar to the spectral clustering one, however it would source the laplacian from an external utils file.
Thus, we are thinking of adding it directly to the SpectralClustering estimator.

We are thus looking for help on the development of this new method, whether it be sklearn contribution rules help or help for performance optimization.

Thanks in advance !

You must be logged in to vote

Replies: 1 comment

Comment options

Hi, I'm working on something similar. I'm developing an algorithm for anomaly detection. I can't tell you exactly how to submit your algorithm to scikit-learn (I think your paper probably needs to be published first), but I can tell you that if you plan to work on the algorithm, you need to follow this guide to make it compatible with the scikit-learn environment. There are a number of conventions and validations you need to follow.
A friendly piece of advice from someone who already made this mistake: read the documentation first, then start writing the code.https://scikit-learn.org/stable/developers/index.html

You must be logged in to vote
0 replies
Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment
Category
Ideas
Labels
None yet
2 participants
@Malik-Hacini@GiulioSurya

[8]ページ先頭

©2009-2025 Movatter.jp