scikit-learn/scikit-learnPublic

NotificationsYou must be signed in to change notification settings
Fork26k
Star62.7k

New spectral clustering algorithm#31648

Malik-Hacini started this conversation inIdeas

Malik-Hacini

Jun 24, 2025

· 1 comment

Return to top

Discussion options

Malik-Hacini
Jun 24, 2025

Hey everyone,

As part of a paper in Machine Learning Research Journal, our research team is trying to submit a new clustering algorithm to scikit learn.

It is a "generalization" of Spectral Clustering to directed graphs. Spectral clustering is already present in sklearn, along with two variations.
This implementation is random walk based, and in practice, the main difference with standard SC is the computation of new (so called generalized for now) laplacian matrices.
The rest of the pipeline is pretty much the same as standard SC (kneighbors graph of the data, adjacency matrix, eigenvectors of the laplacian (spectral embedding), assigning labels).

The research is still ongoing, especially the task of finding optimal base parameters, but the results are promising. They are at least as good as classical spectral clustering (being a subcase) and often outperform it.

We are unsure on how to submit this. Based on the other spectral algorithms implemented, we think the best would be building a new estimator (probably called DiSpectral for directed spectral clustering).
The thing is this estimator would be very similar to the spectral clustering one, however it would source the laplacian from an external utils file.
Thus, we are thinking of adding it directly to the SpectralClustering estimator.

We are thus looking for help on the development of this new method, whether it be sklearn contribution rules help or help for performance optimization.

Thanks in advance !

You must be logged in to vote

Replies: 1 comment

Comment options

GiulioSurya
Jun 25, 2025

Hi, I'm working on something similar. I'm developing an algorithm for anomaly detection. I can't tell you exactly how to submit your algorithm to scikit-learn (I think your paper probably needs to be published first), but I can tell you that if you plan to work on the algorithm, you need to follow this guide to make it compatible with the scikit-learn environment. There are a number of conventions and validations you need to follow.
A friendly piece of advice from someone who already made this mistake: read the documentation first, then start writing the code.https://scikit-learn.org/stable/developers/index.html

You must be logged in to vote

0 replies

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

New spectral clustering algorithm#31648

Uh oh!

{{title}}

Uh oh!

Malik-Hacini
Jun 24, 2025

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

GiulioSurya
Jun 25, 2025

Select a reply

Uh oh!

Movatterモバイル変換

Uh oh!

New spectral clustering algorithm#31648

Uh oh!

Malik-HaciniJun 24, 2025

Replies: 1 comment

Uh oh!

GiulioSuryaJun 25, 2025

Uh oh!

Malik-Hacini
Jun 24, 2025

GiulioSurya
Jun 25, 2025