Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Distance Metric Recommendation for k-Means Clustering: A Meta-Learning Approach

Latest

Choose a tag to compare

@memgonzalesmemgonzales released this 30 Apr 11:09
· 1 commit to master since this release
40e0ec6
This commit was created on GitHub.com and signed with GitHub’sverified signature. The key has expired.
GPG key ID:4AEE18F83AFDEB23
Expired
Verified
Learn about vigilant mode.

This release bundles the code and dataset for the paper "Distance Metric Recommendation fork-Means Clustering: A Meta-Learning Approach," which was accepted for paper presentation at the2022 IEEE Region 10 Conference (TENCON 2022). The project page is https://github.com/memgonzales/meta-learning-clustering. The paper is available at IEEE Xplore: https://ieeexplore.ieee.org/abstract/document/9978037.

The choice of distance metric impacts the clustering quality of centroid-based algorithms, such ask-means. Theoretical attempts to select the optimal metric entail deep domain knowledge, while experimental approaches are resource-intensive. This paper presents a meta-learning approach to automatically recommend a distance metric fork-means clustering that optimizes the Davies-Bouldin score. Three distance measures were considered: Chebyshev, Euclidean, and Manhattan. General, statistical, information-theoretic, structural, and complexity meta-features were extracted, and random forest was used to construct the meta-learning model; borderline SMOTE was applied to address class imbalance. The model registered an accuracy of 70.59%. Employing Shapley additive explanations, it was found that the mean of the sparsity of the attributes has the highest meta-feature importance. Feeding only the top 25 most important meta-features increased the accuracy to 71.57%. The main contribution of this paper is twofold: the construction of a meta-learning model for distance metric recommendation and a fine-grained analysis of the importance and effects of the meta-features on the model's output.

Assets2
Loading

[8]ページ先頭

©2009-2025 Movatter.jp