Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue on the clustering algorithm #14

Open
dyxstat opened this issue Sep 6, 2022 · 1 comment
Open

Issue on the clustering algorithm #14

dyxstat opened this issue Sep 6, 2022 · 1 comment

Comments

@dyxstat
Copy link

dyxstat commented Sep 6, 2022

Hi developers,

Thanks for developing this good and user-friendly software. I am curious about the clustering algorithm you used in vRhyme.

You mentioned 'Weighted networks, representing unrefined bins, are created where each node is a scaffold and each edge is a weighted connection between paired scaffolds. Networks are refined using MiniBatchKMeans implemented in Scikit-Learn' in the vRhyme paper. However, the input of KMeans is usually the feature vectors. I wonder whether you use some tricks, such as kernel, to generalize the KMeans algorithm?

I would appreciate it if you could explain more about how to refine networks using KMeans.

Thanks,
Yancey

@KrisKieft
Copy link
Member

In an update I moved from kmeans to label propagation for bin refinement. The downside of software publications is that it's a snapshot of a previous version. LP seemed to be more accurate and less based on estimated parameters. Let me know if you have questions on the update.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants