Jump to content
Main menu
Main menu
move to sidebar
hide
Navigation
Main page
Recent changes
Random page
Help about MediaWiki
Special pages
Niidae Wiki
Search
Search
Appearance
Create account
Log in
Personal tools
Create account
Log in
Pages for logged out editors
learn more
Contributions
Talk
Editing
Computational biology
(section)
Page
Discussion
English
Read
Edit
View history
Tools
Tools
move to sidebar
hide
Actions
Read
Edit
View history
General
What links here
Related changes
Page information
Appearance
move to sidebar
hide
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
===Unsupervised Learning=== [[Unsupervised learning]] is a type of algorithm that finds patterns in unlabeled data. One example is [[k-means clustering]], which aims to partition ''n'' data points into ''k'' clusters, in which each data point belongs to the cluster with the nearest mean. Another version is the [[k-medoids]] algorithm, which, when selecting a cluster center or cluster centroid, will pick one of its data points in the set, and not just an average of the cluster. [[File:Jmatrix.png|thumb|A heat-map of the Jaccard distances of nuclear profiles]] The algorithm follows these steps: # Randomly select ''k'' distinct data points. These are the initial clusters. # Measure the distance between each point and each of the 'k' clusters. (This is the distance of the points from each point ''k''). # Assign each point to the nearest cluster. # Find the center of each cluster (medoid). # Repeat until the clusters no longer change. # Assess the quality of the clustering by adding up the variation within each cluster. # Repeat the processes with different values of k. # Pick the best value for 'k' by finding the "elbow" in the plot of which k value has the lowest variance. One example of this in biology is used in the 3D mapping of a genome. Information of a mouse's HIST1 region of chromosome 13 is gathered from [[Gene Expression Omnibus]].<ref>{{cite web | url=https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE64881 | title=GEO Accession viewer }}</ref> This information contains data on which nuclear profiles show up in certain genomic regions. With this information, the [[Jaccard distance]] can be used to find a normalized distance between all the loci.
Summary:
Please note that all contributions to Niidae Wiki may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see
Encyclopedia:Copyrights
for details).
Do not submit copyrighted work without permission!
Cancel
Editing help
(opens in new window)
Search
Search
Editing
Computational biology
(section)
Add topic