Ria is a bioinformatician who works in the Health sector. Riatook several PCR samples to determine the DNA abnormality of anunknown disease. The results of the data retrieval are thenextracted and stored into “DNA.csv”. Ria's supervisor asked her tocreate a program based on the following algorithm:
Dataset :https://www.mediafire.com/file/rny7i4t8 ... A.csv/file
Algorithm
1. Determine the value of k
2. Choose a centroid randomly
3. Repeat:
Expectation: Match each point with the nearestcentroid
Maximization : Calculate the new centroid(mean) of each cluster
4. Until the centroid position doesn'tchange
Note: Only libraries that can be used for this problem arenumpy, pandas, seaborn, matplotlib and scipy
Ria is a bioinformatician who works in the Health sector. Ria took several PCR samples to determine the DNA abnormality
-
- Site Admin
- Posts: 899603
- Joined: Mon Aug 02, 2021 8:13 am