An algorithm for clustering cDNA fingerprints. Hartuv, E., Schmitt, A. O., Lange, J., Meier-Ewert, S., Lehrach, H., & Shamir, R. Genomics, 66(3):249–256, 2000.
doi  abstract   bibtex   
Clustering large data sets is a central challenge in gene expression analysis. The hybridization of synthetic oligonucleotides to arrayed cDNAs yields a fingerprint for each cDNA clone. Cluster analysis of these fingerprints can identify clones corresponding to the same gene. We have developed a novel algorithm for cluster analysis that is based on graph theoretic techniques. Unlike other methods, it does not assume that the clusters are hierarchically structured and does not require prior knowledge on the number of clusters. In tests with simulated libraries the algorithm outperformed the Greedy method and demonstrated high speed and robustness to high error rate. Good solution quality was also obtained in a blind test on real cDNA fingerprints.
@Article{hartuv00algorithm,
  author    = {E. Hartuv and A. O. Schmitt and J. Lange and S. Meier-Ewert and H. Lehrach and R. Shamir},
  title     = {An algorithm for clustering {cDNA} fingerprints.},
  journal   = {Genomics},
  year      = {2000},
  volume    = {66},
  number    = {3},
  pages     = {249--256},
  abstract  = {Clustering large data sets is a central challenge in gene expression analysis. The hybridization of synthetic oligonucleotides to arrayed cDNAs yields a fingerprint for each cDNA clone. Cluster analysis of these fingerprints can identify clones corresponding to the same gene. We have developed a novel algorithm for cluster analysis that is based on graph theoretic techniques. Unlike other methods, it does not assume that the clusters are hierarchically structured and does not require prior knowledge on the number of clusters. In tests with simulated libraries the algorithm outperformed the Greedy method and demonstrated high speed and robustness to high error rate. Good solution quality was also obtained in a blind test on real cDNA fingerprints.},
  doi       = {10.1006/geno.2000.6187},
  keywords  = {Cluster Analysis; DNA Fingerprinting; DNA, Complementary; Evaluation Studies; Humans; Reproducibility of Results; Software Validation},
  owner     = {Sebastian},
  pmid      = {10873379},
  timestamp = {2007.04.20},
}

Downloads: 0