skip to main content
Ngôn ngữ:
Giới hạn tìm kiếm: Giới hạn tìm kiếm: Dạng tài nguyên Hiển thị kết quả với: Hiển thị kết quả với: Chỉ mục

SHIPS: Spectral Hierarchical Clustering for the Inference of Population Structure in Genetic Studies (SHIPS: A New Method to Infer Population Structure)

Bouaziz, Matthieu ; Paccard, Caroline ; Guedj, Mickael ; Ambroise, Christophe; Mailund, Thomas (Editor)

2012, Vol.7(10), p.e45685 [Tạp chí có phản biện]

E-ISSN: 1932-6203 ; DOI: 10.1371/journal.pone.0045685

Toàn văn sẵn có

Trích dẫn Trích dẫn bởi
  • Nhan đề:
    SHIPS: Spectral Hierarchical Clustering for the Inference of Population Structure in Genetic Studies (SHIPS: A New Method to Infer Population Structure)
  • Tác giả: Bouaziz, Matthieu ; Paccard, Caroline ; Guedj, Mickael ; Ambroise, Christophe
  • Mailund, Thomas (Editor)
  • Chủ đề: Research Article ; Biology ; Computer Science ; Mathematics ; Genetics And Genomics ; Computer Science ; Mathematics
  • Là 1 phần của: 2012, Vol.7(10), p.e45685
  • Mô tả: Inferring the structure of populations has many applications for genetic research. In addition to providing information for evolutionary studies, it can be used to account for the bias induced by population stratification in association studies. To this end, many algorithms have been proposed to cluster individuals into genetically homogeneous sub-populations. The parametric algorithms, such as Structure, are very popular but their underlying complexity and their high computational cost led to the development of faster parametric alternatives such as Admixture. Alternatives to these methods are the non-parametric approaches. Among this category, AWclust has proven efficient but fails to properly identify population structure for complex datasets. We present in this article a new clustering algorithm called Spectral Hierarchical clustering for the Inference of Population Structure (SHIPS), based on a divisive hierarchical clustering strategy, allowing a progressive investigation of population structure. This method takes genetic data as input to cluster individuals into homogeneous sub-populations and with the use of the gap statistic estimates the optimal number of such sub-populations. SHIPS was applied to a set of simulated discrete and admixed datasets and to real SNP datasets, that are data from the HapMap and Pan-Asian SNP consortium. The programs Structure, Admixture, AWclust and PCAclust were also investigated in a comparison study. SHIPS and the parametric approach Structure were the most accurate when applied to simulated datasets both in terms of individual assignments and estimation of the correct number of clusters. The analysis of the results on the real datasets highlighted that the clusterings of SHIPS were the more consistent with the population labels or those produced by the Admixture program. The performances of SHIPS when applied to SNP data, along with its relatively low computational cost and its ease of use make this method a promising solution to infer fine-scale genetic patterns.
  • Ngôn ngữ: English
  • Số nhận dạng: E-ISSN: 1932-6203 ; DOI: 10.1371/journal.pone.0045685

Đang tìm Cơ sở dữ liệu bên ngoài...