| | |
| | |
Stat |
Members: 3645 Articles: 2'500'096 Articles rated: 2609
17 April 2024 |
|
| | | |
|
Article overview
| |
|
Statistically-Consistent k-mer Methods for Phylogenetic Tree Reconstruction | Elizabeth S. Allman
; John A. Rhodes
; Seth Sullivant
; | Date: |
6 Nov 2015 | Abstract: | Frequencies of $k$-mers in sequences are sometimes used as a basis for
inferring phylogenetic trees without first obtaining a multiple sequence
alignment. We show that a standard approach of using the squared-Euclidean
distance between $k$-mer vectors to approximate a tree metric can be
statistically inconsistent. To remedy this, we derive model-based distance
corrections for orthologous sequences without gaps, which lead to consistent
tree inference. The identifiability of model parameters from $k$-mer
frequencies is also studied. Finally, we report simulations showing the
corrected distance out-performs many other $k$-mer methods, even when sequences
are generated with an insertion and deletion process. These results have
implications for multiple sequence alignment as well, since $k$-mer methods are
usually the first step in constructing a guide tree for such algorithms. | Source: | arXiv, 1511.1956 | Services: | Forum | Review | PDF | Favorites |
|
|
No review found.
Did you like this article?
Note: answers to reviews or questions about the article must be posted in the forum section.
Authors are not allowed to review their own article. They can use the forum section.
browser claudebot
|
| |
|
|
|
| News, job offers and information for researchers and scientists:
| |