Any simplified instructions for a unified genealogy? #596
-
I need help going from a tree sequence to a unified genealogy. My use case is a small number of individuals (n = 38, with plans to expand, but will still be a relatively small data set). I think I'm following enough to create a tree sequence because my results include 9593 trees and 11030 nodes. Output: "Sample file created for 76 samples (38 individuals) with 37281 variable sites." The tskit tutorials that I have seen don't cover a unified genealogy and the code at https://github.com/awohns/unified_genealogy_paper/tree/master/all-data offers so much that I can't see what I need for my simple case. Is there documentation or a tutorial that presents a simple use case? From what I understand from the unified genealogy paper, the steps are:
I can do close to what I want on a single tree but wondered what was already available for the tree sequence as a unified genealogy:
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hi @lakishadavid, thanks for your message and my apologies for the delayed response! A few points:
By the way, I plan to write a tutorial which will help guide users seeking to infer a tree sequence from modern and ancient genomes, please keep an eye out at https://tsdate.readthedocs.io/en/latest/ for this! Code to get the pairwise (log) average TMRCA for each pair of haplotypes and plot the result with hierarchical clustering on the log-average times:
|
Beta Was this translation helpful? Give feedback.
Hi @lakishadavid, thanks for your message and my apologies for the delayed response!
A few points:
tsdate
?tsinfer
with a recombination map?print_sample_edge_stats()
function here. If any samples have many more edges than average, …