cluster subpackage¶
Sub-package Documentation¶
The cluster sub-package is responsible for merging variants coming from different inputs (i.e. different tools).
Types of Output Files¶
expected name/suffix |
file type/format |
content |
---|---|---|
|
text/tabbed |
|
|
text |
list of cluster ids that were dropped by annotation proximity filter |
|
cluster positions |
|
|
text/tabbed |
computed clusters |
Algorithm Overview¶
Collapse any duplicate breakpoint pairs
Split breakpoint pairs by type
Cluster breakpoint pairs by distance (within a type)
Create a graph representation of the distances between pairs
Find cliques up to a given input size (cluster_clique_size)
Hierarchically cluster the cliques (allows redundant participation)
For each input node/pair pick the best cluster(s)
Output the clusters and the mapping to the input pairs
modules