TY - GEN
T1 - Flow decomposition with subpath constraints
AU - Williams, Lucia
AU - Tomescu, Alexandru I.
AU - Mumey, Brendan
N1 - Publisher Copyright:
© Lucia Williams, Alexandru I. Tomescu, and Brendan Mumey; licensed under Creative Commons License CC-BY 4.0 21st International Workshop on Algorithms in Bioinformatics (WABI 2021).
PY - 2021/7/1
Y1 - 2021/7/1
N2 - Flow network decomposition is a natural model for problems where we are given a flow network arising from superimposing a set of weighted paths and would like to recover the underlying data, i.e., decompose the flow into the original paths and their weights. Thus, variations on flow decomposition are often used as subroutines in multiassembly problems such as RNA transcript assembly. In practice, we frequently have access to information beyond flow values in the form of subpaths, and many tools incorporate these heuristically. But despite acknowledging their utility in practice, previous work has not formally addressed the effect of subpath constraints on the accuracy of flow network decomposition approaches. We formalize the flow decomposition with subpath constraints problem, give the first algorithms for it, and study its usefulness for recovering ground truth decompositions. For finding a minimum decomposition, we propose both a heuristic and an FPT algorithm. Experiments on RNA transcript datasets show that for instances with larger solution path sets, the addition of subpath constraints finds 13% more ground truth solutions when minimal decompositions are found exactly, and 30% more ground truth solutions when minimal decompositions are found heuristically.
AB - Flow network decomposition is a natural model for problems where we are given a flow network arising from superimposing a set of weighted paths and would like to recover the underlying data, i.e., decompose the flow into the original paths and their weights. Thus, variations on flow decomposition are often used as subroutines in multiassembly problems such as RNA transcript assembly. In practice, we frequently have access to information beyond flow values in the form of subpaths, and many tools incorporate these heuristically. But despite acknowledging their utility in practice, previous work has not formally addressed the effect of subpath constraints on the accuracy of flow network decomposition approaches. We formalize the flow decomposition with subpath constraints problem, give the first algorithms for it, and study its usefulness for recovering ground truth decompositions. For finding a minimum decomposition, we propose both a heuristic and an FPT algorithm. Experiments on RNA transcript datasets show that for instances with larger solution path sets, the addition of subpath constraints finds 13% more ground truth solutions when minimal decompositions are found exactly, and 30% more ground truth solutions when minimal decompositions are found heuristically.
KW - Flow decomposition
KW - RNA-Seq
KW - Subpath constraints
UR - https://www.scopus.com/pages/publications/85115337994
U2 - 10.4230/LIPIcs.WABI.2021.16
DO - 10.4230/LIPIcs.WABI.2021.16
M3 - Conference contribution
AN - SCOPUS:85115337994
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 21st International Workshop on Algorithms in Bioinformatics, WABI 2021
A2 - Carbone, Alessandra
A2 - El-Kebir, Mohammed
PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
T2 - 21st International Workshop on Algorithms in Bioinformatics, WABI 2021
Y2 - 2 August 2021 through 4 August 2021
ER -