Tersect: a set theoretical utility for exploring sequence variant data
dc.contributor.author | Kurowski, Tomasz J. | |
dc.contributor.author | Mohareb, Fady | |
dc.date.accessioned | 2020-03-20T15:28:16Z | |
dc.date.available | 2020-03-20T15:28:16Z | |
dc.date.issued | 2019-08-08 | |
dc.description.abstract | Comparing genomic features among a large panel of individuals across the same species is considered nowadays a core part of the bioinformatics analyses. This typically involves a series of complex theoretical expressions to compare, intersect, extract symmetric differences between individuals within a large set of genotypes. Several publically available tools are capable of performing such tasks; however, due to the sheer size of variants being queried, such tasks can be computationally expensive with a runtime ranging from few minutes up to several hours depending on the dataset size. This makes existing tools unsuitable for interactive data query or as part of genomic data visualization platforms such as genome browsers. Tersect is a lightweight, high-performance command-line utility which interprets and applies flexible set theoretical expressions to sets of sequence variant data. It can be used both for interactive data exploration and as part of a larger pipeline thanks to its highly optimized storage and indexing algorithms for variant data. | en_UK |
dc.identifier.citation | Kurowski TJ, Mohareb T. (2020) Tersect: a set theoretical utility for exploring sequence variant data. Bioinformatics, Volume 36, Issue 3, February 2020, pp. 934-935 | en_UK |
dc.identifier.cris | 24181533 | |
dc.identifier.issn | 1367-4803 | |
dc.identifier.uri | https://doi.org/10.1093/bioinformatics/btz634 | |
dc.identifier.uri | http://dspace.lib.cranfield.ac.uk/handle/1826/15315 | |
dc.language.iso | en | en_UK |
dc.publisher | Oxford University Press | en_UK |
dc.rights | Attribution-NonCommercial 4.0 International | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc/4.0/ | * |
dc.title | Tersect: a set theoretical utility for exploring sequence variant data | en_UK |
dc.type | Article | en_UK |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Tersect-exploring_sequence_variant_data-2020.pdf
- Size:
- 1.52 MB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.63 KB
- Format:
- Item-specific license agreed upon to submission
- Description: