Developing novel bioinformatics tools and pipelines for working with reference genomes and large sets of resequenced genomes.
dc.contributor.advisor | Mohareb, Fady R. | |
dc.contributor.author | Kurowski, Tomasz Janusz | |
dc.date.accessioned | 2024-04-30T11:23:57Z | |
dc.date.available | 2024-04-30T11:23:57Z | |
dc.date.issued | 2022-01 | |
dc.description.abstract | Both reference genomes assembled for individual species and large, publicly maintained sets of resequenced genomes are of immense value to researchers. The former represent important milestones for research involving the species of interest and serve as ostensibly static points of reference for other data, while the latter serve as catalogues of genetic variation, enabling researchers to place their own data in a wider context. However, maintaining sets of resequenced genomes and ensuring their integrity as they undergo updates to match any new releases of their reference genome poses certain computational challenges, as does manipulating and comparing those large sets of genomes in general. This work reports on the detection and correction of significant errors which were introduced into resequenced tomato data in the course of updating them to a new version. It also introduces Tersect, a low-level utility optimized for manipulating and comparing large sets of resequenced genomic data, as well as Tersect Browser, a Web application which uses the high performance of Tersect, coupled with a higher-level indexing and precomputation scheme to allow for interactive comparison of large sets of resequenced genomes, giving biologists a tool capable of generating visualisations of genetic distance and phylogenetic relationships based on whole-genome sequence data from hundreds of genomes in seconds rather than hours. | en_UK |
dc.description.coursename | PhD in Environment and Agrifood | en_UK |
dc.identifier.uri | https://dspace.lib.cranfield.ac.uk/handle/1826/21286 | |
dc.language.iso | en_UK | en_UK |
dc.publisher | Cranfield University | en_UK |
dc.publisher.department | SWEE | en_UK |
dc.rights | © Cranfield University, 2022. All rights reserved. No part of this publication may be reproduced without the written permission of the copyright holder. | en_UK |
dc.subject | Comparative genomics | en_UK |
dc.subject | genotyping | en_UK |
dc.subject | SNP | en_UK |
dc.subject | SNV | en_UK |
dc.subject | Variant Call Format | en_UK |
dc.subject | Introgression | en_UK |
dc.subject | Tomato | en_UK |
dc.title | Developing novel bioinformatics tools and pipelines for working with reference genomes and large sets of resequenced genomes. | en_UK |
dc.type | Thesis or dissertation | en_UK |
dc.type.qualificationlevel | Doctoral | en_UK |
dc.type.qualificationname | PhD | en_UK |