Identification, organisation and visualisation of complete proteomes in UniProt throughout all taxonomic ranks :|barchaea, bacteria, eukatyote and virus

dc.contributor.advisorMorareb, Fady
dc.contributor.advisorJesus Martin, Maria
dc.contributor.authorStanley, Eleanor Juliet
dc.date.accessioned2012-07-27T10:35:12Z
dc.date.available2012-07-27T10:35:12Z
dc.date.issued2012-04
dc.description.abstractUsers of uniprot.org want to be able to query, retrieve and download proteome sets for an organism of their choice. They expect the data to be easily accessed, complete and up to date based on current available knowledge. UniProt release 2012_01 (25th Jan 2012) contains the proteomes of 2,923 organisms; 50% of which are bacteria, 38% viruses, 8% eukaryota and 4% archaea. Note that the term 'organism' is used in a broad sense to include subspecies, strains and isolates. Each completely sequenced organism is processed as an independent organism, hence the availability of 38 strain-specific proteomes Escherichia coli that are accessible for download. There is a project within UniProt dedicated to the mammoth task of maintaining the “Proteomes database”. This active resource is essential for UniProt to continually provide high quality proteome sets to the users. Accurate identification and incorporation of new, publically available, proteomes as well as the maintenance of existing proteomes permits sustained growth of the proteomes project. This is a huge, complicated and vital task accomplished by the activities of both curators and programmers. This thesis explains the data input and output of the proteomes database: the flow of genome project data from the nucleotide database into the proteomes database, then from each genome how a proteome is identified, augmented and made visible to uniprot.org users. Along this journey of discovery many issues arose, puzzles concerning data gathering, data integrity and also data visualisation. All were resolved and the outcome is a well-documented, actively maintained database that strives to provide optimal proteome information to its users.en_UK
dc.identifier.urihttp://dspace.lib.cranfield.ac.uk/handle/1826/7441
dc.language.isoenen_UK
dc.publisherCranfield Universityen_UK
dc.rights© Cranfield University, 2012. All rights reserved. No part of this publication may be reproduced without the written permission of the copyright owner.en_UK
dc.titleIdentification, organisation and visualisation of complete proteomes in UniProt throughout all taxonomic ranks :|barchaea, bacteria, eukatyote and virusen_UK
dc.typeThesis or dissertationen_UK
dc.type.qualificationlevelMastersen_UK
dc.type.qualificationnameMSc by Researchen_UK

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Eleanor_Juliet_Stanley_Thesis_2012.pdf
Size:
2.88 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.79 KB
Format:
Item-specific license agreed upon to submission
Description: