A data management system for structural genomics

S. Raymond, Nicholas O'Toole, M. Cygler

Research output: Contribution to journalArticle

3 Citations (Scopus)

Abstract

BackgroundStructural genomics (SG) projects aim to determine thousands of protein structures by the development of high-throughput techniques for all steps of the experimental structure determination pipeline. Crucial to the success of such endeavours is the careful tracking and archiving of experimental and external data on protein targets.ResultsWe have developed a sophisticated data management system for structural genomics. Central to the system is an Oracle-based, SQL-interfaced database. The database schema deals with all facets of the structure determination process, from target selection to data deposition. Users access the database via any web browser. Experimental data is input by users with pre-defined web forms. Data can be displayed according to numerous criteria. A list of all current target proteins can be viewed, with links for each target to associated entries in external databases. To avoid unnecessary work on targets, our data management system matches protein sequences weekly using BLAST to entries in the Protein Data Bank and to targets of other SG centers worldwide.ConclusionOur system is a working, effective and user-friendly data management tool for structural genomics projects. In this report we present a detailed summary of the various capabilities of the system, using real target data as examples, and indicate our plans for future enhancements.
Original languageEnglish
Pages (from-to)online - approx 5-20pp
JournalProteome Science
Volume2
Issue number4
DOIs
Publication statusPublished - 2004

    Fingerprint

Cite this

Raymond, S., O'Toole, N., & Cygler, M. (2004). A data management system for structural genomics. Proteome Science, 2(4), online - approx 5-20pp. https://doi.org/10.1186/1477-5956-2-4