Using representative gene sets to validate gene models in legume annotations (Fabaceae)

Cassandria Tay Fernandez

Research output: ThesisDoctoral Thesis

96 Downloads (Pure)

Abstract

With more genomes being assembled, it's important to develop accurate annotation tools. Here, we describe a method to support gene model prediction based on evolutionary conservation and use it to identify potentially erroneous annotations across 12 legume species. We first compare the genes for each of the genomes with the whole genome sequence of other assemblies and the NCBI-NR database to identify potentially unique gene models. Then we removed non-conserved gene models and applied a graph-based approach to identify a representative set. This identified 15,345 representative gene models from 12 legume assemblies that can support genome annotations for other legumes.
Original languageEnglish
QualificationDoctor of Philosophy
Awarding Institution
  • The University of Western Australia
Supervisors/Advisors
  • Ranathunge, Kosala, Supervisor
  • Bayer, Philipp, Supervisor
  • Edwards, Dave, Supervisor
  • Batley, Jacqueline, Supervisor
Thesis sponsors
Award date7 Jun 2023
DOIs
Publication statusUnpublished - 2023

Fingerprint

Dive into the research topics of 'Using representative gene sets to validate gene models in legume annotations (Fabaceae)'. Together they form a unique fingerprint.

Cite this