Skip to main content

Main menu

  • Home
  • Content
    • Current Issue
    • Archive
    • Preview Papers
  • About
    • Editorial Board and Staff
    • About the Journal
    • Terms & Privacy
  • More
    • Alerts
    • Contact Us
  • Submit a Manuscript
    • Instructions for Authors
    • Submit a Manuscript
  • Other Publications
    • Plant Physiology
    • The Plant Cell
    • Plant Direct
    • The Arabidopsis Book
    • Teaching Tools in Plant Biology
    • ASPB
    • Plantae

User menu

  • My alerts
  • Log in

Search

  • Advanced search
Plant Cell
  • Other Publications
    • Plant Physiology
    • The Plant Cell
    • Plant Direct
    • The Arabidopsis Book
    • Teaching Tools in Plant Biology
    • ASPB
    • Plantae
  • My alerts
  • Log in
Plant Cell

Advanced Search

  • Home
  • Content
    • Current Issue
    • Archive
    • Preview Papers
  • About
    • Editorial Board and Staff
    • About the Journal
    • Terms & Privacy
  • More
    • Alerts
    • Contact Us
  • Submit a Manuscript
    • Instructions for Authors
    • Submit a Manuscript
  • Follow PlantCell on Twitter
  • Visit PlantCell on Facebook
  • Visit Plantae
Article CommentaryCOMMENTARY
Open Access

Are We There Yet? Reliably Estimating the Completeness of Plant Genome Sequences

Elisabeth Veeckman, Tom Ruttink, Klaas Vandepoele
Elisabeth Veeckman
aInstitute for Agricultural and Fisheries Research, Plant Sciences Unit, Growth and Development, B-9090 Melle, Belgium
bBioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Elisabeth Veeckman
Tom Ruttink
aInstitute for Agricultural and Fisheries Research, Plant Sciences Unit, Growth and Development, B-9090 Melle, Belgium
bBioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Tom Ruttink
Klaas Vandepoele
bBioinformatics Institute Ghent, Ghent University, B-9052 Ghent, Belgium
cDepartment of Plant Systems Biology, VIB, Technologiepark 927, B-9052 Ghent, Belgium
dDepartment of Plant Biotechnology and Bioinformatics, Ghent University, B-9052 Ghent, Belgium
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Klaas Vandepoele
  • For correspondence: klaas.vandepoele@psb.vib-ugent.be

Published August 2016. DOI: https://doi.org/10.1105/tpc.16.00349

  • Article
  • Figures & Data
  • Info & Metrics
  • PDF
Loading
  • © 2016 American Society of Plant Biologists. All rights reserved.

Abstract

Genome sequencing is becoming cheaper and faster thanks to the introduction of next-generation sequencing techniques. Dozens of new plant genome sequences have been released in recent years, ranging from small to gigantic repeat-rich or polyploid genomes. Most genome projects have a dual purpose: delivering a contiguous, complete genome assembly and creating a full catalog of correctly predicted genes. Frequently, the completeness of a species’ gene catalog is measured using a set of marker genes that are expected to be present. This expectation can be defined along an evolutionary gradient, ranging from highly conserved genes to species-specific genes. Large-scale population resequencing studies have revealed that gene space is fairly variable even between closely related individuals, which limits the definition of the expected gene space, and, consequently, the accuracy of estimates used to assess genome and gene space completeness. We argue that, based on the desired applications of a genome sequencing project, different completeness scores for the genome assembly and/or gene space should be determined. Using examples from several dicot and monocot genomes, we outline some pitfalls and recommendations regarding methods to estimate completeness during different steps of genome assembly and annotation.

  • Received May 3, 2016.
  • Revised July 13, 2016.
  • Accepted August 9, 2016.
  • Published August 10, 2016.

View Full Text
PreviousNext
Back to top

Table of Contents

Print
Download PDF
Email Article

Thank you for your interest in spreading the word on Plant Cell.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Are We There Yet? Reliably Estimating the Completeness of Plant Genome Sequences
(Your Name) has sent you a message from Plant Cell
(Your Name) thought you would like to see the Plant Cell web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
Are We There Yet? Reliably Estimating the Completeness of Plant Genome Sequences
Elisabeth Veeckman, Tom Ruttink, Klaas Vandepoele
The Plant Cell Aug 2016, 28 (8) 1759-1768; DOI: 10.1105/tpc.16.00349

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Request Permissions
Share
Are We There Yet? Reliably Estimating the Completeness of Plant Genome Sequences
Elisabeth Veeckman, Tom Ruttink, Klaas Vandepoele
The Plant Cell Aug 2016, 28 (8) 1759-1768; DOI: 10.1105/tpc.16.00349
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • INTRODUCTION
    • DEFINING THE EXPECTED GENOME SIZE AND GENE SPACE
    • ESTIMATING THE COMPLETENESS OF A GENOME ASSEMBLY
    • ESTIMATING THE COMPLETENESS OF THE ANNOTATED GENE SPACE
    • DEFINING THE EXPECTED GENE SPACE ON A GLIDING EVOLUTIONARY SCALE
    • INFLUENCE OF TRANSCRIPT MAPPING PARAMETERS ON GENE SPACE COMPLETENESS
    • COMPARISON OF FOUR GENE SPACE COMPLETENESS METHODS
    • CONCLUSIONS AND GUIDELINES
    • Acknowledgments
    • AUTHOR CONTRIBUTIONS
    • Footnotes
    • References
  • Figures & Data
  • Info & Metrics
  • PDF

In this issue

The Plant Cell: 28 (8)
The Plant Cell
Vol. 28, Issue 8
Aug 2016
  • Table of Contents
  • Table of Contents (PDF)
  • Cover (PDF)
  • About the Cover
  • Index by author
View this article with LENS

More in this TOC Section

  • Revisiting Criteria for Plant MicroRNA Annotation in the Era of Big Data
  • Is It Ordered Correctly? Validating Genome Assemblies by Optical Mapping
  • Widespread Contamination of Arabidopsis Embryo and Endosperm Transcriptome Data Sets
Show more COMMENTARY

Similar Articles

Our Content

  • Home
  • Current Issue
  • Plant Cell Preview
  • Archive
  • Teaching Tools in Plant Biology
  • Plant Physiology
  • Plant Direct
  • Plantae
  • ASPB

For Authors

  • Instructions
  • Submit a Manuscript
  • Editorial Board and Staff
  • Policies
  • Recognizing our Authors

For Reviewers

  • Instructions
  • Peer Review Reports
  • Journal Miles
  • Transfer of reviews to Plant Direct
  • Policies

Other Services

  • Permissions
  • Librarian resources
  • Advertise in our journals
  • Alerts
  • RSS Feeds
  • Contact Us

Copyright © 2021 by The American Society of Plant Biologists

Powered by HighWire