Skip to main content

Main menu

  • Home
  • Content
    • Current Issue
    • Archive
    • Preview Papers
  • About
    • Editorial Board and Staff
    • About the Journal
    • Terms & Privacy
  • More
    • Alerts
    • Contact Us
  • Submit a Manuscript
    • Instructions for Authors
    • Submit a Manuscript
  • Other Publications
    • Plant Physiology
    • The Plant Cell
    • Plant Direct
    • The Arabidopsis Book
    • Teaching Tools in Plant Biology
    • ASPB
    • Plantae

User menu

  • My alerts
  • Log in

Search

  • Advanced search
Plant Cell
  • Other Publications
    • Plant Physiology
    • The Plant Cell
    • Plant Direct
    • The Arabidopsis Book
    • Teaching Tools in Plant Biology
    • ASPB
    • Plantae
  • My alerts
  • Log in
Plant Cell

Advanced Search

  • Home
  • Content
    • Current Issue
    • Archive
    • Preview Papers
  • About
    • Editorial Board and Staff
    • About the Journal
    • Terms & Privacy
  • More
    • Alerts
    • Contact Us
  • Submit a Manuscript
    • Instructions for Authors
    • Submit a Manuscript
  • Follow PlantCell on Twitter
  • Visit PlantCell on Facebook
  • Visit Plantae
Research ArticleLARGE-SCALE BIOLOGY ARTICLES
Open Access

Functional Network Construction in Arabidopsis Using Rule-Based Machine Learning on Large-Scale Data Sets

George W. Bassel, Enrico Glaab, Julietta Marquez, Michael J. Holdsworth, Jaume Bacardit
George W. Bassel
aDivision of Plant and Crop Sciences, University of Nottingham, Loughborough, Leicestershire LE12 5RD, United Kingdom
bCentre for Plant Integrative Biology, University of Nottingham, Loughborough, Leicestershire LE12 5RD, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • For correspondence: george.bassel@nottingham.ac.uk
Enrico Glaab
cSchool of Computer Science, University of Nottingham, Nottingham, Nottinghamshire NG8 1BB, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Julietta Marquez
aDivision of Plant and Crop Sciences, University of Nottingham, Loughborough, Leicestershire LE12 5RD, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Michael J. Holdsworth
aDivision of Plant and Crop Sciences, University of Nottingham, Loughborough, Leicestershire LE12 5RD, United Kingdom
bCentre for Plant Integrative Biology, University of Nottingham, Loughborough, Leicestershire LE12 5RD, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Jaume Bacardit
dASAP Research Group, School of Computer Science, Nottingham NG8 1BB, United Kingdom
eMultidisciplinary Centre for Integrative Biology, School of Biosciences, University of Nottingham, Sutton Bonington LE12 5RD, United Kingdom
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site

Published September 2011. DOI: https://doi.org/10.1105/tpc.111.088153

  • Article
  • Figures & Data
  • Info & Metrics
  • PDF
Loading

Article Figures & Data

Figures

  • Tables
  • Additional Files
  • Figure 1.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 1.

    Generation of a Rule-Based ML Coprediction Network Based on Arabidopsis Seed Microarray Data.

    (A) An example rule and two example rule sets predicting the germination and nongermination developmental outcomes in Arabidopsis seeds. The example rule represents the first rule within the example germination rule set. Within each rule is an Arabidopsis gene identifier followed by the > operator followed by a number, representing a gene expression level.

    (B) Pipeline used to generate the coprediction functional gene network based on rules produced through rule-based ML. The associated software can be downloaded at www.vseed.nottingham.ac.uk.

  • Figure 2.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 2.

    Properties and Topologies of SCoPNet and Comparison with SeedNet.

    (A) Organic network topology of SCoPNet. Node color is based on gene lists of significantly differentially regulated transcripts in nongeminating (SAM NG, red nodes) and germinating (SAM G, blue nodes) seeds. Gray nodes represent genes not statistically associated with either germination or nongermination. Node sizes in (A), (B), (C), and (E) correspond to node degree.

    (B) Distribution of nodes and edges appearing with an increased frequency in nongermination predicting rule sets within SCoPNet. Nodes with increasing nongermination node strength are colored with darker shades of red and edges representing an increasing frequency of co-occurrence between gene pairs in nongermination rule sets with a darker shade of blue.

    (C) Distribution of nodes and edges appearing with an increased frequency in germination predicting rule sets within SCoPNet. Nodes with increasing germination node strength are colored with darker shades of red and edges representing an increasing frequency of co-occurrence between gene pairs in germination rule sets with a darker shade of blue.

    (D) Plot of nongermination and germination node scores along a linear ordering of genes starting from the highest to lowest node score for each set of predictions. The highest 100 node scoring genes for each developmental state are plotted on the graph.

    (E) Distribution of nodes with the greatest degree within SCoPNet. The darker the shade of red, the higher the degree of the node.

    (F) Intersection between SCoPNet and the coexpression network SeedNet. Only clusters with at least two common edges between networks are shown. Red nodes are genes associated with the nongerminating state (SAM NG), blue nodes are associated with the germinating state (SAM G), and gray nodes are not associated with either state.

    (G) Distribution of the top 100 nongermination node and germination node scoring genes in the gene coexpression network SeedNet. Nongermination predicted nodes are colored red and germination predicted nodes blue.

  • Figure 3.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 3.

    Significantly Represented GO Biological Process Categories within the Nongermination and Germination Domains of SCoPNet.

    (A) Significant GO categories within the nongermination domain of SCoPNet.

    (B) Significant GO categories within the germination domain of SCoPNet.

    A greater node size indicates more genes within a given GO category. Node color indicates the P value significance using the scale from yellow to orange in the bottom left of (A) and (B). A threshold of P < 0.05 was used to identify significant GO categories.

  • Figure 4.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 4.

    Phenotypic Characterization of Newly Identified Regulators of Seed Germination.

    (A) asg5-1 and asg5-2 mutant seeds on increasing concentrations of the germination inhibiting hormone ABA relative to their wild-type equivalent Columbia-0.

    (B) asg5-1 and asg5-2 mutant seeds on increasing concentrations of the GA synthesis inhibiting compound PAC.

    (C) Same as (A) with asg6-1 mutant seeds.

    (D) Same as (B) with asg6-1 mutant seeds.

    (E) Same as (A) with asg7-1 mutant seeds.

    (F) Same as (B) with asg7-1 mutant seeds.

    (G) Same as (A) with asg8-1 mutant seeds.

    (H) Same as (B) with asg8-1 mutant seeds.

    All seeds were stratified at 4°C for 2 d, and graphs indicate the final percentage of following 7 d of incubation at 22°C.

    [See online article for color version of this figure.]

  • Figure 5.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 5.

    Associations between Known and Newly Identified Regulators in the Rule-Based ML Network.

    (A) Associations between newly uncovered and previously identified regulators of seed developmental fate within the nongermination domain of SCoPNet. Nodes colored yellow are newly indentified regulators of seed germination, red nodes are classified by the SAM NG gene list (transcriptionally upregulated in nongerminating seeds), and gray nodes are genes whose transcripts are not significantly regulated by germination. Node size corresponds to degree and increasing edge thickness corresponds to increasing confidence for the predicted association based on point-wise mutual information.

    (B) Transcript abundance of ASG5, ASG6, and ASG7 in the abi3-4 mutant and the corresponding Landsberg erecta control seeds at 24 h after imbibition (Carrera et al., 2008).

    (C) Transcript abundance of ASG6 and ASG7 in GA-deficient ga1-3 mutant seeds in the absence and presence of exogenously applied GA (Ogawa et al., 2003).

    (D) eFP output indicating the transcript abundance of ASG6 in the embryo and endosperm of germinated and PAC-inhibited seeds (Penfield et al., 2006; Bassel et al., 2008).

    (E) Associations between previously identified and newly characterized regulators of seed developmental fate within the germination domain of SCoPNet. ASG8 is a newly identified regulator and colored yellow, SAM G (germination upregulated) genes are colored blue, and gray nodes indicate genes whose transcripts are not significantly regulated by germination. Node size corresponds to degree and increasing edge thickness corresponds to increasing confidence for the predicted association based on pointwise mutual information.

    (F) eFP output indicating the transcript abundance of ASG8 in the embryo and endosperm of PAC-inhibited and germinated seeds.

  • Figure 6.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 6.

    Expression Patterns of Genes Connected in SCoPNet over a Time Course of Seed Germination.

    In each case relative transcript abundance during a time course of seed germination is indicated (Nakabayashi et al., 2005).

    (A) ABA3 and ABI4.

    (B) RGL3 and EIN3.

    (C) ASG5 and ASG7.

    (D) SAD1 and SOMNUS.

    (E) β-HYDROXYLASE1 and PYL4.

    (F) ABI3 and PYL9.

    (G) MYB33 and MYB101.

    (H) ABI3 and ABI4.

  • Figure 7.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 7.

    Screenshot of the Online Network Query Tool Generated in This Study to Query SCoPNet.

    The seed germination regulatory gene RGL2 was queried using the gene name in the query box and is highlighted within the network view window. SCoPNet is available at http://www.vseed.nottingham.ac.uk/.

Tables

  • Figures
  • Additional Files
    • View popup
    Table 1.

    Predictability of Germination Outcome Based on Gene Expression for Various ML Methods

    MethodAccuracy
    BioHEL-germination93.5 ± 1.0
    BioHEL-nongermination92.4 ± 1.5
    Naïve Bayes88.0 ± 2.4
    C4.579.8 ± 3.6
    SVM82.4 ± 0.4
    • View popup
    Table 2.

    Previously Characterized Regulatory Genes with High Node Scores Occurring within Each of the Nongermination and Germination Rule Sets

    AGIAnnotationNode ScoreDegree
    Known Regulators in Nongermination RulesNongermination
     At2g28350ARF1020622
     At3g24220NCED61591
     At2g04240XERICO1128
     At3g62090PIL21066
     At5g07200Gibberellin 20-oxidase310412
     At1g33060ANAC01410019
     At1g03790SOMNUS8113
     At2g26300G Protein Alpha Subunit18014
     At1g30040AtGA2ox2803
     At3g45640AtMPK3767
     At3g24650ABI36814
     At1g09570PHY A6711
     At1g55255HUB25316
     At5g25900GA35314
     At4g25420GA55036
     At2g18790PHYB4876
     At1g50420SCL34772
     At1g01360PYL94667
    Known Regulators in Germination RulesGermination
     At2g46340SPA114129
     At5g11260HY57117
     At2g40220ABI46719
     At5g56860GNC4576
    • Regulatory genes displayed are present within the top 2.5% of node scores for each nongermination and germination. AGI, Arabidopsis Genome Initiative.

Additional Files

  • Figures
  • Tables
  • Supplemental Data

    Files in this Data Supplement:

    • Supplemental Figures and Tables
    • Supplemental Dataset 1
    • Supplemental Dataset 2
PreviousNext
Back to top

Table of Contents

Print
Download PDF
Email Article

Thank you for your interest in spreading the word on Plant Cell.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Functional Network Construction in Arabidopsis Using Rule-Based Machine Learning on Large-Scale Data Sets
(Your Name) has sent you a message from Plant Cell
(Your Name) thought you would like to see the Plant Cell web site.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Citation Tools
Functional Network Construction in Arabidopsis Using Rule-Based Machine Learning on Large-Scale Data Sets
George W. Bassel, Enrico Glaab, Julietta Marquez, Michael J. Holdsworth, Jaume Bacardit
The Plant Cell Sep 2011, 23 (9) 3101-3116; DOI: 10.1105/tpc.111.088153

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Request Permissions
Share
Functional Network Construction in Arabidopsis Using Rule-Based Machine Learning on Large-Scale Data Sets
George W. Bassel, Enrico Glaab, Julietta Marquez, Michael J. Holdsworth, Jaume Bacardit
The Plant Cell Sep 2011, 23 (9) 3101-3116; DOI: 10.1105/tpc.111.088153
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • INTRODUCTION
    • RESULTS AND DISCUSSION
    • METHODS
    • Acknowledgments
    • AUTHOR CONTRIBUTIONS
    • Footnotes
    • References
  • Figures & Data
  • Info & Metrics
  • PDF

In this issue

The Plant Cell Online: 23 (9)
The Plant Cell
Vol. 23, Issue 9
Sep 2011
  • Table of Contents
  • Table of Contents (PDF)
  • About the Cover
  • Index by author
  • Advertising (PDF)
  • Front Matter (PDF)
View this article with LENS

More in this TOC Section

  • Alternative Crassulacean Acid Metabolism Modes Provide Environment-Specific Water-Saving Benefits in a Leaf Metabolic Model
  • ARADEEPOPSIS, an Automated Workflow for Top-View Plant Phenomics using Semantic Segmentation of Leaf States
  • Nonsense-Mediated RNA Decay Factor UPF1 Is Critical for Posttranscriptional and Translational Gene Regulation in Arabidopsis
Show more LARGE-SCALE BIOLOGY ARTICLES

Similar Articles

Our Content

  • Home
  • Current Issue
  • Plant Cell Preview
  • Archive
  • Teaching Tools in Plant Biology
  • Plant Physiology
  • Plant Direct
  • Plantae
  • ASPB

For Authors

  • Instructions
  • Submit a Manuscript
  • Editorial Board and Staff
  • Policies
  • Recognizing our Authors

For Reviewers

  • Instructions
  • Peer Review Reports
  • Journal Miles
  • Transfer of reviews to Plant Direct
  • Policies

Other Services

  • Permissions
  • Librarian resources
  • Advertise in our journals
  • Alerts
  • RSS Feeds
  • Contact Us

Copyright © 2021 by The American Society of Plant Biologists

Powered by HighWire