Top-down machine learning approach for high-throughput single-molecule analysis

  1. David S White
  2. Marcel P Goldschen-Ohm
  3. Randall H Goldsmith  Is a corresponding author
  4. Baron Chanda  Is a corresponding author
  1. University of Wisconsin-Madison, United States
  2. University of Texas at Austin, United States

Abstract

Single-molecule approaches provide enormous insight into the dynamics of biomolecules, but adequately sampling distributions of states and events often requires extensive sampling. Although emerging experimental techniques can generate such large datasets, existing analysis tools are not suitable to process the large volume of data obtained in high-throughput paradigms. Here, we present a new analysis platform (DISC) that accelerates unsupervised analysis of single-molecule trajectories. By merging model-free statistical learning with the Viterbi algorithm, DISC idealizes single-molecule trajectories up to three orders of magnitude faster with improved accuracy compared to other commonly used algorithms. Further, we demonstrate the utility of DISC algorithm to probe cooperativity between multiple binding events in the cyclic nucleotide binding domains of HCN pacemaker channel. Given the flexible and efficient nature of DISC, we anticipate it will be a powerful tool for unsupervised processing of high-throughput data across a range of single-molecule experiments.

Data availability

Simulated and raw data in addition to analysis scripts are available at https://zenodo.org/record/3727917#.Xn0Fw9NKjq0DOI: 10.5281/zenodo.3727917

The following data sets were generated

Article and author information

Author details

  1. David S White

    Neuroscience, University of Wisconsin-Madison, Madison, United States
    Competing interests
    No competing interests declared.
  2. Marcel P Goldschen-Ohm

    Neuroscience, University of Texas at Austin, Austin, United States
    Competing interests
    No competing interests declared.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-1466-9808
  3. Randall H Goldsmith

    Chemistry, University of Wisconsin-Madison, Madison, United States
    For correspondence
    rhg@chem.wisc.edu
    Competing interests
    No competing interests declared.
  4. Baron Chanda

    Department of Neuroscience, University of Wisconsin-Madison, Madison, United States
    For correspondence
    chanda@wisc.edu
    Competing interests
    Baron Chanda, Reviewing editor, eLife.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-4954-7034

Funding

National Institute of Neurological Disorders and Stroke (NS-101723)

  • Baron Chanda

National Institute of Neurological Disorders and Stroke (NS-081320)

  • Baron Chanda

National Institute of Neurological Disorders and Stroke (NS-081293)

  • Baron Chanda

National Institute of General Medical Sciences (GM007507)

  • David S White

National Institute of General Medical Sciences (GM127957)

  • Randall H Goldsmith

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. Leon D Islas, Universidad Nacional Autónoma de México, Mexico

Version history

  1. Received: November 6, 2019
  2. Accepted: April 8, 2020
  3. Accepted Manuscript published: April 8, 2020 (version 1)
  4. Version of Record published: May 7, 2020 (version 2)

Copyright

© 2020, White et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 3,858
    views
  • 497
    downloads
  • 30
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. David S White
  2. Marcel P Goldschen-Ohm
  3. Randall H Goldsmith
  4. Baron Chanda
(2020)
Top-down machine learning approach for high-throughput single-molecule analysis
eLife 9:e53357.
https://doi.org/10.7554/eLife.53357

Share this article

https://doi.org/10.7554/eLife.53357

Further reading

    1. Biochemistry and Chemical Biology
    2. Structural Biology and Molecular Biophysics
    Damien M Rasmussen, Manny M Semonis ... Nicholas M Levinson
    Research Article

    The type II class of RAF inhibitors currently in clinical trials paradoxically activate BRAF at subsaturating concentrations. Activation is mediated by induction of BRAF dimers, but why activation rather than inhibition occurs remains unclear. Using biophysical methods tracking BRAF dimerization and conformation, we built an allosteric model of inhibitor-induced dimerization that resolves the allosteric contributions of inhibitor binding to the two active sites of the dimer, revealing key differences between type I and type II RAF inhibitors. For type II inhibitors the allosteric coupling between inhibitor binding and BRAF dimerization is distributed asymmetrically across the two dimer binding sites, with binding to the first site dominating the allostery. This asymmetry results in efficient and selective induction of dimers with one inhibited and one catalytically active subunit. Our allosteric models quantitatively account for paradoxical activation data measured for 11 RAF inhibitors. Unlike type II inhibitors, type I inhibitors lack allosteric asymmetry and do not activate BRAF homodimers. Finally, NMR data reveal that BRAF homodimers are dynamically asymmetric with only one of the subunits locked in the active αC-in state. This provides a structural mechanism for how binding of only a single αC-in inhibitor molecule can induce potent BRAF dimerization and activation.

    1. Structural Biology and Molecular Biophysics
    Nicholas James Ose, Paul Campitelli ... Sefika Banu Ozkan
    Research Article

    We integrate evolutionary predictions based on the neutral theory of molecular evolution with protein dynamics to generate mechanistic insight into the molecular adaptations of the SARS-COV-2 spike (S) protein. With this approach, we first identified candidate adaptive polymorphisms (CAPs) of the SARS-CoV-2 S protein and assessed the impact of these CAPs through dynamics analysis. Not only have we found that CAPs frequently overlap with well-known functional sites, but also, using several different dynamics-based metrics, we reveal the critical allosteric interplay between SARS-CoV-2 CAPs and the S protein binding sites with the human ACE2 (hACE2) protein. CAPs interact far differently with the hACE2 binding site residues in the open conformation of the S protein compared to the closed form. In particular, the CAP sites control the dynamics of binding residues in the open state, suggesting an allosteric control of hACE2 binding. We also explored the characteristic mutations of different SARS-CoV-2 strains to find dynamic hallmarks and potential effects of future mutations. Our analyses reveal that Delta strain-specific variants have non-additive (i.e., epistatic) interactions with CAP sites, whereas the less pathogenic Omicron strains have mostly additive mutations. Finally, our dynamics-based analysis suggests that the novel mutations observed in the Omicron strain epistatically interact with the CAP sites to help escape antibody binding.