High-throughput profiling of sequence recognition by tyrosine kinases and SH2 domains using bacterial peptide display

  1. Allyson Li
  2. Rashmi Voleti
  3. Minhee Lee
  4. Dejan Gagoski
  5. Neel H Shah  Is a corresponding author
  1. Columbia University, United States

Abstract

Tyrosine kinases and SH2 (phosphotyrosine recognition) domains have binding specificities that depend on the amino acid sequence surrounding the target (phospho)tyrosine residue. Although the preferred recognition motifs of many kinases and SH2 domains are known, we lack a quantitative description of sequence specificity that could guide predictions about signaling pathways or be used to design sequences for biomedical applications. Here, we present a platform that combines genetically-encoded peptide libraries and deep sequencing to profile sequence recognition by tyrosine kinases and SH2 domains. We screened several tyrosine kinases against a million-peptide random library and used the resulting profiles to design high-activity sequences. We also screened several kinases against a library containing thousands of human proteome-derived peptides and their naturally-occurring variants. These screens recapitulated independently measured phosphorylation rates and revealed hundreds of phosphosite-proximal mutations that impact phosphosite recognition by tyrosine kinases. We extended this platform to the analysis of SH2 domains and showed that screens could predict relative binding affinities. Finally, we expanded our method to assess the impact of non-canonical and post-translationally modified amino acids on sequence recognition. This specificity profiling platform will shed new light on phosphotyrosine signaling and could readily be adapted to other protein modification/recognition domains.

Data availability

All of the processed data from the high-throughput specificity screens are provided as source data files. The raw fastq and fasta sequencing files are available as a Dryad repository (DOI: 10.5061/dryad.0zpc86727). Custom code used to process/analyze screening data can be found in a GitHub repository, as specified in the manuscript.

The following data sets were generated
The following previously published data sets were used

Article and author information

Author details

  1. Allyson Li

    Department of Chemistry, Columbia University, New York, United States
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0003-2359-7703
  2. Rashmi Voleti

    Department of Chemistry, Columbia University, New York, United States
    Competing interests
    The authors declare that no competing interests exist.
  3. Minhee Lee

    Department of Chemistry, Columbia University, New York, United States
    Competing interests
    The authors declare that no competing interests exist.
  4. Dejan Gagoski

    Department of Chemistry, Columbia University, New York, United States
    Competing interests
    The authors declare that no competing interests exist.
  5. Neel H Shah

    Department of Chemistry, Columbia University, New York, United States
    For correspondence
    neel.shah@columbia.edu
    Competing interests
    The authors declare that no competing interests exist.
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-1186-0626

Funding

National Institute of General Medical Sciences (R35GM138014)

  • Neel H Shah

Damon Runyon Cancer Research Foundation (DFS 31-18)

  • Neel H Shah

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Reviewing Editor

  1. Tony Hunter, Salk Institute for Biological Studies, United States

Version history

  1. Preprint posted: August 1, 2022 (view preprint)
  2. Received: August 1, 2022
  3. Accepted: March 15, 2023
  4. Accepted Manuscript published: March 16, 2023 (version 1)
  5. Version of Record published: March 31, 2023 (version 2)

Copyright

© 2023, Li et al.

This article is distributed under the terms of the Creative Commons Attribution License permitting unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 1,845
    views
  • 229
    downloads
  • 6
    citations

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Allyson Li
  2. Rashmi Voleti
  3. Minhee Lee
  4. Dejan Gagoski
  5. Neel H Shah
(2023)
High-throughput profiling of sequence recognition by tyrosine kinases and SH2 domains using bacterial peptide display
eLife 12:e82345.
https://doi.org/10.7554/eLife.82345

Share this article

https://doi.org/10.7554/eLife.82345

Further reading

    1. Biochemistry and Chemical Biology
    2. Neuroscience
    Maximilian Nagel, Marco Niestroj ... Marc Spehr
    Research Article

    In most mammals, conspecific chemosensory communication relies on semiochemical release within complex bodily secretions and subsequent stimulus detection by the vomeronasal organ (VNO). Urine, a rich source of ethologically relevant chemosignals, conveys detailed information about sex, social hierarchy, health, and reproductive state, which becomes accessible to a conspecific via vomeronasal sampling. So far, however, numerous aspects of social chemosignaling along the vomeronasal pathway remain unclear. Moreover, since virtually all research on vomeronasal physiology is based on secretions derived from inbred laboratory mice, it remains uncertain whether such stimuli provide a true representation of potentially more relevant cues found in the wild. Here, we combine a robust low-noise VNO activity assay with comparative molecular profiling of sex- and strain-specific mouse urine samples from two inbred laboratory strains as well as from wild mice. With comprehensive molecular portraits of these secretions, VNO activity analysis now enables us to (i) assess whether and, if so, how much sex/strain-selective ‘raw’ chemical information in urine is accessible via vomeronasal sampling; (ii) identify which chemicals exhibit sufficient discriminatory power to signal an animal’s sex, strain, or both; (iii) determine the extent to which wild mouse secretions are unique; and (iv) analyze whether vomeronasal response profiles differ between strains. We report both sex- and, in particular, strain-selective VNO representations of chemical information. Within the urinary ‘secretome’, both volatile compounds and proteins exhibit sufficient discriminative power to provide sex- and strain-specific molecular fingerprints. While total protein amount is substantially enriched in male urine, females secrete a larger variety at overall comparatively low concentrations. Surprisingly, the molecular spectrum of wild mouse urine does not dramatically exceed that of inbred strains. Finally, vomeronasal response profiles differ between C57BL/6 and BALB/c animals, with particularly disparate representations of female semiochemicals.

    1. Biochemistry and Chemical Biology
    2. Structural Biology and Molecular Biophysics
    Claudia D Consalvo, Adedeji M Aderounmu ... Brenda L Bass
    Research Article

    Invertebrates use the endoribonuclease Dicer to cleave viral dsRNA during antiviral defense, while vertebrates use RIG-I-like Receptors (RLRs), which bind viral dsRNA to trigger an interferon response. While some invertebrate Dicers act alone during antiviral defense, Caenorhabditis elegans Dicer acts in a complex with a dsRNA binding protein called RDE-4, and an RLR ortholog called DRH-1. We used biochemical and structural techniques to provide mechanistic insight into how these proteins function together. We found RDE-4 is important for ATP-independent and ATP-dependent cleavage reactions, while helicase domains of both DCR-1 and DRH-1 contribute to ATP-dependent cleavage. DRH-1 plays the dominant role in ATP hydrolysis, and like mammalian RLRs, has an N-terminal domain that functions in autoinhibition. A cryo-EM structure indicates DRH-1 interacts with DCR-1’s helicase domain, suggesting this interaction relieves autoinhibition. Our study unravels the mechanistic basis of the collaboration between two helicases from typically distinct innate immune defense pathways.