Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

To predict cryptic cleavage sites in proteins with non-canonical signal peptides

License

NotificationsYou must be signed in to change notification settings

russelllab/spc

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DOI

SPC as a quality control enzyme

Workflow


Introduction

We performed a computational analysis on the first 70 aa atthe N-t (N-terminal analysis) of thewhole proteome and on 70 aa windows starting from allinternal TMDs (Internal TMDs analysis) toidentifypotential SPC cleavage sites not associated with signal peptides (cryptic cleavage sites) byusing a pre-existing a signal peptide prediction softwareSignalP and comparing the output of twonetworks (no-TM andTM modes). Our hits represent membrane protein with type-II oriented TMDpredicted to be cleaved inno-TM network mode but not inTM network mode. Moreover, we rank the hits based on the numberof disease-linked mutations.

webApp

Access the Shiny webApphere


Computational method

We used the command-line version of theSignalP4.1 program. The program takes a proteins sequence (FASTA formatted) of 70 amino acids length as input and predicts thecleavage site and reports the Y-score (combined cleavage site score) in its output. To extractproteins with cryptic cleavage sites, we created two types of peptide sequences by considering the first 70 amino acids of all:

  • proteins that lack a canonical signal peptide (N-terminal analysis)
  • transmembrane domains of proteins that lack a canonical signal peptide (Internal TMDs analysis)

We predicted the Y-scores of both types of peptide sequences using the SignalP-TM (TM network mode) and SignalP-noTM (no-TM network mode) version (see Methods for details).A protein/transmembrane domain was considered to have a non-canonical cleavage site ifits corresponding peptide’s Y-score in the SignalP-TM output was less than 0.6 and in SignalP-noTM output was more than 0.5. We retrieved allthe proteins and their associated feature information fromUniProt/Swiss-Prot.We assigned mutational information provided in (a)UniProt,COSMIC, andClinVAR databases.


Validation experiments

Validation of the hits (proteins predicted to have non-canonical cleavage sites) were then performed by ectopically expressing the WT (Wild Type) and mutant constructs inWT HEK293T cells or lacking the regulatory SPC subunit SPCS1. Cell lysates were then analyzedvia Western blot to detect possible SPCS1-dependent cleavage fragments (see Methods for details).


Contents

N-terminal analysis

LocationDescription
pyScripts/fetch_candidates.pyScript to extract candidates from UniProt/SwissProt's human.dat.gz file and run modified SignalP4.1.
pyScripts/plot_scatter.pyScript to generate and plot the data.
data/part_a_fastadirectory with peptide FASTA sequences given as input to SignalP.
data/part_a_signalpoutput of SignalP peptide sequence prediction.
data/part_a_data.tsvoutput of SignalP and protein information in TSV format.

Internal TMDs analysis

LocationDescription
pyScripts/fetch_candidates_part_C.pyScript to extract candidates from UniProt/SwissProt's human.dat.gz file and run modified SignalP4.1.
pyScripts/plot_scatter_part_C.pyScript to generate and plot the data.
data/part_c_annotations.tsvAnnotations of TM domains in proteins.
data/part_c_fastadirectory with peptide FASTA sequences given as input to SignalP.
data/part_c_signalpoutput of SignalP peptide sequence prediction.
data/part_c_data.tsvoutput of SignalP and protein information in TSV format.

signalp-4.1

LocationDescription
signalp-4.1/signalpOriginal signalp-4.1 script.
signalp-4.1/signalp_TMCustomized signalp-4.1 script in which the TM_TRESHOLD is set to -1 so that it is always less than TMCount and thus forces the program to run in the TM network mode.
signalp-4.1/signalp_noTMCustomized signalp-4.1 script in which the TM_TRESHOLD is set to 100 so that it is always greater than TMCount and thus forces the program to run in the no-TM network mode.

Note: The information about the data downloaded from UniProt, COSMIC and ClinVar can be retrieved from Materials and Methods of the accompanying manuscript.

LocationDescription
data/server.RScript to run the Shiny webApp.
data/ui.RScript to run the Shiny webApp.
data/workflow*Workflow figure shown in the REAMDE.

Contacts

Marius Lemberg:m.lemberg@uni-koeln.de (Lemberg lab, Heidelberg/Cologne)

Matthias Feige:matthias.feige@tum.de (CPB lab, Munich)

Gurdeep Singh:gurdeep.singh@bioquant.uni-heidelberg.de (Russell lab, Heidelberg)

Cite

Zanotti A, Coelho JPL, Kaylani D, Singh G, Tauber M, Hitzenberger M, Avci D, Zacharias M, Russell RB, Lemberg MK, Feige MJThe human signal peptidase complex acts as a quality control enzyme for membrane proteins.Science (2022).doi: 10.1126/science.abo5672PMID:36454823


[8]ページ先頭

©2009-2025 Movatter.jp