The Eukaryotic Linear Motif resource for
Functional Sites in Proteins
Accession:
Functional site class:
N-degron
Functional site description:
The N-end rule pathway regulates protein stability by targeting proteins for ubiquitin-dependent proteasomal degradation. Polyubiquitylation of N-end rule substrates depends on their recognition by N-recognins, specific E3 ubiquitin ligases that use their conserved UBR-box and N-box domains to bind specific N-terminal protein motifs, called N-degrons, in their target proteins. N-degrons are defined by a destabilizing N-terminal residue. Type I destabilizing residues can either occur as primary destabilizing residues, which are positively charged amino acids directly recognized by N-recognins, or as secondary and tertiary destabilizing amino acids, which can be conjugated to a primary destabilizing residue. N-degrons containing type I destabilizing residues are specifically bound by the UBR-box of N-recognins. In contrast, type II destabilizing residues, which comprise bulky hydrophobic amino acids, initiate protein degradation by binding to the N-box of N-recognins.
ELMs with same func. site: DEG_Nend_Nbox_1  DEG_Nend_UBRbox_1  DEG_Nend_UBRbox_2  DEG_Nend_UBRbox_3  DEG_Nend_UBRbox_4 
ELM Description:
This class of N-degrons is defined by a type I tertiary destabilizing Cys residue in the N-terminal position that is required to be oxidized and subsequently arginylated for recognition by the UBR-box of N-recognins (Varshavsky,2011). This class of N-degrons is absent in Fungi. Cys-containing pre-N-degrons can be generated by excision of the N-terminal Met on nascent proteins or by internal cleavage of a protein. Cleavage of the N-terminal Met is catalyzed by N-teriminal Met-aminopeptidases, which specifically cleave N-terminal of small residues. Currently, Cys is the only destabilizing residue known to be generated by Met excision (Xiao,2010; Varshavsky,2011). It is important to note that the ELM prediction tool will only return internal N-degrons if the sequence of the cleavage product is entered for analysis.
Once the tertiary destabilizing Cys residue is exposed at the N-terminus of the protein, it is non-enzymatically oxidized to Cys sulfinic acid or Cys sulfonic acid in the presence of oxygen and nitric oxide. Hence, this subset of the N-end rule pathway might act as a sensor for oxygen and nitric oxide (Hu,2005, Gibbs,2011). Oxidized Cys, a structural Asp mimic, is recognized by ATE1-encoded arginyl transferases (R-transferases) that arginylate the protein. In Mammals, six isoforms of R-transferase have been detected, differing in cellular location, tissue distribution and activity (Tasaki,2007); plants encode two (ATE1 and ATE2) (Sriram,2011). Conjugation of the primary destabilizing Arg creates a functional N-degron that is recognized by the UBR-box of N-recognins (Varshavsky,2011). The UBR-box is a highly conserved region whose tertiary structure is stabilized by two zinc fingers, which form a negatively charged binding pocket that rigidly binds the positively charged N-terminal residue. In addition, the UBR-box forms interactions with the free alpha amino group, the side chain of the second residue and the backbone of the first three residues (Choi,2010; Tasaki,2012).
Pattern: ^M{0,1}(C).
Pattern Probability: 0.0000177
Present in taxon: Eukaryota
Not represented in taxon: Fungi
Interaction Domain:
zf-UBR (PF02207) Putative zinc finger in N-recognin (UBR box) (Stochiometry: 1 : 1)
o See 8 Instances for DEG_Nend_UBRbox_4
o Abstract
The half-life of proteins, which can vary from a few seconds to several days and even years (Varshavsky,2011), is determined by regulated proteolytic degradation. A degradation system common to Eukaryotes is the ubiquitin proteasome system (UPS). Proteins exhibiting degradation signals are recognized and polyubiquitylated by ubiquitin ligases, leading to their subsequent proteasomal degradation. A subset of these degradation signals can be found at the N-terminus of proteins or at the neo-N-terminus of protein cleavage products. These N-terminal motifs, called N-degrons, are recognized by E3 ubiquitin ligases known as N-recognins, which mark their substrates for proteasomal degradation by polyubiquitylation of Lys residues (Bachmair,1986). Studies suggest that this highly conserved N-end rule pathway is the most frequent way of controlled protein degradation, being involved in regulation of G-protein signalling, control of peptide import, regulation of apoptosis, fidelity of chromosome segregation and maintenance of amino acid pools during starvation of cells (Hu,2005).
Currently, 12 destabilizing N-terminal residues are known and classified into two groups of N-degrons according to their physicochemical properties. Type I destabilizing residues comprise positively charged Arg and Lys whereas type II residues are amino acids with large hydrophobic side chains including Tyr, Trp, Phe, Leu and Ile (Tasaki,2012). These primary destabilizing residues are directly recognized by N-recognins. In addition, the N-end rule pathway also contains secondary and tertiary destabilizing amino acids, which are destabilizing due to their ability of being conjugated to a primary destabilizing residue. Glu and Asp are secondary destabilizing residues as they can be conjugated to Arg by ATE1-encoded arginyl transferases (R-transferases) (Kwon,1999). These R-transferases are specific for Glu and Asp but cannot arginylate Asn and Gln. Before being arginylated, the tertiary destabilizing residues Asn and Gln are first deamidated by N-terminal amidohydrolases to create the secondary destabilizing residues Asp and Glu. In addition, Cys was also found to function as a tertiary destabilizing residue in higher Eukaryotes but not in Fungi. In the presence of oxygen and nitric oxide, N-terminal Cys gets oxidized to sulfinic (CysO2(H)) or sulfonic (CysO3(H)) acid. As structural mimics of Asp, these can be recognized and arginylated by R-transferases. Due to the requirement of cellular oxygen and nitric oxide, this subset of N-end rule pathway substrates is thought to mediate oxygen sensing in both Mammalia and Viridiplantae systems (Hu,2005; Licausi,2011).
N-degrons can be generated either by excision of the N-terminal Met or by internal proteolytic cleavage, which is carried out by proteases such as separases and caspases (Varshavsky,2011; Tasaki,2007). Met excision is catalyzed by N-terminal methionine aminopeptidases. Cleavage by these peptidases requires a small residue in the scissile bond C-terminal position. Among the known destabilizing amino acids, only Cys fulfills this requirement, suggesting that most N-degrons are created by internal cleavage (Hu,2005). Once an N-degron is generated, the motif is bound by N-recognins, which initiate the degradation process. N-recognins, also referred to as UBR proteins, are members of the E3 ubiquitin ligase family. The yeast S. cerevisiae expresses one known UBR protein (Ubr1), whereas mammalian genomes encode at least 7 UBR proteins. In plants, PRT1 (Stary,2003) and PRT6 (Garzon,2007) have been identified as N-recognins (Graciet,2010). Canonical UBR proteins contain two motif-binding domains (Tasaki,2009). The UBR-box is composed of two zinc fingers forming a negatively charged binding pocket that specifically interacts with primary type I destabilizing residues (3NY3). The N-box recognizes Type II destabilizing residues via hydrophobic interactions (Tasaki,2012; Choi,2010). Additional domains found in N-recognins are the RING finger domain and the auto-inhibitory domain that catalyze and regulate protein ubiquitylation, respectively. Under physiological conditions UBR proteins build complexes with E2 conjugating enzymes, like the human HR6A and HR6B proteins (Sriram,2011).
To date, only few physiological N-end rule substrates have been experimentally characterized, most of which are endoproteolytic protein cleavage products. They can be categorized into 5 different classes according to their destabilizing residue. DEG_Nend_UBRbox_1 contains type I primary destabilizing residues whereas DEG_Nend_UBRbox_2 comprises the secondary destabilizing residues Glu and Asp. N-degrons depending on tertiary destabilizing residues are described in DEG_Nend_UBRbox_3 (Asn or Gln) and DEG_Nend_UBRbox_4 (Cys), the latter of which is not functional in Fungi. N-degrons containing type II destabilizing residues are captured in DEG_Nend_Nbox_1.
o 8 selected references:

o 4 GO-Terms:

o 8 Instances for DEG_Nend_UBRbox_4
(click table headers for sorting; Notes column: =Number of Switches, =Number of Interactions)
Acc., Gene-, NameStartEndSubsequenceLogic#Ev.OrganismNotes
F4IDA7 HRE1
F4IDA7_ARATH
1 3 MCGGAVISDYIAPEKIARSS TP 3 Arabidopsis thaliana (Thale cress)
Q9LUM4 RAP2-2
RAP22_ARATH
1 3 MCGGAIISDFIPPPRSLRVT TP 3 Arabidopsis thaliana (Thale cress)
Q9SSA8 RAP2-12
RA212_ARATH
1 3 MCGGAIISDFIPPPRSRRVT TP 9 Arabidopsis thaliana (Thale cress)
P42736 RAP2-3
RAP23_ARATH
1 3 MCGGAIISDYAPLVTKAKGR TP 3 Arabidopsis thaliana (Thale cress)
O22259 ERF071
ERF71_ARATH
1 3 MCGGAIISDFIWSKSESEPS TP 3 Arabidopsis thaliana (Thale cress)
P97428 Rgs16
RGS16_MOUSE
1 3 MCRTLATFPNTCLERAKEFK TP 5 Mus musculus (House mouse)
2 
O08850 Rgs5
RGS5_MOUSE
1 3 MCKGLAALPHSCLERAKEIK TP 8 Mus musculus (House mouse)
2 
O08899 Rgs4
RGS4_MOUSE
1 3 MCKGLAGLPASCLRSAKDMK TP 19 Mus musculus (House mouse)
2 
Please cite: The Eukaryotic Linear Motif resource: 2022 release. (PMID:34718738)

ELM data can be downloaded & distributed for non-commercial use according to the ELM Software License Agreement