This page contains ASEs represented as a graph of the mRNA exonic structure mapped to the protein primary sequence. Exons are shown in white with black outline. The reference form is represented above the alternative form. Coding regions are shown with blue horizontal bars. These can be interrupted by red blocks for absent regions. Below are the domains in the protein obtained from NCBI's Conserved Domain Database. Domains that are affected by the ASE are outlined in red.
Further information can be obtained in the help page
FAS
113302 TNFRSF6-6 162 409
NCBIGene 36.3 355
Multiple exon skipping, size difference: 247
Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_000043
Changed!
cd TNFR 95aa 5e-16
in ref transcript
Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Changed!
smart DEATH 88aa 2e-16
in ref transcript
DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
Changed!
pfam TNFR_c6 37aa 4e-05
in ref transcript
TNFR/NGFR cysteine-rich region.
BCL2L11
113317 BCL2L11-4 130 220
NCBIGene 36.2 10018
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_006538
Changed!
pfam Bclx_interact 36aa 3e-12
in ref transcript
Bcl-x interacting. This domain is a long alpha helix, required for interaction with Bcl-x. It is found in BAM, Bim and Bcl2-like protein 11.
pfam Bim_N 37aa 9e-05
in ref transcript
Bim protein N-terminus. This family represents the N-terminal region of several mammal specific Bim proteins. The Bim protein is one of the BH3-only proteins, members of the Bcl-2 family that have only one of the Bcl-2 homology regions, BH3. BH3-only proteins are essential initiators of apoptotic cell death.
Changed!
pfam Bclx_interact 35aa 9e-11
in modified transcript
ACACA
ACACA.F4 rs.ACACA.R1 208 319
NCBIGene 36.3 31
Single exon skipping, size difference: 111
Inclusion in the protein causing a new stop codon
Reference transcript: NM_198837
Changed!
cd biotinyl_domain 64aa 5e-11
in ref transcript
The biotinyl-domain or biotin carboxyl carrier protein (BCCP) domain is present in all biotin-dependent enzymes, such as acetyl-CoA carboxylase, pyruvate carboxylase, propionyl-CoA carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA carboxylase, oxaloacetate decarboxylase, methylmalonyl-CoA decarboxylase, transcarboxylase and urea amidolyase. This domain functions in transferring CO2 from one subsite to another, allowing carboxylation, decarboxylation, or transcarboxylation. During this process, biotin is covalently attached to a specific lysine.
Changed!
pfam ACC_central 751aa 0.0
in ref transcript
Acetyl-CoA carboxylase, central region. The region featured in this family is found in various eukaryotic acetyl-CoA carboxylases, N-terminal to the catalytic domain (pfam01039). This enzyme (EC:6.4.1.2) is involved in the synthesis of long-chain fatty acids, as it catalyses the rate-limiting step in this process.
Changed!
pfam Carboxyl_trans 560aa 1e-163
in ref transcript
Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction; transcarboxylation from biotin to an acceptor molecule. There are two recognised types of carboxyl transferase. One of them uses acyl-CoA and the other uses 2-oxoacid as the acceptor molecule of carbon dioxide. All of the members in this family utilise acyl-CoA as the acceptor molecule.
Changed!
TIGR accC 502aa 2e-89
in ref transcript
This model represents the biotin carboxylase subunit found usually as a component of acetyl-CoA carboxylase. Acetyl-CoA carboxylase is designated EC 6.4.1.2 and this component, biotin carboxylase, has its own designation, EC 6.3.4.14. Homologous domains are found in eukaryotic forms of acetyl-CoA carboxylase and in a number of other carboxylases (e.g. pyruvate carboxylase), but seed members and trusted cutoff are selected so as to exclude these. In some systems, the biotin carboxyl carrier protein and this protein (biotin carboxylase) may be shared by different carboxyltransferases. However, this model is not intended to identify the biotin carboxylase domain of propionyl-coA carboxylase. The model should hit the full length of proteins, except for chloroplast transit peptides in plants. If it hits a domain only of a longer protein, there may be a problem with the identification.
Changed!
pfam Biotin_lipoyl 66aa 3e-13
in ref transcript
Biotin-requiring enzyme. This family covers two Prosite entries, the conserved lysine residue binds biotin in one group and lipoic acid in the other. May not recognise the Glycine cleavage system H proteins.
Changed!
COG AccC 503aa 1e-112
in ref transcript
Biotin carboxylase [Lipid metabolism].
Changed!
COG COG4799 561aa 2e-88
in ref transcript
Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism].
Changed!
COG AccB 75aa 1e-07
in ref transcript
Biotin carboxyl carrier protein [Lipid metabolism].
AFF3
AFF3.F2 AFF3.R31 380 455
NCBIGene 36.3 3899
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_001025108
pfam AF-4 1204aa 0.0
in ref transcript
AF-4 proto-oncoprotein. This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental retardation syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental retardation. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila.
AKAP13
AKAP13.u.f.21 AKAP13.u.r.24 286 352
AceView 36.Apr07 AKAP13
Single exon skipping, size difference: 66
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: AKAP13.bApr07
cd RhoGEF 195aa 5e-37
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
cd PH 91aa 2e-05
in ref transcript
Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
smart RhoGEF 192aa 5e-40
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
smart PH 102aa 5e-07
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 274aa 5e-47
in ref transcript
pfam NB-ARC 286aa 5e-73
in ref transcript
NB-ARC domain.
pfam CARD 85aa 4e-14
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
smart WD40 40aa 8e-08
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 40aa 2e-06
in ref transcript
smart WD40 40aa 8e-05
in ref transcript
smart WD40 42aa 1e-04
in ref transcript
smart WD40 37aa 1e-04
in ref transcript
Changed!
pfam eIF2A 114aa 0.003
in ref transcript
Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
smart WD40 28aa 0.006
in ref transcript
smart WD40 34aa 0.008
in ref transcript
Changed!
COG COG2319 398aa 4e-33
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
COG COG2319 408aa 5e-25
in ref transcript
Changed!
cd WD40 264aa 1e-55
in modified transcript
Changed!
COG COG2319 390aa 1e-33
in modified transcript
APCandSRP19andZRSR1
APC.F16 APC.R14 232 344
AceView 36.Apr07 APCandSRP19andZRSR1
Single exon skipping, size difference: 112
Inclusion in the protein causing a frameshift
Reference transcript: APCandSRP19andZRSR1.eApr07
Changed!
cd RRM 61aa 5e-08
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
smart RRM_1 72aa 7e-16
in ref transcript
RNA recognition motif.
Changed!
pfam zf-CCCH 25aa 7e-04
in ref transcript
Zinc finger C-x8-C-x5-C-x3-H type (and similar).
ATG5
APG5L.u.f.9 APG5L.u.r.5 148 276
AceView 36.Apr07 ATG5
Single exon skipping, size difference: 128
Exclusion in the protein causing a frameshift
Reference transcript: ATG5.bApr07
Changed!
pfam APG5 194aa 9e-71
in ref transcript
Autophagy protein Apg5. Apg5 is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway.
APP
APP.F9 APP.R1 118 286
NCBIGene 36.3 351
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_201413
Changed!
cd KU 52aa 1e-17
in ref transcript
BPTI/Kunitz family of serine protease inhibitors; Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure.
pfam A4_EXTRA 165aa 1e-88
in ref transcript
Amyloid A4 extracellular domain.
Changed!
pfam Kunitz_BPTI 52aa 8e-19
in ref transcript
Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure. Certain family members are similar to the tick anticoagulant peptide (TAP). This is a highly selective inhibitor of factor Xa in the blood coagulation pathways. TAP molecules are highly dipolar, and are arranged to form a twisted two- stranded antiparallel beta-sheet followed by an alpha helix.
pfam APP_amyloid 43aa 4e-18
in ref transcript
beta-amyloid precursor protein C-terminus. This is the amyloid, C-terminal, protein of the beta-Amyloid precursor protein (APP) which is a conserved and ubiquitous transmembrane glycoprotein strongly implicated in the pathogenesis of Alzheimer's disease but whose normal biological function is unknown. The C-terminal 100 residues are released and aggregate into amyloid deposits which are strongly implicated in the pathology of Alzheimer's disease plaque-formation. The domain is associated with family A4_EXTRA, pfam02177, further towards the N-terminus.
pfam Beta-APP 34aa 1e-06
in ref transcript
Beta-amyloid peptide (beta-APP).
pfam OmpH 81aa 0.004
in ref transcript
Outer membrane protein (OmpH-like). This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterised as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.
Changed!
PRK xseA 173aa 0.001
in ref transcript
exodeoxyribonuclease VII large subunit; Reviewed.
AXIN1
AXIN1.u.f.11 AXIN1.R5 312 420
NCBIGene 36.3 8312
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_003502
pfam RGS 123aa 2e-31
in ref transcript
Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
smart DAX 83aa 4e-27
in ref transcript
Domain present in Dishevelled and axin. Domain of unknown function.
pfam Axin_b-cat_bind 33aa 2e-08
in ref transcript
Axin beta-catenin binding domain. This domain is found on the scaffolding protein Axin which is a component of the beta-catenin destruction complex. It competes with the tumour suppressor adenomatous polyposis coli protein (APC) for binding to beta-catenin.
AXL
AXL_x151_x133_f AXL_x151_x133_r 178 205
NCBIGene 36.3 558
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_021913
cd PTKc_Axl 272aa 1e-167
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Axl. Protein Tyrosine Kinase (PTK) family; Axl; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Axl is a member of the Axl subfamily, which is composed of receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with two immunoglobulin-like domains followed by two fibronectin type III repeats, a transmembrane segment, and an intracellular catalytic domain. Binding to their ligands, Gas6 and protein S, leads to receptor dimerization, autophosphorylation, activation, and intracellular signaling. Axl is widely expressed in a variety of organs and cells including epithelial, mesenchymal, hematopoietic, as well as non-transformed cells. Axl signaling is important in many cellular functions such as survival, anti-apoptosis, proliferation, migration, and adhesion. Axl was originally isolated from patients with chronic myelogenous leukemia and a chronic myeloproliferative disorder. Axl is overexpressed in many human cancers including colon, squamous cell, thyroid, breast, and lung carcinomas.
cd FN3 104aa 2e-06
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd IG 77aa 2e-05
in ref transcript
Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 84aa 0.006
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam fn3 97aa 1e-04
in ref transcript
pfam I-set 85aa 0.008
in ref transcript
Immunoglobulin I-set domain.
COG SPS1 280aa 7e-23
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
BCAS1
BCAS1.F9 BCAS1.R4 226 292
AceView 36.Apr07 BCAS1
Single exon skipping, size difference: 66
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: BCAS1.aApr07
BCL2L1
BCL2L1.u.f.5 BCL2L1.b.r.24 169 187
AceView 36.Apr07 BCL2L1
Alternative 3-prime, size difference: 18
Inclusion in 5'UTR
Reference transcript: BCL2L1.cApr07
cd Bcl-2_like 114aa 2e-40
in ref transcript
Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
TIGR bcl-2 233aa 1e-89
in ref transcript
in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.
BCL2L12
BCL2L12-3 BCL2L12-4 187 330
AceView 36.Apr07 BCL2L12
Single exon skipping, size difference: 143
Exclusion in the protein causing a frameshift
Reference transcript: BCL2L12.aApr07
AGR3
BCMP11.F2 BCMP11.R2 119 255
AceView 36.Apr07 AGR3
Single exon skipping, size difference: 136
Exclusion of the protein initiation site
Reference transcript: AGR3.aApr07
Changed!
cd AGR 130aa 2e-67
in ref transcript
Anterior Gradient (AGR) family; members of this family are similar to secreted proteins encoded by the cement gland-specific genes XAG-1 and XAG-2, expressed in the anterior region of dorsal ectoderm of Xenopus. They are implicated in the formation of the cement gland and the induction of forebrain fate. The human homologs, hAG-2 and hAG-3, are secreted proteins associated with estrogen-positive breast tumors. Yeast two-hybrid studies identified the metastasis-associated C4.4a protein and dystroglycan as binding partners, indicating possible roles in the development and progression of breast cancer. hAG-2 has also been implicated in prostate cancer. Its gene was cloned as an androgen-inducible gene and it was shown to be overexpressed in prostate cancer cells at the mRNA and protein levels. AGR proteins contain one conserved cysteine corresponding to the first cysteine in the CXXC motif of TRX. They show high sequence similarity to ERp19.
Changed!
cd AGR 104aa 1e-51
in modified transcript
BMP4
BMP4.u.f.4 BMP4.u.r.5 128 337
NCBIGene 36.3 652
Alternative 5-prime, size difference: 209
Exclusion in 5'UTR
Reference transcript: NM_001202
pfam TGFb_propeptide 236aa 9e-62
in ref transcript
TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein.
smart TGFB 101aa 9e-48
in ref transcript
Transforming growth factor-beta (TGF-beta) family. Family members are active as disulphide-linked homo- or heterodimers. TGFB is a multifunctional peptide that controls proliferation, differentiation, and other functions in many cell types.
BTC
BTC.F1 BTC.R1 137 284
AceView 36.Apr07 BTC
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: BTC.aApr07
C11orf17
C11orf17-F1 C11orf17-R1 252 333
AceView 36.Apr07 C11orf17
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: C11orf17.bApr07
SMG7
C1orf16.F8 C1orf16.R8 99 249
AceView 36.Apr07 SMG7
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: SMG7.aApr07
cd TPR 79aa 0.003
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
pfam EST1 119aa 1e-24
in ref transcript
Telomerase activating protein Est1. Est1 is a protein which recruits or activates telomerase at the site of polymerisation.
CAPN3
CAPN3.u.f.38 CAPN3.u.r.40 153 297
NCBIGene 36.3 825
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_000070
Changed!
cd CysPc 353aa 1e-106
in ref transcript
Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
cd Calpain_III 158aa 1e-54
in ref transcript
Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
cd EFh 55aa 2e-06
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 68aa 0.004
in ref transcript
cd EFh 56aa 0.008
in ref transcript
Changed!
pfam Peptidase_C2 343aa 1e-158
in ref transcript
Calpain family cysteine protease.
pfam Calpain_III 155aa 8e-65
in ref transcript
Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions.
COG FRQ1 99aa 6e-06
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
Changed!
cd CysPc 305aa 1e-111
in modified transcript
Changed!
pfam Peptidase_C2 295aa 1e-165
in modified transcript
CASC4
CASC4.F1 NM_177974-R1 127 295
NCBIGene 36.3 113201
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_138423
TIGR SMC_prok_B 159aa 3e-06
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG Smc 157aa 9e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG FlaD 98aa 0.003
in ref transcript
Putative archaeal flagellar protein D/E [Cell motility and secretion].
CCL4
CCL4.F1 CCL4.R1 259 372
AceView 36.Apr07 CCL4
Single exon skipping, size difference: 115
Inclusion in the protein causing a new stop codon
Reference transcript: CCL4.dApr07
Changed!
cd Chemokine_CC 26aa 1e-04
in ref transcript
Chemokine_CC: 1 of 4 subgroup designations based on the arrangement of the two N-terminal cysteine residues; includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity; some members (e.g. 2HCC) contain an additional disulfide bond which is thought to compensate for the highly conserved Trp missing in these; chemotatic for monocytes, macrophages, eosinophils, basophils, and T cells, but not neutrophils; exist as monomers and dimers, but are believed to be functional as monomers; found only in vertebrates and a few viruses; a subgroup of CC, identified by an N-terminal DCCL motif (Exodus-1, Exodus-2, and Exodus-3), has been shown to inhibit specific types of human cancer cell growth in a mouse model. See CDs: Chemokine (cd00169) for the general alignment of chemokines, or Chemokine_CXC (cd00273), Chemokine_C (cd00271), and Chemokine_CX3C (cd00274) for the additional chemokine subgroups, and Chemokine_CC_DCCL for the DCCL subgroup of this CD.
Changed!
pfam IL8 22aa 6e-05
in ref transcript
Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds.
CCNE1
CCNE1.F14 CCNE1.R3 260 395
AceView 36.Apr07 CCNE1
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: CCNE1.cApr07
cd CYCLIN 91aa 1e-14
in ref transcript
Cyclin box fold. Protein binding domain functioning in cell-cycle and transcription control. Present in cyclins, TFIIB and Retinoblastoma (RB).The cyclins consist of 8 classes of cell cycle regulators that regulate cyclin dependent kinases (CDKs). TFIIB is a transcription factor that binds the TATA box. Cyclins, TFIIB and RB contain 2 copies of the domain.
Changed!
pfam Cyclin_N 128aa 7e-42
in ref transcript
Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-terminal domain.
Changed!
pfam Cyclin_C 126aa 1e-08
in ref transcript
Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). Human UDG2 is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-terminal domain.
Changed!
COG COG5024 213aa 5e-23
in ref transcript
Cyclin [Cell division and chromosome partitioning].
Changed!
pfam Cyclin_N 121aa 7e-39
in modified transcript
Changed!
pfam Cyclin_C 78aa 3e-07
in modified transcript
Changed!
COG COG5024 234aa 1e-21
in modified transcript
CD40
CD40.F10 CD40.R9 133 229
AceView 36.Apr07 CD40
Alternative 3-prime, size difference: 96
Inclusion in the protein causing a new stop codon
Reference transcript: CD40.aApr07
Changed!
cd TNFR 98aa 8e-16
in ref transcript
Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Changed!
cd TNFR 63aa 3e-04
in ref transcript
Changed!
smart TNFR 39aa 0.004
in ref transcript
Tumor necrosis factor receptor / nerve growth factor receptor repeats. Repeats in growth factor receptors that are involved in growth factor binding. TNF/TNFR.
CDC2L1
CDC2L1.u.f.9 refseq_CDC2L1.R1 197 334
NCBIGene 36.2 984
Single exon skipping, size difference: 137
Exclusion in the protein causing a frameshift
Reference transcript: NM_001787
Changed!
cd S_TKc 287aa 1e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 276aa 6e-69
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00024 295aa 2e-55
in ref transcript
cyclin-dependent protein kinase; Provisional.
CDC2L2
CDC2L2.u.f.7 refseq_CDC2L1.R6 140 179
NCBIGene 36.2 985
Alternative 5-prime, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_033531
cd S_TKc 287aa 1e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 276aa 5e-69
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
PTZ PTZ00024 295aa 4e-55
in ref transcript
cyclin-dependent protein kinase; Provisional.
CHEK2
CHEK2.u.f.31 CHEK2.u.r.29 194 256
AceView 36.Apr07 CHEK2
Single exon skipping, size difference: 62
Exclusion in the protein causing a frameshift
Reference transcript: CHEK2.aApr07
Changed!
cd S_TKc 268aa 7e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
cd FHA 89aa 5e-07
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
Changed!
smart S_TKc 257aa 4e-75
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam FHA 79aa 8e-08
in ref transcript
FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
Changed!
PTZ PTZ00263 226aa 5e-43
in ref transcript
protein kinase A catalytic subunit; Provisional.
Changed!
cd S_TKc 65aa 7e-09
in modified transcript
Changed!
smart STYKc 64aa 7e-10
in modified transcript
Protein kinase; unclassified specificity. Phosphotransferases. The specificity of this class of kinases can not be predicted. Possible dual-specificity Ser/Thr/Tyr kinase.
NLRP1
DEFCAP.au.11.f NALP1.u.r.25 152 284
NCBIGene 36.3 22861
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_033004
cd LRR_RI 179aa 8e-30
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
cd AAA 95aa 0.009
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
pfam NACHT 170aa 5e-45
in ref transcript
NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
pfam CARD 82aa 2e-16
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
pfam PAAD_DAPIN 58aa 2e-08
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
smart LRR_RI 28aa 0.002
in ref transcript
Leucine rich repeat, ribonuclease inhibitor type.
COG RNA1 142aa 0.004
in ref transcript
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].
DNMT3B
DNMT3B.u.f.26 DNMT3B.u.r.28 137 326
NCBIGene 36.3 1789
Multiple exon skipping, size difference: 189
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_006892
cd Dnmt3b_related 87aa 2e-35
in ref transcript
The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Changed!
cd Cyt_C5_DNA_methylase 271aa 4e-13
in ref transcript
Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology. Cytosine-specific DNA methylases are found both in prokaryotes and eukaryotes. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the mammalian genome. These effects include transcriptional repression via inhibition of transcription factor binding or the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability.
pfam PWWP 73aa 8e-24
in ref transcript
PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown.
pfam DNA_methylase 123aa 8e-08
in ref transcript
C-5 cytosine-specific DNA methylase.
COG Dcm 160aa 5e-13
in ref transcript
Site-specific DNA methylase [DNA replication, recombination, and repair].
Changed!
cd Cyt_C5_DNA_methylase 134aa 9e-09
in modified transcript
DBF4B
DRF1.F1 DRF1.R1 93 158
AceView 36.Apr07 DBF4B
Alternative 3-prime, size difference: 65
Exclusion in the protein causing a frameshift
Reference transcript: DBF4B.aApr07
Changed!
pfam zf-DBF 49aa 1e-16
in ref transcript
DBF zinc finger. This domain is predicted to bind metal ions and is often found associated with pfam00533 and pfam02178.
Changed!
COG DBF4 58aa 3e-04
in ref transcript
Protein kinase essential for the initiation of DNA replication [DNA replication, recombination, and repair / Cell division and chromosome partitioning].
DSC3
DSC3.F1 DSC3.R1 229 272
NCBIGene 36.3 1825
Single exon skipping, size difference: 43
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001941
cd CA 218aa 9e-36
in ref transcript
Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
cd CA 209aa 8e-31
in ref transcript
cd CA 210aa 4e-24
in ref transcript
Changed!
cd CA 193aa 1e-19
in ref transcript
pfam Cadherin_pro 87aa 5e-18
in ref transcript
Cadherin prodomain like. Cadherins are a family of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This domain corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions.
pfam Cadherin 99aa 9e-17
in ref transcript
Cadherin domain.
pfam Cadherin 103aa 2e-11
in ref transcript
smart CA 77aa 4e-11
in ref transcript
Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
pfam Cadherin 95aa 1e-10
in ref transcript
Changed!
pfam Cadherin_C 63aa 1e-04
in ref transcript
Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic binding units. A key determinant to the strength of the binding that it is mediated by cadherins is the juxtamembrane region of the cadherin. This region induces clustering and also binds to the protein p120ctn.
Changed!
pfam Cadherin 74aa 0.009
in ref transcript
Changed!
cd CA 192aa 2e-19
in modified transcript
ECT2
ECT2.F12 ECT2.R11 100 193
AceView 36.Apr07 ECT2
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: ECT2.aApr07
cd PH_etc2 129aa 2e-61
in ref transcript
Epithelial cell transforming 2 (ECT2) pleckstrin homology (PH) domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinases, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes.
cd RhoGEF 187aa 7e-37
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
cd BRCT 71aa 4e-08
in ref transcript
Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
smart RhoGEF 185aa 2e-43
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
smart BRCT 73aa 2e-07
in ref transcript
breast cancer carboxy-terminal domain.
pfam BRCT 72aa 7e-04
in ref transcript
BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not been included in this family. The BRCT domain of XRCC1 forms a homodimer in the crystal structure. This suggests that pairs of BRCT domains associate as homo- or heterodimers.
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: FANCL.bApr07
Changed!
pfam WD-3 270aa 1e-118
in ref transcript
WD-repeat region. This entry is of a region of approximately 100 residues containing three WD repeats and six cysteine residues possibly as three cystine-bridges. These regions are contained within the Fancl protein in humans which is the putative E3 ubiquitin ligase subunit of the FA complex (Fanconi anaemia). Eight subunits of the Fanconi anaemia gene products form a multisubunit nuclear complex which is required for mono-ubiquitination of a downstream FA protein, FANCD2. The WD repeats are required for interaction with other subunits of the FA complex.
Changed!
COG COG5219 62aa 0.003
in ref transcript
Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].
FASTK
FAST_HUMAN-5 FAST_HUMAN-6 410 730
AceView 36.Apr07 FASTK
Multiple exon skipping, size difference: 320
Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: FASTK.aApr07
Changed!
pfam FAST_1 67aa 1e-13
in ref transcript
FAST kinase-like protein, subdomain 1. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This region is often found immediately N-terminal to the FAST kinase-like protein, subdomain 2.
Changed!
pfam FAST_2 92aa 8e-13
in ref transcript
FAST kinase-like protein, subdomain 2. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This subdomain is often found associated with the FAST kinase-like protein, subdomain 2.
Changed!
pfam RAP 58aa 5e-12
in ref transcript
RAP domain. This domain is found in various eukaryotic species, particularly in apicomplexans such as Plasmodium falciparum, where it is found in proteins that are important in various parasite-host cell interactions. It is thought to be an RNA-binding domain.
FGFR1OP
FGFR1OP.u.f.10 FGFR1OP.u.r.12 108 168
NCBIGene 36.3 11116
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_007045
pfam FOP_dimer 81aa 8e-35
in ref transcript
FOP N terminal dimerisation domain. Fibroblast growth factor receptor 1 (FGFR1) oncogene partner (FOP) is a centrosomal protein that is involved in anchoring microtubules to subcellular structures. This domain includes a Lis-homology motif. It forms an alpha helical bundle and is involved in dimerisation.
FGFR2
FGFR2.F1 FGFR2.R1 205 472
AceView 36.Apr07 FGFR2
Single exon skipping, size difference: 267
Exclusion in the protein (no frameshift)
Reference transcript: FGFR2.aApr07
cd PTKc_FGFR2 304aa 0.0
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 2. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 2 (FGFR2); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR2 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. There are many splice variants of FGFR2 which show differential expression and binding to FGF ligands. Disruption of either FGFR2 or FGFR2b is lethal in mice, due to defects in the placenta or severe impairment of tissue development including lung, limb, and thyroid, respectively. Disruption of FGFR2c in mice results in defective bone and skull development. Genetic alterations of FGFR2 are associated with many human skeletal disorders including Apert syndrome, Crouzon syndrome, Jackson-Weiss syndrome, and Pfeiffer syndrome.
cd IGcam 93aa 1e-16
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 101aa 3e-05
in ref transcript
Changed!
cd IGcam 64aa 8e-05
in ref transcript
pfam Pkinase_Tyr 277aa 1e-115
in ref transcript
Protein tyrosine kinase.
pfam I-set 75aa 2e-12
in ref transcript
Immunoglobulin I-set domain.
Changed!
smart IGc2 57aa 9e-11
in ref transcript
Immunoglobulin C-2 Type.
smart IG_like 96aa 2e-10
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
COG SPS1 252aa 3e-21
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
FGFR3
FGFR3.F4 rs.FGFR3.R1 99 435
NCBIGene 36.3 2261
Multiple exon skipping, size difference: 336
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_000142
cd PTKc_FGFR3 334aa 0.0
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 3. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 3 (FGFR3); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR3 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. Many FGFR3 splice variants have been reported with the IIIb and IIIc isoforms being the predominant forms. FGFR3 IIIc is the isoform expressed in chondrocytes, the cells affected in dwarfism, while IIIb is expressed in epithelial cells. FGFR3 ligands include FGF1, FGF2, FGF4, FGF8, FGF9, and FGF23. It is a negative regulator of long bone growth. In the cochlear duct and in the lens, FGFR3 is involved in differentiation while it appears to have a role in cell proliferation in epithelial cells. Germline mutations in FGFR3 are associated with skeletal disorders including several forms of dwarfism. Some missense mutations are associated with multiple myeloma and carcinomas of the bladder and cervix. Overexpression of FGFR3 is found in thyroid carcinoma.
cd IGcam 93aa 4e-16
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
cd IGcam 94aa 8e-08
in ref transcript
pfam Pkinase_Tyr 277aa 1e-117
in ref transcript
Protein tyrosine kinase.
pfam I-set 75aa 2e-13
in ref transcript
Immunoglobulin I-set domain.
Changed!
smart IG_like 94aa 2e-10
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IGc2 58aa 6e-06
in ref transcript
Immunoglobulin C-2 Type.
COG SPS1 275aa 6e-22
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
smart IG_like 43aa 0.009
in modified transcript
FGFR4
FGFR4.F23 FGFR4.R11 275 469
AceView 36.Apr07 FGFR4
Alternative 3-prime, size difference: 194
Inclusion in the protein causing a frameshift
Reference transcript: FGFR4.bApr07
Changed!
cd PTKc_FGFR4 311aa 0.0
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 4. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 4 (FGFR4); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR4 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. Unlike other FGFRs, there is only one splice form of FGFR4. It binds FGF1, FGF2, FGF6, FGF19, and FGF23. FGF19 is a selective ligand for FGFR4. Although disruption of FGFR4 in mice causes no obvious phenotype, in vivo inhibition of FGFR4 in cultured skeletal muscle cells resulted in an arrest of muscle progenitor differentiation. FGF6 and FGFR4 are uniquely expressed in myofibers and satellite cells. FGF6/FGFR4 signaling appears to play a key role in the regulation of muscle regeneration. A polymorphism in FGFR4 is found in head and neck squamous cell carcinoma.
cd IGcam 93aa 6e-17
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 101aa 2e-07
in ref transcript
cd IGcam 67aa 0.003
in ref transcript
Changed!
pfam Pkinase_Tyr 277aa 1e-114
in ref transcript
Protein tyrosine kinase.
smart IGc2 67aa 4e-15
in ref transcript
Immunoglobulin C-2 Type.
smart IG_like 96aa 6e-12
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IGc2 58aa 8e-09
in ref transcript
Changed!
COG SPS1 251aa 7e-23
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
FN1
FN1.u.f.30 FN1.u.r.34 296 569
NCBIGene 36.3 2335
Single exon skipping, size difference: 273
Exclusion in the protein (no frameshift)
Reference transcript: NM_212482
cd FN2 48aa 2e-16
in ref transcript
Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
cd FN2 48aa 3e-16
in ref transcript
cd FN3 83aa 4e-10
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN1 44aa 4e-09
in ref transcript
Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
cd FN3 81aa 8e-09
in ref transcript
cd FN3 80aa 1e-08
in ref transcript
cd FN1 40aa 4e-08
in ref transcript
cd FN1 42aa 5e-08
in ref transcript
cd FN1 45aa 8e-08
in ref transcript
cd FN3 88aa 1e-07
in ref transcript
cd FN3 88aa 2e-07
in ref transcript
cd FN1 44aa 2e-07
in ref transcript
cd FN3 87aa 3e-07
in ref transcript
cd FN1 38aa 4e-07
in ref transcript
cd FN3 81aa 9e-07
in ref transcript
cd FN3 82aa 1e-06
in ref transcript
cd FN3 88aa 1e-06
in ref transcript
cd FN1 41aa 1e-06
in ref transcript
Changed!
cd FN3 93aa 2e-06
in ref transcript
cd FN1 45aa 2e-06
in ref transcript
cd FN1 42aa 2e-06
in ref transcript
cd FN3 81aa 6e-06
in ref transcript
cd FN3 90aa 8e-06
in ref transcript
cd FN3 73aa 9e-06
in ref transcript
cd FN1 44aa 1e-05
in ref transcript
cd FN1 43aa 3e-05
in ref transcript
cd FN1 39aa 0.003
in ref transcript
cd FN3 73aa 0.006
in ref transcript
smart FN2 49aa 9e-20
in ref transcript
Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
smart FN2 49aa 4e-19
in ref transcript
pfam fn3 82aa 8e-17
in ref transcript
Fibronectin type III domain.
smart FN1 45aa 6e-13
in ref transcript
Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
pfam fn1 39aa 1e-12
in ref transcript
Fibronectin type I domain.
pfam fn3 81aa 3e-12
in ref transcript
pfam fn3 81aa 3e-12
in ref transcript
smart FN1 45aa 3e-12
in ref transcript
smart FN1 42aa 3e-12
in ref transcript
smart FN1 45aa 5e-12
in ref transcript
smart FN1 39aa 2e-11
in ref transcript
pfam fn3 81aa 4e-11
in ref transcript
Changed!
pfam fn3 83aa 5e-11
in ref transcript
pfam fn3 67aa 8e-11
in ref transcript
pfam fn1 39aa 8e-11
in ref transcript
pfam fn3 81aa 1e-10
in ref transcript
pfam fn3 80aa 1e-10
in ref transcript
smart FN1 43aa 2e-10
in ref transcript
pfam fn3 80aa 5e-10
in ref transcript
smart FN1 44aa 7e-10
in ref transcript
pfam fn3 81aa 8e-09
in ref transcript
pfam fn3 85aa 9e-09
in ref transcript
pfam fn3 70aa 1e-08
in ref transcript
pfam fn3 65aa 2e-08
in ref transcript
smart FN1 41aa 3e-08
in ref transcript
pfam fn3 81aa 3e-07
in ref transcript
smart FN1 41aa 5e-07
in ref transcript
pfam fn3 54aa 5e-05
in ref transcript
pfam fn1 31aa 9e-05
in ref transcript
FN1
FN1.u.f.40 FN1.u.r.45 126 396
NCBIGene 36.3 2335
Single exon skipping, size difference: 270
Exclusion in the protein (no frameshift)
Reference transcript: NM_212482
cd FN2 48aa 2e-16
in ref transcript
Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
cd FN2 48aa 3e-16
in ref transcript
cd FN3 83aa 4e-10
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN1 44aa 4e-09
in ref transcript
Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
cd FN3 81aa 8e-09
in ref transcript
cd FN3 80aa 1e-08
in ref transcript
cd FN1 40aa 4e-08
in ref transcript
cd FN1 42aa 5e-08
in ref transcript
cd FN1 45aa 8e-08
in ref transcript
cd FN3 88aa 1e-07
in ref transcript
cd FN3 88aa 2e-07
in ref transcript
cd FN1 44aa 2e-07
in ref transcript
cd FN3 87aa 3e-07
in ref transcript
cd FN1 38aa 4e-07
in ref transcript
Changed!
cd FN3 81aa 9e-07
in ref transcript
cd FN3 82aa 1e-06
in ref transcript
Changed!
cd FN3 88aa 1e-06
in ref transcript
cd FN1 41aa 1e-06
in ref transcript
cd FN3 93aa 2e-06
in ref transcript
cd FN1 45aa 2e-06
in ref transcript
cd FN1 42aa 2e-06
in ref transcript
cd FN3 81aa 6e-06
in ref transcript
cd FN3 90aa 8e-06
in ref transcript
cd FN3 73aa 9e-06
in ref transcript
cd FN1 44aa 1e-05
in ref transcript
cd FN1 43aa 3e-05
in ref transcript
cd FN1 39aa 0.003
in ref transcript
cd FN3 73aa 0.006
in ref transcript
smart FN2 49aa 9e-20
in ref transcript
Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
smart FN2 49aa 4e-19
in ref transcript
pfam fn3 82aa 8e-17
in ref transcript
Fibronectin type III domain.
smart FN1 45aa 6e-13
in ref transcript
Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
Changed!
pfam fn1 39aa 1e-12
in ref transcript
Fibronectin type I domain.
pfam fn3 81aa 3e-12
in ref transcript
pfam fn3 81aa 3e-12
in ref transcript
smart FN1 45aa 3e-12
in ref transcript
smart FN1 42aa 3e-12
in ref transcript
smart FN1 45aa 5e-12
in ref transcript
smart FN1 39aa 2e-11
in ref transcript
pfam fn3 81aa 4e-11
in ref transcript
pfam fn3 83aa 5e-11
in ref transcript
pfam fn3 67aa 8e-11
in ref transcript
pfam fn1 39aa 8e-11
in ref transcript
pfam fn3 81aa 1e-10
in ref transcript
pfam fn3 80aa 1e-10
in ref transcript
smart FN1 43aa 2e-10
in ref transcript
pfam fn3 80aa 5e-10
in ref transcript
smart FN1 44aa 7e-10
in ref transcript
pfam fn3 81aa 8e-09
in ref transcript
pfam fn3 85aa 9e-09
in ref transcript
pfam fn3 70aa 1e-08
in ref transcript
pfam fn3 65aa 2e-08
in ref transcript
smart FN1 41aa 3e-08
in ref transcript
Changed!
pfam fn3 81aa 3e-07
in ref transcript
smart FN1 41aa 5e-07
in ref transcript
pfam fn3 54aa 5e-05
in ref transcript
pfam fn1 31aa 9e-05
in ref transcript
Changed!
cd FN3 89aa 3e-07
in modified transcript
Changed!
smart FN1 44aa 2e-12
in modified transcript
FN1
FN1.u.f.55 FN1.u.r.62 129 204
NCBIGene 36.3 2335
Alternative 3-prime, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_212482
cd FN2 48aa 2e-16
in ref transcript
Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
cd FN2 48aa 3e-16
in ref transcript
cd FN3 83aa 4e-10
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN1 44aa 4e-09
in ref transcript
Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
cd FN3 81aa 8e-09
in ref transcript
cd FN3 80aa 1e-08
in ref transcript
cd FN1 40aa 4e-08
in ref transcript
cd FN1 42aa 5e-08
in ref transcript
cd FN1 45aa 8e-08
in ref transcript
cd FN3 88aa 1e-07
in ref transcript
cd FN3 88aa 2e-07
in ref transcript
cd FN1 44aa 2e-07
in ref transcript
cd FN3 87aa 3e-07
in ref transcript
cd FN1 38aa 4e-07
in ref transcript
cd FN3 81aa 9e-07
in ref transcript
cd FN3 82aa 1e-06
in ref transcript
cd FN3 88aa 1e-06
in ref transcript
cd FN1 41aa 1e-06
in ref transcript
cd FN3 93aa 2e-06
in ref transcript
cd FN1 45aa 2e-06
in ref transcript
cd FN1 42aa 2e-06
in ref transcript
cd FN3 81aa 6e-06
in ref transcript
cd FN3 90aa 8e-06
in ref transcript
cd FN3 73aa 9e-06
in ref transcript
cd FN1 44aa 1e-05
in ref transcript
cd FN1 43aa 3e-05
in ref transcript
cd FN1 39aa 0.003
in ref transcript
cd FN3 73aa 0.006
in ref transcript
smart FN2 49aa 9e-20
in ref transcript
Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
smart FN2 49aa 4e-19
in ref transcript
pfam fn3 82aa 8e-17
in ref transcript
Fibronectin type III domain.
smart FN1 45aa 6e-13
in ref transcript
Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
pfam fn1 39aa 1e-12
in ref transcript
Fibronectin type I domain.
pfam fn3 81aa 3e-12
in ref transcript
pfam fn3 81aa 3e-12
in ref transcript
smart FN1 45aa 3e-12
in ref transcript
smart FN1 42aa 3e-12
in ref transcript
smart FN1 45aa 5e-12
in ref transcript
smart FN1 39aa 2e-11
in ref transcript
pfam fn3 81aa 4e-11
in ref transcript
pfam fn3 83aa 5e-11
in ref transcript
pfam fn3 67aa 8e-11
in ref transcript
pfam fn1 39aa 8e-11
in ref transcript
pfam fn3 81aa 1e-10
in ref transcript
pfam fn3 80aa 1e-10
in ref transcript
smart FN1 43aa 2e-10
in ref transcript
pfam fn3 80aa 5e-10
in ref transcript
smart FN1 44aa 7e-10
in ref transcript
pfam fn3 81aa 8e-09
in ref transcript
pfam fn3 85aa 9e-09
in ref transcript
pfam fn3 70aa 1e-08
in ref transcript
pfam fn3 65aa 2e-08
in ref transcript
smart FN1 41aa 3e-08
in ref transcript
pfam fn3 81aa 3e-07
in ref transcript
smart FN1 41aa 5e-07
in ref transcript
pfam fn3 54aa 5e-05
in ref transcript
pfam fn1 31aa 9e-05
in ref transcript
FN1
FN1.u.f.56 FN1.u.r.67 231 319
NCBIGene 36.3 2335
Intron retention, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_212482
cd FN2 48aa 2e-16
in ref transcript
Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
cd FN2 48aa 3e-16
in ref transcript
cd FN3 83aa 4e-10
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN1 44aa 4e-09
in ref transcript
Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
cd FN3 81aa 8e-09
in ref transcript
cd FN3 80aa 1e-08
in ref transcript
cd FN1 40aa 4e-08
in ref transcript
cd FN1 42aa 5e-08
in ref transcript
cd FN1 45aa 8e-08
in ref transcript
cd FN3 88aa 1e-07
in ref transcript
cd FN3 88aa 2e-07
in ref transcript
cd FN1 44aa 2e-07
in ref transcript
cd FN3 87aa 3e-07
in ref transcript
cd FN1 38aa 4e-07
in ref transcript
cd FN3 81aa 9e-07
in ref transcript
cd FN3 82aa 1e-06
in ref transcript
cd FN3 88aa 1e-06
in ref transcript
cd FN1 41aa 1e-06
in ref transcript
cd FN3 93aa 2e-06
in ref transcript
cd FN1 45aa 2e-06
in ref transcript
cd FN1 42aa 2e-06
in ref transcript
cd FN3 81aa 6e-06
in ref transcript
cd FN3 90aa 8e-06
in ref transcript
cd FN3 73aa 9e-06
in ref transcript
cd FN1 44aa 1e-05
in ref transcript
cd FN1 43aa 3e-05
in ref transcript
cd FN1 39aa 0.003
in ref transcript
cd FN3 73aa 0.006
in ref transcript
smart FN2 49aa 9e-20
in ref transcript
Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
smart FN2 49aa 4e-19
in ref transcript
pfam fn3 82aa 8e-17
in ref transcript
Fibronectin type III domain.
smart FN1 45aa 6e-13
in ref transcript
Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain.
pfam RBB1NT 93aa 3e-15
in ref transcript
RBB1NT (NUC162) domain. This domain is found N terminal to the ARID/BRIGHT domain in DNA-binding proteins of the Retinoblastoma-binding protein 1 family.
smart TUDOR 56aa 1e-05
in ref transcript
Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
FN1
FOX.FN1.F1 FOX.FN1.R1 124 397
NCBIGene 36.3 2335
Single exon skipping, size difference: 273
Exclusion in the protein (no frameshift)
Reference transcript: NM_212482
cd FN2 48aa 2e-16
in ref transcript
Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
cd FN2 48aa 3e-16
in ref transcript
cd FN3 83aa 4e-10
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN1 44aa 4e-09
in ref transcript
Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
cd FN3 81aa 8e-09
in ref transcript
cd FN3 80aa 1e-08
in ref transcript
cd FN1 40aa 4e-08
in ref transcript
cd FN1 42aa 5e-08
in ref transcript
cd FN1 45aa 8e-08
in ref transcript
cd FN3 88aa 1e-07
in ref transcript
cd FN3 88aa 2e-07
in ref transcript
cd FN1 44aa 2e-07
in ref transcript
cd FN3 87aa 3e-07
in ref transcript
cd FN1 38aa 4e-07
in ref transcript
cd FN3 81aa 9e-07
in ref transcript
cd FN3 82aa 1e-06
in ref transcript
cd FN3 88aa 1e-06
in ref transcript
cd FN1 41aa 1e-06
in ref transcript
Changed!
cd FN3 93aa 2e-06
in ref transcript
cd FN1 45aa 2e-06
in ref transcript
cd FN1 42aa 2e-06
in ref transcript
cd FN3 81aa 6e-06
in ref transcript
cd FN3 90aa 8e-06
in ref transcript
cd FN3 73aa 9e-06
in ref transcript
cd FN1 44aa 1e-05
in ref transcript
cd FN1 43aa 3e-05
in ref transcript
cd FN1 39aa 0.003
in ref transcript
cd FN3 73aa 0.006
in ref transcript
smart FN2 49aa 9e-20
in ref transcript
Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
smart FN2 49aa 4e-19
in ref transcript
pfam fn3 82aa 8e-17
in ref transcript
Fibronectin type III domain.
smart FN1 45aa 6e-13
in ref transcript
Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
pfam fn1 39aa 1e-12
in ref transcript
Fibronectin type I domain.
pfam fn3 81aa 3e-12
in ref transcript
pfam fn3 81aa 3e-12
in ref transcript
smart FN1 45aa 3e-12
in ref transcript
smart FN1 42aa 3e-12
in ref transcript
smart FN1 45aa 5e-12
in ref transcript
smart FN1 39aa 2e-11
in ref transcript
pfam fn3 81aa 4e-11
in ref transcript
Changed!
pfam fn3 83aa 5e-11
in ref transcript
pfam fn3 67aa 8e-11
in ref transcript
pfam fn1 39aa 8e-11
in ref transcript
pfam fn3 81aa 1e-10
in ref transcript
pfam fn3 80aa 1e-10
in ref transcript
smart FN1 43aa 2e-10
in ref transcript
pfam fn3 80aa 5e-10
in ref transcript
smart FN1 44aa 7e-10
in ref transcript
pfam fn3 81aa 8e-09
in ref transcript
pfam fn3 85aa 9e-09
in ref transcript
pfam fn3 70aa 1e-08
in ref transcript
pfam fn3 65aa 2e-08
in ref transcript
smart FN1 41aa 3e-08
in ref transcript
pfam fn3 81aa 3e-07
in ref transcript
smart FN1 41aa 5e-07
in ref transcript
pfam fn3 54aa 5e-05
in ref transcript
pfam fn1 31aa 9e-05
in ref transcript
INSR
FOX.INSR.F1 FOX.INSR.R1 194 230
NCBIGene 36.3 3643
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_000208
cd PTKc_InsR 288aa 1e-179
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Insulin Receptor. Protein Tyrosine Kinase (PTK) family; Insulin Receptor (InsR); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. InsR is a receptor tyr kinase (RTK) that is composed of two alphabeta heterodimers. Binding of the insulin ligand to the extracellular alpha subunit activates the intracellular tyr kinase domain of the transmembrane beta subunit. Receptor activation leads to autophosphorylation, stimulating downstream kinase activities, which initiate signaling cascades and biological function. InsR signaling plays an important role in many cellular processes including glucose homeostasis, glycogen synthesis, lipid and protein metabolism, ion and amino acid transport, cell cycle and proliferation, cell differentiation, gene transcription, and nitric oxide synthesis. Insulin resistance, caused by abnormalities in InsR signaling, has been described in diabetes, hypertension, cardiovascular disease, metabolic syndrome, heart failure, and female infertility.
cd FN3 78aa 3e-07
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FU 48aa 2e-06
in ref transcript
Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
cd FN3 45aa 7e-04
in ref transcript
pfam Pkinase_Tyr 268aa 1e-118
in ref transcript
Protein tyrosine kinase.
pfam Furin-like 162aa 5e-57
in ref transcript
Furin-like cysteine rich region.
pfam Recep_L_domain 114aa 1e-25
in ref transcript
Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain.
pfam Recep_L_domain 113aa 2e-25
in ref transcript
smart FN3 71aa 5e-05
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
COG SPS1 265aa 2e-18
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
KIF2A
FOX.KIF2AandIPO11.F1 rs.KIF2A.R1 157 271
NCBIGene 36.3 3796
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098511
Changed!
cd KISc_KIF2_like 330aa 1e-142
in ref transcript
Kinesin motor domain, KIF2-like group. KIF2 is a protein expressed in neurons, which has been associated with axonal transport and neuron development; alternative splice forms have been implicated in lysosomal translocation. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In this subgroup the motor domain is found in the middle (M-type) of the protein chain. M-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. Kinesin motor domains hydrolyze ATP at a rate of about 80 per second, and move along the microtubule at a speed of about 6400 Angstroms per second (KIF2 may be slower). To achieve that, kinesin head groups work in pairs. Upon replacing ADP with ATP, a kinesin motor domain increases its affinity for microtubule binding and locks in place. Also, the neck linker binds to the motor domain, which repositions the other head domain through the coiled-coil domain close to a second tubulin dimer, about 80 Angstroms along the microtubule. Meanwhile, ATP hydrolysis takes place, and when the second head domain binds to the microtubule, the first domain again replaces ADP with ATP, triggering a conformational change that pulls the first domain forward.
Changed!
smart KISc 337aa 1e-109
in ref transcript
Kinesin motor, catalytic domain. ATPase. Microtubule-dependent molecular motors that play important roles in intracellular transport of organelles and in cell division.
Changed!
COG KIP1 327aa 7e-51
in ref transcript
Kinesin-like protein [Cytoskeleton].
Changed!
cd KISc_KIF2_like 330aa 1e-142
in modified transcript
Changed!
smart KISc 336aa 1e-109
in modified transcript
Changed!
COG KIP1 344aa 5e-51
in modified transcript
SYNE2
FOX.SYNE2.F1 FOX.SYNE2.R1 266 335
NCBIGene 36.3 23224
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_182914
cd SPEC 214aa 1e-12
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
cd CH 103aa 5e-12
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
cd SPEC 215aa 1e-10
in ref transcript
cd CH 103aa 6e-10
in ref transcript
cd SPEC 214aa 4e-09
in ref transcript
cd SPEC 221aa 8e-08
in ref transcript
cd SPEC 216aa 1e-07
in ref transcript
cd SPEC 207aa 2e-05
in ref transcript
pfam KASH 60aa 7e-14
in ref transcript
Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
pfam CH 99aa 2e-13
in ref transcript
Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
smart CH 102aa 2e-13
in ref transcript
Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
pfam SMC_N 302aa 1e-06
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
TIGR SMC_prok_B 305aa 4e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart SPEC 102aa 9e-05
in ref transcript
Spectrin repeats.
smart SPEC 104aa 3e-04
in ref transcript
TIGR SMC_prok_A 303aa 3e-04
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
TIGR SMC_prok_A 375aa 4e-04
in ref transcript
pfam SMC_N 281aa 0.002
in ref transcript
COG SAC6 255aa 3e-22
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
COG Smc 266aa 3e-04
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
GDNF
GDNF.u.f.4 refseq_GDNF.R2 190 268
NCBIGene 36.3 2668
Alternative 5-prime, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_000514
pfam TGF_beta 94aa 8e-15
in ref transcript
Transforming growth factor beta like domain.
GNB3
GNB3.F12 GNB3.R2 177 418
AceView 36.Apr07 GNB3
Intron retention, size difference: 241
Inclusion in the protein causing a new stop codon
Reference transcript: GNB3.aApr07
Changed!
cd WD40 293aa 1e-65
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Changed!
smart WD40 39aa 3e-05
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Changed!
smart WD40 35aa 2e-04
in ref transcript
Changed!
pfam WD40 39aa 3e-04
in ref transcript
WD domain, G-beta repeat.
Changed!
pfam WD40 39aa 0.002
in ref transcript
Changed!
COG COG2319 297aa 1e-33
in ref transcript
FOG: WD40 repeat [General function prediction only].
HMGA1
HMGA1.au.0.f HMGA1.u.r.18 268 334
AceView 36.Apr07 HMGA1
Exon skipping and alternative 3-prime or 5-prime, size difference: 71
Inclusion in the protein causing a frameshift, Inclusion in the protein (no stop codon or frameshift)
Reference transcript: HMGA1.aApr07
HMMR
HMMR.F1 HMMR.R1 144 189
NCBIGene 36.3 3161
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_012484
pfam SMC_N 239aa 3e-05
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Changed!
TIGR SMC_prok_A 213aa 9e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
COG Smc 236aa 4e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
PRK PRK05771 196aa 0.002
in ref transcript
V-type ATP synthase subunit I; Validated.
Changed!
TIGR SMC_prok_A 223aa 3e-04
in modified transcript
HNRPAB
HNRPAB-F1 HNRPAB-R1 269 416
NCBIGene 36.3 3182
Single exon skipping, size difference: 141
Exclusion in the protein (no frameshift)
Reference transcript: NM_031266
cd RRM 70aa 6e-14
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 48aa 4e-10
in ref transcript
TIGR SF-CC1 137aa 2e-18
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
pfam CBFNT 71aa 6e-06
in ref transcript
CBFNT (NUC161) domain. This N terminal domain is found in proteins of CARG-binding factor A-like proteins.
COG COG0724 142aa 6e-08
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
CADM1
IGSF4.F6 IGSF4.R1 132 165
AceView 36.Apr07 CADM1
Single exon skipping, size difference: 33
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: CADM1.bApr07
cd IGcam 71aa 1e-07
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IG 72aa 0.004
in ref transcript
Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
smart IG_like 80aa 3e-10
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam C2-set_2 70aa 3e-08
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
smart IG_like 92aa 9e-08
in ref transcript
INSR
INSR.u.f.15 INSR.u.r.18 130 166
NCBIGene 36.3 3643
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_000208
cd PTKc_InsR 288aa 1e-179
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Insulin Receptor. Protein Tyrosine Kinase (PTK) family; Insulin Receptor (InsR); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. InsR is a receptor tyr kinase (RTK) that is composed of two alphabeta heterodimers. Binding of the insulin ligand to the extracellular alpha subunit activates the intracellular tyr kinase domain of the transmembrane beta subunit. Receptor activation leads to autophosphorylation, stimulating downstream kinase activities, which initiate signaling cascades and biological function. InsR signaling plays an important role in many cellular processes including glucose homeostasis, glycogen synthesis, lipid and protein metabolism, ion and amino acid transport, cell cycle and proliferation, cell differentiation, gene transcription, and nitric oxide synthesis. Insulin resistance, caused by abnormalities in InsR signaling, has been described in diabetes, hypertension, cardiovascular disease, metabolic syndrome, heart failure, and female infertility.
cd FN3 78aa 3e-07
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FU 48aa 2e-06
in ref transcript
Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
cd FN3 45aa 7e-04
in ref transcript
pfam Pkinase_Tyr 268aa 1e-118
in ref transcript
Protein tyrosine kinase.
pfam Furin-like 162aa 5e-57
in ref transcript
Furin-like cysteine rich region.
pfam Recep_L_domain 114aa 1e-25
in ref transcript
Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain.
pfam Recep_L_domain 113aa 2e-25
in ref transcript
smart FN3 71aa 5e-05
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
COG SPS1 265aa 2e-18
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
KITLG
KITLG.u.f.9 KITLG.u.r.10 215 299
NCBIGene 36.3 4254
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_000899
Changed!
pfam SCF 273aa 1e-134
in ref transcript
Stem cell factor. Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. The crystal structure of human SCF has been resolved and a potential receptor-binding site identified.
Changed!
pfam SCF 245aa 1e-121
in modified transcript
LGALS9
LGALS9.F10 LGALS9.R6 135 231
NCBIGene 36.3 3965
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_009587
cd GLECT 131aa 2e-36
in ref transcript
Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.
cd GLECT 128aa 5e-33
in ref transcript
smart GLECT 131aa 4e-40
in ref transcript
Galectin. Galectin - galactose-binding lectin.
pfam Gal-bind_lectin 128aa 3e-35
in ref transcript
Galactoside-binding lectin. This family contains galactoside binding lectins. The family also includes enzymes such as human eosinophil lysophospholipase (EC:3.1.1.5).
LIG3
LIG3.u.f.1 LIG3.u.r.6 148 378
AceView 36.Apr07 LIG3
Alternative 3-prime, size difference: 230
Exclusion of the protein initiation site
Reference transcript: LIG3.aApr07
TIGR dnl1 513aa 0.0
in ref transcript
All proteins in this family with known functions are ATP-dependent DNA ligases. Functions include DNA repair, DNA replication, and DNA recombination (or any process requiring ligation of two single-stranded DNA sections). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
pfam zf-PARP 57aa 4e-16
in ref transcript
Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc fingers. This region acts as a DNA nick sensor.
smart BRCT 68aa 5e-05
in ref transcript
breast cancer carboxy-terminal domain.
PRK PRK01109 556aa 6e-74
in ref transcript
ATP-dependent DNA ligase; Provisional.
LIG4
LIG4.F1 LIG4.R1 267 340
AceView 36.Apr07 LIG4
Single exon skipping, size difference: 73
Exclusion in 5'UTR
Reference transcript: LIG4.bApr07
cd BRCT 71aa 7e-06
in ref transcript
Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
TIGR dnl1 522aa 1e-171
in ref transcript
All proteins in this family with known functions are ATP-dependent DNA ligases. Functions include DNA repair, DNA replication, and DNA recombination (or any process requiring ligation of two single-stranded DNA sections). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
smart BRCT 77aa 2e-07
in ref transcript
breast cancer carboxy-terminal domain.
pfam BRCT 70aa 0.003
in ref transcript
BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not been included in this family. The BRCT domain of XRCC1 forms a homodimer in the crystal structure. This suggests that pairs of BRCT domains associate as homo- or heterodimers.
PRK PRK01109 574aa 6e-59
in ref transcript
ATP-dependent DNA ligase; Provisional.
LRDD
LRDD.u.f.7 LRDD.u.r.3 121 181
NCBIGene 36.3 55367
Alternative 3-prime, size difference: 60
Inclusion in the protein causing a new stop codon
Reference transcript: NM_145886
Changed!
pfam Death 79aa 3e-09
in ref transcript
Death domain.
Changed!
pfam Peptidase_S68 34aa 2e-08
in ref transcript
Peptidase S68. This family of serine peptidases contains PIDD proteins. PIDD forms a complex with RAIDD and procaspase-2 that is known as the 'PIDDosome'. The PIDDosome forms when DNA damage occurs and either activates NF-kappaB, leading to cell survival, or caspase-2, which leads to apoptosis.
Changed!
smart ZU5 63aa 9e-04
in ref transcript
Domain present in ZO-1 and Unc5-like netrin receptors. Domain of unknown function.
Changed!
COG COG4886 96aa 3e-07
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
MAPT
MAPT_x56_x127_f MAPT_x56_x127_r 162 360
NCBIGene 36.3 4137
Single exon skipping, size difference: 198
Exclusion in the protein (no frameshift)
Reference transcript: NM_016835
pfam Tubulin-binding 31aa 8e-10
in ref transcript
Tau and MAP protein, tubulin-binding repeat. This family includes the vertebrate proteins MAP2, MAP4 and Tau, as well as other animal homologs. MAP4 is present in many tissues but is usually absent from neurons; MAP2 and Tau are mainly neuronal. Members of this family have the ability to bind to and stabilise microtubules. As a result, they are involved in neuronal migration, supporting dendrite elongation, and regulating microtubules during mitotic metaphase. Note that Tau is involved in neurofibrillary tangle formation in Alzheimer's disease and some other dementias. This family features a C-terminal microtubule binding repeat that contains a conserved KXGS motif.
pfam Tubulin-binding 31aa 2e-08
in ref transcript
pfam Tubulin-binding 32aa 2e-06
in ref transcript
pfam Tubulin-binding 32aa 3e-06
in ref transcript
pfam Hid1 123aa 3e-04
in ref transcript
High-temperature-induced dauer-formation protein. Hid1 (high-temperature-induced dauer-formation protein 1) represents proteins of approximately 800 residues long and is conserved from fungi to humans. It contains up to seven potential transmembrane domains separated by regions of low complexity. Functionally it might be involved in vesicle secretion or be an inter-cellular signalling protein or be a novel insulin receptor.
MCL1
MCL1.F1 MCL1.R1 134 382
NCBIGene 36.3 4170
Single exon skipping, size difference: 248
Exclusion in the protein causing a frameshift
Reference transcript: NM_021960
Changed!
cd Bcl-2_like 144aa 7e-37
in ref transcript
Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
Changed!
pfam Bcl-2 100aa 5e-32
in ref transcript
Apoptosis regulator proteins, Bcl-2 family.
Changed!
cd Bcl-2_like 56aa 2e-04
in modified transcript
NOTCH3
NOTCH3.F2 NOTCH3.R2 286 442
AceView 36.Apr07 NOTCH3
Single exon skipping, size difference: 156
Exclusion in the protein (no frameshift)
Reference transcript: NOTCH3.aApr07
cd ANK 127aa 4e-24
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 148aa 8e-07
in ref transcript
cd EGF_CA 38aa 1e-06
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Changed!
cd EGF_CA 38aa 8e-05
in ref transcript
cd EGF_CA 37aa 3e-04
in ref transcript
cd EGF_CA 39aa 4e-04
in ref transcript
Changed!
cd EGF_CA 36aa 6e-04
in ref transcript
cd EGF_CA 33aa 0.001
in ref transcript
cd EGF_CA 37aa 0.002
in ref transcript
cd EGF_CA 38aa 0.002
in ref transcript
cd EGF_CA 37aa 0.003
in ref transcript
cd EGF_CA 32aa 0.003
in ref transcript
cd EGF_CA 37aa 0.005
in ref transcript
cd EGF_CA 36aa 0.008
in ref transcript
pfam NODP 55aa 4e-14
in ref transcript
NOTCH protein. NOTCH signalling plays a fundamental role during a great number of developmental processes in multicellular animals. NOD and NODP represent a region present in many NOTCH proteins and NOTCH homologs in multiple species such as NOTCH2 and NOTCH3, LIN12, SC1 and TAN1. The role of the NOD and NODP domains remains to be elucidated.
pfam NOD 55aa 2e-11
in ref transcript
NOTCH protein. NOTCH signalling plays a fundamental role during a great number of developmental processes in multicellular animals. NOD and NODP represent a region present in many NOTCH proteins and NOTCH homologs in multiple species such as NOTCH2 and NOTCH3, LIN12, SC1 and TAN1. Role of NOD domain remains to be elucidated.
smart EGF_CA 38aa 2e-07
in ref transcript
Calcium-binding EGF-like domain.
pfam Notch 33aa 5e-06
in ref transcript
LNR domain. The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch related proteins. The structure of the domain has been determined by NMR and was shown to contain three disulphide bonds and coordinate a calcium ion. Three repeats are also found in the PAPP-A peptidase.
smart NL 38aa 5e-06
in ref transcript
Domain found in Notch and Lin-12. The Notch protein is essential for the proper differentiation of the Drosophila ectoderm. This protein contains 3 NL domains.
pfam Notch 29aa 2e-05
in ref transcript
Changed!
smart EGF_CA 38aa 8e-05
in ref transcript
smart EGF_CA 39aa 8e-05
in ref transcript
smart EGF_CA 37aa 1e-04
in ref transcript
TIGR trp 139aa 1e-04
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Changed!
smart EGF_CA 36aa 3e-04
in ref transcript
smart EGF_CA 37aa 3e-04
in ref transcript
smart EGF_CA 37aa 0.001
in ref transcript
smart EGF_CA 37aa 0.001
in ref transcript
smart EGF_CA 33aa 0.001
in ref transcript
smart EGF_CA 38aa 0.002
in ref transcript
smart EGF_CA 36aa 0.003
in ref transcript
smart EGF_CA 32aa 0.004
in ref transcript
COG Arp 154aa 5e-17
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
NRG1
NRG1.u.f.21 NRG1.u.r.25 372 396
NCBIGene 36.3 3084
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_013956
cd IGcam 89aa 2e-05
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam Neuregulin 396aa 1e-109
in ref transcript
Neuregulin family.
smart IG_like 87aa 1e-07
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam EGF 32aa 0.006
in ref transcript
EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
NUP98
NUP98.F1 NUP98.R1 260 482
NCBIGene 36.3 4928
Single exon skipping, size difference: 222
Exclusion in the protein (no frameshift)
Reference transcript: NM_016320
pfam Nucleoporin2 156aa 2e-63
in ref transcript
Nucleoporin autopeptidase.
OPA1
OPA1.u.f.9 OPA1.u.r.8 143 254
NCBIGene 36.3 4976
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_130837
pfam Dynamin_N 179aa 4e-38
in ref transcript
Dynamin family.
PAXIP1
PAXIP1.F1 PAXIP1.R1 114 185
AceView 36.Apr07 PAXIP1
Alternative 5-prime, size difference: 71
Inclusion in the protein causing a new stop codon
Reference transcript: PAXIP1.aApr07
cd BRCT 68aa 6e-07
in ref transcript
Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
cd BRCT 68aa 7e-07
in ref transcript
cd BRCT 70aa 8e-07
in ref transcript
cd BRCT 75aa 4e-06
in ref transcript
pfam BRCT 73aa 4e-11
in ref transcript
BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not been included in this family. The BRCT domain of XRCC1 forms a homodimer in the crystal structure. This suggests that pairs of BRCT domains associate as homo- or heterodimers.
pfam BRCT 72aa 6e-10
in ref transcript
pfam BRCT 66aa 8e-07
in ref transcript
smart BRCT 81aa 3e-06
in ref transcript
breast cancer carboxy-terminal domain.
PCSK6
PCSK6.F2 PCSK6.R2 116 260
AceView 36.Apr07 PCSK6
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: PCSK6.aApr07
cd FU 51aa 6e-07
in ref transcript
Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
cd FU 43aa 2e-06
in ref transcript
cd FU 47aa 3e-06
in ref transcript
cd FU 53aa 1e-05
in ref transcript
Changed!
pfam Peptidase_S8 301aa 2e-94
in ref transcript
Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold containing a 7-stranded parallel beta sheet, order 2314567.
pfam P_proprotein 91aa 4e-34
in ref transcript
Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic domain.
smart FU 48aa 2e-08
in ref transcript
Furin-like repeats.
smart FU 44aa 2e-08
in ref transcript
smart FU 44aa 1e-06
in ref transcript
pfam Furin-like 112aa 4e-06
in ref transcript
Furin-like cysteine rich region.
pfam VSP 115aa 5e-04
in ref transcript
Giardia variant-specific surface protein.
Changed!
COG AprE 324aa 1e-20
in ref transcript
Subtilisin-like serine proteases [Posttranslational modification, protein turnover, chaperones].
COG COG4935 91aa 1e-15
in ref transcript
Regulatory P domain of the subtilisin-like proprotein convertases and other proteases [Posttranslational modification, protein turnover, chaperones].
COG NapH 83aa 0.009
in ref transcript
Polyferredoxin [Energy production and conversion].
Changed!
pfam Peptidase_S8 261aa 2e-76
in modified transcript
Changed!
COG AprE 281aa 2e-13
in modified transcript
PLD1
PLD1.F1 PLD1.R1 124 238
AceView 36.Apr07 PLD1
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: PLD1.aApr07
cd PLDc 142aa 1e-21
in ref transcript
Phospholipase D. Active site motifs; The PLD superfamily includes enzymes involved in signal transduction, lipid biosynthesis, endonucleases and open reading frames in pathogenic viruses and bacteria. PLD hydrolyzes the terminal phosphodiester bond of phospholipids to phosphatidic acid and a hydrophilic constituent. Phosphatidic acid is a compound that is heavily involved in signal transduction. The common features of the family members are that they can bind to a phosphodiester moiety, and that most of these enzymes are active as bi-lobed monomers or dimers.
cd PLDc 221aa 1e-15
in ref transcript
cd PH_PLD 62aa 8e-12
in ref transcript
Phospholipase D (PLD) pleckstrin homology (PH) domain. PLD hydrolyzes phosphatidylcholine to phosphatidic acid (PtdOH), which can bind target proteins. PLD contains a PH domain, a PX domain and four conserved PLD signature domains. The PLD PH domain is specific for bisphosphorylated inositides. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam PLDc 28aa 0.001
in ref transcript
Phospholipase D Active site motif. Phosphatidylcholine-hydrolysing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways. PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, and/or asparagine residues which may contribute to the active site. aspartic acid. An Escherichia coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs. The profile contained here represents only the putative active site regions, since an accurate multiple alignment of the repeat units has not been achieved.
smart PLDc 27aa 0.002
in ref transcript
Phospholipase D. Active site motifs. Phosphatidylcholine-hydrolyzing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways. PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, aspartic acid, and/or asparagine residues which may contribute to the active site. An E. coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs. The profile contained here represents only the putative active site regions, since an accurate multiple alignment of the repeat units has not been achieved.
COG Cls 261aa 9e-15
in ref transcript
Phosphatidylserine/phosphatidylglycerophosphate/cardioli pin synthases and related enzymes [Lipid metabolism].
COG Cls 171aa 0.001
in ref transcript
POLB
POLB.F7 POLB.R13 213 271
AceView 36.Apr07 POLB
Single exon skipping, size difference: 58
Exclusion in the protein causing a frameshift
Reference transcript: POLB.bApr07
Changed!
cd POLXc 332aa 1e-104
in ref transcript
DNA polymerase X family; includes vertebrate DNA polymerase beta and terminal deoxynucleotidyltransferase. An N-terminal 8kD domain and a 31kD C-terminal polymerase domain are connected with a protease-sensitive hinge. The activity of the N-terminal domain seems to be variable, in DNA polymerase beta it has metal dependent nuclease activity and metal independent lyase activity.
Changed!
smart POLXc 325aa 1e-103
in ref transcript
DNA polymerase X family. includes vertebrate polymerase beta and terminal deoxynucleotidyltransferases.
Changed!
COG POL4 299aa 5e-28
in ref transcript
DNA polymerase IV (family X) [DNA replication, recombination, and repair].
POLI
POLI.F8 POLI.R14 197 303
AceView 36.Apr07 POLI
Intron retention, size difference: 106
Inclusion in 5'UTR
Reference transcript: POLI.dApr07
cd Pol_iota 33aa 2e-07
in ref transcript
Pol iota is member of the DNA polymerase Y-family, and has also been called Rad30 homolog B. Unlike classic DNA polymerases,Y-family polymerases are induced by DNA damage. They can transverse normal replication-blocking DNA lesions. Unlike Pol eta, Pol iota is unable to replicate through a cis-syn T-T dimer. In human Pol iota, the base-pairing mode in the active site at the replicative end mat bee Hoogsteen instead of Watson-Click. Human Pol iota can incorporate the correct nucleotide opposite a purine much more efficiently than opposite a pyrimidine. Pol iota prefers to insert Guanosine instead of Adenosine opposite Thymidine.
TIGR dmsA_ynfE 133aa 0.002
in ref transcript
Members of this family include known and probable dimethyl sulfoxide reductase (DMSO reductase) A chains. In E. coli, dmsA encodes the canonical anaerobic DMSO reductase A chain. The paralog ynfE, as part of ynfFGH expressed from a multicopy plasmid, could complement a dmsABC deletion, suggesting a similar function and some overlap in specificity, although YnfE could not substitute for DmsA in a mixed complex.
POLM
POLM.u.f.1 POLM.u.r.6 316 586
AceView 36.Apr07 POLM
Multiple exon skipping, size difference: 270
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: POLM.aApr07
cd POLXc 118aa 1e-17
in ref transcript
DNA polymerase X family; includes vertebrate DNA polymerase beta and terminal deoxynucleotidyltransferase. An N-terminal 8kD domain and a 31kD C-terminal polymerase domain are connected with a protease-sensitive hinge. The activity of the N-terminal domain seems to be variable, in DNA polymerase beta it has metal dependent nuclease activity and metal independent lyase activity.
Changed!
cd POLXc 77aa 3e-10
in ref transcript
smart POLXc 105aa 2e-14
in ref transcript
DNA polymerase X family. includes vertebrate polymerase beta and terminal deoxynucleotidyltransferases.
Changed!
smart POLXc 49aa 3e-07
in ref transcript
COG POL4 48aa 1e-05
in ref transcript
DNA polymerase IV (family X) [DNA replication, recombination, and repair].
PPP3CB
PPP3CB.u.f.21 PPP3CB.u.r.22 117 147
AceView 36.Apr07 PPP3CB
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: PPP3CB.aApr07
cd PP2Ac 278aa 1e-100
in ref transcript
Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, including PP1, PP2A and PP2B (calcineurin) family members.
smart PP2Ac 272aa 1e-112
in ref transcript
Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, that includes PP1, PP2A and PP2B (calcineurin) family members.
PTZ PTZ00239 278aa 6e-60
in ref transcript
serine/threonine protein phosphatase 2A; Provisional.
PTK2B
PTK2B_x66_x147_f PTK2B_x66_x147_r 170 296
NCBIGene 36.3 2185
Single exon skipping, size difference: 126
Exclusion in the protein (no frameshift)
Reference transcript: NM_173174
cd PTKc_FAK 270aa 1e-146
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Focal Adhesion Kinase. Protein Tyrosine Kinase (PTK) family; Focal Adhesion kinase (FAK); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FAK is a cytoplasmic (or nonreceptor) tyr kinase that contains an autophosphorylation site and a FERM domain at the N-terminus, a central tyr kinase domain, proline-rich regions, and a C-terminal FAT (focal adhesion targeting) domain. FAK activity is dependent on integrin-mediated cell adhesion, which facilitates N-terminal autophosphorylation. Full activation is achieved by the phosphorylation of its two adjacent A-loop tyrosines. FAK is important in mediating signaling initiated at sites of cell adhesions and at growth factor receptors. Through diverse molecular interactions, FAK functions as a biosensor or integrator to control cell motility. It is a key regulator of cell survival, proliferation, migration and invasion, and thus plays an important role in the development and progression of cancer. Src binds to autophosphorylated FAK forming the FAK-Src dual kinase complex, which is activated in a wide variety of tumor cells and generates signals promoting growth and metastasis. FAK is being developed as a target for cancer therapy.
pfam Pkinase_Tyr 233aa 5e-98
in ref transcript
Protein tyrosine kinase.
pfam Focal_AT 139aa 6e-59
in ref transcript
Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion targeting' (FAT) region is necessary and sufficient for localising FAK to focal adhesions. The crystal structure of FAT shows it forms a four-helix bundle that resembles those found in two other proteins involved in cell adhesion, alpha-catenin and vinculin. The binding of FAT to the focal adhesion protein, paxillin, requires the integrity of the helical bundle, whereas binding to another focal adhesion protein, talin, does not.
smart B41 227aa 2e-31
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
COG SPS1 229aa 2e-16
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
PTPN13
PTPN13.F7 PTPN13.R7 133 190
NCBIGene 36.3 5783
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_080685
cd PTPc 228aa 3e-79
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PDZ_signaling 85aa 7e-16
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 80aa 1e-15
in ref transcript
cd PDZ_signaling 90aa 2e-13
in ref transcript
cd FERM_C 94aa 2e-13
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
cd PDZ_signaling 79aa 2e-08
in ref transcript
cd PDZ_signaling 88aa 6e-08
in ref transcript
smart PTPc 253aa 5e-91
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart KIND 188aa 2e-42
in ref transcript
kinase non-catalytic C-lobe domain. It is an interaction domain identified as being similar to the C-terminal protein kinase catalytic fold (C lobe). Its presence at the N terminus of signalling proteins and the absence of the active-site residues in the catalytic and activation loops suggest that it folds independently and is likely to be non-catalytic. The occurrence of KIND only in metazoa implies that it has evolved from the catalytic protein kinase domain into an interaction domain possibly by keeping the substrate-binding features.
smart B41 210aa 4e-40
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam FERM_C 88aa 2e-17
in ref transcript
FERM C-terminal PH-like domain.
smart PDZ 90aa 1e-16
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart PDZ 92aa 2e-15
in ref transcript
smart PDZ 83aa 1e-13
in ref transcript
smart PDZ 92aa 2e-11
in ref transcript
smart PDZ 84aa 9e-08
in ref transcript
COG PTP2 267aa 1e-40
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd FN3 83aa 2e-05
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 89aa 7e-05
in ref transcript
cd FN3 87aa 2e-04
in ref transcript
cd FN3 76aa 3e-04
in ref transcript
cd RICIN 73aa 0.003
in ref transcript
Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication. The domain is found in a variety of molecules serving diverse functions such as enzymatic activity, inhibitory toxicity and signal transduction. Highly specific ligand binding occurs on exposed surfaces of the compact domain sturcture.
smart PTPc 259aa 1e-100
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
pfam fn3 84aa 6e-11
in ref transcript
Fibronectin type III domain.
pfam fn3 78aa 5e-09
in ref transcript
pfam fn3 81aa 1e-08
in ref transcript
pfam fn3 78aa 3e-08
in ref transcript
pfam fn3 75aa 3e-08
in ref transcript
pfam fn3 76aa 3e-07
in ref transcript
pfam fn3 76aa 9e-07
in ref transcript
pfam fn3 75aa 1e-06
in ref transcript
pfam fn3 78aa 4e-06
in ref transcript
pfam fn3 76aa 1e-05
in ref transcript
pfam fn3 76aa 2e-05
in ref transcript
pfam fn3 82aa 3e-05
in ref transcript
Changed!
pfam fn3 63aa 4e-05
in ref transcript
pfam fn3 68aa 0.003
in ref transcript
pfam fn3 65aa 0.004
in ref transcript
pfam Ricin_B_lectin 77aa 0.006
in ref transcript
Ricin-type beta-trefoil lectin domain.
COG PTP2 266aa 5e-48
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
RAC1
RAC1.F18 rs.RAC1.R1 328 386
NCBIGene 36.3 5879
Single exon skipping, size difference: 58
Inclusion in the protein causing a new stop codon
Reference transcript: NM_018890
Changed!
cd Rac1_like 192aa 1e-107
in ref transcript
Rac1-like subfamily. The Rac1-like subfamily consists of Rac1, Rac2, and Rac3 proteins, plus the splice variant Rac1b that contains a 19-residue insertion near switch II relative to Rac1. While Rac1 is ubiquitously expressed, Rac2 and Rac3 are largely restricted to hematopoietic and neural tissues respectively. Rac1 stimulates the formation of actin lamellipodia and membrane ruffles. It also plays a role in cell-matrix adhesion and cell anoikis. In intestinal epithelial cells, Rac1 is an important regulator of migration and mediates apoptosis. Rac1 is also essential for RhoA-regulated actin stress fiber and focal adhesion complex formation. In leukocytes, Rac1 and Rac2 have distinct roles in regulating cell morphology, migration, and invasion, but are not essential for macrophage migration or chemotaxis. Rac3 has biochemical properties that are closely related to Rac1, such as effector interaction, nucleotide binding, and hydrolysis; Rac2 has a slower nucleotide association and is more efficiently activated by the RacGEF Tiam1. Both Rac1 and Rac3 have been implicated in the regulation of cell migration and invasion in human metastatic breast cancer. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
Changed!
smart RHO 189aa 1e-98
in ref transcript
Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
Changed!
COG COG1100 193aa 1e-27
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
CLIP1
RSN.F10 RSN.R9 155 500
AceView 36.Apr07 CLIP1
Exon skipping and alternative 3-prime or 5-prime, size difference: 345
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: CLIP1.aApr07
pfam CAP_GLY 66aa 6e-27
in ref transcript
CAP-Gly domain. Cytoskeleton-associated proteins (CAPs) are involved in the organisation of microtubules and transportation of vesicles and organelles along the cytoskeletal network. A conserved motif, CAP-Gly, has been identified in a number of CAPs, including CLIP-170 and dynactins. The crystal structure of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain was recently solved. The domain contains three beta-strands. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove.
pfam CAP_GLY 66aa 6e-26
in ref transcript
Changed!
TIGR SMC_prok_A 315aa 1e-13
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
TIGR SMC_prok_B 189aa 2e-07
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
pfam SMC_N 386aa 5e-05
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
COG NIP100 61aa 2e-11
in ref transcript
Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning].
Changed!
COG Smc 306aa 4e-11
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG NIP100 62aa 2e-10
in ref transcript
Changed!
PRK PRK03918 589aa 1e-06
in ref transcript
chromosome segregation protein; Provisional.
Changed!
TIGR SMC_prok_B 231aa 5e-07
in modified transcript
Changed!
TIGR SMC_prok_B 360aa 6e-06
in modified transcript
Changed!
COG Smc 191aa 9e-04
in modified transcript
Changed!
PRK PRK03918 512aa 0.001
in modified transcript
CLIP1
RSN.F35 RSN.R13 248 281
AceView 36.Apr07 CLIP1
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: CLIP1.aApr07
pfam CAP_GLY 66aa 6e-27
in ref transcript
CAP-Gly domain. Cytoskeleton-associated proteins (CAPs) are involved in the organisation of microtubules and transportation of vesicles and organelles along the cytoskeletal network. A conserved motif, CAP-Gly, has been identified in a number of CAPs, including CLIP-170 and dynactins. The crystal structure of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain was recently solved. The domain contains three beta-strands. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove.
pfam CAP_GLY 66aa 6e-26
in ref transcript
TIGR SMC_prok_A 315aa 1e-13
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Changed!
TIGR SMC_prok_B 189aa 2e-07
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
pfam SMC_N 386aa 5e-05
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
COG NIP100 61aa 2e-11
in ref transcript
Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning].
COG Smc 306aa 4e-11
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG NIP100 62aa 2e-10
in ref transcript
Changed!
PRK PRK03918 589aa 1e-06
in ref transcript
chromosome segregation protein; Provisional.
Changed!
TIGR SMC_prok_B 178aa 8e-08
in modified transcript
Changed!
TIGR SMC_prok_B 349aa 2e-07
in modified transcript
Changed!
PRK PRK03918 578aa 5e-07
in modified transcript
RUNX2
RUNX2.F1 RUNX2.R1 169 235
NCBIGene 36.3 860
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_001024630
pfam Runt 134aa 9e-74
in ref transcript
Runt domain.
pfam RunxI 78aa 2e-32
in ref transcript
Runx inhibition domain. This domain lies to the C-terminus of Runx-related transcription factors and homologous proteins (AML, CBF-alpha, PEBP2). Its function might be to interact with functional cofactors.
SDCCAG8
SDCCAG8-F1 SDCCAG8-R1 180 417
AceView 36.Apr07 SDCCAG8
Multiple exon skipping, size difference: 237
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: SDCCAG8.aApr07
Changed!
TIGR SMC_prok_B 270aa 2e-06
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
TIGR SMC_prok_B 306aa 5e-06
in ref transcript
Changed!
COG Smc 382aa 4e-07
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
TIGR SMC_prok_B 283aa 2e-10
in modified transcript
Changed!
COG Smc 300aa 6e-07
in modified transcript
SHC1
SHC1.F7 SHC1.R7 111 165
AceView 36.Apr07 SHC1
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: SHC1.aApr07
Changed!
cd SHC 166aa 3e-73
in ref transcript
SHC phosphotyrosine-binding (PTB) domain. SHC is a substrate for receptor tyrosine kinases, which can interact with phosphoproteins at NPXY motifs. SHC contains an PTB domain followed by an SH2 domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
cd SH2 92aa 7e-13
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Changed!
pfam PID 156aa 6e-39
in ref transcript
Phosphotyrosine interaction domain (PTB/PID).
pfam SH2 45aa 4e-14
in ref transcript
SH2 domain.
Changed!
cd SHC 148aa 2e-59
in modified transcript
Changed!
pfam PID 138aa 2e-26
in modified transcript
SHMT1
SHMT1.u.f.24 SHMT1.u.r.18 108 225
NCBIGene 36.3 6470
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_004169
Changed!
cd SHMT 427aa 0.0
in ref transcript
Serine-glycine hydroxymethyltransferase (SHMT). This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). SHMT carries out interconversion of serine and glycine; it catalyzes the transfer of hydroxymethyl group of N5, N10-methylene tetrahydrofolate to glycine resulting in the formation of serine and tetrahydrofolate. Both eukaryotic and prokaryotic SHMT enzymes form tight obligate homodimers; the mammalian enzyme forms a homotetramer comprising four pyridoxal phosphate-bound active sites.
Changed!
pfam SHMT 400aa 0.0
in ref transcript
Serine hydroxymethyltransferase.
Changed!
PTZ PTZ00094 457aa 0.0
in ref transcript
serine hydroxymethyltransferase; Provisional.
Changed!
cd SHMT 388aa 1e-167
in modified transcript
Changed!
pfam SHMT 361aa 1e-178
in modified transcript
Changed!
PTZ PTZ00094 418aa 0.0
in modified transcript
SLIT2
SLIT2.F36 SLIT2.R4 222 234
AceView 36.Apr07 SLIT2
Single exon skipping, size difference: 12
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: SLIT2.aApr07
cd LamG 154aa 3e-26
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
cd EGF_CA 37aa 1e-04
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
cd EGF_CA 37aa 0.002
in ref transcript
smart LamG 137aa 5e-33
in ref transcript
Laminin G domain.
TIGR PCC 77aa 4e-10
in ref transcript
Note: this model is restricted to the amino half because a full-length model is incompatible with the HMM software package.
TIGR PCC 156aa 1e-09
in ref transcript
Changed!
TIGR PCC 63aa 3e-05
in ref transcript
smart EGF_CA 37aa 3e-04
in ref transcript
Calcium-binding EGF-like domain.
TIGR PCC 64aa 4e-04
in ref transcript
smart LRRNT 32aa 8e-04
in ref transcript
Leucine rich repeat N-terminal domain.
smart LRRNT 32aa 0.001
in ref transcript
smart LRRNT 33aa 0.003
in ref transcript
pfam EGF 32aa 0.003
in ref transcript
EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
COG COG4886 164aa 2e-07
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
COG COG4886 156aa 6e-06
in ref transcript
Changed!
COG COG4886 230aa 3e-04
in ref transcript
COG COG4886 286aa 6e-04
in ref transcript
Changed!
TIGR PCC 79aa 2e-05
in modified transcript
Changed!
COG COG4886 234aa 0.001
in modified transcript
SPP1
SPP1-3 SPP1-4 181 223
NCBIGene 36.3 6696
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040058
Changed!
pfam Osteopontin 314aa 1e-96
in ref transcript
Osteopontin.
Changed!
pfam Osteopontin 300aa 9e-89
in modified transcript
STIM1
STIM1.F1 STIM1.R1 202 295
AceView 36.Apr07 STIM1
Single exon skipping, size difference: 93
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: STIM1.aApr07
cd SAM 69aa 4e-04
in ref transcript
Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
TIGR SMC_prok_A 132aa 3e-06
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
smart SAM 65aa 3e-05
in ref transcript
Sterile alpha motif. Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerisation.
COG Smc 255aa 7e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
SYK
SYK.F1 SYK.R16 244 313
AceView 36.Apr07 SYK
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: SYK.bApr07
cd PTKc_Syk 257aa 1e-155
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Spleen tyrosine kinase. Protein Tyrosine Kinase (PTK) family; Spleen tyrosine kinase (Syk); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Syk, together with Zap-70, form the Syk subfamily of kinases which are cytoplasmic (or nonreceptor) tyr kinases containing two Src homology 2 (SH2) domains N-terminal to the catalytic tyr kinase domain. Syk was first cloned from the spleen, and its function in hematopoietic cells is well-established. Syk is involved in the signaling downstream of activated receptors (including B-cell and Fc receptors) that contain ITAMs (immunoreceptor tyr activation motifs), leading to processes such as cell proliferation, differentiation, survival, adhesion, migration, and phagocytosis. More recently, Syk expression has been detected in other cell types (including epithelial cells, vascular endothelial cells, neurons, hepatocytes, and melanocytes), suggesting a variety of biological functions in non-immune cells. Syk plays a critical role in maintaining vascular integrity and in wound healing during embryogenesis. It also regulates Vav3, which is important in osteoclast function including bone development. In breast epithelial cells, where Syk acts as a negative regulator for epidermal growth factor receptor (EGFR) signaling, loss of Syk expression is associated with abnormal proliferation during cancer development suggesting a potential role as a tumor suppressor. In mice, Syk has been shown to inhibit malignant transformation of mammary epithelial cells induced with murine mammary tumor virus (MMTV).
cd SH2 93aa 4e-18
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
cd SH2 93aa 1e-16
in ref transcript
pfam Pkinase_Tyr 253aa 2e-96
in ref transcript
Protein tyrosine kinase.
pfam SH2 77aa 4e-22
in ref transcript
SH2 domain.
smart SH2 85aa 2e-19
in ref transcript
Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
COG SPS1 251aa 4e-18
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
SYNE2
SYNE2.u.f.128 SYNE2.u.r.124 119 188
NCBIGene 36.3 23224
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_182914
cd SPEC 214aa 1e-12
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
cd CH 103aa 5e-12
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
cd SPEC 215aa 1e-10
in ref transcript
cd CH 103aa 6e-10
in ref transcript
cd SPEC 214aa 4e-09
in ref transcript
cd SPEC 221aa 8e-08
in ref transcript
cd SPEC 216aa 1e-07
in ref transcript
cd SPEC 207aa 2e-05
in ref transcript
pfam KASH 60aa 7e-14
in ref transcript
Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
pfam CH 99aa 2e-13
in ref transcript
Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
smart CH 102aa 2e-13
in ref transcript
Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
pfam SMC_N 302aa 1e-06
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
TIGR SMC_prok_B 305aa 4e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart SPEC 102aa 9e-05
in ref transcript
Spectrin repeats.
smart SPEC 104aa 3e-04
in ref transcript
TIGR SMC_prok_A 303aa 3e-04
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
TIGR SMC_prok_A 375aa 4e-04
in ref transcript
pfam SMC_N 281aa 0.002
in ref transcript
COG SAC6 255aa 3e-22
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
COG Smc 266aa 3e-04
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
TGFBR2
TGFBR2.F3 rs.TGFBR2.R1 219 294
NCBIGene 36.3 7048
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_001024847
cd S_TKc 288aa 5e-35
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam ecTbetaR2 104aa 4e-49
in ref transcript
Transforming growth factor beta receptor 2 ectodomain. The Transforming growth factor beta receptor 2 ectodomain is a compact fold consisting of nine beta-strands and a single helix stabilised by a network of six intra strand disulphide bonds. The folding topology includes a central five-stranded antiparallel beta-sheet, eight-residues long at its centre, covered by a second layer consisting of two segments of two-stranded antiparallel beta-sheets (beta1-beta4, beta3-beta9).
pfam Pkinase 291aa 8e-45
in ref transcript
Protein kinase domain.
COG SPS1 287aa 2e-16
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
MYO18A
TIAF1-3 TIAF1-4 279 324
NCBIGene 36.3 399687
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_078471
cd MYSc_type_XVIII 779aa 0.0
in ref transcript
Myosin motor domain, type XVIII myosins. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
cd PDZ_signaling 80aa 6e-10
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart MYSc 786aa 1e-105
in ref transcript
Myosin. Large ATPases. ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.
pfam Myosin_tail_1 527aa 1e-36
in ref transcript
Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Changed!
TIGR SMC_prok_B 288aa 2e-11
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart PDZ 81aa 5e-08
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
COG COG5022 1200aa 1e-103
in ref transcript
Myosin heavy chain [Cytoskeleton].
Changed!
COG Smc 738aa 5e-21
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
TIGR SMC_prok_B 331aa 2e-08
in modified transcript
Changed!
COG Smc 691aa 1e-18
in modified transcript
TNFRSF10B
TNFRSF10B.u.f.10 tnfrsf10b.r.3 303 390
NCBIGene 36.3 8795
Intron retention, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_003842
cd TNFR 86aa 2e-16
in ref transcript
Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
smart DEATH 87aa 2e-17
in ref transcript
DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
smart TNFR 41aa 3e-06
in ref transcript
Tumor necrosis factor receptor / nerve growth factor receptor repeats. Repeats in growth factor receptors that are involved in growth factor binding. TNF/TNFR.
smart TNFR 40aa 2e-04
in ref transcript
TOPBP1
TOPBP1.u.f.8 TOPBP1.u.r.8 128 143
AceView 36.Apr07 TOPBP1
Alternative 3-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: TOPBP1.aApr07
cd BRCT 71aa 4e-09
in ref transcript
Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
cd BRCT 70aa 6e-08
in ref transcript
cd BRCT 71aa 2e-06
in ref transcript
cd BRCT 70aa 2e-06
in ref transcript
cd BRCT 58aa 7e-05
in ref transcript
pfam BRCT 73aa 2e-13
in ref transcript
BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not been included in this family. The BRCT domain of XRCC1 forms a homodimer in the crystal structure. This suggests that pairs of BRCT domains associate as homo- or heterodimers.
pfam BRCT 74aa 5e-11
in ref transcript
pfam BRCT 72aa 3e-10
in ref transcript
pfam BRCT 71aa 5e-08
in ref transcript
pfam BRCT 79aa 1e-07
in ref transcript
smart BRCT 53aa 3e-06
in ref transcript
breast cancer carboxy-terminal domain.
TRAF3
TRAF3.u.f.1 TRAF3.u.r.3 161 300
NCBIGene 36.3 7187
Single exon skipping, size difference: 139
Exclusion in 5'UTR
Reference transcript: NM_145725
cd MATH_TRAF3 186aa 1e-106
in ref transcript
Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF3 subfamily, TRAF domain; TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF3 was first described as a molecule that binds the cytoplasmic tail of CD40. However, it is not required for CD40 signaling. More recently, TRAF3 has been identified as a key regulator of type I interferon (IFN) production and the mammalian innate antiviral immunity. It mediates IFN responses in Toll-like receptor (TLR)-dependent as well as TLR-independent viral recognition pathways. It is also a key element in immunological homeostasis through its regulation of the anti-inflammatory cytokine interleukin-10. TRAF3 contains a RING finger domain, five zinc finger domains, and a TRAF domain. The TRAF domain can be divided into a more divergent N-terminal alpha helical region (TRAF-N), and a highly conserved C-terminal MATH subdomain (TRAF-C) with an eight-stranded beta-sandwich structure. TRAF-N mediates trimerization while TRAF-C interacts with receptors.
pfam MATH 126aa 4e-18
in ref transcript
MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode Caenorhabditis elegans.
pfam zf-TRAF 57aa 1e-11
in ref transcript
TRAF-type zinc finger.
pfam Cast 176aa 2e-05
in ref transcript
RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
smart RING 36aa 0.002
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
TIGR rad18 203aa 0.005
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
COG Smc 161aa 5e-07
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG COG5222 59aa 0.005
in ref transcript
Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].
TSSC4
TSSC4.F10 TSSC4.R7 140 332
AceView 36.Apr07 TSSC4
Intron retention, size difference: 192
Exclusion in the protein (no frameshift)
Reference transcript: TSSC4.dApr07
TUBA4A
TUBA1.F5 TUBA1.R5 101 324
AceView 36.Apr07 TUBA4A
Single exon skipping, size difference: 223
Exclusion in the protein causing a frameshift
Reference transcript: TUBA4A.aApr07
Changed!
cd alpha_tubulin 434aa 0.0
in ref transcript
The tubulin superfamily includes five distinct families, the alpha-, beta-, gamma-, delta-, and epsilon-tubulins and a sixth family (zeta-tubulin) which is present only in kinetoplastid protozoa. The alpha- and beta-tubulins are the major components of microtubules, while gamma-tubulin plays a major role in the nucleation of microtubule assembly. The delta- and epsilon-tubulins are widespread but unlike the alpha, beta, and gamma-tubulins they are not ubiquitous among eukaryotes. The alpha/beta-tubulin heterodimer is the structural subunit of microtubules. The alpha- and beta-tubulins share 40% amino-acid sequence identity, exist in several isotype forms, and undergo a variety of posttranslational modifications. The structures of alpha- and beta-tubulin are basically identical: each monomer is formed by a core of two beta-sheets surrounded by alpha-helices. The monomer structure is very compact, but can be divided into three regions based on function: the amino-terminal nucleotide-binding region, an intermediate taxol-binding region and the carboxy-terminal region which probably constitutes the binding surface for motor proteins.
Changed!
pfam Tubulin 243aa 2e-67
in ref transcript
Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial cell division. It is part of a ring in the middle of the dividing cell that is required for constriction of cell membrane and cell envelope to yield two daughter cells. FtsZ and tubulin are GTPases. FtsZ can polymerise into tubes, sheets, and rings in vitro and is ubiquitous in eubacteria and archaea. Tubulin is the major component of microtubules.
Changed!
pfam Tubulin_C 136aa 3e-52
in ref transcript
Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial cell division. It is part of a ring in the middle of the dividing cell that is required for constriction of cell membrane and cell envelope to yield two daughter cells. FtsZ and tubulin are GTPases. FtsZ can polymerise into tubes, sheets, and rings in vitro and is ubiquitous in eubacteria and archaea. Tubulin is the major component of microtubules.
Changed!
PTZ PTZ00012 447aa 0.0
in ref transcript
alpha-tubulin II; Provisional.
UTRN
UTRN.u.f.81 UTRN.u.r.78 231 270
AceView 36.Apr07 UTRN
Single exon skipping, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: UTRN.aApr07
cd ZZ_dystrophin 49aa 2e-23
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
cd SPEC 219aa 2e-18
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
cd CH 105aa 2e-14
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
cd SPEC 203aa 6e-14
in ref transcript
cd SPEC 241aa 6e-13
in ref transcript
cd SPEC 211aa 1e-09
in ref transcript
cd SPEC 208aa 4e-09
in ref transcript
cd CH 104aa 3e-08
in ref transcript
cd SPEC 213aa 6e-08
in ref transcript
cd SPEC 117aa 2e-06
in ref transcript
cd SPEC 214aa 2e-05
in ref transcript
cd SPEC 201aa 4e-04
in ref transcript
cd SPEC 205aa 5e-04
in ref transcript
cd WW 29aa 6e-04
in ref transcript
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
pfam efhand_1 121aa 2e-49
in ref transcript
EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
pfam efhand_2 92aa 1e-36
in ref transcript
EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
pfam ZZ 46aa 1e-17
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin. Putative zinc finger; binding not yet shown. Four to six cysteine residues in its sequence are responsible for coordinating zinc ions, to reinforce the structure.
pfam CH 104aa 5e-17
in ref transcript
Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
pfam CH 103aa 2e-10
in ref transcript
smart SPEC 102aa 9e-10
in ref transcript
Spectrin repeats.
smart SPEC 102aa 2e-08
in ref transcript
smart SPEC 106aa 7e-07
in ref transcript
smart SPEC 94aa 2e-06
in ref transcript
TIGR SMC_prok_B 742aa 2e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
pfam SMC_N 241aa 2e-04
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
pfam SMC_N 302aa 2e-04
in ref transcript
pfam WW 30aa 4e-04
in ref transcript
WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
TIGR SMC_prok_B 783aa 0.001
in ref transcript
TIGR SMC_prok_B 348aa 0.008
in ref transcript
COG SAC6 238aa 2e-23
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
COG Smc 764aa 8e-08
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG SbcC 623aa 7e-06
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
COG Smc 335aa 8e-06
in ref transcript
COG Smc 817aa 5e-05
in ref transcript
BCL2L1
bcl2l1.f.2 bcl2l1.r.2 193 382
NCBIGene 36.3 598
Alternative 5-prime, size difference: 189
Exclusion in the protein (no frameshift)
Reference transcript: NM_138578
Changed!
cd Bcl-2_like 114aa 2e-40
in ref transcript
Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
Changed!
TIGR bcl-2 233aa 1e-89
in ref transcript
in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.
Changed!
cd Bcl-2_like 45aa 1e-08
in modified transcript
Changed!
TIGR bcl-2 125aa 1e-35
in modified transcript
Changed!
TIGR bcl-2 45aa 6e-09
in modified transcript
CASP9
primer-113212 primer-113213 360 670
AceView 36.Apr07 CASP9
Exon skipping and alternative 3-prime or 5-prime, size difference: 310
Inclusion in the protein causing a new stop codon, Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: CASP9.dApr07
Changed!
cd CASc 125aa 2e-26
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
Changed!
smart CASc 124aa 3e-26
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
smart CARD 91aa 1e-15
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signalling. Mediates homodimerisation. Structure consists of six antiparallel helices arranged in a topology homologue to the DEATH and the DED domain.
A2BP1
refseq_A2BP1.F1 refseq_A2BP1.R1 189 267
NCBIGene 36.3 54715
Alternative 3-prime, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_145891
cd RRM 72aa 8e-21
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM_2 72aa 1e-20
in ref transcript
RNA recognition motif.
TIGR hnRNP-R-Q 159aa 7e-04
in ref transcript
Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
pfam PAT1 133aa 0.009
in ref transcript
Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
COG COG0724 101aa 1e-12
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
A2BP1
refseq_A2BP1.F3 refseq_A2BP1.R3 112 165
NCBIGene 36.3 54715
Single exon skipping, size difference: 53
Inclusion in the protein causing a frameshift
Reference transcript: NM_145891
cd RRM 72aa 8e-21
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM_2 72aa 1e-20
in ref transcript
RNA recognition motif.
TIGR hnRNP-R-Q 159aa 7e-04
in ref transcript
Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
Changed!
pfam PAT1 133aa 0.009
in ref transcript
Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
COG COG0724 101aa 1e-12
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
AAA1
refseq_AAA1.F1 refseq_AAA1.R1 121 271
NCBIGene 36.2 404744
Single exon skipping, size difference: 150
Exclusion of the stop codon
Reference transcript: NM_207288
AAA1
refseq_AAA1.F2 refseq_AAA1.R3 204 324
NCBIGene 36.2 404744
Single exon skipping, size difference: 120
Inclusion in the protein causing a new stop codon
Reference transcript: NM_207288
ABCB4
refseq_ABCB4.F1 refseq_ABCB4.R1 106 127
NCBIGene 36.3 5244
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_018849
cd ABC_MTABC3_MDL1_MDL2 238aa 1e-116
in ref transcript
MTABC3 (also known as ABCB6) is a mitochondrial ATP-binding cassette protein involved in iron homeostasis and one of four ABC transporters expressed in the mitochondrial inner membrane, the other three being MDL1(ABC7), MDL2, and ATM1. In fact, the yeast MDL1 (multidrug resistance-like protein 1) and MDL2 (multidrug resistance-like protein 2) transporters are also included in this CD. MDL1 is an ATP-dependent permease that acts as a high-copy suppressor of ATM1 and is thought to have a role in resistance to oxidative stress. Interestingly, subfamily B is more closely related to the carboxyl-terminal component of subfamily C than the two halves of ABCC molecules are with one another.
Changed!
cd ABC_MTABC3_MDL1_MDL2 247aa 1e-113
in ref transcript
TIGR 3a01208 587aa 1e-115
in ref transcript
Changed!
TIGR 3a01208 495aa 2e-95
in ref transcript
COG MdlB 524aa 1e-111
in ref transcript
ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
Changed!
PTZ PTZ00265 773aa 3e-98
in ref transcript
multidrug resistance protein (mdr1); Provisional.
Changed!
cd ABC_MTABC3_MDL1_MDL2 240aa 1e-115
in modified transcript
Changed!
TIGR 3a01208 488aa 1e-97
in modified transcript
Changed!
COG MdlB 498aa 3e-98
in modified transcript
ABCB4
refseq_ABCB4.F3 refseq_ABCB4.R3 179 320
NCBIGene 36.3 5244
Single exon skipping, size difference: 141
Exclusion in the protein (no frameshift)
Reference transcript: NM_018849
cd ABC_MTABC3_MDL1_MDL2 238aa 1e-116
in ref transcript
MTABC3 (also known as ABCB6) is a mitochondrial ATP-binding cassette protein involved in iron homeostasis and one of four ABC transporters expressed in the mitochondrial inner membrane, the other three being MDL1(ABC7), MDL2, and ATM1. In fact, the yeast MDL1 (multidrug resistance-like protein 1) and MDL2 (multidrug resistance-like protein 2) transporters are also included in this CD. MDL1 is an ATP-dependent permease that acts as a high-copy suppressor of ATM1 and is thought to have a role in resistance to oxidative stress. Interestingly, subfamily B is more closely related to the carboxyl-terminal component of subfamily C than the two halves of ABCC molecules are with one another.
cd ABC_MTABC3_MDL1_MDL2 247aa 1e-113
in ref transcript
TIGR 3a01208 587aa 1e-115
in ref transcript
Changed!
TIGR 3a01208 495aa 2e-95
in ref transcript
COG MdlB 524aa 1e-111
in ref transcript
ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
Changed!
PTZ PTZ00265 773aa 3e-98
in ref transcript
multidrug resistance protein (mdr1); Provisional.
Changed!
TIGR 3a01208 448aa 2e-91
in modified transcript
Changed!
PTZ PTZ00265 726aa 2e-86
in modified transcript
ABCB9
refseq_ABCB9.F1 refseq_ABCB9.R1 184 373
NCBIGene 36.2 23457
Single exon skipping, size difference: 189
Exclusion in the protein (no frameshift)
Reference transcript: NM_019625
Changed!
cd ABCC_TAP 226aa 1e-110
in ref transcript
TAP, the Transporter Associated with Antigen Processing; TAP is essential for peptide delivery from the cytosol into the lumen of the endoplasmic reticulum (ER), where these peptides are loaded on major histocompatibility complex (MHC) I molecules. Loaded MHC I leave the ER and display their antigenic cargo on the cell surface to cytotoxic T cells. Subsequently, virus-infected or malignantly transformed cells can be eliminated. TAP belongs to the large family of ATP-binding cassette (ABC) transporters, which translocate a vast variety of solutes across membranes.
Changed!
TIGR 3a01208 695aa 0.0
in ref transcript
Changed!
COG MdlB 573aa 1e-127
in ref transcript
ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
Changed!
cd ABCC_TAP 197aa 4e-94
in modified transcript
Changed!
TIGR 3a01208 632aa 1e-180
in modified transcript
Changed!
COG SunT 506aa 1e-94
in modified transcript
TAP, the Transporter Associated with Antigen Processing; TAP is essential for peptide delivery from the cytosol into the lumen of the endoplasmic reticulum (ER), where these peptides are loaded on major histocompatibility complex (MHC) I molecules. Loaded MHC I leave the ER and display their antigenic cargo on the cell surface to cytotoxic T cells. Subsequently, virus-infected or malignantly transformed cells can be eliminated. TAP belongs to the large family of ATP-binding cassette (ABC) transporters, which translocate a vast variety of solutes across membranes.
Changed!
TIGR 3a01208 695aa 0.0
in ref transcript
Changed!
COG MdlB 573aa 1e-127
in ref transcript
ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
Changed!
TIGR MsbA_rel 525aa 1e-110
in modified transcript
This protein is related to a Proteobacterial ATP transporter that exports lipid A and to eukaryotic P-glycoproteins.
Changed!
COG MdlB 530aa 1e-110
in modified transcript
ABCC11
refseq_ABCC11.F1 refseq_ABCC11.R1 241 355
NCBIGene 36.3 85320
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_032583
Changed!
cd ABCC_MRP_domain2 221aa 2e-95
in ref transcript
Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
cd ABCC_MRP_domain1 193aa 7e-80
in ref transcript
Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
Changed!
TIGR MRP_assoc_pro 1298aa 0.0
in ref transcript
This model describes multi drug resistance-associated protein (MRP) in eukaryotes. The multidrug resistance-associated protein is an integral membrane protein that causes multidrug resistance when overexpressed in mammalian cells. It belongs to ABC transporter superfamily. The protein topology and function was experimentally demonstrated by epitope tagging and immunofluorescence. Insertion of tags in the critical regions associated with drug efflux, abrogated its function. The C-terminal domain seem to highly conserved.
Changed!
PTZ PTZ00243 893aa 1e-153
in ref transcript
ABC transporter; Provisional.
COG MdlB 583aa 2e-51
in ref transcript
ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
Changed!
cd ABCC_MRP_domain2 183aa 2e-73
in modified transcript
Changed!
TIGR MRP_assoc_pro 1260aa 0.0
in modified transcript
Changed!
PTZ PTZ00243 855aa 1e-139
in modified transcript
ABCC9
refseq_ABCC9.F2 refseq_ABCC9.R2 236 344
NCBIGene 36.3 10060
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_020297
cd ABCC_SUR2 240aa 1e-137
in ref transcript
The SUR domain 2. The sulfonylurea receptor SUR is an ATP binding cassette (ABC) protein of the ABCC/MRP family. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity.
Changed!
cd ABCC_SUR1_N 218aa 1e-125
in ref transcript
The SUR domain 1. The sulfonylurea receptor SUR is an ATP transporter of the ABCC/MRP family with tandem ATPase binding domains. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity.
Changed!
TIGR MRP_assoc_pro 1325aa 0.0
in ref transcript
This model describes multi drug resistance-associated protein (MRP) in eukaryotes. The multidrug resistance-associated protein is an integral membrane protein that causes multidrug resistance when overexpressed in mammalian cells. It belongs to ABC transporter superfamily. The protein topology and function was experimentally demonstrated by epitope tagging and immunofluorescence. Insertion of tags in the critical regions associated with drug efflux, abrogated its function. The C-terminal domain seem to highly conserved.
Changed!
PTZ PTZ00243 1183aa 1e-170
in ref transcript
ABC transporter; Provisional.
Changed!
cd ABCC_SUR1_N 217aa 1e-124
in modified transcript
Changed!
TIGR MRP_assoc_pro 1289aa 0.0
in modified transcript
Changed!
PTZ PTZ00243 1147aa 1e-170
in modified transcript
ABCD4
refseq_ABCD4.F1 refseq_ABCD4.R1 155 283
NCBIGene 36.3 5826
Single exon skipping, size difference: 128
Exclusion in the protein causing a frameshift
Reference transcript: NM_005050
Changed!
cd ABCD_peroxisomal_ALDP 212aa 2e-55
in ref transcript
Peroxisomal ATP-binding cassette transporter (Pat) is involved in the import of very long-chain fatty acids (VLCFA) into the peroxisome. The peroxisomal membrane forms a permeability barrier for a wide variety of metabolites required for and formed during fatty acid beta-oxidation. To communicate with the cytoplasm and mitochondria, peroxisomes need dedicated proteins to transport such hydrophilic molecules across their membranes. X-linked adrenoleukodystrophy (X-ALD) is caused by mutations in the ALD gene, which encodes ALDP (adrenoleukodystrophy protein ), a peroxisomal integral membrane protein that is a member of the ATP-binding cassette (ABC) transporter protein family. The disease is characterized by a striking and unpredictable variation in phenotypic expression. Phenotypes include the rapidly progressive childhood cerebral form (CCALD), the milder adult form, adrenomyeloneuropathy (AMN), and variants without neurologic involvement (i.e. asymptomatic).
Changed!
TIGR 3a01203 522aa 8e-77
in ref transcript
Changed!
COG COG4178 534aa 1e-75
in ref transcript
ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only].
ABCD4
refseq_ABCD4.F3 refseq_ABCD4.R3 159 299
NCBIGene 36.2 5826
Single exon skipping, size difference: 140
Exclusion in the protein causing a frameshift
Reference transcript: NM_005050
Changed!
cd ABCD_peroxisomal_ALDP 212aa 2e-55
in ref transcript
Peroxisomal ATP-binding cassette transporter (Pat) is involved in the import of very long-chain fatty acids (VLCFA) into the peroxisome. The peroxisomal membrane forms a permeability barrier for a wide variety of metabolites required for and formed during fatty acid beta-oxidation. To communicate with the cytoplasm and mitochondria, peroxisomes need dedicated proteins to transport such hydrophilic molecules across their membranes. X-linked adrenoleukodystrophy (X-ALD) is caused by mutations in the ALD gene, which encodes ALDP (adrenoleukodystrophy protein ), a peroxisomal integral membrane protein that is a member of the ATP-binding cassette (ABC) transporter protein family. The disease is characterized by a striking and unpredictable variation in phenotypic expression. Phenotypes include the rapidly progressive childhood cerebral form (CCALD), the milder adult form, adrenomyeloneuropathy (AMN), and variants without neurologic involvement (i.e. asymptomatic).
Changed!
TIGR 3a01203 522aa 8e-77
in ref transcript
Changed!
COG COG4178 534aa 1e-75
in ref transcript
ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only].
Changed!
pfam ABC_membrane_2 67aa 9e-09
in modified transcript
ABC transporter transmembrane region 2. This domain covers the transmembrane of a small family of ABC transporters and shares sequence similarity with pfam00664. Mutations in this domain in human PMP70 (70 kDa peroxisomal membrane protein) are believed responsible for Zellweger Syndrome-2; mutations in human ALDP (Adrenoleukodystrophy protein) are responsible for recessive X-linked adrenoleukodystrophy. A Saccharomyces cerevisiae homolog is involved in the import of long-chain fatty acids.
ABCF1
refseq_ABCF1.F2 refseq_ABCF1.R2 135 249
NCBIGene 36.3 23
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_001025091
cd ABCF_EF-3 189aa 3e-32
in ref transcript
ABCF_EF-3 Elongation factor 3 (EF-3) is a cytosolic protein required by fungal ribosomes for in vitro protein synthesis and for in vivo growth. EF-3 stimulates the binding of the EF-1: GTP: aa-tRNA ternary complex to the ribosomal A site by facilitated release of the deacylated tRNA from the E site. The reaction requires ATP hydrolysis. EF-3 contains two ATP nucleotide binding sequence (NBS) motifs. NBSI is sufficient for the intrinsic ATPase activity. NBSII is essential for the ribosome-stimulated functions.
cd ABCF_EF-3 76aa 4e-27
in ref transcript
cd ABC_cobalt_CbiO_domain1 217aa 6e-18
in ref transcript
Domain I of the ABC component of a cobalt transport family found in bacteria, archaea, and eukaryota. The transition metal cobalt is an essential component of many enzymes and must be transported into cells in appropriate amounts when needed. This ABC transport system of the CbiMNQO family is involved in cobalt transport in association with the cobalamin (vitamin B12) biosynthetic pathways. Most of cobalt (Cbi) transport systems possess a separate CbiN component, the cobalt-binding periplasmic protein, and they are encoded by the conserved gene cluster cbiMNQO. Both the CbiM and CbiQ proteins are integral cytoplasmic membrane proteins, and the CbiO protein has the linker peptide and the Walker A and B motifs commonly found in the ATPase components of the ABC-type transport systems.
pfam ABC_tran 196aa 4e-29
in ref transcript
ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporters are composed of two copies of this domain and two copies of a transmembrane domain pfam00664. These four domains may belong to a single polypeptide or belong in different polypeptide chains.
pfam ABC_tran 163aa 9e-25
in ref transcript
TIGR PhnT 49aa 0.004
in ref transcript
This ATP-binding component of an ABC transport system is found in Salmonella and Burkholderia lineages in the vicinity of enzymes for the breakdown of 2-aminoethylphosphonate.
COG Uup 532aa 1e-107
in ref transcript
ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only].
ABCG1
refseq_ABCG1.F1 refseq_ABCG1.R1 125 161
NCBIGene 36.3 9619
Alternative 5-prime, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_004915
cd ABCG_EPDR 226aa 4e-74
in ref transcript
ABCG transporters are involved in eye pigment (EP) precursor transport, regulation of lipid-trafficking mechanisms, and pleiotropic drug resistance (DR). DR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. Compared to other members of the ABC transporter subfamilies, the ABCG transporter family is composed of proteins that have an ATP-binding cassette domain at the N-terminus and a TM (transmembrane) domain at the C-terminus.
Changed!
TIGR 3a01204 619aa 0.0
in ref transcript
COG CcmA 220aa 7e-46
in ref transcript
ABC-type multidrug transport system, ATPase component [Defense mechanisms].
Changed!
TIGR 3a01204 607aa 0.0
in modified transcript
ABCG1
refseq_ABCG1.F3 refseq_ABCG1.R3 204 245
NCBIGene 36.3 9619
Single exon skipping, size difference: 41
Inclusion in the protein causing a new stop codon
Reference transcript: NM_207627
Changed!
cd ABCG_EPDR 226aa 2e-74
in ref transcript
ABCG transporters are involved in eye pigment (EP) precursor transport, regulation of lipid-trafficking mechanisms, and pleiotropic drug resistance (DR). DR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. Compared to other members of the ABC transporter subfamilies, the ABCG transporter family is composed of proteins that have an ATP-binding cassette domain at the N-terminus and a TM (transmembrane) domain at the C-terminus.
Changed!
TIGR 3a01204 607aa 0.0
in ref transcript
Changed!
COG CcmA 220aa 3e-46
in ref transcript
ABC-type multidrug transport system, ATPase component [Defense mechanisms].
ABHD11
refseq_ABHD11.F1 refseq_ABHD11.R1 114 135
NCBIGene 36.2 83451
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_148912
pfam Abhydrolase_1 216aa 4e-13
in ref transcript
alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes.
PRK PRK10673 246aa 2e-37
in ref transcript
hypothetical protein; Provisional.
ABHD2
refseq_ABHD2.F1 refseq_ABHD2.R1 100 498
NCBIGene 36.3 11057
Multiple exon skipping, size difference: 398
Exclusion in 5'UTR, Exclusion in 5'UTR, Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_007011
pfam Abhydrolase_1 230aa 3e-26
in ref transcript
alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes.
COG COG0429 336aa 4e-44
in ref transcript
Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only].
ABI1
refseq_ABI1.F1 refseq_ABI1.R1 135 216
NCBIGene 36.3 10006
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_005470
cd SH3 52aa 3e-15
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
pfam Abi_HHR 79aa 4e-32
in ref transcript
Abl-interactor HHR. The region featured in this family is found towards the N-terminus of a number of adaptor proteins that interact with Abl-family tyrosine kinases. More specifically, it is termed the homeo-domain homologous region (HHR), as it is similar to the DNA-binding region of homeo-domain proteins. Other homeo-domain proteins have been implicated in specifying positional information during embryonic development, and in the regulation of the expression of cell-type specific genes. The Abl-interactor proteins are thought to coordinate the cytoplasmic and nuclear functions of the Abl-family kinases, and seem to be involved in cytoskeletal reorganisation, but their precise role remains unclear.
smart SH3 56aa 2e-17
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
ABI1
refseq_ABI1.F3 refseq_ABI1.R3 165 252
NCBIGene 36.3 10006
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_005470
cd SH3 52aa 3e-15
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
pfam Abi_HHR 79aa 4e-32
in ref transcript
Abl-interactor HHR. The region featured in this family is found towards the N-terminus of a number of adaptor proteins that interact with Abl-family tyrosine kinases. More specifically, it is termed the homeo-domain homologous region (HHR), as it is similar to the DNA-binding region of homeo-domain proteins. Other homeo-domain proteins have been implicated in specifying positional information during embryonic development, and in the regulation of the expression of cell-type specific genes. The Abl-interactor proteins are thought to coordinate the cytoplasmic and nuclear functions of the Abl-family kinases, and seem to be involved in cytoskeletal reorganisation, but their precise role remains unclear.
smart SH3 56aa 2e-17
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
ABLIM1
refseq_ABLIM1.F1 refseq_ABLIM1.R1 167 272
NCBIGene 36.3 3983
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_002313
smart VHP 36aa 2e-12
in ref transcript
Villin headpiece domain.
pfam LIM 56aa 4e-10
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
pfam LIM 56aa 1e-08
in ref transcript
pfam LIM 45aa 6e-07
in ref transcript
smart LIM 33aa 5e-05
in ref transcript
Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.
ABLIM1
refseq_ABLIM1.F3 refseq_ABLIM1.R3 119 203
NCBIGene 36.3 3983
Single exon skipping, size difference: 84
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_002313
smart VHP 36aa 2e-12
in ref transcript
Villin headpiece domain.
pfam LIM 56aa 4e-10
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
pfam LIM 56aa 1e-08
in ref transcript
pfam LIM 45aa 6e-07
in ref transcript
smart LIM 33aa 5e-05
in ref transcript
Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.
ABTB1
refseq_ABTB1.F1 refseq_ABTB1.R1 167 202
NCBIGene 36.3 80325
Alternative 5-prime, size difference: 35
Exclusion in the protein causing a frameshift
Reference transcript: NM_172027
Changed!
cd ANK 73aa 6e-08
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
pfam BTB 76aa 3e-15
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
Changed!
smart BTB 96aa 1e-14
in ref transcript
Broad-Complex, Tramtrack and Bric a brac. Domain in Broad-Complex, Tramtrack and Bric a brac. Also known as POZ (poxvirus and zinc finger) domain. Known to be a protein-protein interaction motif found at the N-termini of several C2H2-type transcription factors as well as Shaw-type potassium channels. Known structure reveals a tightly intertwined dimer formed via interactions between N-terminal strand and helix structures. However in a subset of BTB/POZ domains, these two secondary structures appear to be missing. Be aware SMART predicts BTB/POZ domains without the beta1- and alpha1-secondary structures.
ABTB1
refseq_ABTB1.F2 refseq_ABTB1.R2 112 158
NCBIGene 36.3 80325
Alternative 3-prime, size difference: 46
Inclusion in the protein causing a new stop codon
Reference transcript: NM_172027
Changed!
cd ANK 73aa 6e-08
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
pfam BTB 76aa 3e-15
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
Changed!
smart BTB 96aa 1e-14
in ref transcript
Broad-Complex, Tramtrack and Bric a brac. Domain in Broad-Complex, Tramtrack and Bric a brac. Also known as POZ (poxvirus and zinc finger) domain. Known to be a protein-protein interaction motif found at the N-termini of several C2H2-type transcription factors as well as Shaw-type potassium channels. Known structure reveals a tightly intertwined dimer formed via interactions between N-terminal strand and helix structures. However in a subset of BTB/POZ domains, these two secondary structures appear to be missing. Be aware SMART predicts BTB/POZ domains without the beta1- and alpha1-secondary structures.
ACADVL
refseq_ACADVL.F2 refseq_ACADVL.R2 327 393
NCBIGene 36.3 37
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_000018
cd VLCAD 410aa 0.0
in ref transcript
Very long chain acyl-CoA dehydrogenase (VLCAD). VLCAD acyl-CoA dehydrogenases (ACAD), which is found in the mitochondria of eukaryotes and in some bacteria. It catalyzes the alpha,beta dehydrogenation of the corresponding trans-enoyl-CoA by FAD, which becomes reduced. The reduced form of ACAD is reoxidized in the oxidative half-reaction by electron-transferring flavoprotein (ETF), from which the electrons are transferred to the mitochondrial respiratory chain coupled with ATP synthesis. VLCAD, which is a homodimer.
pfam Acyl-CoA_dh_1 147aa 2e-35
in ref transcript
Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle.
TIGR cyc_hxne_CoA_dh 380aa 1e-26
in ref transcript
Cyclohex-1-ene-1carboxyl-CoA is an intermediate in the anaerobic degradation of benzoyl-CoA derived from varioius aromatic compounds, in Rhodopseudomonas palustris but not Thauera aromatica. The aliphatic compound cyclohexanecarboxylate, can be converted to the same intermediate in two steps. The first step is its ligation to coenzyme A. The second is the action of this enzyme, cyclohexanecarboxyl-CoA dehydrogenase.
COG CaiA 380aa 2e-79
in ref transcript
Acyl-CoA dehydrogenases [Lipid metabolism].
ACCN2
refseq_ACCN2.F1 refseq_ACCN2.R1 140 278
NCBIGene 36.3 41
Alternative 5-prime, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_020039
Changed!
TIGR ENaC 538aa 1e-160
in ref transcript
This Hmm is designed from the vertebrate members of the ENaC family.
Changed!
TIGR ENaC 492aa 1e-168
in modified transcript
ACCN3
refseq_ACCN3.F1 refseq_ACCN3.R1 113 133
NCBIGene 36.3 9311
Alternative 3-prime, size difference: 20
Exclusion in the protein causing a frameshift
Reference transcript: NM_020321
Changed!
pfam ASC 441aa 1e-120
in ref transcript
Amiloride-sensitive sodium channel.
Changed!
TIGR ENaC 395aa 1e-120
in modified transcript
This Hmm is designed from the vertebrate members of the ENaC family.
Changed!
TIGR deg-1 87aa 4e-05
in modified transcript
This Hmm is designed from the invertebrate members of the ENaC family.
ACCN4
refseq_ACCN4.F1 refseq_ACCN4.R1 182 239
NCBIGene 36.3 55515
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_018674
Changed!
TIGR ENaC 362aa 5e-95
in ref transcript
This Hmm is designed from the vertebrate members of the ENaC family.
TIGR ENaC 103aa 0.002
in ref transcript
Changed!
TIGR ENaC 343aa 2e-92
in modified transcript
A1CF
refseq_ACF.F1 refseq_ACF.R1 113 137
NCBIGene 36.3 29974
Alternative 3-prime, size difference: 24
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_138933
cd RRM 69aa 1e-16
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 68aa 4e-13
in ref transcript
cd RRM 79aa 7e-09
in ref transcript
Changed!
TIGR hnRNP-R-Q 566aa 0.0
in ref transcript
Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
COG COG0724 76aa 4e-09
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
COG COG0724 160aa 8e-08
in ref transcript
Changed!
TIGR hnRNP-R-Q 574aa 0.0
in modified transcript
A1CF
refseq_ACF.F3 refseq_ACF.R3 193 241
NCBIGene 36.3 29974
Single exon skipping, size difference: 48
Exclusion in 5'UTR
Reference transcript: NM_138932
cd RRM 69aa 1e-16
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 68aa 5e-13
in ref transcript
cd RRM 79aa 9e-09
in ref transcript
TIGR hnRNP-R-Q 593aa 0.0
in ref transcript
Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
COG COG0724 76aa 7e-09
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
COG COG0724 160aa 2e-07
in ref transcript
A1CF
refseq_ACF.F5 refseq_ACF.R5 200 343
NCBIGene 36.3 29974
Single exon skipping, size difference: 143
Exclusion of the protein initiation site
Reference transcript: NM_138933
cd RRM 69aa 1e-16
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 68aa 4e-13
in ref transcript
cd RRM 79aa 7e-09
in ref transcript
Changed!
TIGR hnRNP-R-Q 566aa 0.0
in ref transcript
Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
COG COG0724 76aa 4e-09
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
COG COG0724 160aa 8e-08
in ref transcript
Changed!
TIGR hnRNP-R-Q 585aa 0.0
in modified transcript
ACLY
refseq_ACLY.F1 refseq_ACLY.R1 141 171
NCBIGene 36.3 47
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_001096
cd CCL_ACL-C 238aa 2e-78
in ref transcript
Citryl-CoA lyase (CCL), the C-terminal portion of the single-subunit type ATP-citrate lyase (ACL) and the C-terminal portion of the large subunit of the two-subunit type ACL. CCL cleaves citryl-CoA (CiCoA) to acetyl-CoA (AcCoA) and oxaloacetate (OAA). ACL catalyzes an ATP- and a CoA- dependant cleavage of citrate to form AcCoA and OAA in a multistep reaction, the final step of which is likely to involve the cleavage of CiCoA to generate AcCoA and OAA. In fungi, yeast, plants, and animals ACL is cytosolic and generates AcCoA for lipogenesis. ACL may be required for fruiting body maturation in the filamentous fungus Sordaria macrospore. In several groups of autotrophic prokaryotes and archaea, ACL carries out the citrate-cleavage reaction of the reductive tricarboxylic acid (rTCA) cycle. In the family Aquificaceae this latter reaction in the rTCA cycle is carried out via a two enzyme system the second enzyme of which is CCL; the first enzyme is citryl-CoA synthetase (CCS) which is not included in this group. Chlorobium limicola ACL is an example of a two-subunit type ACL. It is comprised of a large and a small subunit; it has been speculated that the large subunit arose from a fusion of the small subunit of the two subunit CCS with CCL. The small ACL subunit is a homolog of the larger CCS subunit. Mammalian ACL is of the single-subunit type and may have arisen from the two-subunit ACL by another gene fusion. Mammalian ACLs are homotetramers; the ACLs of C. limicola and Arabidopsis are a heterooctomers (alpha4beta4). In cancer cells there is a shift in energy metabolism to aerobic glycolysis, the glycolytic end product pyruvate enters a truncated TCA cycle generating citrate which is cleaved in the cytosol by ACL. Inhibiting ACL limits the in-vitro proliferation and survival of these cancer cells, reduces in vivo tumor growth, and induces differentiation.
TIGR sucCoAalpha 257aa 3e-31
in ref transcript
ATP citrate lyases appear to form an outgroup.
pfam Citrate_synt 188aa 1e-12
in ref transcript
Citrate synthase.
TIGR sucCoAbeta 349aa 3e-12
in ref transcript
This family contains a split seen both in a maximum parsimony tree (which ignores gaps) and in the gap pattern near position 85 of the seed alignment. Eukaryotic and most bacterial sequences are longer and contain a region similar to TXQTXXXG. Sequences from Deinococcus radiodurans, Mycobacterium tuberculosis, Streptomyces coelicolor, and the Archaea are 6 amino acids shorter in that region and contain a motif resembling [KR]G.
COG SucD 256aa 8e-55
in ref transcript
Succinyl-CoA synthetase, alpha subunit [Energy production and conversion].
COG SucC 410aa 5e-46
in ref transcript
Succinyl-CoA synthetase, beta subunit [Energy production and conversion].
COG GltA 235aa 1e-29
in ref transcript
Citrate synthase [Energy production and conversion].
ACOT8
refseq_ACOT8.F1 refseq_ACOT8.R1 146 341
NCBIGene 36.3 10005
Single exon skipping, size difference: 195
Exclusion in the protein (no frameshift)
Reference transcript: NM_005469
Changed!
cd Thioesterase_II_repeat1 104aa 2e-33
in ref transcript
Thioesterase II (TEII) is thought to regenerate misprimed nonribosomal peptide synthetases (NRPSs) as well as modular polyketide synthases (PKSs) by hydrolyzing acetyl groups bound to the peptidyl carrier protein (PCP) and acyl carrier protein (ACP) domains, respectively. TEII has two tandem asymmetric hot dog folds that are structurally similar to one found in PaaI thioesterase, 4-hydroxybenzoyl-CoA thioesterase (4HBT) and beta-hydroxydecanoyl-ACP dehydratase and thus, the TEII monomer is equivalent to the homodimeric form of the latter three enzymes. Human TEII is expressed in T cells and has been shown to bind the product of the HIV-1 Nef gene.
cd Thioesterase_II_repeat2 91aa 3e-30
in ref transcript
Thioesterase II (TEII) is thought to regenerate misprimed nonribosomal peptide synthetases (NRPSs) as well as modular polyketide synthases (PKSs) by hydrolyzing acetyl groups bound to the peptidyl carrier protein (PCP) and acyl carrier protein (ACP) domains, respectively. TEII has two tandem asymmetric hot dog folds that are structurally similar to one found in PaaI thioesterase, 4-hydroxybenzoyl-CoA thioesterase (4HBT) and beta-hydroxydecanoyl-ACP dehydratase and thus, the TEII monomer is equivalent to the homodimeric form of the latter three enzymes. Human TEII is expressed in T cells and has been shown to bind the product of the HIV-1 Nef gene.
Changed!
TIGR tesB 275aa 1e-105
in ref transcript
Subunit: homotetramer.
Changed!
COG TesB 283aa 9e-78
in ref transcript
Acyl-CoA thioesterase [Lipid metabolism].
Changed!
cd Thioesterase_II_repeat1 29aa 9e-06
in modified transcript
Changed!
TIGR tesB 182aa 5e-60
in modified transcript
Changed!
TIGR tesB 29aa 2e-08
in modified transcript
Changed!
COG TesB 184aa 1e-46
in modified transcript
Changed!
COG TesB 33aa 1e-04
in modified transcript
ACOT8
refseq_ACOT8.F3 refseq_ACOT8.R3 234 368
NCBIGene 36.2 10005
Single exon skipping, size difference: 134
Exclusion in the protein causing a frameshift
Reference transcript: NM_005469
Changed!
cd Thioesterase_II_repeat1 104aa 2e-33
in ref transcript
Thioesterase II (TEII) is thought to regenerate misprimed nonribosomal peptide synthetases (NRPSs) as well as modular polyketide synthases (PKSs) by hydrolyzing acetyl groups bound to the peptidyl carrier protein (PCP) and acyl carrier protein (ACP) domains, respectively. TEII has two tandem asymmetric hot dog folds that are structurally similar to one found in PaaI thioesterase, 4-hydroxybenzoyl-CoA thioesterase (4HBT) and beta-hydroxydecanoyl-ACP dehydratase and thus, the TEII monomer is equivalent to the homodimeric form of the latter three enzymes. Human TEII is expressed in T cells and has been shown to bind the product of the HIV-1 Nef gene.
Changed!
cd Thioesterase_II_repeat2 91aa 3e-30
in ref transcript
Thioesterase II (TEII) is thought to regenerate misprimed nonribosomal peptide synthetases (NRPSs) as well as modular polyketide synthases (PKSs) by hydrolyzing acetyl groups bound to the peptidyl carrier protein (PCP) and acyl carrier protein (ACP) domains, respectively. TEII has two tandem asymmetric hot dog folds that are structurally similar to one found in PaaI thioesterase, 4-hydroxybenzoyl-CoA thioesterase (4HBT) and beta-hydroxydecanoyl-ACP dehydratase and thus, the TEII monomer is equivalent to the homodimeric form of the latter three enzymes. Human TEII is expressed in T cells and has been shown to bind the product of the HIV-1 Nef gene.
Changed!
TIGR tesB 275aa 1e-105
in ref transcript
Subunit: homotetramer.
Changed!
COG TesB 283aa 9e-78
in ref transcript
Acyl-CoA thioesterase [Lipid metabolism].
ACOT9
refseq_ACOT9.F2 refseq_ACOT9.R2 103 130
NCBIGene 36.3 23597
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_001037171
cd BFIT_BACH 121aa 1e-26
in ref transcript
Brown fat-inducible thioesterase (BFIT). Brain acyl-CoA hydrolase (BACH). These enzymes deacylate long-chain fatty acids by hydrolyzing acyl-CoA thioesters to free fatty acids and CoA-SH. Eukaryotic members of this family are expressed in brain, testis, and brown adipose tissues. The archeal and eukaryotic members of this family have two tandem copies of the conserved hot dog fold, while most bacterial members have only one copy.
cd BFIT_BACH 126aa 2e-17
in ref transcript
COG COG1607 134aa 6e-11
in ref transcript
Acyl-CoA hydrolase [Lipid metabolism].
COG COG1607 103aa 5e-05
in ref transcript
ACPT
refseq_ACPT.F1 refseq_ACPT.R1 134 413
NCBIGene 36.2 93650
Exon skipping and alternative 3-prime or 5-prime, size difference: 279
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_033068
Changed!
cd HP_HAP_like 285aa 1e-24
in ref transcript
Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
Changed!
pfam Acid_phosphat_A 307aa 1e-62
in ref transcript
Histidine acid phosphatase.
Changed!
PRK PRK10172 345aa 3e-06
in ref transcript
phosphoanhydride phosphorylase; Provisional.
Changed!
cd HP_HAP_like 192aa 4e-18
in modified transcript
Changed!
pfam Acid_phosphat_A 170aa 6e-19
in modified transcript
Changed!
pfam Acid_phosphat_A 63aa 6e-16
in modified transcript
Changed!
PRK PRK10172 175aa 0.008
in modified transcript
ACPT
refseq_ACPT.F3 refseq_ACPT.R3 231 364
NCBIGene 36.2 93650
Single exon skipping, size difference: 133
Exclusion in the protein causing a frameshift
Reference transcript: NM_033068
Changed!
cd HP_HAP_like 285aa 1e-24
in ref transcript
Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
Changed!
pfam Acid_phosphat_A 307aa 1e-62
in ref transcript
Histidine acid phosphatase.
Changed!
PRK PRK10172 345aa 3e-06
in ref transcript
phosphoanhydride phosphorylase; Provisional.
Changed!
cd HP_HAP_like 119aa 9e-23
in modified transcript
Changed!
pfam Acid_phosphat_A 183aa 2e-39
in modified transcript
Changed!
PRK PRK10172 109aa 0.002
in modified transcript
ACRV1
refseq_ACRV1.F2 refseq_ACRV1.R2 267 387
NCBIGene 36.3 56
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_001612
Changed!
cd LU 77aa 4e-05
in ref transcript
Ly-6 antigen / uPA receptor -like domain; occurs singly in GPI-linked cell-surface glycoproteins (Ly-6 family,CD59, thymocyte B cell antigen, Sgp-2) or as three-fold repeated domain in urokinase-type plasminogen activator receptor. Topology of these domains is similar to that of snake venom neurotoxins.
ACRV1
refseq_ACRV1.F3 refseq_ACRV1.R3 116 173
NCBIGene 36.3 56
Alternative 5-prime, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_001612
cd LU 77aa 4e-05
in ref transcript
Ly-6 antigen / uPA receptor -like domain; occurs singly in GPI-linked cell-surface glycoproteins (Ly-6 family,CD59, thymocyte B cell antigen, Sgp-2) or as three-fold repeated domain in urokinase-type plasminogen activator receptor. Topology of these domains is similar to that of snake venom neurotoxins.
ACSL3
refseq_ACSL3.F1 refseq_ACSL3.R1 249 356
NCBIGene 36.3 2181
Single exon skipping, size difference: 107
Exclusion in 5'UTR
Reference transcript: NM_004457
pfam AMP-binding 477aa 2e-82
in ref transcript
AMP-binding enzyme.
PTZ PTZ00216 617aa 1e-121
in ref transcript
acyl-CoA synthetase; Provisional.
ACSL4
refseq_ACSL4.F1 refseq_ACSL4.R1 137 449
NCBIGene 36.3 2182
Exon skipping and alternative 3-prime or 5-prime, size difference: 312
Exclusion in 5'UTR, Exclusion of the protein initiation site
Reference transcript: NM_022977
pfam AMP-binding 476aa 5e-82
in ref transcript
AMP-binding enzyme.
PTZ PTZ00216 585aa 1e-120
in ref transcript
acyl-CoA synthetase; Provisional.
PRK PRK07868 119aa 0.004
in ref transcript
acyl-CoA synthetase; Validated.
ACTL6A
refseq_ACTL6A.F1 refseq_ACTL6A.R1 124 172
NCBIGene 36.3 86
Alternative 5-prime, size difference: 48
Inclusion in the protein causing a new stop codon
Reference transcript: NM_004301
Changed!
cd ACTIN 415aa 1e-111
in ref transcript
Actin; An ubiquitous protein involved in the formation of filaments that are a major component of the cytoskeleton. Interaction with myosin provides the basis of muscular contraction and many aspects of cell motility. Each actin protomer binds one molecule of ATP and either calcium or magnesium ions. Actin exists as a monomer in low salt concentrations, but filaments form rapidly as salt concentration rises, with the consequent hydrolysis of ATP. Polymerization is regulated by so-called capping proteins. The ATPase domain of actin shares similarity with ATPase domains of hexokinase and hsp70 proteins.
Changed!
pfam Actin 421aa 1e-145
in ref transcript
Actin.
Changed!
PTZ PTZ00281 420aa 2e-70
in ref transcript
actin; Provisional.
ACTR3B
refseq_ACTR3B.F1 refseq_ACTR3B.R1 218 428
NCBIGene 36.3 57180
Multiple exon skipping, size difference: 210
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_020445
Changed!
cd ACTIN 405aa 1e-120
in ref transcript
Actin; An ubiquitous protein involved in the formation of filaments that are a major component of the cytoskeleton. Interaction with myosin provides the basis of muscular contraction and many aspects of cell motility. Each actin protomer binds one molecule of ATP and either calcium or magnesium ions. Actin exists as a monomer in low salt concentrations, but filaments form rapidly as salt concentration rises, with the consequent hydrolysis of ATP. Polymerization is regulated by so-called capping proteins. The ATPase domain of actin shares similarity with ATPase domains of hexokinase and hsp70 proteins.
Changed!
smart ACTIN 407aa 1e-129
in ref transcript
Actin. ACTIN subfamily of ACTIN/mreB/sugarkinase/Hsp70 superfamily.
Changed!
PTZ PTZ00280 415aa 0.0
in ref transcript
actin; Provisional.
Changed!
cd ACTIN 311aa 7e-91
in modified transcript
Changed!
smart ACTIN 312aa 4e-97
in modified transcript
Changed!
PTZ PTZ00280 314aa 1e-148
in modified transcript
Changed!
PTZ PTZ00280 31aa 1e-09
in modified transcript
ACYP1
refseq_ACYP1.F1 refseq_ACYP1.R1 189 268
NCBIGene 36.3 97
Single exon skipping, size difference: 79
Inclusion in the protein causing a frameshift
Reference transcript: NM_001107
Changed!
pfam Acylphosphatase 93aa 3e-24
in ref transcript
Acylphosphatase.
Changed!
COG AcyP 92aa 2e-11
in ref transcript
Acylphosphatases [Energy production and conversion].
Changed!
pfam Acylphosphatase 23aa 0.001
in modified transcript
ADAM22
refseq_ADAM22.F2 refseq_ADAM22.R2 156 243
NCBIGene 36.3 53616
Single exon skipping, size difference: 87
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_021723
cd ZnMc_adamalysin_II_like 198aa 4e-48
in ref transcript
Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
pfam Reprolysin 200aa 1e-63
in ref transcript
Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
smart ACR 141aa 2e-40
in ref transcript
ADAM Cysteine-Rich Domain.
pfam Disintegrin 77aa 4e-25
in ref transcript
Disintegrin.
pfam Pep_M12B_propep 104aa 2e-22
in ref transcript
Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not well aligned.
ADAM22
refseq_ADAM22.F3 refseq_ADAM22.R3 182 290
NCBIGene 36.3 53616
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_021723
cd ZnMc_adamalysin_II_like 198aa 4e-48
in ref transcript
Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
pfam Reprolysin 200aa 1e-63
in ref transcript
Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
smart ACR 141aa 2e-40
in ref transcript
ADAM Cysteine-Rich Domain.
pfam Disintegrin 77aa 4e-25
in ref transcript
Disintegrin.
pfam Pep_M12B_propep 104aa 2e-22
in ref transcript
Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not well aligned.
ADAM33
refseq_ADAM33.F1 refseq_ADAM33.R1 105 183
NCBIGene 36.3 80332
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_025220
cd ZnMc_adamalysin_II_like 198aa 7e-63
in ref transcript
Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
pfam Reprolysin 200aa 8e-68
in ref transcript
Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
Changed!
smart ACR 142aa 1e-35
in ref transcript
ADAM Cysteine-Rich Domain.
pfam Disintegrin 76aa 2e-29
in ref transcript
Disintegrin.
pfam Pep_M12B_propep 70aa 3e-21
in ref transcript
Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not well aligned.
Changed!
smart ACR 141aa 8e-35
in modified transcript
ADAMTS13
refseq_ADAMTS13.F2 refseq_ADAMTS13.R2 135 303
NCBIGene 36.3 11093
Alternative 5-prime, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_139025
cd ZnMc_ADAMTS_like 204aa 8e-68
in ref transcript
Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
pfam Reprolysin 206aa 1e-11
in ref transcript
Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
smart TSP1 53aa 3e-11
in ref transcript
Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.
ADAMTS13
refseq_ADAMTS13.F3 refseq_ADAMTS13.R3 193 432
NCBIGene 36.2 11093
Alternative 3-prime, size difference: 239
Inclusion in the protein causing a new stop codon
Reference transcript: NM_139025
Changed!
cd ZnMc_ADAMTS_like 204aa 8e-68
in ref transcript
Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Changed!
pfam Reprolysin 206aa 1e-11
in ref transcript
Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
Changed!
smart TSP1 53aa 3e-11
in ref transcript
Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.
Changed!
cd ZnMc_ADAMTS_like 183aa 4e-59
in modified transcript
Changed!
pfam Reprolysin 171aa 6e-09
in modified transcript
ADAMTS13
refseq_ADAMTS13.F5 refseq_ADAMTS13.R5 127 207
NCBIGene 36.2 11093
Alternative 5-prime, size difference: 80
Inclusion in the protein causing a new stop codon
Reference transcript: NM_139025
cd ZnMc_ADAMTS_like 204aa 8e-68
in ref transcript
Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
pfam Reprolysin 206aa 1e-11
in ref transcript
Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
Changed!
smart TSP1 53aa 3e-11
in ref transcript
Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.
ADAMTSL1
refseq_ADAMTSL1.F2 refseq_ADAMTSL1.R2 260 311
NCBIGene 36.2 92949
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_139238
smart TSP1 47aa 5e-05
in ref transcript
Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.
ADAR
refseq_ADAR.F2 refseq_ADAR.R2 138 195
NCBIGene 36.3 103
Alternative 3-prime, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_001111
cd DSRM 67aa 5e-13
in ref transcript
Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
cd DSRM 67aa 1e-10
in ref transcript
cd DSRM 50aa 0.002
in ref transcript
smart ADEAMc 384aa 1e-149
in ref transcript
tRNA-specific and double-stranded RNA adenosine deaminase (RNA-specific editase).
pfam z-alpha 65aa 2e-19
in ref transcript
Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this region to B-DNA has been shown to be disfavoured by steric hindrance.
pfam z-alpha 67aa 4e-19
in ref transcript
smart DSRM 66aa 1e-15
in ref transcript
Double-stranded RNA binding motif.
smart DSRM 66aa 4e-11
in ref transcript
smart DSRM 50aa 2e-06
in ref transcript
PRK rnc 59aa 2e-05
in ref transcript
ribonuclease III; Reviewed.
COG Rnc 67aa 5e-05
in ref transcript
dsRNA-specific ribonuclease [Transcription].
ADAR
refseq_ADAR.F3 refseq_ADAR.R3 143 221
NCBIGene 36.3 103
Alternative 5-prime, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_001111
cd DSRM 67aa 5e-13
in ref transcript
Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
cd DSRM 67aa 1e-10
in ref transcript
cd DSRM 50aa 0.002
in ref transcript
smart ADEAMc 384aa 1e-149
in ref transcript
tRNA-specific and double-stranded RNA adenosine deaminase (RNA-specific editase).
pfam z-alpha 65aa 2e-19
in ref transcript
Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this region to B-DNA has been shown to be disfavoured by steric hindrance.
pfam z-alpha 67aa 4e-19
in ref transcript
smart DSRM 66aa 1e-15
in ref transcript
Double-stranded RNA binding motif.
smart DSRM 66aa 4e-11
in ref transcript
smart DSRM 50aa 2e-06
in ref transcript
PRK rnc 59aa 2e-05
in ref transcript
ribonuclease III; Reviewed.
Changed!
COG Rnc 67aa 5e-05
in ref transcript
dsRNA-specific ribonuclease [Transcription].
Changed!
COG Rnc 76aa 5e-05
in modified transcript
ADARB1
refseq_ADARB1.F2 refseq_ADARB1.R2 112 232
NCBIGene 36.3 104
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_015833
cd DSRM 48aa 5e-06
in ref transcript
Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
cd DSRM 46aa 3e-05
in ref transcript
Changed!
smart ADEAMc 417aa 1e-133
in ref transcript
tRNA-specific and double-stranded RNA adenosine deaminase (RNA-specific editase).
smart DSRM 48aa 2e-07
in ref transcript
Double-stranded RNA binding motif.
smart DSRM 45aa 4e-06
in ref transcript
PRK rnc 48aa 0.008
in ref transcript
ribonuclease III; Reviewed.
Changed!
smart ADEAMc 377aa 1e-137
in modified transcript
ADARB1
refseq_ADARB1.F4 refseq_ADARB1.R4 180 298
NCBIGene 36.3 104
Single exon skipping, size difference: 118
Inclusion in 5'UTR
Reference transcript: NM_015833
cd DSRM 48aa 5e-06
in ref transcript
Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
cd DSRM 46aa 3e-05
in ref transcript
smart ADEAMc 417aa 1e-133
in ref transcript
tRNA-specific and double-stranded RNA adenosine deaminase (RNA-specific editase).
smart DSRM 48aa 2e-07
in ref transcript
Double-stranded RNA binding motif.
smart DSRM 45aa 4e-06
in ref transcript
PRK rnc 48aa 0.008
in ref transcript
ribonuclease III; Reviewed.
ADCY6
refseq_ADCY6.F1 refseq_ADCY6.R1 235 394
NCBIGene 36.3 112
Single exon skipping, size difference: 159
Exclusion in the protein (no frameshift)
Reference transcript: NM_015270
pfam Guanylate_cyc 183aa 6e-59
in ref transcript
Adenylate and Guanylate cyclase catalytic domain.
pfam Guanylate_cyc 194aa 8e-57
in ref transcript
pfam DUF1053 110aa 3e-38
in ref transcript
Domain of Unknown Function (DUF1053). This domain is found in Adenylate cyclases.
Class II Aldolase and Adducin head (N-terminal) domain. Aldolases are ubiquitous enzymes catalyzing central steps of carbohydrate metabolism. Based on enzymatic mechanisms, this superfamily has been divided into two distinct classes (Class I and II). Class II enzymes are further divided into two sub-classes A and B. This family includes class II A aldolases and adducins which has not been ascribed any enzymatic function. Members of this class are primarily bacterial and eukaryotic in origin and include L-fuculose-1-phosphate, L-rhamnulose-1-phosphate aldolases and L-ribulose-5-phosphate 4-epimerases. They all share the ability to promote carbon-carbon bond cleavage and stabilize enolate intermediates using divalent cations.
pfam Aldolase_II 183aa 1e-50
in ref transcript
Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function.
PRK PRK07044 247aa 8e-82
in ref transcript
aldolase_II super-family; Provisional.
ADD1
refseq_ADD1.F3 refseq_ADD1.R3 161 195
NCBIGene 36.3 118
Single exon skipping, size difference: 34
Inclusion in the protein causing a new stop codon
Reference transcript: NM_014189
cd Aldolase_II 211aa 2e-52
in ref transcript
Class II Aldolase and Adducin head (N-terminal) domain. Aldolases are ubiquitous enzymes catalyzing central steps of carbohydrate metabolism. Based on enzymatic mechanisms, this superfamily has been divided into two distinct classes (Class I and II). Class II enzymes are further divided into two sub-classes A and B. This family includes class II A aldolases and adducins which has not been ascribed any enzymatic function. Members of this class are primarily bacterial and eukaryotic in origin and include L-fuculose-1-phosphate, L-rhamnulose-1-phosphate aldolases and L-ribulose-5-phosphate 4-epimerases. They all share the ability to promote carbon-carbon bond cleavage and stabilize enolate intermediates using divalent cations.
pfam Aldolase_II 183aa 1e-50
in ref transcript
Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function.
PRK PRK07044 247aa 8e-82
in ref transcript
aldolase_II super-family; Provisional.
ADD2
refseq_ADD2.F1 refseq_ADD2.R1 226 312
NCBIGene 36.3 119
Single exon skipping, size difference: 86
Inclusion in the protein causing a frameshift
Reference transcript: NM_001617
cd Aldolase_II 211aa 1e-54
in ref transcript
Class II Aldolase and Adducin head (N-terminal) domain. Aldolases are ubiquitous enzymes catalyzing central steps of carbohydrate metabolism. Based on enzymatic mechanisms, this superfamily has been divided into two distinct classes (Class I and II). Class II enzymes are further divided into two sub-classes A and B. This family includes class II A aldolases and adducins which has not been ascribed any enzymatic function. Members of this class are primarily bacterial and eukaryotic in origin and include L-fuculose-1-phosphate, L-rhamnulose-1-phosphate aldolases and L-ribulose-5-phosphate 4-epimerases. They all share the ability to promote carbon-carbon bond cleavage and stabilize enolate intermediates using divalent cations.
pfam Aldolase_II 183aa 9e-45
in ref transcript
Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function.
PRK PRK07044 250aa 2e-61
in ref transcript
aldolase_II super-family; Provisional.
ADD3
refseq_ADD3.F2 refseq_ADD3.R2 284 380
NCBIGene 36.3 120
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_016824
cd Aldolase_II 210aa 6e-40
in ref transcript
Class II Aldolase and Adducin head (N-terminal) domain. Aldolases are ubiquitous enzymes catalyzing central steps of carbohydrate metabolism. Based on enzymatic mechanisms, this superfamily has been divided into two distinct classes (Class I and II). Class II enzymes are further divided into two sub-classes A and B. This family includes class II A aldolases and adducins which has not been ascribed any enzymatic function. Members of this class are primarily bacterial and eukaryotic in origin and include L-fuculose-1-phosphate, L-rhamnulose-1-phosphate aldolases and L-ribulose-5-phosphate 4-epimerases. They all share the ability to promote carbon-carbon bond cleavage and stabilize enolate intermediates using divalent cations.
pfam Aldolase_II 183aa 1e-43
in ref transcript
Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function.
PRK PRK07044 249aa 5e-60
in ref transcript
aldolase_II super-family; Provisional.
ADNP
refseq_ADNP.F1 refseq_ADNP.R1 116 291
NCBIGene 36.3 23394
Single exon skipping, size difference: 175
Exclusion in 5'UTR
Reference transcript: NM_015339
cd homeodomain 46aa 3e-06
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
smart HOX 43aa 5e-08
in ref transcript
Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.
AFG3L1
refseq_AFG3L1.F1 refseq_AFG3L1.R1 327 393
NCBIGene 36.2 172
Alternative 3-prime, size difference: 66
Inclusion in 3'UTR
Reference transcript: NM_001132
cd AAA 50aa 7e-08
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
TIGR FtsH_fam 46aa 5e-20
in ref transcript
HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
CHL ftsH 46aa 3e-19
in ref transcript
cell division protein; Validated.
PRK PRK08116 102aa 4e-04
in ref transcript
hypothetical protein; Validated.
AFG3L1
refseq_AFG3L1.F3 refseq_AFG3L1.R3 112 212
NCBIGene 36.2 172
Single exon skipping, size difference: 100
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001132
Changed!
cd AAA 50aa 7e-08
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
Changed!
TIGR FtsH_fam 46aa 5e-20
in ref transcript
HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
Changed!
CHL ftsH 46aa 3e-19
in ref transcript
cell division protein; Validated.
Changed!
PRK PRK08116 102aa 4e-04
in ref transcript
hypothetical protein; Validated.
AFG3L1
refseq_AFG3L1.F6 refseq_AFG3L1.R6 208 281
NCBIGene 36.2 172
Alternative 3-prime, size difference: 73
Inclusion in 5'UTR
Reference transcript: NM_001132
cd AAA 50aa 7e-08
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
TIGR FtsH_fam 46aa 5e-20
in ref transcript
HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
CHL ftsH 46aa 3e-19
in ref transcript
cell division protein; Validated.
PRK PRK08116 102aa 4e-04
in ref transcript
hypothetical protein; Validated.
AFTPH
refseq_AFTPH.F1 refseq_AFTPH.R1 147 231
NCBIGene 36.3 54812
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_203437
ACAN
refseq_AGC1.F1 refseq_AGC1.R1 166 349
NCBIGene 36.3 176
Single exon skipping, size difference: 183
Exclusion in the protein (no frameshift)
Reference transcript: NM_013227
cd CLECT_CSPGs 124aa 9e-79
in ref transcript
CLECT_CSPGs: C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins (CSPGs) in human and chicken aggrecan, frog brevican, and zebra fish dermacan. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Xenopus brevican is expressed in the notochord and the brain during early embryogenesis. Zebra fish dermacan is expressed in dermal bones and may play a role in dermal bone development. CSPGs do contain LINK domain(s) which bind HA. These LINK domains are considered by one classification system to be a variety of CTLD, but are omitted from this hierarchical classification based on insignificant sequence similarity.
cd Link_domain_CSPGs_modules_1_3 95aa 5e-46
in ref transcript
Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
cd Link_domain_CSPGs_modules_2_4 96aa 5e-45
in ref transcript
Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan and, in the second link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) having link modules 3 and 4 which lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
cd Link_domain_CSPGs_modules_2_4 96aa 4e-44
in ref transcript
cd Link_domain_CSPGs_modules_1_3 95aa 4e-40
in ref transcript
Changed!
cd CCP 57aa 6e-08
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd EGF_CA 31aa 3e-04
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
pfam Xlink 95aa 2e-40
in ref transcript
Extracellular link domain.
pfam Xlink 95aa 9e-40
in ref transcript
pfam Xlink 96aa 1e-36
in ref transcript
smart CLECT 122aa 6e-36
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
smart LINK 99aa 7e-35
in ref transcript
Link (Hyaluronan-binding).
pfam V-set 120aa 9e-09
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
Changed!
smart CCP 57aa 3e-08
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
pfam EGF 31aa 1e-04
in ref transcript
EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
ACAN
refseq_AGC1.F3 refseq_AGC1.R3 104 218
NCBIGene 36.3 176
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_013227
cd CLECT_CSPGs 124aa 9e-79
in ref transcript
CLECT_CSPGs: C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins (CSPGs) in human and chicken aggrecan, frog brevican, and zebra fish dermacan. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Xenopus brevican is expressed in the notochord and the brain during early embryogenesis. Zebra fish dermacan is expressed in dermal bones and may play a role in dermal bone development. CSPGs do contain LINK domain(s) which bind HA. These LINK domains are considered by one classification system to be a variety of CTLD, but are omitted from this hierarchical classification based on insignificant sequence similarity.
cd Link_domain_CSPGs_modules_1_3 95aa 5e-46
in ref transcript
Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
cd Link_domain_CSPGs_modules_2_4 96aa 5e-45
in ref transcript
Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan and, in the second link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) having link modules 3 and 4 which lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
cd Link_domain_CSPGs_modules_2_4 96aa 4e-44
in ref transcript
cd Link_domain_CSPGs_modules_1_3 95aa 4e-40
in ref transcript
cd CCP 57aa 6e-08
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
Changed!
cd EGF_CA 31aa 3e-04
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
pfam Xlink 95aa 2e-40
in ref transcript
Extracellular link domain.
pfam Xlink 95aa 9e-40
in ref transcript
pfam Xlink 96aa 1e-36
in ref transcript
smart CLECT 122aa 6e-36
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
smart LINK 99aa 7e-35
in ref transcript
Link (Hyaluronan-binding).
pfam V-set 120aa 9e-09
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
smart CCP 57aa 3e-08
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
Changed!
pfam EGF 31aa 1e-04
in ref transcript
EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
AGER
refseq_AGER.F1 refseq_AGER.R1 120 233
NCBIGene 36.3 177
Exon skipping and alternative 3-prime or 5-prime, size difference: 113
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001136
Changed!
cd IGcam 66aa 3e-06
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IG 77aa 0.004
in ref transcript
Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam C2-set_2 96aa 4e-18
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
smart IG_like 95aa 1e-07
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
Changed!
smart IGc2 58aa 4e-07
in ref transcript
Immunoglobulin C-2 Type.
Changed!
cd IGcam 27aa 0.008
in modified transcript
Changed!
smart IG_like 26aa 3e-04
in modified transcript
AGER
refseq_AGER.F3 refseq_AGER.R3 100 142
NCBIGene 36.3 177
Alternative 3-prime, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_001136
cd IGcam 66aa 3e-06
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
cd IG 77aa 0.004
in ref transcript
Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam C2-set_2 96aa 4e-18
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
Changed!
smart IG_like 95aa 1e-07
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IGc2 58aa 4e-07
in ref transcript
Immunoglobulin C-2 Type.
Changed!
cd IG 63aa 6e-04
in modified transcript
Changed!
smart IG_like 81aa 6e-08
in modified transcript
AGL
refseq_AGL.F1 refseq_AGL.R1 117 260
NCBIGene 36.3 178
Single exon skipping, size difference: 143
Inclusion in the protein causing a new stop codon
Reference transcript: NM_000642
Changed!
TIGR glyc_debranch 1511aa 0.0
in ref transcript
glycogen debranching enzyme possesses two different catalytic activities; oligo-1,4-->1,4-glucantransferase (EC 2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33). Site directed mutagenesis studies in S. cerevisiae indicate that the transferase and glucosidase activities are independent and located in different regions of the polypeptide chain. Proteins in this model belong to the larger alpha-amylase family. The model covers eukaryotic proteins with a seed composed of human, nematode and yeast sequences. Yeast seed sequence is well characterized. The model is quite rigorous; either query sequence yields large bit score or it fails to hit the model altogether. There doesn't appear to be any middle ground.
Changed!
COG GDB1 462aa 5e-33
in ref transcript
Glycogen debranching enzyme [Carbohydrate transport and metabolism].
Changed!
COG AmyA 175aa 8e-05
in ref transcript
Glycosidases [Carbohydrate transport and metabolism].
AGL
refseq_AGL.F2 refseq_AGL.R2 112 392
NCBIGene 36.3 178
Alternative 5-prime, size difference: 280
Exclusion in 5'UTR
Reference transcript: NM_000028
TIGR glyc_debranch 1511aa 0.0
in ref transcript
glycogen debranching enzyme possesses two different catalytic activities; oligo-1,4-->1,4-glucantransferase (EC 2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33). Site directed mutagenesis studies in S. cerevisiae indicate that the transferase and glucosidase activities are independent and located in different regions of the polypeptide chain. Proteins in this model belong to the larger alpha-amylase family. The model covers eukaryotic proteins with a seed composed of human, nematode and yeast sequences. Yeast seed sequence is well characterized. The model is quite rigorous; either query sequence yields large bit score or it fails to hit the model altogether. There doesn't appear to be any middle ground.
COG GDB1 462aa 5e-33
in ref transcript
Glycogen debranching enzyme [Carbohydrate transport and metabolism].
COG AmyA 175aa 8e-05
in ref transcript
Glycosidases [Carbohydrate transport and metabolism].
AGPAT4
refseq_AGPAT4.F1 refseq_AGPAT4.R1 102 239
NCBIGene 36.2 56895
Single exon skipping, size difference: 137
Inclusion in the protein causing a new stop codon
Reference transcript: NM_020133
Changed!
smart PlsC 122aa 1e-17
in ref transcript
Phosphate acyltransferases. Function in phospholipid biosynthesis and have either glycerolphosphate, 1-acylglycerolphosphate, or 2-acylglycerolphosphoethanolamine acyltransferase activities. Tafazzin, the product of the gene mutated in patients with Barth syndrome, is a member of this family.
Changed!
pfam AGTRAP 159aa 1e-78
in ref transcript
Angiotensin II, type I receptor-associated protein (AGTRAP). This family consists of several angiotensin II, type I receptor-associated protein (AGTRAP) sequences. AGTRAP is known to interact specifically with the carboxyl-terminal cytoplasmic region of the angiotensin II type 1 (AT(1)) receptor to regulate different aspects of AT(1) receptor physiology. The function of this family is unclear.
Changed!
pfam AGTRAP 152aa 4e-75
in modified transcript
AGTRAP
refseq_AGTRAP.F2 refseq_AGTRAP.R2 140 237
NCBIGene 36.3 57085
Single exon skipping, size difference: 97
Inclusion in the protein causing a frameshift
Reference transcript: NM_020350
Changed!
pfam AGTRAP 159aa 1e-78
in ref transcript
Angiotensin II, type I receptor-associated protein (AGTRAP). This family consists of several angiotensin II, type I receptor-associated protein (AGTRAP) sequences. AGTRAP is known to interact specifically with the carboxyl-terminal cytoplasmic region of the angiotensin II type 1 (AT(1)) receptor to regulate different aspects of AT(1) receptor physiology. The function of this family is unclear.
Changed!
pfam AGTRAP 20aa 2e-06
in modified transcript
AIFM3
refseq_AIFL.F1 refseq_AIFL.R1 110 131
NCBIGene 36.3 150209
Single exon skipping, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_144704
cd Rieske_AIFL_N 95aa 2e-41
in ref transcript
AIFL (apoptosis-inducing factor like) family, N-terminal Rieske domain; members of this family show similarity to human AIFL, containing an N-terminal Rieske domain and a C-terminal pyridine nucleotide-disulfide oxidoreductase domain (Pyr_redox). The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. AIFL shares 35% homology with human AIF (apoptosis-inducing factor), mainly in the Pyr_redox domain. AIFL is predominantly localized to the mitochondria. AIFL induces apoptosis in a caspase-dependent manner.
pfam Pyr_redox_2 278aa 6e-43
in ref transcript
Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain.
pfam Rieske 88aa 7e-14
in ref transcript
Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines.
Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases [Inorganic ion transport and metabolism / General function prediction only].
AIPL1
refseq_AIPL1.F2 refseq_AIPL1.R2 136 325
NCBIGene 36.3 23746
Single exon skipping, size difference: 189
Exclusion in the protein (no frameshift)
Reference transcript: NM_014336
cd TPR 117aa 7e-04
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
pfam FKBP_C 61aa 0.002
in ref transcript
FKBP-type peptidyl-prolyl cis-trans isomerase.
AIPL1
refseq_AIPL1.F4 refseq_AIPL1.R4 186 366
NCBIGene 36.3 23746
Single exon skipping, size difference: 180
Exclusion in the protein (no frameshift)
Reference transcript: NM_014336
cd TPR 117aa 7e-04
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
Changed!
pfam FKBP_C 61aa 0.002
in ref transcript
FKBP-type peptidyl-prolyl cis-trans isomerase.
AKAP14
refseq_AKAP14.F1 refseq_AKAP14.R1 144 324
NCBIGene 36.3 158798
Single exon skipping, size difference: 180
Exclusion in the protein (no frameshift)
Reference transcript: NM_178813
AKAP7
refseq_AKAP7.F1 refseq_AKAP7.R1 267 336
NCBIGene 36.3 9465
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_138633
pfam AKAP7_RIRII_bdg 61aa 2e-19
in ref transcript
PKA-RI-RII subunit binding domain of A-kinase anchor protein. AKAP7_RIRII_bdg is the C-terminal domain of the cyclic AMP-dependent protein kinase A, PKA, anchor protein AKAP7. This protein anchors PKA, for its role in regulating PKA-mediated gene transcription in both somatic cells and oocytes, by binding to its regulatory subunits, RI and RII, hence being known as a dual-specific AKAP. The 25 crucial amino acids of RII-binding domains in general form structurally conserved amphipathic helices with unrelated sequences; hydrophobic amino acid residues form the backbone of the interaction and hydrogen bond- and salt-bridge-forming amino acid residues increase the affinity of the interaction. The N-terminus, of family AKAP7_NLS, carries the nuclear localisation signal.
AKAP9
refseq_AKAP9.F1 refseq_AKAP9.R1 197 233
NCBIGene 36.3 10142
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_147171
pfam PACT_coil_coil 83aa 2e-26
in ref transcript
Pericentrin-AKAP-450 domain of centrosomal targeting protein. This domain is a coiled-coil region close to the C-terminus of centrosomal proteins that is directly responsible for recruiting AKAP-450 and pericentrin to the centrosome. Hence the suggested name for this region is a PACT domain (pericentrin-AKAP-450 centrosomal targeting). This domain is also present at the C-terminus of coiled-coil proteins from Drosophila and S. pombe, and that from the Drosophila protein is sufficient for targeting to the centrosome in mammalian cells. The function of these proteins is unknown but they seem good candidates for having a centrosomal or spindle pole body location. The final 22 residues of this domain in AKAP-450 appear specifically to be a calmodulin-binding domain indicating that this member at least is likely to contribute to centrosome assembly.
TIGR SMC_prok_B 404aa 6e-07
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
pfam SMC_N 259aa 6e-05
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
TIGR SMC_prok_A 252aa 0.001
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
TIGR SMC_prok_B 709aa 0.002
in ref transcript
COG Smc 264aa 3e-04
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG Smc 317aa 7e-04
in ref transcript
PRK PRK05771 168aa 0.009
in ref transcript
V-type ATP synthase subunit I; Validated.
AKAP9
refseq_AKAP9.F2 refseq_AKAP9.R2 133 157
NCBIGene 36.3 10142
Exon skipping and alternative 3-prime or 5-prime, size difference: 24
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_147171
pfam PACT_coil_coil 83aa 2e-26
in ref transcript
Pericentrin-AKAP-450 domain of centrosomal targeting protein. This domain is a coiled-coil region close to the C-terminus of centrosomal proteins that is directly responsible for recruiting AKAP-450 and pericentrin to the centrosome. Hence the suggested name for this region is a PACT domain (pericentrin-AKAP-450 centrosomal targeting). This domain is also present at the C-terminus of coiled-coil proteins from Drosophila and S. pombe, and that from the Drosophila protein is sufficient for targeting to the centrosome in mammalian cells. The function of these proteins is unknown but they seem good candidates for having a centrosomal or spindle pole body location. The final 22 residues of this domain in AKAP-450 appear specifically to be a calmodulin-binding domain indicating that this member at least is likely to contribute to centrosome assembly.
TIGR SMC_prok_B 404aa 6e-07
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
pfam SMC_N 259aa 6e-05
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
TIGR SMC_prok_A 252aa 0.001
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
TIGR SMC_prok_B 709aa 0.002
in ref transcript
COG Smc 264aa 3e-04
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG Smc 317aa 7e-04
in ref transcript
PRK PRK05771 168aa 0.009
in ref transcript
V-type ATP synthase subunit I; Validated.
AKR1A1
refseq_AKR1A1.F2 refseq_AKR1A1.R2 166 294
NCBIGene 36.3 10327
Single exon skipping, size difference: 128
Exclusion in 5'UTR
Reference transcript: NM_006066
cd Aldo_ket_red 284aa 2e-72
in ref transcript
Aldo-keto reductases (AKRs) are a superfamily of soluble NAD(P)(H) oxidoreductases whose chief purpose is to reduce aldehydes and ketones to primary and secondary alcohols. AKRs are present in all phyla and are of importance to both health and industrial applications. Members have very distinct functions and include the prokaryotic 2,5-diketo-D-gluconic acid reductases and beta-keto ester reductases, the eukaryotic aldose reductases, aldehyde reductases, hydroxysteroid dehydrogenases, steroid 5beta-reductases, potassium channel beta-subunits and aflatoxin aldehyde reductases, among others.
pfam Aldo_ket_red 287aa 2e-97
in ref transcript
Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity.
COG ARA1 298aa 4e-81
in ref transcript
Aldo/keto reductases, related to diketogulonate reductase [General function prediction only].
ALAD
refseq_ALAD.F1 refseq_ALAD.R1 188 354
NCBIGene 36.3 210
Alternative 3-prime, size difference: 166
Exclusion in the protein causing a frameshift
Reference transcript: NM_001003945
Changed!
cd eu_ALAD_PBGS_cysteine_rich 313aa 1e-151
in ref transcript
Porphobilinogen synthase (PBGS), which is also called delta-aminolevulinic acid dehydratase (ALAD), catalyzes the condensation of two 5-aminolevulinic acid (ALA) molecules to form the pyrrole porphobilinogen (PBG), which is the second step in the biosynthesis of tetrapyrroles, such as heme, vitamin B12 and chlorophyll. This reaction involves the formation of a Schiff base link between the substrate and the enzyme. PBGSs are metalloenzymes, some of which have a second, allosteric metal binding site, beside the metal ion binding site in their active site. Although PBGS is a family of homologous enzymes, its metal ion utilization at catalytic site varies between zinc and magnesium and/or potassium. PBGS can be classified into two groups based on differences in their active site metal binding site. The eukaryotic PBGSs represented by this model, which contain a cysteine-rich zinc binding motif (DXCXCX(Y/F)X3G(H/Q)CG), require zinc for their activity, they do not contain an additional allosteric metal binding site and do not bind magnesium.
Changed!
pfam ALAD 289aa 1e-138
in ref transcript
Delta-aminolevulinic acid dehydratase.
Changed!
PRK PRK09283 284aa 1e-112
in ref transcript
delta-aminolevulinic acid dehydratase; Validated.
ALAS1
refseq_ALAS1.F2 refseq_ALAS1.R2 186 363
NCBIGene 36.3 211
Single exon skipping, size difference: 177
Exclusion in 5'UTR
Reference transcript: NM_000688
cd KBL_like 356aa 1e-137
in ref transcript
KBL_like; this family belongs to the pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD corresponds to serine palmitoyltransferase (SPT), 5-aminolevulinate synthase (ALAS), 8-amino-7-oxononanoate synthase (AONS), and 2-amino-3-ketobutyrate CoA ligase (KBL). SPT is responsible for the condensation of L-serine with palmitoyl-CoA to produce 3-ketodihydrospingosine, the reaction of the first step in sphingolipid biosynthesis. ALAS is involved in heme biosynthesis; it catalyzes the synthesis of 5-aminolevulinic acid from glycine and succinyl-coenzyme A. AONS catalyses the decarboxylative condensation of l-alanine and pimeloyl-CoA in the first committed step of biotin biosynthesis. KBL catalyzes the second reaction step of the metabolic degradation pathway for threonine converting 2-amino-3-ketobutyrate, to glycine and acetyl-CoA. The members of this CD are widely found in all three forms of life.
TIGR 5aminolev_synth 406aa 1e-178
in ref transcript
This model represents 5-aminolevulinic acid synthase, an enzyme for one of two routes to the heme precursor 5-aminolevulinate. The protein is a pyridoxal phosphate-dependent enzyme related to 2-amino-3-ketobutyrate CoA tranferase and 8-amino-7-oxononanoate synthase. This enzyme appears restricted to the alpha Proteobacteria and mitochondrial derivatives.
PRK PRK09064 405aa 1e-167
in ref transcript
5-aminolevulinate synthase; Validated.
ALAS2
refseq_ALAS2.F1 refseq_ALAS2.R1 178 374
NCBIGene 36.3 212
Single exon skipping, size difference: 196
Exclusion of the protein initiation site
Reference transcript: NM_000032
cd KBL_like 352aa 1e-149
in ref transcript
KBL_like; this family belongs to the pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD corresponds to serine palmitoyltransferase (SPT), 5-aminolevulinate synthase (ALAS), 8-amino-7-oxononanoate synthase (AONS), and 2-amino-3-ketobutyrate CoA ligase (KBL). SPT is responsible for the condensation of L-serine with palmitoyl-CoA to produce 3-ketodihydrospingosine, the reaction of the first step in sphingolipid biosynthesis. ALAS is involved in heme biosynthesis; it catalyzes the synthesis of 5-aminolevulinic acid from glycine and succinyl-coenzyme A. AONS catalyses the decarboxylative condensation of l-alanine and pimeloyl-CoA in the first committed step of biotin biosynthesis. KBL catalyzes the second reaction step of the metabolic degradation pathway for threonine converting 2-amino-3-ketobutyrate, to glycine and acetyl-CoA. The members of this CD are widely found in all three forms of life.
TIGR 5aminolev_synth 406aa 0.0
in ref transcript
This model represents 5-aminolevulinic acid synthase, an enzyme for one of two routes to the heme precursor 5-aminolevulinate. The protein is a pyridoxal phosphate-dependent enzyme related to 2-amino-3-ketobutyrate CoA tranferase and 8-amino-7-oxononanoate synthase. This enzyme appears restricted to the alpha Proteobacteria and mitochondrial derivatives.
Changed!
pfam Preseq_ALAS 101aa 1e-55
in ref transcript
5-aminolevulinate synthase presequence. The N terminal presequence domain found in 5-aminolevulinate synthase exists as an amphipathic helix, with a positively charged surface provided by lysine residues and no stable helix at the N-terminus. The domain is essential for the import process by which ALAS is transported into the mitochondria: translocase of the outer membrane (Tom) and translocase of the inner membrane protein complexes appear responsible for recognition and import through the mitochondrial membrane. The protein Tom20 is anchored to the mitochondrial outer membrane, and its interaction with presequences is thought to be the recognition step which allows subsequent import.
PRK PRK09064 405aa 0.0
in ref transcript
5-aminolevulinate synthase; Validated.
Changed!
pfam Preseq_ALAS 29aa 7e-10
in modified transcript
ALAS2
refseq_ALAS2.F1 refseq_ALAS2.R5 160 267
NCBIGene 36.3 212
Single exon skipping, size difference: 107
Inclusion in 5'UTR
Reference transcript: NM_000032
cd KBL_like 352aa 1e-149
in ref transcript
KBL_like; this family belongs to the pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD corresponds to serine palmitoyltransferase (SPT), 5-aminolevulinate synthase (ALAS), 8-amino-7-oxononanoate synthase (AONS), and 2-amino-3-ketobutyrate CoA ligase (KBL). SPT is responsible for the condensation of L-serine with palmitoyl-CoA to produce 3-ketodihydrospingosine, the reaction of the first step in sphingolipid biosynthesis. ALAS is involved in heme biosynthesis; it catalyzes the synthesis of 5-aminolevulinic acid from glycine and succinyl-coenzyme A. AONS catalyses the decarboxylative condensation of l-alanine and pimeloyl-CoA in the first committed step of biotin biosynthesis. KBL catalyzes the second reaction step of the metabolic degradation pathway for threonine converting 2-amino-3-ketobutyrate, to glycine and acetyl-CoA. The members of this CD are widely found in all three forms of life.
TIGR 5aminolev_synth 406aa 0.0
in ref transcript
This model represents 5-aminolevulinic acid synthase, an enzyme for one of two routes to the heme precursor 5-aminolevulinate. The protein is a pyridoxal phosphate-dependent enzyme related to 2-amino-3-ketobutyrate CoA tranferase and 8-amino-7-oxononanoate synthase. This enzyme appears restricted to the alpha Proteobacteria and mitochondrial derivatives.
pfam Preseq_ALAS 101aa 1e-55
in ref transcript
5-aminolevulinate synthase presequence. The N terminal presequence domain found in 5-aminolevulinate synthase exists as an amphipathic helix, with a positively charged surface provided by lysine residues and no stable helix at the N-terminus. The domain is essential for the import process by which ALAS is transported into the mitochondria: translocase of the outer membrane (Tom) and translocase of the inner membrane protein complexes appear responsible for recognition and import through the mitochondrial membrane. The protein Tom20 is anchored to the mitochondrial outer membrane, and its interaction with presequences is thought to be the recognition step which allows subsequent import.
PRK PRK09064 405aa 0.0
in ref transcript
5-aminolevulinate synthase; Validated.
ALAS2
refseq_ALAS2.F3 refseq_ALAS2.R3 186 297
NCBIGene 36.3 212
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_000032
cd KBL_like 352aa 1e-149
in ref transcript
KBL_like; this family belongs to the pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD corresponds to serine palmitoyltransferase (SPT), 5-aminolevulinate synthase (ALAS), 8-amino-7-oxononanoate synthase (AONS), and 2-amino-3-ketobutyrate CoA ligase (KBL). SPT is responsible for the condensation of L-serine with palmitoyl-CoA to produce 3-ketodihydrospingosine, the reaction of the first step in sphingolipid biosynthesis. ALAS is involved in heme biosynthesis; it catalyzes the synthesis of 5-aminolevulinic acid from glycine and succinyl-coenzyme A. AONS catalyses the decarboxylative condensation of l-alanine and pimeloyl-CoA in the first committed step of biotin biosynthesis. KBL catalyzes the second reaction step of the metabolic degradation pathway for threonine converting 2-amino-3-ketobutyrate, to glycine and acetyl-CoA. The members of this CD are widely found in all three forms of life.
TIGR 5aminolev_synth 406aa 0.0
in ref transcript
This model represents 5-aminolevulinic acid synthase, an enzyme for one of two routes to the heme precursor 5-aminolevulinate. The protein is a pyridoxal phosphate-dependent enzyme related to 2-amino-3-ketobutyrate CoA tranferase and 8-amino-7-oxononanoate synthase. This enzyme appears restricted to the alpha Proteobacteria and mitochondrial derivatives.
pfam Preseq_ALAS 101aa 1e-55
in ref transcript
5-aminolevulinate synthase presequence. The N terminal presequence domain found in 5-aminolevulinate synthase exists as an amphipathic helix, with a positively charged surface provided by lysine residues and no stable helix at the N-terminus. The domain is essential for the import process by which ALAS is transported into the mitochondria: translocase of the outer membrane (Tom) and translocase of the inner membrane protein complexes appear responsible for recognition and import through the mitochondrial membrane. The protein Tom20 is anchored to the mitochondrial outer membrane, and its interaction with presequences is thought to be the recognition step which allows subsequent import.
PRK PRK09064 405aa 0.0
in ref transcript
5-aminolevulinate synthase; Validated.
ALDH1A2
refseq_ALDH1A2.F1 refseq_ALDH1A2.R1 311 425
NCBIGene 36.3 8854
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_003888
Changed!
pfam Aldedh 464aa 0.0
in ref transcript
Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28. Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27. Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9. Delta-1-pyrroline-5-carboxylate dehydrogenase EC: 1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10. Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This family also includes omega crystallin, an eye lens protein from squid and octopus that has little aldehyde dehydrogenase activity.
Changed!
COG PutA 477aa 1e-155
in ref transcript
NAD-dependent aldehyde dehydrogenases [Energy production and conversion].
Changed!
pfam Aldedh 426aa 1e-174
in modified transcript
Changed!
COG PutA 439aa 1e-136
in modified transcript
ALDH3A2
refseq_ALDH3A2.F1 refseq_ALDH3A2.R1 231 356
NCBIGene 36.3 224
Single exon skipping, size difference: 125
Exclusion of the stop codon
Reference transcript: NM_001031806
pfam Aldedh 416aa 7e-73
in ref transcript
Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28. Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27. Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9. Delta-1-pyrroline-5-carboxylate dehydrogenase EC: 1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10. Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This family also includes omega crystallin, an eye lens protein from squid and octopus that has little aldehyde dehydrogenase activity.
COG PutA 423aa 8e-82
in ref transcript
NAD-dependent aldehyde dehydrogenases [Energy production and conversion].
ALDH3B1
refseq_ALDH3B1.F1 refseq_ALDH3B1.R1 101 211
NCBIGene 36.3 221
Single exon skipping, size difference: 110
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001030010
Changed!
pfam Aldedh 250aa 1e-49
in ref transcript
Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28. Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27. Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9. Delta-1-pyrroline-5-carboxylate dehydrogenase EC: 1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10. Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This family also includes omega crystallin, an eye lens protein from squid and octopus that has little aldehyde dehydrogenase activity.
Changed!
COG PutA 258aa 1e-50
in ref transcript
NAD-dependent aldehyde dehydrogenases [Energy production and conversion].
Changed!
TIGR arg_catab_astD 75aa 0.009
in modified transcript
Members of this protein family are succinylglutamic semialdehyde dehydrogenase (EC 1.2.1.71), the fourth enzyme in the arginine succinyltransferase (AST) pathway for arginine catabolism.
Changed!
COG PutA 54aa 0.005
in modified transcript
ALDH5A1
refseq_ALDH5A1.F1 refseq_ALDH5A1.R1 125 164
NCBIGene 36.3 7915
Single exon skipping, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_170740
Changed!
TIGR SSADH 464aa 0.0
in ref transcript
SSADH enzyme belongs to the aldehyde dehydrogenase family (pfam00171), sharing a common evolutionary origin and enzymatic mechanism with lactaldehyde dehydrogenase. Like in lactaldehyde dehydrogenase and succinate semialdehyde dehydrogenase, the mammalian catalytic glutamic acid and cysteine residues are conserved in all the enzymes of this family (PS00687, PS00070).
Changed!
TIGR SSADH 451aa 0.0
in modified transcript
Changed!
PRK gabD 471aa 1e-160
in modified transcript
ALDH8A1
refseq_ALDH8A1.F1 refseq_ALDH8A1.R1 183 345
NCBIGene 36.3 64577
Single exon skipping, size difference: 162
Exclusion in the protein (no frameshift)
Reference transcript: NM_022568
Changed!
pfam Aldedh 464aa 1e-172
in ref transcript
Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28. Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27. Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9. Delta-1-pyrroline-5-carboxylate dehydrogenase EC: 1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10. Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This family also includes omega crystallin, an eye lens protein from squid and octopus that has little aldehyde dehydrogenase activity.
Changed!
COG PutA 476aa 1e-143
in ref transcript
NAD-dependent aldehyde dehydrogenases [Energy production and conversion].
Changed!
pfam Aldedh 410aa 1e-138
in modified transcript
Changed!
COG PutA 422aa 1e-115
in modified transcript
ALG8
refseq_ALG8.F1 refseq_ALG8.R1 308 383
NCBIGene 36.3 79053
Single exon skipping, size difference: 75
Inclusion in the protein causing a new stop codon
Reference transcript: NM_024079
Changed!
pfam Alg6_Alg8 494aa 1e-139
in ref transcript
ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum membrane prior to the transfer of the oligosaccharide moiety to the target asparagine residues. This oligosaccharide is composed of Glc(3)Man(9)GlcNAc(2). The addition of the three glucose residues is the final series of steps in the synthesis of the oligosaccharide precursor. Alg6 transfers the first glucose residue, and Alg8 transfers the second one. In the human alg6 gene, a C->T transition, which causes Ala333 to be replaced with Val, has been identified as the cause of a congenital disorder of glycosylation, designated as type Ic OMIM:603147.
Changed!
pfam Alg6_Alg8 437aa 1e-121
in modified transcript
ALKBH1
refseq_ALKBH1.F2 refseq_ALKBH1.R2 117 280
NCBIGene 36.2 8846
Single exon skipping, size difference: 163
Exclusion in the protein causing a frameshift
Reference transcript: NM_006020
Changed!
TIGR alkb 171aa 3e-72
in ref transcript
Proteins in this family have an as of yet undetermined function in the repair of alkylation damage to DNA. Alignment and family designation based on phylogenomic analysis of Jonathan A. Eisen (PhD Thesis, Stanford University, 1999).
Changed!
COG AlkB 137aa 2e-16
in ref transcript
Alkylated DNA repair protein [DNA replication, recombination, and repair].
ALKBH6
refseq_ALKBH6.F2 refseq_ALKBH6.R2 137 175
NCBIGene 36.3 84964
Alternative 3-prime, size difference: 38
Exclusion in the protein causing a frameshift
Reference transcript: NM_032878
ALKBH6
refseq_ALKBH6.F4 refseq_ALKBH6.R4 145 177
NCBIGene 36.2 84964
Alternative 3-prime, size difference: 32
Exclusion in 5'UTR
Reference transcript: NM_032878
ALOX15B
refseq_ALOX15B.F1 refseq_ALOX15B.R1 128 263
NCBIGene 36.3 247
Exon skipping and alternative 3-prime or 5-prime, size difference: 135
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001141
cd PLAT_LOX 120aa 7e-37
in ref transcript
PLAT domain of 12/15-lipoxygenase. As a unique subfamily of the mammalian lipoxygenases, they catalyze enzymatic lipid peroxidation in complex biological structures via direct dioxygenation of phospholipids and cholesterol esters of biomembranes and plasma lipoproteins. Both types of enzymes are cytosolic but need this domain to access their sequestered membrane or micelle bound substrates.
Changed!
pfam Lipoxygenase 437aa 1e-106
in ref transcript
Lipoxygenase.
pfam PLAT 114aa 2e-17
in ref transcript
PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.
Changed!
pfam Lipoxygenase 392aa 3e-85
in modified transcript
ALOX15B
refseq_ALOX15B.F4 refseq_ALOX15B.R4 134 221
NCBIGene 36.3 247
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_001141
cd PLAT_LOX 120aa 7e-37
in ref transcript
PLAT domain of 12/15-lipoxygenase. As a unique subfamily of the mammalian lipoxygenases, they catalyze enzymatic lipid peroxidation in complex biological structures via direct dioxygenation of phospholipids and cholesterol esters of biomembranes and plasma lipoproteins. Both types of enzymes are cytosolic but need this domain to access their sequestered membrane or micelle bound substrates.
Changed!
pfam Lipoxygenase 437aa 1e-106
in ref transcript
Lipoxygenase.
pfam PLAT 114aa 2e-17
in ref transcript
PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.
Changed!
pfam Lipoxygenase 408aa 2e-91
in modified transcript
PARD3B
refseq_ALS2CR19.F1 refseq_ALS2CR19.R1 119 326
NCBIGene 36.3 117583
Single exon skipping, size difference: 207
Exclusion in the protein (no frameshift)
Reference transcript: NM_152526
cd PDZ_signaling 79aa 9e-12
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 88aa 1e-05
in ref transcript
cd PDZ_signaling 32aa 0.008
in ref transcript
smart PDZ 81aa 2e-13
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 88aa 1e-05
in ref transcript
cd PDZ_signaling 32aa 0.008
in ref transcript
smart PDZ 81aa 2e-13
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_152526
cd PDZ_signaling 79aa 9e-12
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 88aa 1e-05
in ref transcript
Changed!
cd PDZ_signaling 32aa 0.008
in ref transcript
smart PDZ 81aa 2e-13
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
Changed!
cd PDZ_signaling 78aa 2e-10
in modified transcript
Changed!
smart PDZ 88aa 2e-10
in modified transcript
Changed!
TIGR degP_htrA_DO 147aa 1e-05
in modified transcript
This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
AMACR
refseq_AMACR.F1 refseq_AMACR.R1 189 350
NCBIGene 36.3 23600
Single exon skipping, size difference: 161
Exclusion in the protein causing a frameshift
Reference transcript: NM_014324
Changed!
pfam CoA_transf_3 192aa 1e-68
in ref transcript
CoA-transferase family III. CoA-transferases are found in organisms from all lines of descent. Most of these enzymes belong to two well-known enzyme families, but recent work on unusual biochemical pathways of anaerobic bacteria has revealed the existence of a third family of CoA-transferases. The members of this enzyme family differ in sequence and reaction mechanism from CoA-transferases of the other families. Currently known enzymes of the new family are a formyl-CoA: oxalate CoA-transferase, a succinyl-CoA: (R)-benzylsuccinate CoA-transferase, an (E)-cinnamoyl-CoA: (R)-phenyllactate CoA-transferase, and a butyrobetainyl-CoA: (R)-carnitine CoA-transferase. In addition, a large number of proteins of unknown or differently annotated function from Bacteria, Archaea and Eukarya apparently belong to this enzyme family. Properties and reaction mechanisms of the CoA-transferases of family III are described and compared to those of the previously known CoA-transferases.
Changed!
COG CaiB 371aa 2e-85
in ref transcript
Predicted acyl-CoA transferases/carnitine dehydratase [Energy production and conversion].
Changed!
pfam CoA_transf_3 79aa 6e-28
in modified transcript
Changed!
TIGR oxalate_frc 161aa 1e-21
in modified transcript
This enzyme, formyl-CoA transferase, transfers coenzyme A from formyl-CoA to oxalate. It forms a pathway, together with oxalyl-CoA decarboxylase, for oxalate degradation; decarboxylation by the latter gene regenerates formyl-CoA. The two enzymes typically are encoded by a two-gene operon.
Changed!
COG CaiB 129aa 3e-37
in modified transcript
AMD1
refseq_AMD1.F2 refseq_AMD1.R2 108 425
NCBIGene 36.3 262
Multiple exon skipping, size difference: 317
Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001634
Changed!
pfam SAM_decarbox 328aa 1e-124
in ref transcript
Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermidine and spermine biosynthesis from putrescine. The Pfam alignment contains both the alpha and beta chains that are cleaved to form the active enzyme.
Changed!
pfam SAM_decarbox 41aa 2e-09
in modified transcript
AMMECR1
refseq_AMMECR1.F2 refseq_AMMECR1.R2 252 363
NCBIGene 36.3 9949
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_015365
Changed!
pfam AMMECR1 172aa 4e-45
in ref transcript
AMMECR1. This family consists of several AMMECR1 as well as several uncharacterised proteins. The contiguous gene deletion syndrome AMME is characterised by Alport syndrome, midface hypoplasia, mental retardation and elliptocytosis and is caused by a deletion in Xq22.3, comprising several genes including COL4A5, FACL4 and AMMECR1. This family contains sequences from several eukaryotic species as well as archaebacteria and it has been suggested that the AMMECR1 protein may have a basic cellular function, potentially in either the transcription, replication, repair or translation machinery.
Changed!
COG AMMECR1 153aa 1e-24
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
pfam AMMECR1 106aa 6e-28
in modified transcript
Changed!
COG AMMECR1 85aa 7e-15
in modified transcript
AMZ2
refseq_AMZ2.F1 refseq_AMZ2.R1 175 349
NCBIGene 36.3 51321
Single exon skipping, size difference: 174
Exclusion in the protein (no frameshift)
Reference transcript: NM_016627
Changed!
pfam Peptidase_M54 154aa 9e-12
in ref transcript
Peptidase family M54. This is a family of metallopeptidases. Two human proteins have been reported to degrade synthetic substrates and peptides.
Changed!
COG COG1913 178aa 2e-18
in ref transcript
Predicted Zn-dependent proteases [General function prediction only].
Changed!
pfam Peptidase_M54 128aa 6e-12
in modified transcript
Changed!
COG COG1913 143aa 3e-17
in modified transcript
ANAPC11
refseq_ANAPC11.F2 refseq_ANAPC11.R3 141 204
NCBIGene 36.3 51529
Single exon skipping, size difference: 63
Inclusion in 5'UTR
Reference transcript: NM_001002244
COG APC11 37aa 5e-07
in ref transcript
Component of SCF ubiquitin ligase and anaphase-promoting complex [Posttranslational modification, protein turnover, chaperones / Cell division and chromosome partitioning].
ANAPC11
refseq_ANAPC11.F3 refseq_ANAPC11.R4 102 403
NCBIGene 36.3 51529
Single exon skipping, size difference: 301
Exclusion in the protein causing a frameshift
Reference transcript: NM_001002244
Changed!
COG APC11 37aa 5e-07
in ref transcript
Component of SCF ubiquitin ligase and anaphase-promoting complex [Posttranslational modification, protein turnover, chaperones / Cell division and chromosome partitioning].
Changed!
COG APC11 84aa 1e-16
in modified transcript
ANGPTL4
refseq_ANGPTL4.F2 refseq_ANGPTL4.R2 104 218
NCBIGene 36.3 51129
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_139314
Changed!
cd FReD 218aa 1e-68
in ref transcript
Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.
Changed!
smart FBG 217aa 2e-62
in ref transcript
Fibrinogen-related domains (FReDs). Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.
Changed!
cd FReD 179aa 5e-56
in modified transcript
Changed!
smart FBG 179aa 1e-50
in modified transcript
ANK1
refseq_ANK1.F1 refseq_ANK1.R1 157 223
NCBIGene 36.2 286
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_000037
cd ANK 126aa 1e-31
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 126aa 2e-31
in ref transcript
cd ANK 126aa 3e-31
in ref transcript
cd ANK 124aa 1e-30
in ref transcript
cd ANK 126aa 1e-30
in ref transcript
cd ANK 120aa 2e-28
in ref transcript
cd ANK 126aa 3e-28
in ref transcript
cd ANK 155aa 2e-27
in ref transcript
cd ANK 126aa 8e-24
in ref transcript
pfam ZU5 105aa 2e-42
in ref transcript
ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function.
smart DEATH 87aa 1e-18
in ref transcript
DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
TIGR trp 253aa 4e-10
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
TIGR trp 143aa 4e-09
in ref transcript
TIGR trp 163aa 5e-09
in ref transcript
TIGR trp 237aa 5e-06
in ref transcript
pfam Ank 31aa 3e-05
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
COG Arp 158aa 2e-17
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
COG Arp 178aa 2e-16
in ref transcript
COG Arp 187aa 4e-14
in ref transcript
COG Arp 128aa 1e-13
in ref transcript
COG Arp 160aa 1e-12
in ref transcript
ANK3
refseq_ANK3.F1 refseq_ANK3.R1 104 413
NCBIGene 36.3 288
Single exon skipping, size difference: 309
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_020987
cd ANK 126aa 2e-33
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 126aa 5e-33
in ref transcript
cd ANK 126aa 5e-33
in ref transcript
cd ANK 126aa 8e-33
in ref transcript
cd ANK 126aa 4e-32
in ref transcript
cd ANK 125aa 2e-31
in ref transcript
cd ANK 122aa 2e-31
in ref transcript
pfam ZU5 105aa 9e-37
in ref transcript
ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function.
smart DEATH 87aa 3e-21
in ref transcript
DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
TIGR trp 213aa 3e-14
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
TIGR trp 308aa 1e-07
in ref transcript
TIGR trp 186aa 2e-07
in ref transcript
TIGR trp 131aa 6e-06
in ref transcript
pfam Ank 32aa 3e-05
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
pfam Ank 32aa 7e-05
in ref transcript
COG Arp 191aa 2e-17
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 126aa 5e-33
in ref transcript
cd ANK 126aa 5e-33
in ref transcript
cd ANK 126aa 8e-33
in ref transcript
cd ANK 126aa 4e-32
in ref transcript
cd ANK 125aa 2e-31
in ref transcript
cd ANK 122aa 2e-31
in ref transcript
pfam ZU5 105aa 9e-37
in ref transcript
ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function.
smart DEATH 87aa 3e-21
in ref transcript
DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
TIGR trp 213aa 3e-14
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
TIGR trp 308aa 1e-07
in ref transcript
TIGR trp 186aa 2e-07
in ref transcript
TIGR trp 131aa 6e-06
in ref transcript
pfam Ank 32aa 3e-05
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
pfam Ank 32aa 7e-05
in ref transcript
COG Arp 191aa 2e-17
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_016552
Changed!
cd ANK 166aa 6e-13
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 83aa 1e-08
in ref transcript
Changed!
pfam Ank 34aa 3e-04
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
pfam Ank 33aa 0.004
in ref transcript
Changed!
COG Arp 119aa 1e-06
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
COG Arp 86aa 6e-05
in ref transcript
COG COG4642 83aa 1e-04
in ref transcript
Uncharacterized protein conserved in bacteria [Function unknown].
Changed!
cd ANK 105aa 1e-07
in modified transcript
Changed!
COG Arp 107aa 1e-05
in modified transcript
ANKMY1
refseq_ANKMY1.F4 refseq_ANKMY1.R4 234 397
NCBIGene 36.3 51281
Single exon skipping, size difference: 163
Inclusion in 5'UTR
Reference transcript: NM_016552
cd ANK 166aa 6e-13
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 83aa 1e-08
in ref transcript
pfam Ank 34aa 3e-04
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
pfam Ank 33aa 0.004
in ref transcript
COG Arp 119aa 1e-06
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
COG Arp 86aa 6e-05
in ref transcript
COG COG4642 83aa 1e-04
in ref transcript
Uncharacterized protein conserved in bacteria [Function unknown].
ANKRD16
refseq_ANKRD16.F1 refseq_ANKRD16.R1 136 215
NCBIGene 36.3 54522
Single exon skipping, size difference: 79
Exclusion in the protein causing a frameshift
Reference transcript: NM_019046
cd ANK 121aa 5e-22
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
cd ANK 156aa 5e-20
in ref transcript
cd ANK 128aa 3e-19
in ref transcript
TIGR trp 142aa 4e-05
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Changed!
TIGR trp 223aa 3e-04
in ref transcript
Changed!
COG Arp 184aa 7e-12
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
COG Arp 122aa 1e-10
in ref transcript
Changed!
cd ANK 112aa 2e-18
in modified transcript
Changed!
pfam Ank 32aa 0.003
in modified transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
Changed!
TIGR trp 177aa 0.004
in modified transcript
Changed!
COG Arp 145aa 4e-13
in modified transcript
ANKRD5
refseq_ANKRD5.F2 refseq_ANKRD5.R2 339 403
NCBIGene 36.3 63926
Single exon skipping, size difference: 64
Exclusion in 5'UTR
Reference transcript: NM_022096
cd ANK 125aa 1e-25
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 115aa 4e-15
in ref transcript
cd ANK 124aa 6e-14
in ref transcript
cd EFh 59aa 0.005
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
pfam Ank 30aa 3e-06
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
TIGR trp 119aa 7e-06
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
pfam Ank 33aa 4e-04
in ref transcript
pfam Ank 28aa 0.002
in ref transcript
COG Arp 196aa 2e-13
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
COG Arp 143aa 4e-04
in ref transcript
COG Arp 91aa 0.003
in ref transcript
ANKS1B
refseq_ANKS1B.F2 refseq_ANKS1B.R2 169 241
NCBIGene 36.3 56899
Single exon skipping, size difference: 72
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_152788
cd AIDA-1b 132aa 1e-50
in ref transcript
AIDA-1b Phosphotyrosine-binding (PTB) domain. AIDA-1b is an amyloid-beta precursor protein interacting protein. It consists of ankyrin repeats, a SAM domain and a C-terminal PTB domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
cd ANK 129aa 3e-29
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 125aa 4e-25
in ref transcript
cd SAM 62aa 2e-11
in ref transcript
Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
cd SAM 63aa 3e-08
in ref transcript
pfam PID 131aa 3e-29
in ref transcript
Phosphotyrosine interaction domain (PTB/PID).
smart SAM 66aa 2e-13
in ref transcript
Sterile alpha motif. Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerisation.
smart SAM 62aa 3e-11
in ref transcript
TIGR trp 191aa 1e-06
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
pfam Ank 32aa 4e-04
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
COG Arp 134aa 2e-14
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
COG Arp 184aa 2e-08
in ref transcript
ANXA13
refseq_ANXA13.F1 refseq_ANXA13.R1 100 223
NCBIGene 36.3 312
Single exon skipping, size difference: 123
Exclusion in the protein (no frameshift)
Reference transcript: NM_001003954
pfam Annexin 66aa 6e-21
in ref transcript
Annexin. This family of annexins also includes giardin that has been shown to function as an annexin.
pfam Annexin 66aa 2e-20
in ref transcript
pfam Annexin 67aa 7e-17
in ref transcript
pfam Annexin 66aa 3e-12
in ref transcript
ANXA6
refseq_ANXA6.F1 refseq_ANXA6.R1 102 120
NCBIGene 36.3 309
Single exon skipping, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_001155
pfam Annexin 66aa 3e-21
in ref transcript
Annexin. This family of annexins also includes giardin that has been shown to function as an annexin.
pfam Annexin 66aa 8e-20
in ref transcript
pfam Annexin 66aa 2e-18
in ref transcript
pfam Annexin 65aa 8e-18
in ref transcript
pfam Annexin 66aa 3e-17
in ref transcript
pfam Annexin 67aa 2e-15
in ref transcript
pfam Annexin 65aa 1e-12
in ref transcript
Changed!
pfam Annexin 67aa 3e-11
in ref transcript
Changed!
pfam Annexin 67aa 6e-11
in modified transcript
ANXA7
refseq_ANXA7.F2 refseq_ANXA7.R2 288 354
NCBIGene 36.3 310
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_004034
pfam Annexin 66aa 7e-24
in ref transcript
Annexin. This family of annexins also includes giardin that has been shown to function as an annexin.
pfam Annexin 66aa 3e-22
in ref transcript
pfam Annexin 66aa 5e-21
in ref transcript
pfam Annexin 67aa 6e-14
in ref transcript
AOC2
refseq_AOC2.F1 refseq_AOC2.R1 140 221
NCBIGene 36.3 314
Alternative 5-prime, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_009590
Changed!
pfam Cu_amine_oxid 416aa 1e-136
in ref transcript
Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrogen peroxide. The enzymes are dimers of identical 70-90 kDa subunits, each of which contains a single copper ion and a covalently bound cofactor formed by the post-translational modification of a tyrosine side chain to 2,4,5-trihydroxyphenylalanine quinone (TPQ). This family corresponds to the catalytic domain of the enzyme.
pfam Cu_amine_oxidN3 99aa 1e-23
in ref transcript
Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrogen peroxide. The enzymes are dimers of identical 70-90 kDa subunits, each of which contains a single copper ion and a covalently bound cofactor formed by the post-translational modification of a tyrosine side chain to 2,4,5-trihydroxyphenylalanine quinone (TPQ).
pfam Cu_amine_oxidN2 86aa 9e-20
in ref transcript
Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrogen peroxide. The enzymes are dimers of identical 70-90 kDa subunits, each of which contains a single copper ion and a covalently bound cofactor formed by the post-translational modification of a tyrosine side chain to 2,4,5-trihydroxyphenylalanine quinone (TPQ).
Changed!
COG TynA 416aa 1e-45
in ref transcript
Cu2+-containing amine oxidase [Secondary metabolites biosynthesis, transport, and catabolism].
Changed!
pfam Cu_amine_oxid 389aa 1e-126
in modified transcript
Changed!
COG TynA 389aa 1e-41
in modified transcript
AP1B1
refseq_AP1B1.F1 refseq_AP1B1.R1 119 140
NCBIGene 36.3 162
Single exon skipping, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_001127
cd ARM 113aa 2e-09
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
pfam Adaptin_N 524aa 1e-163
in ref transcript
Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pits and vesicles. The N-terminal region of the various adaptor proteins (APs) is constant by comparison to the C-terminal which is variable within members of the AP-2 family; and it has been proposed that this constant region interacts with another uniform component of the coated vesicles.
pfam B2-adapt-app_C 114aa 2e-34
in ref transcript
Beta2-adaptin appendage, C-terminal sub-domain. Members of this family adopt a structure consisting of a 5 stranded beta-sheet, flanked by one alpha helix on the outer side, and by two alpha helices on the inner side. This domain is required for binding to clathrin, and its subsequent polymerisation. Furthermore, a hydrophobic patch present in the domain also binds to a subset of D-phi-F/W motif-containing proteins that are bound by the alpha-adaptin appendage domain (epsin, AP180, eps15).
smart Alpha_adaptinC2 101aa 2e-16
in ref transcript
Adaptin C-terminal domain. Adaptins are components of the adaptor complexes which link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. Gamma-adaptin is a subunit of the golgi adaptor. Alpha adaptin is a heterotetramer that regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This Ig-fold domain is found in alpha, beta and gamma adaptins and consists of a beta-sandwich containing 7 strands in 2 beta-sheets in a greek-key topology PUBMED:10430869, PUBMED:12176391. The adaptor appendage contains an additional N-terminal strand.
COG COG5096 548aa 1e-113
in ref transcript
Vesicle coat complex, various subunits [Intracellular trafficking and secretion].
AP1GBP1
refseq_AP1GBP1.F1 refseq_AP1GBP1.R1 159 195
NCBIGene 36.3 11276
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_007247
cd EH 52aa 3e-09
in ref transcript
Eps15 homology domain; found in proteins implicated in endocytosis, vesicle transport, and signal transduction. The alignment contains a pair of EF-hand motifs, typically one of them is canonical and binds to Ca2+, while the other may not bind to Ca2+. A hydrophobic binding pocket is formed by residues from both EF-hand motifs. The EH domain binds to proteins containing NPF (class I), [WF]W or SWG (class II), or H[TS]F (class III) sequence motifs.
smart EH 62aa 6e-08
in ref transcript
Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
AP2A1
refseq_AP2A1.F1 refseq_AP2A1.R1 132 198
NCBIGene 36.3 160
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_014203
pfam Adaptin_N 563aa 1e-151
in ref transcript
Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pits and vesicles. The N-terminal region of the various adaptor proteins (APs) is constant by comparison to the C-terminal which is variable within members of the AP-2 family; and it has been proposed that this constant region interacts with another uniform component of the coated vesicles.
pfam Alpha_adaptin_C 109aa 7e-50
in ref transcript
Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
pfam Alpha_adaptinC2 106aa 2e-13
in ref transcript
Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found in alpha, beta and gamma adaptins.
COG COG5096 549aa 3e-11
in ref transcript
Vesicle coat complex, various subunits [Intracellular trafficking and secretion].
AP2B1
refseq_AP2B1.F1 refseq_AP2B1.R1 196 238
NCBIGene 36.3 163
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_001030006
cd ARM 113aa 1e-09
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
pfam Adaptin_N 497aa 1e-160
in ref transcript
Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pits and vesicles. The N-terminal region of the various adaptor proteins (APs) is constant by comparison to the C-terminal which is variable within members of the AP-2 family; and it has been proposed that this constant region interacts with another uniform component of the coated vesicles.
pfam B2-adapt-app_C 111aa 4e-35
in ref transcript
Beta2-adaptin appendage, C-terminal sub-domain. Members of this family adopt a structure consisting of a 5 stranded beta-sheet, flanked by one alpha helix on the outer side, and by two alpha helices on the inner side. This domain is required for binding to clathrin, and its subsequent polymerisation. Furthermore, a hydrophobic patch present in the domain also binds to a subset of D-phi-F/W motif-containing proteins that are bound by the alpha-adaptin appendage domain (epsin, AP180, eps15).
smart Alpha_adaptinC2 101aa 3e-17
in ref transcript
Adaptin C-terminal domain. Adaptins are components of the adaptor complexes which link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. Gamma-adaptin is a subunit of the golgi adaptor. Alpha adaptin is a heterotetramer that regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This Ig-fold domain is found in alpha, beta and gamma adaptins and consists of a beta-sandwich containing 7 strands in 2 beta-sheets in a greek-key topology PUBMED:10430869, PUBMED:12176391. The adaptor appendage contains an additional N-terminal strand.
COG COG5096 548aa 1e-112
in ref transcript
Vesicle coat complex, various subunits [Intracellular trafficking and secretion].
AP2S1
refseq_AP2S1.F1 refseq_AP2S1.R1 179 293
NCBIGene 36.3 1175
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_004069
Changed!
pfam Clat_adaptor_s 142aa 6e-57
in ref transcript
Clathrin adaptor complex small chain.
Changed!
COG APS2 142aa 6e-42
in ref transcript
Clathrin adaptor complex, small subunit [Intracellular trafficking and secretion].
Changed!
pfam Clat_adaptor_s 104aa 2e-33
in modified transcript
Changed!
COG APS2 104aa 3e-22
in modified transcript
AP3M1
refseq_AP3M1.F1 refseq_AP3M1.R1 268 394
NCBIGene 36.3 26985
Single exon skipping, size difference: 126
Exclusion in 5'UTR
Reference transcript: NM_207012
pfam Adap_comp_sub 254aa 3e-77
in ref transcript
Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits.
pfam Clat_adaptor_s 144aa 0.002
in ref transcript
Clathrin adaptor complex small chain.
COG APS2 140aa 0.002
in ref transcript
Clathrin adaptor complex, small subunit [Intracellular trafficking and secretion].
AP3S1
refseq_AP3S1.F1 refseq_AP3S1.R1 179 215
NCBIGene 36.2 1176
Single exon skipping, size difference: 36
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001284
Changed!
pfam Clat_adaptor_s 148aa 7e-47
in ref transcript
Clathrin adaptor complex small chain.
Changed!
COG APS2 152aa 5e-44
in ref transcript
Clathrin adaptor complex, small subunit [Intracellular trafficking and secretion].
Changed!
pfam Clat_adaptor_s 23aa 4e-05
in modified transcript
Changed!
COG APS2 22aa 2e-04
in modified transcript
APOL1
refseq_APOL1.F1 refseq_APOL1.R1 245 399
NCBIGene 36.3 8542
Single exon skipping, size difference: 154
Exclusion of the protein initiation site
Reference transcript: NM_145343
pfam ApoL 299aa 1e-108
in ref transcript
Apolipoprotein L. Apo L belongs to the high density lipoprotein family that plays a central role in cholesterol transport. The cholesterol content of membranes is important in cellular processes such as modulating gene transcription and signal transduction both in the adult brain and during neurodevelopment. There are six apo L genes located in close proximity to each other on chromosome 22q12 in humans. 22q12 is a confirmed high-susceptibility locus for schizophrenia and close to the region associated with velocardiofacial syndrome that includes symptoms of schizophrenia.
APOL3
refseq_APOL3.F1 refseq_APOL3.R1 112 175
NCBIGene 36.3 80833
Single exon skipping, size difference: 63
Exclusion in 5'UTR
Reference transcript: NM_145641
pfam ApoL 196aa 3e-50
in ref transcript
Apolipoprotein L. Apo L belongs to the high density lipoprotein family that plays a central role in cholesterol transport. The cholesterol content of membranes is important in cellular processes such as modulating gene transcription and signal transduction both in the adult brain and during neurodevelopment. There are six apo L genes located in close proximity to each other on chromosome 22q12 in humans. 22q12 is a confirmed high-susceptibility locus for schizophrenia and close to the region associated with velocardiofacial syndrome that includes symptoms of schizophrenia.
APOL4
refseq_APOL4.F1 refseq_APOL4.R1 218 407
NCBIGene 36.3 80832
Single exon skipping, size difference: 189
Exclusion in 5'UTR
Reference transcript: NM_030643
pfam ApoL 190aa 1e-49
in ref transcript
Apolipoprotein L. Apo L belongs to the high density lipoprotein family that plays a central role in cholesterol transport. The cholesterol content of membranes is important in cellular processes such as modulating gene transcription and signal transduction both in the adult brain and during neurodevelopment. There are six apo L genes located in close proximity to each other on chromosome 22q12 in humans. 22q12 is a confirmed high-susceptibility locus for schizophrenia and close to the region associated with velocardiofacial syndrome that includes symptoms of schizophrenia.
APRT
refseq_APRT.F1 refseq_APRT.R1 263 397
NCBIGene 36.3 353
Alternative 3-prime, size difference: 134
Exclusion in the protein causing a frameshift
Reference transcript: NM_000485
Changed!
TIGR apt 171aa 1e-70
in ref transcript
A phylogenetic analysis suggested omitting the bi-directional best hit homologs from the spirochetes from the seed for this HMM and making only tentative predictions of adenine phosphoribosyltransferase function for this lineage. The trusted cutoff score is made high for this reason. Most proteins scoring between the trusted and noise cutoffs are likely to act as adenine phosphotransferase.
Changed!
PRK PRK02304 175aa 7e-67
in ref transcript
adenine phosphoribosyltransferase; Provisional.
Changed!
TIGR apt 125aa 7e-53
in modified transcript
Changed!
PRK PRK02304 128aa 1e-49
in modified transcript
APTX
refseq_APTX.F1 refseq_APTX.R1 192 307
NCBIGene 36.3 54840
Alternative 3-prime, size difference: 115
Exclusion in the protein causing a frameshift
Reference transcript: NM_175073
Changed!
cd aprataxin_related 102aa 6e-35
in ref transcript
aprataxin related: Aprataxin, a HINT family hydrolase is mutated in ataxia oculomotor apraxia syndrome. All the members of this subgroup have the conserved HxHxHxx (where x is a hydrophobic residue) signature motif. Members of this subgroup are predominantly eukaryotic in origin.
Changed!
cd FHA 94aa 2e-05
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
Changed!
TIGR PNK-3'Pase 105aa 1e-14
in ref transcript
Note that the EC number for the kinase function is: 2.7.1.78.
Changed!
pfam HIT 91aa 6e-06
in ref transcript
HIT domain.
Changed!
COG Hit 83aa 1e-05
in ref transcript
Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only].
Changed!
TIGR PNK-3'Pase 58aa 2e-07
in modified transcript
APTX
refseq_APTX.F3 refseq_APTX.R3 195 231
NCBIGene 36.3 54840
Alternative 3-prime, size difference: 36
Inclusion in the protein causing a new stop codon
Reference transcript: NM_175073
cd aprataxin_related 102aa 6e-35
in ref transcript
aprataxin related: Aprataxin, a HINT family hydrolase is mutated in ataxia oculomotor apraxia syndrome. All the members of this subgroup have the conserved HxHxHxx (where x is a hydrophobic residue) signature motif. Members of this subgroup are predominantly eukaryotic in origin.
cd FHA 94aa 2e-05
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
TIGR PNK-3'Pase 105aa 1e-14
in ref transcript
Note that the EC number for the kinase function is: 2.7.1.78.
pfam HIT 91aa 6e-06
in ref transcript
HIT domain.
COG Hit 83aa 1e-05
in ref transcript
Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only].
APTX
refseq_APTX.F4 refseq_APTX.R4 114 220
NCBIGene 36.3 54840
Single exon skipping, size difference: 106
Exclusion in 5'UTR
Reference transcript: NM_175073
cd aprataxin_related 102aa 6e-35
in ref transcript
aprataxin related: Aprataxin, a HINT family hydrolase is mutated in ataxia oculomotor apraxia syndrome. All the members of this subgroup have the conserved HxHxHxx (where x is a hydrophobic residue) signature motif. Members of this subgroup are predominantly eukaryotic in origin.
cd FHA 94aa 2e-05
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
TIGR PNK-3'Pase 105aa 1e-14
in ref transcript
Note that the EC number for the kinase function is: 2.7.1.78.
pfam HIT 91aa 6e-06
in ref transcript
HIT domain.
COG Hit 83aa 1e-05
in ref transcript
Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only].
ARF1
refseq_ARF1.F1 refseq_ARF1.R1 114 136
NCBIGene 36.3 375
Alternative 5-prime, size difference: 22
Exclusion in 5'UTR
Reference transcript: NM_001024227
cd Arf1_5_like 159aa 1e-98
in ref transcript
Arf1-Arf5-like subfamily. This subfamily contains Arf1, Arf2, Arf3, Arf4, Arf5, and related proteins. Arfs1-5 are soluble proteins that are crucial for assembling coat proteins during vesicle formation. Each contains an N-terminal myristoylated amphipathic helix that is folded into the protein in the GDP-bound state. GDP/GTP exchange exposes the helix, which anchors to the membrane. Following GTP hydrolysis, the helix dissociates from the membrane and folds back into the protein. A general feature of Arf1-5 signaling may be the cooperation of two Arfs at the same site. Arfs1-5 are generally considered to be interchangeable in function and location, but some specific functions have been assigned. Arf1 localizes to the early/cis-Golgi, where it is activated by GBF1 and recruits the coat protein COPI. It also localizes to the trans-Golgi network (TGN), where it is activated by BIG1/BIG2 and recruits the AP1, AP3, AP4, and GGA proteins. Humans, but not rodents and other lower eukaryotes, lack Arf2. Human Arf3 shares 96% sequence identity with Arf1 and is believed to generally function interchangeably with Arf1. Human Arf4 in the activated (GTP-bound) state has been shown to interact with the cytoplasmic domain of epidermal growth factor receptor (EGFR) and mediate the EGF-dependent activation of phospholipase D2 (PLD2), leading to activation of the activator protein 1 (AP-1) transcription factor. Arf4 has also been shown to recognize the C-terminal sorting signal of rhodopsin and regulate its incorporation into specialized post-Golgi rhodopsin transport carriers (RTCs). There is some evidence that Arf5 functions at the early-Golgi and the trans-Golgi to affect Golgi-associated alpha-adaptin homology Arf-binding proteins (GGAs).
smart ARF 175aa 1e-98
in ref transcript
ARF-like small GTPases; ARF, ADP-ribosylation factor. Ras homologues involved in vesicular transport. Activator of phospholipase D isoforms. Unlike Ras proteins they lack cysteine residues at their C-termini and therefore are unlikely to be prenylated. ARFs are N-terminally myristoylated. Contains ATP/GTP-binding motif (P-loop).
PTZ PTZ00133 181aa 4e-96
in ref transcript
ADP-ribosylation factor; Provisional.
ARF1
refseq_ARF1.F3 refseq_ARF1.R3 187 272
NCBIGene 36.3 375
Alternative 5-prime, size difference: 85
Exclusion in 5'UTR
Reference transcript: NM_001024226
cd Arf1_5_like 159aa 1e-98
in ref transcript
Arf1-Arf5-like subfamily. This subfamily contains Arf1, Arf2, Arf3, Arf4, Arf5, and related proteins. Arfs1-5 are soluble proteins that are crucial for assembling coat proteins during vesicle formation. Each contains an N-terminal myristoylated amphipathic helix that is folded into the protein in the GDP-bound state. GDP/GTP exchange exposes the helix, which anchors to the membrane. Following GTP hydrolysis, the helix dissociates from the membrane and folds back into the protein. A general feature of Arf1-5 signaling may be the cooperation of two Arfs at the same site. Arfs1-5 are generally considered to be interchangeable in function and location, but some specific functions have been assigned. Arf1 localizes to the early/cis-Golgi, where it is activated by GBF1 and recruits the coat protein COPI. It also localizes to the trans-Golgi network (TGN), where it is activated by BIG1/BIG2 and recruits the AP1, AP3, AP4, and GGA proteins. Humans, but not rodents and other lower eukaryotes, lack Arf2. Human Arf3 shares 96% sequence identity with Arf1 and is believed to generally function interchangeably with Arf1. Human Arf4 in the activated (GTP-bound) state has been shown to interact with the cytoplasmic domain of epidermal growth factor receptor (EGFR) and mediate the EGF-dependent activation of phospholipase D2 (PLD2), leading to activation of the activator protein 1 (AP-1) transcription factor. Arf4 has also been shown to recognize the C-terminal sorting signal of rhodopsin and regulate its incorporation into specialized post-Golgi rhodopsin transport carriers (RTCs). There is some evidence that Arf5 functions at the early-Golgi and the trans-Golgi to affect Golgi-associated alpha-adaptin homology Arf-binding proteins (GGAs).
smart ARF 175aa 1e-98
in ref transcript
ARF-like small GTPases; ARF, ADP-ribosylation factor. Ras homologues involved in vesicular transport. Activator of phospholipase D isoforms. Unlike Ras proteins they lack cysteine residues at their C-termini and therefore are unlikely to be prenylated. ARFs are N-terminally myristoylated. Contains ATP/GTP-binding motif (P-loop).
PTZ PTZ00133 181aa 4e-96
in ref transcript
ADP-ribosylation factor; Provisional.
ARFGAP1
refseq_ARFGAP1.F1 refseq_ARFGAP1.R1 136 166
NCBIGene 36.3 55738
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_175609
pfam ArfGap 118aa 3e-48
in ref transcript
Putative GTPase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
COG COG5347 159aa 4e-31
in ref transcript
GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
ARFIP1
refseq_ARFIP1.F1 refseq_ARFIP1.R1 145 280
NCBIGene 36.3 27236
Alternative 5-prime, size difference: 135
Exclusion in 5'UTR
Reference transcript: NM_001025595
cd Arfaptin 202aa 1e-51
in ref transcript
Arfaptin domain; arfaptin is a ubiquitously expressed protein implicated in mediating cross-talk between Rac, a member of the Rho family, and Arf small GTPases; Arfaptin binds to GTP-bound Arf1 and Arf6, but binds Rac.GTP and Rac.GDP with similar affinities. Structures of Arfaptin with Rac bound to either GDP or the slowly hydrolysable analogue GMPPNP show that the switch regions adopt similar conformations in both complexes. Arf1 and Arf6 are thought to bind to the same surface as Rac.
pfam Arfaptin 196aa 5e-70
in ref transcript
Arfaptin-like domain. Arfaptin interacts with ARF1, a small GTPase involved in vesicle budding at the Golgi complex and immature secretory granules. The structure of arfaptin shows that upon binding to a small GTPase, arfaptin forms a an elongated, crescent-shaped dimer of three-helix coiled-coils. The N-terminal region of ICA69 is similar to arfaptin.
ARFIP1
refseq_ARFIP1.F2 refseq_ARFIP1.R2 290 386
NCBIGene 36.3 27236
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_001025595
cd Arfaptin 202aa 1e-51
in ref transcript
Arfaptin domain; arfaptin is a ubiquitously expressed protein implicated in mediating cross-talk between Rac, a member of the Rho family, and Arf small GTPases; Arfaptin binds to GTP-bound Arf1 and Arf6, but binds Rac.GTP and Rac.GDP with similar affinities. Structures of Arfaptin with Rac bound to either GDP or the slowly hydrolysable analogue GMPPNP show that the switch regions adopt similar conformations in both complexes. Arf1 and Arf6 are thought to bind to the same surface as Rac.
pfam Arfaptin 196aa 5e-70
in ref transcript
Arfaptin-like domain. Arfaptin interacts with ARF1, a small GTPase involved in vesicle budding at the Golgi complex and immature secretory granules. The structure of arfaptin shows that upon binding to a small GTPase, arfaptin forms a an elongated, crescent-shaped dimer of three-helix coiled-coils. The N-terminal region of ICA69 is similar to arfaptin.
ARHGAP17
refseq_ARHGAP17.F1 refseq_ARHGAP17.R1 162 396
NCBIGene 36.3 55114
Single exon skipping, size difference: 234
Exclusion in the protein (no frameshift)
Reference transcript: NM_001006634
cd RhoGAP_nadrin 201aa 1e-85
in ref transcript
RhoGAP_nadrin: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of Nadrin-like proteins. Nadrin, also named Rich-1, has been shown to be involved in the regulation of Ca2+-dependent exocytosis in neurons and recently has been implicated in tight junction maintenance in mammalian epithelium. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
pfam BAR 238aa 1e-51
in ref transcript
BAR domain. BAR domains are dimerisation, lipid binding and curvature sensing modules found in many different protein families. A BAR domain with an additional N-terminal amphipathic helix (an N-BAR) can drive membrane curvature. These N-BAR domains are found in amphiphysin, endophilin, BRAP and Nadrin. BAR domains are also frequently found alongside domains that determine lipid specificity, like pfam00169 and pfam00787 domains in beta centaurins and sorting nexins respectively.
smart RhoGAP 176aa 6e-41
in ref transcript
GTPase-activator protein for Rho-like GTPases. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases. etter domain limits and outliers.
ARHGAP6
refseq_ARHGAP6.F1 refseq_ARHGAP6.R1 103 199
NCBIGene 36.3 395
Single exon skipping, size difference: 96
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_013427
cd RhoGAP_ARHGAP6 207aa 1e-93
in ref transcript
RhoGAP_ARHGAP6: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP6-like proteins. ArhGAP6 shows GAP activity towards RhoA, but not towards Cdc42 and Rac1. ArhGAP6 is often deleted in microphthalmia with linear skin defects syndrome (MLS); MLS is a severe X-linked developmental disorder. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
pfam RhoGAP 160aa 4e-45
in ref transcript
RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.
ARHGAP6
refseq_ARHGAP6.F4 refseq_ARHGAP6.R4 186 276
NCBIGene 36.3 395
Single exon skipping, size difference: 90
Inclusion in the protein causing a new stop codon
Reference transcript: NM_013427
cd RhoGAP_ARHGAP6 207aa 1e-93
in ref transcript
RhoGAP_ARHGAP6: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP6-like proteins. ArhGAP6 shows GAP activity towards RhoA, but not towards Cdc42 and Rac1. ArhGAP6 is often deleted in microphthalmia with linear skin defects syndrome (MLS); MLS is a severe X-linked developmental disorder. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
pfam RhoGAP 160aa 4e-45
in ref transcript
RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.
ARHGAP8
refseq_ARHGAP8.F2 refseq_ARHGAP8.R2 141 234
NCBIGene 36.3 23779
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_001017526
cd RhoGAP-p50rhoGAP 191aa 4e-73
in ref transcript
RhoGAP-p50rhoGAP: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of p50RhoGAP-like proteins; p50RhoGAP, also known as RhoGAP-1, contains a C-terminal RhoGAP domain and an N-terminal Sec14 domain which binds phosphatidylinositol 3,4,5-trisphosphate (PtdIns(3,4,5)P3). It is ubiquitously expressed and preferentially active on Cdc42. This subgroup also contains closely related ARHGAP8. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
Changed!
cd SEC14 177aa 2e-18
in ref transcript
Sec14p-like lipid-binding domain. Found in secretory proteins, such as S. cerevisiae phosphatidylinositol transfer protein (Sec14p), and in lipid regulated proteins such as RhoGAPs, RhoGEFs and neurofibromin (NF1). SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
pfam RhoGAP 149aa 3e-41
in ref transcript
RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.
Changed!
smart SEC14 181aa 1e-17
in ref transcript
Domain in homologues of a S. cerevisiae phosphatidylinositol transfer protein (Sec14p). Domain in homologues of a S. cerevisiae phosphatidylinositol transfer protein (Sec14p) and in RhoGAPs, RhoGEFs and the RasGAP, neurofibromin (NF1). Lipid-binding domain. The SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
Changed!
cd SEC14 146aa 3e-22
in modified transcript
Changed!
smart SEC14 150aa 2e-22
in modified transcript
ARHGEF1
refseq_ARHGEF1.F2 refseq_ARHGEF1.R2 265 364
NCBIGene 36.3 9138
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_199002
cd RhoGEF 187aa 3e-37
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
Changed!
pfam RGS-like 191aa 3e-59
in ref transcript
Regulator of G protein signalling-like domain. Members of this family adopt a structure consisting of twelve helices that fold into a compact domain that contains the overall structural scaffold observed in other RGS proteins and three additional helical elements that pack closely to it. Helices 1-9 comprise the RGS (pfam00615) fold, in which helices 4-7 form a classic antiparallel bundle adjacent to the other helices. Like other RGS structures, helices 7 and 8 span the length of the folded domain and form essentially one continuous helix with a kink in the middle. Helices 10-12 form an apparently stable C-terminal extension of the structural domain, and although other RGS proteins lack this structure, these elements are intimately associated with the rest of the structural framework by hydrophobic interactions. Members of the family bind to active G-alpha proteins, promoting GTP hydrolysis by the alpha subunit of heterotrimeric G proteins, thereby inactivating the G protein and rapidly switching off G protein-coupled receptor signalling pathways.
smart RhoGEF 185aa 2e-42
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
Changed!
pfam RGS-like 158aa 1e-32
in modified transcript
ARHGEF10L
refseq_ARHGEF10L.F1 refseq_ARHGEF10L.R1 192 309
NCBIGene 36.3 55160
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_018125
cd RhoGEF 167aa 4e-31
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
smart RhoGEF 164aa 2e-33
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
cd PDZ_signaling 75aa 2e-13
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
pfam RGS-like 180aa 3e-69
in ref transcript
Regulator of G protein signalling-like domain. Members of this family adopt a structure consisting of twelve helices that fold into a compact domain that contains the overall structural scaffold observed in other RGS proteins and three additional helical elements that pack closely to it. Helices 1-9 comprise the RGS (pfam00615) fold, in which helices 4-7 form a classic antiparallel bundle adjacent to the other helices. Like other RGS structures, helices 7 and 8 span the length of the folded domain and form essentially one continuous helix with a kink in the middle. Helices 10-12 form an apparently stable C-terminal extension of the structural domain, and although other RGS proteins lack this structure, these elements are intimately associated with the rest of the structural framework by hydrophobic interactions. Members of the family bind to active G-alpha proteins, promoting GTP hydrolysis by the alpha subunit of heterotrimeric G proteins, thereby inactivating the G protein and rapidly switching off G protein-coupled receptor signalling pathways.
smart RhoGEF 185aa 7e-45
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
smart PDZ 77aa 1e-15
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
Alternative 5-prime and 3-prime, size difference: 256
Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
Reference transcript: NM_012097
Changed!
cd Arl5_Arl8 173aa 1e-94
in ref transcript
Arl5/Arl8 subfamily. Arl5 (Arf-like 5) and Arl8, like Arl4 and Arl7, are localized to the nucleus and nucleolus. Arl5 is developmentally regulated during embryogenesis in mice. Human Arl5 interacts with the heterochromatin protein 1-alpha (HP1alpha), a nonhistone chromosomal protein that is associated with heterochromatin and telomeres, and prevents telomere fusion. Arl5 may also play a role in embryonic nuclear dynamics and/or signaling cascades. Arl8 was identified from a fetal cartilage cDNA library. It is found in brain, heart, lung, cartilage, and kidney. No function has been assigned for Arl8 to date.
Changed!
pfam Arf 158aa 2e-61
in ref transcript
ADP-ribosylation factor family. Pfam combines a number of different Prosite families together.
Changed!
PTZ PTZ00133 179aa 5e-55
in ref transcript
ADP-ribosylation factor; Provisional.
Changed!
cd Arl5_Arl8 138aa 6e-76
in modified transcript
Changed!
pfam Arf 135aa 3e-50
in modified transcript
Changed!
PTZ PTZ00133 142aa 4e-43
in modified transcript
Arl6 subfamily. Arl6 (Arf-like 6) forms a subfamily of the Arf family of small GTPases. Arl6 expression is limited to the brain and kidney in adult mice, but it is expressed in the neural plate and somites during embryogenesis, suggesting a possible role for Arl6 in early development. Arl6 is also believed to have a role in cilia or flagella function. Several proteins have been identified that bind Arl6, including Arl6 interacting protein (Arl6ip), and SEC61beta, a subunit of the heterotrimeric conducting channel SEC61p. Based on Arl6 binding to these effectors, Arl6 is also proposed to play a role in protein transport, membrane trafficking, or cell signaling during hematopoietic maturation. At least three specific homozygous Arl6 mutations in humans have been found to cause Bardet-Biedl syndrome, a disorder characterized by obesity, retinopathy, polydactyly, renal and cardiac malformations, learning disabilities, and hypogenitalism. Older literature suggests that Arl6 is a part of the Arl4/Arl7 subfamily, but analyses based on more recent sequence data place Arl6 in its own subfamily.
pfam Arf 167aa 3e-49
in ref transcript
ADP-ribosylation factor family. Pfam combines a number of different Prosite families together.
PTZ PTZ00133 182aa 2e-37
in ref transcript
ADP-ribosylation factor; Provisional.
ARMC8
refseq_ARMC8.F1 refseq_ARMC8.R1 281 335
NCBIGene 36.3 25852
Alternative 3-prime, size difference: 54
Inclusion in 5'UTR
Reference transcript: NM_015396
cd ARM 116aa 7e-07
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
cd ARM 111aa 2e-06
in ref transcript
cd ARM 112aa 0.007
in ref transcript
COG COG5369 234aa 1e-18
in ref transcript
Uncharacterized conserved protein [Function unknown].
COG SRP1 185aa 2e-05
in ref transcript
Karyopherin (importin) alpha [Intracellular trafficking and secretion].
ARMCX2
refseq_ARMCX2.F1 refseq_ARMCX2.R1 137 232
NCBIGene 36.3 9823
Exon skipping and alternative 3-prime or 5-prime, size difference: 95
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_177949
cd ARM 118aa 1e-06
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
pfam DUF634 244aa 1e-107
in ref transcript
Protein of unknown function (DUF634). Mammalian protein of unknown function.
ARMCX2
refseq_ARMCX2.F4 refseq_ARMCX2.R4 158 202
NCBIGene 36.3 9823
Alternative 3-prime, size difference: 44
Inclusion in 5'UTR
Reference transcript: NM_177949
cd ARM 118aa 1e-06
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
pfam DUF634 244aa 1e-107
in ref transcript
Protein of unknown function (DUF634). Mammalian protein of unknown function.
ARMCX3
refseq_ARMCX3.F1 refseq_ARMCX3.R1 102 130
NCBIGene 36.3 51566
Alternative 3-prime, size difference: 28
Inclusion in 5'UTR
Reference transcript: NM_177947
cd ARM 86aa 3e-04
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
pfam DUF634 253aa 1e-106
in ref transcript
Protein of unknown function (DUF634). Mammalian protein of unknown function.
ARMCX6
refseq_ARMCX6.F2 refseq_ARMCX6.R2 291 373
NCBIGene 36.3 54470
Single exon skipping, size difference: 82
Exclusion in 5'UTR
Reference transcript: NM_019007
pfam DUF634 191aa 6e-32
in ref transcript
Protein of unknown function (DUF634). Mammalian protein of unknown function.
ARNT
refseq_ARNT.F1 refseq_ARNT.R1 149 194
NCBIGene 36.3 405
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_001668
cd PAS 97aa 1e-11
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
Changed!
cd HLH 57aa 2e-09
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
cd PAS 63aa 1e-05
in ref transcript
pfam PAS_3 89aa 5e-14
in ref transcript
PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
Changed!
pfam HLH 54aa 3e-11
in ref transcript
Helix-loop-helix DNA-binding domain.
pfam PAS 107aa 4e-08
in ref transcript
PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
Changed!
cd HLH 53aa 1e-09
in modified transcript
Changed!
pfam HLH 54aa 3e-11
in modified transcript
ARNTL
refseq_ARNTL.F1 refseq_ARNTL.R1 162 217
NCBIGene 36.3 406
Single exon skipping, size difference: 55
Exclusion in 5'UTR
Reference transcript: NM_001178
cd PAS 96aa 9e-12
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
cd HLH 56aa 5e-11
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
cd PAS 95aa 4e-08
in ref transcript
pfam HLH 54aa 4e-13
in ref transcript
Helix-loop-helix DNA-binding domain.
smart PAS 60aa 1e-09
in ref transcript
PAS domain. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels ([1]; Ponting & Aravind, in press).
smart PAS 54aa 1e-07
in ref transcript
ARNTL
refseq_ARNTL.F4 refseq_ARNTL.R4 254 325
NCBIGene 36.3 406
Single exon skipping, size difference: 71
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001178
Changed!
cd PAS 96aa 9e-12
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
Changed!
cd HLH 56aa 5e-11
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
Changed!
cd PAS 95aa 4e-08
in ref transcript
Changed!
pfam HLH 54aa 4e-13
in ref transcript
Helix-loop-helix DNA-binding domain.
Changed!
smart PAS 60aa 1e-09
in ref transcript
PAS domain. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels ([1]; Ponting & Aravind, in press).
Changed!
pfam Arrestin_N 142aa 4e-23
in modified transcript
ARS2
refseq_ARS2.F1 refseq_ARS2.R1 135 209
NCBIGene 36.3 51593
Single exon skipping, size difference: 74
Inclusion in the protein causing a new stop codon
Reference transcript: NM_015908
Changed!
pfam ARS2 222aa 8e-79
in ref transcript
Arsenite-resistance protein 2. Arsenite is a carcinogenic compound which can act as a co-mutagen by inhibiting DNA repair. Arsenite-resistance protein 2 is thought to play a role in arsenite resistance.
ASB3
refseq_ASB3.F1 refseq_ASB3.R1 167 376
NCBIGene 36.3 51130
Single exon skipping, size difference: 209
Exclusion of the protein initiation site
Reference transcript: NM_016115
cd ANK 122aa 3e-22
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
cd ANK 130aa 9e-22
in ref transcript
cd SOCS_ASB3 51aa 6e-20
in ref transcript
SOCS (suppressors of cytokine signaling) box of ASB3-like proteins. ASB family members have a C-terminal SOCS box and an N-terminal ankyrin-related sequence. ABS3 has been shown to be negative regulator of TNF-R2-mediated cellular responses to TNF-alpha by direct targeting of tumor necrosis factor receptor II (TNF-R2) for ubiquitination and proteasome-mediated degradation. The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.
cd ANK 101aa 3e-07
in ref transcript
pfam SOCS_box 39aa 2e-06
in ref transcript
SOCS box. The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.
pfam Ank 32aa 3e-05
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
TIGR trp 154aa 1e-04
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Changed!
COG Arp 183aa 4e-12
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
PTZ PTZ00322 82aa 0.004
in ref transcript
Changed!
cd ANK 122aa 3e-24
in modified transcript
Changed!
COG Arp 124aa 2e-13
in modified transcript
Changed!
COG Arp 153aa 1e-07
in modified transcript
ASB6
refseq_ASB6.F1 refseq_ASB6.R1 169 278
NCBIGene 36.3 140459
Single exon skipping, size difference: 109
Exclusion in the protein causing a frameshift
Reference transcript: NM_017873
Changed!
cd SOCS_ASB6 44aa 2e-18
in ref transcript
SOCS (suppressors of cytokine signaling) box of ASB6-like proteins. ASB family members have a C-terminal SOCS box and an N-terminal ankyrin-related sequence. ASB6 interacts with the adaptor protein APS and recruits elongin B/C to the insulin receptor signaling complex. The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.
Changed!
cd ANK 127aa 1e-14
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
cd ANK 124aa 3e-04
in ref transcript
Changed!
pfam SOCS_box 37aa 8e-06
in ref transcript
SOCS box. The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.
pfam Ank 31aa 2e-05
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
Changed!
TIGR trp 167aa 2e-04
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Changed!
PTZ PTZ00322 83aa 1e-04
in ref transcript
Changed!
PTZ PTZ00322 57aa 0.001
in ref transcript
Changed!
PTZ PTZ00322 131aa 0.002
in ref transcript
Changed!
cd ANK 104aa 4e-10
in modified transcript
Changed!
TIGR trp 119aa 0.004
in modified transcript
Changed!
COG Arp 127aa 0.001
in modified transcript
FOG: Ankyrin repeat [General function prediction only].
ASB9
refseq_ASB9.F1 refseq_ASB9.R1 219 280
NCBIGene 36.3 140462
Alternative 5-prime, size difference: 61
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001031739
cd ANK 125aa 6e-22
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
cd SOCS_ASB_9_11 42aa 9e-15
in ref transcript
SOCS (suppressors of cytokine signaling) box of ASB9 and 11 proteins. ASB family members have a C-terminal SOCS box and an N-terminal ankyrin-related sequence. The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.
Changed!
pfam SOCS_box 36aa 2e-06
in ref transcript
SOCS box. The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.
TIGR trp 134aa 2e-05
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
pfam Ank 29aa 6e-04
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
COG Arp 173aa 2e-12
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
COG Arp 156aa 1e-05
in ref transcript
Changed!
PTZ PTZ00322 108aa 0.002
in modified transcript
cd CLECT_DC-SIGN_like 126aa 1e-40
in ref transcript
CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
pfam Lectin_N 143aa 5e-55
in ref transcript
Hepatic lectin, N-terminal domain.
pfam Lectin_C 109aa 4e-32
in ref transcript
Lectin C-type domain. This family includes both long and short form C-type.
ASL
refseq_ASL.F1 refseq_ASL.R1 130 190
NCBIGene 36.3 435
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_000048
Changed!
cd Argininosuccinate_lyase 430aa 1e-173
in ref transcript
Argininosuccinate lyase (argininosuccinase, ASAL). This group contains proteins similar to ASAL, a member of the Lyase class I family. Members of this family for the most part catalyze similar beta-elimination reactions in which a C-N or C-O bond is cleaved with the release of fumarate as one of the products. These proteins are active as tetramers. The four active sites of the homotetrameric enzyme are each formed by residues from three different subunits. ASAL is a cytosolic enzyme which catalyzes the reversible breakdown of argininosuccinate to arginine and fumarate during arginine biosynthesis. In ureotleic species ASAL also catalyzes a reaction involved in the production of urea. Included in this group are the major soluble avian eye lens proteins from duck, delta 1 and delta 2 crystallin. Of these two isoforms only delta 2 has retained ASAL activity. These crystallins may have evolved by, gene recruitment of ASAL followed by gene duplication. In humans, mutations in ASAL result in the autosomal recessive disorder argininosuccinic aciduria.
Changed!
TIGR argH 445aa 1e-163
in ref transcript
This model describes argininosuccinate lyase, but may include examples of avian delta crystallins, in which argininosuccinate lyase activity may or may not be present and the biological role is to provide the optically clear cellular protein of the eye lens.
Changed!
PRK PRK00855 452aa 0.0
in ref transcript
argininosuccinate lyase; Provisional.
Changed!
cd Argininosuccinate_lyase 410aa 1e-159
in modified transcript
Changed!
TIGR argH 425aa 1e-149
in modified transcript
Changed!
PRK PRK00855 432aa 1e-167
in modified transcript
ASL
refseq_ASL.F3 refseq_ASL.R3 119 197
NCBIGene 36.3 435
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_000048
Changed!
cd Argininosuccinate_lyase 430aa 1e-173
in ref transcript
Argininosuccinate lyase (argininosuccinase, ASAL). This group contains proteins similar to ASAL, a member of the Lyase class I family. Members of this family for the most part catalyze similar beta-elimination reactions in which a C-N or C-O bond is cleaved with the release of fumarate as one of the products. These proteins are active as tetramers. The four active sites of the homotetrameric enzyme are each formed by residues from three different subunits. ASAL is a cytosolic enzyme which catalyzes the reversible breakdown of argininosuccinate to arginine and fumarate during arginine biosynthesis. In ureotleic species ASAL also catalyzes a reaction involved in the production of urea. Included in this group are the major soluble avian eye lens proteins from duck, delta 1 and delta 2 crystallin. Of these two isoforms only delta 2 has retained ASAL activity. These crystallins may have evolved by, gene recruitment of ASAL followed by gene duplication. In humans, mutations in ASAL result in the autosomal recessive disorder argininosuccinic aciduria.
Changed!
TIGR argH 445aa 1e-163
in ref transcript
This model describes argininosuccinate lyase, but may include examples of avian delta crystallins, in which argininosuccinate lyase activity may or may not be present and the biological role is to provide the optically clear cellular protein of the eye lens.
Changed!
PRK PRK00855 452aa 0.0
in ref transcript
argininosuccinate lyase; Provisional.
Changed!
cd Argininosuccinate_lyase 404aa 1e-159
in modified transcript
Changed!
TIGR argH 419aa 1e-149
in modified transcript
Changed!
PRK PRK00855 426aa 1e-167
in modified transcript
ASNS
refseq_ASNS.F2 refseq_ASNS.R2 115 393
NCBIGene 36.3 440
Single exon skipping, size difference: 278
Exclusion in 5'UTR
Reference transcript: NM_183356
cd AsnB 190aa 2e-43
in ref transcript
Glutamine amidotransferases class-II (GATase) asparagine synthase_B type. Asparagine synthetase B catalyses the ATP-dependent conversion of aspartate to asparagine. This enzyme is a homodimer, with each monomer composed of a glutaminase domain and a synthetase domain. The N-terminal glutaminase domain hydrolyzes glutamine to glutamic acid and ammonia.
cd Asn_Synthase_B_C 238aa 2e-34
in ref transcript
The C-terminal domain of Asparagine Synthase B. This domain is always found associated n-terminal amidotransferase domain. Family members that contain this domain catalyse the conversion of aspartate to asparagine. Asparagine synthetase B catalyzes the assembly of asparagine from aspartate, Mg(2+)ATP, and glutamine. The three-dimensional architecture of the N-terminal domain of asparagine synthetase B is similar to that observed for glutamine phosphoribosylpyrophosphate amidotransferase while the molecular motif of the C-domain is reminiscent to that observed for GMP synthetase.
TIGR asn_synth_AEB 467aa 1e-134
in ref transcript
This model describes the glutamine-hydrolysing asparagine synthase. A poorly conserved C-terminal extension was removed from the model. Bacterial members of the family tend to have a long, poorly conserved insert lacking from archaeal and eukaryotic sequences. Multiple isozymes have been demonstrated, such as in Bacillus subtilis. Long-branch members of the phylogenetic tree (which typically were also second or third candidate members from their genomes) were removed from the seed alignment and score below trusted cutoff.
TIGR eps_aminotran_1 141aa 1e-12
in ref transcript
The predicted protein-sorting transpeptidase that we call exosortase (see TIGR02602) has distinct subclasses that associated with different types of exopolysaccharide production loci. This model represents a distinct clade among a set of amidotransferases largely annotated (not necessarily accurately) as glutatime-hydrolyzing asparagine synthases. Members of this clade are essentially restricted to the characteristic exopolysaccharide (EPS) regions that contain the exosortase 1 genome (xrtA), in genomes that also have numbers of PEP-CTERM domain (TIGR02595) proteins.
PRK asnB 555aa 1e-136
in ref transcript
asparagine synthetase B; Provisional.
ASPH
refseq_ASPH.F1 refseq_ASPH.R1 178 223
NCBIGene 36.3 444
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_020164
Changed!
pfam Asp-B-Hydro_N 71aa 1e-15
in ref transcript
Aspartyl beta-hydroxylase N-terminal region. This family includes the N-terminal regions of the junctin, junctate and aspartyl beta-hydroxylase proteins. Junctate is an integral ER/SR membrane calcium binding protein, which comes from an alternatively spliced form of the same gene that generates aspartyl beta-hydroxylase and junctin. Aspartyl beta-hydroxylase catalyses the post-translational hydroxylation of aspartic acid or asparagine residues contained within epidermal growth factor (EGF) domains of proteins.
pfam Asp-B-Hydro_N 77aa 1e-10
in ref transcript
Changed!
pfam Asp-B-Hydro_N 56aa 2e-18
in modified transcript
ASTN2
refseq_ASTN2.F1 refseq_ASTN2.R1 217 370
NCBIGene 36.3 23245
Single exon skipping, size difference: 153
Exclusion in the protein (no frameshift)
Reference transcript: NM_198187
smart MACPF 184aa 1e-36
in ref transcript
membrane-attack complex / perforin.
ATF3
refseq_ATF3.F2 refseq_ATF3.R2 208 359
NCBIGene 36.2 467
Exon skipping and alternative 3-prime or 5-prime, size difference: 151
Inclusion in the protein causing a new stop codon, Exclusion in the protein causing a frameshift
Reference transcript: NM_001030287
Changed!
smart BRLZ 45aa 7e-05
in ref transcript
basic region leucin zipper.
ATG16L1
refseq_ATG16L1.F1 refseq_ATG16L1.R1 254 311
NCBIGene 36.3 55054
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_030803
cd WD40 291aa 4e-48
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
pfam ATG16 195aa 2e-56
in ref transcript
Autophagy protein 16 (ATG16). Autophagy is a ubiquitous intracellular degradation system for eukaryotic cells. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. ATG16 (also known as Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate in the yeast autophagy pathway.
smart WD40 36aa 7e-06
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 38aa 1e-05
in ref transcript
TIGR propeller_TolB 52aa 2e-05
in ref transcript
The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
pfam SGL 128aa 0.009
in ref transcript
SMP-30/Gluconolaconase/LRE-like region. This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30), gluconolactonase and luciferin-regenerating enzyme (LRE). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 and LRE.
COG COG2319 260aa 6e-25
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
COG Smc 153aa 4e-06
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
COG Smc 145aa 5e-06
in modified transcript
ATG4A
refseq_ATG4A.F1 refseq_ATG4A.R1 184 370
NCBIGene 36.3 115201
Exon skipping and alternative 3-prime or 5-prime, size difference: 186
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_052936
Changed!
pfam Peptidase_C54 300aa 1e-123
in ref transcript
Peptidase family C54.
Changed!
pfam Peptidase_C54 238aa 7e-93
in modified transcript
ATG4C
refseq_ATG4C.F1 refseq_ATG4C.R1 143 191
NCBIGene 36.3 84938
Alternative 5-prime, size difference: 48
Exclusion in 5'UTR
Reference transcript: NM_032852
pfam Peptidase_C54 330aa 1e-108
in ref transcript
Peptidase family C54.
ATP11C
refseq_ATP11C.F2 refseq_ATP11C.R2 127 232
NCBIGene 36.3 286410
Single exon skipping, size difference: 105
Exclusion of the stop codon
Reference transcript: NM_173694
TIGR ATPase-Plipid 1028aa 0.0
in ref transcript
This model describes the P-type ATPase responsible for transporting phospholipids from one leaflet of bilayer membranes to the other. These ATPases are found only in eukaryotes.
COG MgtA 659aa 5e-70
in ref transcript
Cation transport ATPase [Inorganic ion transport and metabolism].
COG MgtA 205aa 8e-24
in ref transcript
ATP2A1
refseq_ATP2A1.F1 refseq_ATP2A1.R1 103 145
NCBIGene 36.3 487
Single exon skipping, size difference: 42
Inclusion in the protein causing a new stop codon
Reference transcript: NM_173201
TIGR ATPase-IIA1_Ca 937aa 0.0
in ref transcript
The calcium P-type ATPases have been characterized as Type IIA based on a phylogenetic analysis which distinguishes this group from the Type IIB PMCA calcium pump modelled by TIGR01517. A separate analysis divides Type IIA into sub-types, SERCA and PMR1 the latter of which is modelled by TIGR01522.
pfam Cation_ATPase_N 75aa 1e-23
in ref transcript
Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport.
COG MgtA 991aa 0.0
in ref transcript
Cation transport ATPase [Inorganic ion transport and metabolism].
ATP2B3
refseq_ATP2B3.F1 refseq_ATP2B3.R1 218 372
NCBIGene 36.3 492
Single exon skipping, size difference: 154
Inclusion in the protein causing a frameshift
Reference transcript: NM_001001344
TIGR ATPase-IIB_Ca 704aa 0.0
in ref transcript
The calcium P-type ATPases have been characterized as Type IIB based on a phylogenetic analysis which distinguishes this group from the Type IIA SERCA calcium pump. A separate analysis divides Type IIA into sub-types (SERCA and PMR1) which are modelled by two corresponding HMMs (TIGR01116 and TIGR01522). This model is well separated from those.
TIGR ATPase-IIB_Ca 266aa 2e-80
in ref transcript
TIGR ATPase-IB_hvy 313aa 2e-13
in ref transcript
This model encompasses two equivalog models for the copper and cadmium-type heavy metal transporting P-type ATPases (TIGR01511 and TIGR01512) as well as those species which score ambiguously between both models. For more comments and references, see the files on TIGR01511 and 01512.
COG MgtA 993aa 1e-154
in ref transcript
Cation transport ATPase [Inorganic ion transport and metabolism].
ATP2B4
refseq_ATP2B4.F1 refseq_ATP2B4.R1 198 376
NCBIGene 36.3 493
Single exon skipping, size difference: 178
Inclusion in the protein causing a frameshift
Reference transcript: NM_001684
cd HAD_like 159aa 0.007
in ref transcript
Haloacid dehalogenase-like hydrolases. The haloacid dehalogenase-like (HAD) superfamily includes L-2-haloacid dehalogenase, epoxide hydrolase, phosphoserine phosphatase, phosphomannomutase, phosphoglycolate phosphatase, P-type ATPase, and many others, all of which use a nucleophilic aspartate in their phosphoryl transfer reaction. All members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. Members of this superfamily are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases.
TIGR ATPase-IIB_Ca 1037aa 0.0
in ref transcript
The calcium P-type ATPases have been characterized as Type IIB based on a phylogenetic analysis which distinguishes this group from the Type IIA SERCA calcium pump. A separate analysis divides Type IIA into sub-types (SERCA and PMR1) which are modelled by two corresponding HMMs (TIGR01116 and TIGR01522). This model is well separated from those.
COG MgtA 681aa 1e-132
in ref transcript
Cation transport ATPase [Inorganic ion transport and metabolism].
COG MgtA 236aa 3e-28
in ref transcript
ATP5C1
refseq_ATP5C1.F1 refseq_ATP5C1.R1 132 169
NCBIGene 36.3 509
Single exon skipping, size difference: 37
Exclusion of the stop codon
Reference transcript: NM_001001973
TIGR ATPsyn_F1gamma 272aa 6e-95
in ref transcript
This model describes the ATP synthase gamma subunit in bacteria and its equivalents in organelles, namely, mitochondria and chloroplast. F1/F0-ATP synthase is a multisubunit, membrane associated enzyme found in bacteria and organelles of higher eukaryotes, namely, mitochondria and chloroplast. This enzyme is principally involed in the synthesis of ATP from ADP and inorganic phosphate by coupling the energy derived from the proton electrochemical gradient across the biological membrane. A brief description of this multisubunit enzyme complex: F1 and F0 represent two major clusters of subunits. The gamma subunit is the part of F1 cluster. Surrounding the gamma subunit in a cylinder-like structure are three alpha and three subunits in an alternating fashion. This is the central catalytic unit whose different conformations permit the binding of ADP and inorganic phosphate and release of ATP.
PRK PRK05621 273aa 7e-66
in ref transcript
F0F1 ATP synthase subunit gamma; Validated.
ATP5G1
refseq_ATP5G1.F1 refseq_ATP5G1.R1 115 185
NCBIGene 36.3 516
Alternative 5-prime, size difference: 70
Exclusion in 5'UTR
Reference transcript: NM_005175
pfam ATP-synt_C 46aa 1e-10
in ref transcript
ATP synthase subunit C.
PRK PRK07558 46aa 7e-09
in ref transcript
F0F1 ATP synthase subunit C; Validated.
ATP5G2
refseq_ATP5G2.F1 refseq_ATP5G2.R1 213 336
NCBIGene 36.3 517
Alternative 5-prime, size difference: 123
Exclusion in the protein (no frameshift)
Reference transcript: NM_005176
pfam ATP-synt_C 46aa 3e-11
in ref transcript
ATP synthase subunit C.
PRK PRK07558 46aa 3e-09
in ref transcript
F0F1 ATP synthase subunit C; Validated.
ATP5H
refseq_ATP5H.F1 refseq_ATP5H.R1 104 176
NCBIGene 36.3 10476
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_006356
Changed!
pfam Mt_ATP-synt_D 160aa 1e-60
in ref transcript
ATP synthase D chain, mitochondrial (ATP5H). This family consists of several ATP synthase D chain, mitochondrial (ATP5H) proteins. Subunit d has no extensive hydrophobic sequences, and is not apparently related to any subunit described in the simpler ATP synthases in bacteria and chloroplasts.
Changed!
pfam Mt_ATP-synt_D 136aa 5e-47
in modified transcript
ATP5J
refseq_ATP5J.F2 refseq_ATP5J.R2 251 301
NCBIGene 36.3 522
Alternative 5-prime, size difference: 50
Exclusion of the protein initiation site
Reference transcript: NM_001003701
pfam ATP-synt_F6 92aa 6e-33
in ref transcript
Mitochondrial ATP synthase coupling factor 6. Coupling factor 6 (F6) is a component of mitochondrial ATP synthase which is required for the interactions of the catalytic and proton-translocating segments.
ATP5J2
refseq_ATP5J2.F2 refseq_ATP5J2.R2 100 118
NCBIGene 36.3 9551
Alternative 5-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_004889
pfam WRW 76aa 3e-10
in ref transcript
Mitochondrial F1F0-ATP synthase, subunit f. This is a family of small proteins of approximately 110 amino acids, which are highly conserved from nematodes to humans. Some members of the family have been annotated in Swiss-Prot as being the f subunit of mitochondrial F1F0-ATP synthase but this could not be confirmed. The sequence has a well-conserved WRW motif. The exact function of the protein is not known.
ATP5J2
refseq_ATP5J2.F3 refseq_ATP5J2.R3 266 383
NCBIGene 36.3 9551
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_004889
Changed!
pfam WRW 76aa 3e-10
in ref transcript
Mitochondrial F1F0-ATP synthase, subunit f. This is a family of small proteins of approximately 110 amino acids, which are highly conserved from nematodes to humans. Some members of the family have been annotated in Swiss-Prot as being the f subunit of mitochondrial F1F0-ATP synthase but this could not be confirmed. The sequence has a well-conserved WRW motif. The exact function of the protein is not known.
Changed!
pfam WRW 37aa 0.001
in modified transcript
ATP5S
refseq_ATP5S.F1 refseq_ATP5S.R1 179 351
NCBIGene 36.3 27109
Single exon skipping, size difference: 172
Exclusion in the protein causing a frameshift
Reference transcript: NM_001003803
ATP6V0A4
refseq_ATP6V0A4.F1 refseq_ATP6V0A4.R1 292 395
NCBIGene 36.3 50617
Single exon skipping, size difference: 103
Exclusion in 5'UTR
Reference transcript: NM_020632
pfam V_ATPase_I 806aa 0.0
in ref transcript
V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryotic cells for example yeast central vacuoles, clathrin-coated and synaptic vesicles. They have important roles in membrane trafficking processes. The 116kDa subunit (subunit a) in the V-type ATPase is part of the V0 functional domain responsible for proton transport. The a subunit is a transmembrane glycoprotein with multiple putative transmembrane helices it has a hydrophilic amino terminal and a hydrophobic carboxy terminal. It has roles in proton transport and assembly of the V-type ATPase complex. This subunit is encoded by two homologous gene in yeast VPH1 and STV1.
COG NtpI 595aa 5e-45
in ref transcript
Archaeal/vacuolar-type H+-ATPase subunit I [Energy production and conversion].
COG NtpI 115aa 5e-22
in ref transcript
ATP6V0B
refseq_ATP6V0B.F2 refseq_ATP6V0B.R2 164 213
NCBIGene 36.3 533
Single exon skipping, size difference: 49
Exclusion in the protein causing a frameshift
Reference transcript: NM_004047
Changed!
TIGR V_ATP_synt_C 108aa 7e-06
in ref transcript
The principal role V-ATPases are the acidification of intracellular compartments of eukaryotic cells.
Changed!
pfam ATP-synt_C 62aa 0.002
in ref transcript
ATP synthase subunit C.
Changed!
COG AtpE 64aa 4e-04
in ref transcript
F0F1-type ATP synthase, subunit c/Archaeal/vacuolar-type H+-ATPase, subunit K [Energy production and conversion].
ATP6V1E1
refseq_ATP6V1E1.F1 refseq_ATP6V1E1.R1 149 215
NCBIGene 36.3 529
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_001696
Changed!
pfam vATP-synt_E 199aa 4e-33
in ref transcript
ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit.
COG NtpE 146aa 4e-05
in ref transcript
Archaeal/vacuolar-type H+-ATPase subunit E [Energy production and conversion].
Changed!
pfam vATP-synt_E 188aa 5e-34
in modified transcript
ATP6V1E1
refseq_ATP6V1E1.F3 refseq_ATP6V1E1.R3 246 336
NCBIGene 36.3 529
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_001696
Changed!
pfam vATP-synt_E 199aa 4e-33
in ref transcript
ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit.
Changed!
COG NtpE 146aa 4e-05
in ref transcript
Archaeal/vacuolar-type H+-ATPase subunit E [Energy production and conversion].
Changed!
pfam vATP-synt_E 169aa 2e-28
in modified transcript
Changed!
COG NtpE 116aa 7e-05
in modified transcript
ATP6V1G2
refseq_ATP6V1G2.F2 refseq_ATP6V1G2.R2 122 276
NCBIGene 36.3 534
Alternative 5-prime, size difference: 154
Exclusion of the protein initiation site
Reference transcript: NM_130463
Changed!
pfam V-ATPase_G 105aa 1e-13
in ref transcript
Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins in several organelles. They are also found in the plasma membranes of some specialised cells. V-ATPases consist of peripheral (V1) and membrane integral (V0) heteromultimeric complexes. The G subunit is part of the V1 subunit, but is also thought to be strongly attached to the V0 complex. It may be involved in the coupling of ATP degradation to H+ translocation.
Changed!
pfam V-ATPase_G 66aa 3e-11
in modified transcript
ATP6V1G3
refseq_ATP6V1G3.F1 refseq_ATP6V1G3.R1 119 165
NCBIGene 36.3 127124
Single exon skipping, size difference: 46
Inclusion in the protein causing a frameshift
Reference transcript: NM_133262
Changed!
pfam V-ATPase_G 105aa 3e-18
in ref transcript
Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins in several organelles. They are also found in the plasma membranes of some specialised cells. V-ATPases consist of peripheral (V1) and membrane integral (V0) heteromultimeric complexes. The G subunit is part of the V1 subunit, but is also thought to be strongly attached to the V0 complex. It may be involved in the coupling of ATP degradation to H+ translocation.
Changed!
pfam V-ATPase_G 25aa 2e-04
in modified transcript
ATP6V1H
refseq_ATP6V1H.F2 refseq_ATP6V1H.R2 181 235
NCBIGene 36.3 51606
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_015941
Changed!
cd VATPase_H 449aa 1e-176
in ref transcript
VATPase_H, regulatory vacuolar ATP synthase subunit H (Vma13p); activation component of the peripheral V1 complex of V-ATPase, a heteromultimeric enzyme which uses ATP to actively transport protons into organelles and extracellular compartments. The topology is that of a superhelical spiral, in part the geometry is similar to superhelices composed of armadillo repeat motifs, as found in importins for example.
Changed!
pfam V-ATPase_H 415aa 1e-178
in ref transcript
V-ATPase subunit H.
COG VMA13 261aa 6e-49
in ref transcript
Vacuolar H+-ATPase V1 sector, subunit H [Energy production and conversion].
Changed!
cd VATPase_H 431aa 1e-177
in modified transcript
Changed!
pfam V-ATPase_H 397aa 1e-178
in modified transcript
ATP7B
refseq_ATP7B.F1 refseq_ATP7B.R1 164 299
NCBIGene 36.3 540
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_000053
cd HMA 63aa 1e-14
in ref transcript
Heavy-metal-associated domain (HMA) is a conserved domain of approximately 30 amino acid residues found in a number of proteins that transport or detoxify heavy metals, for example, the CPx-type heavy metal ATPases and copper chaperones. HMA domain contains two cysteine residues that are important in binding and transfer of metal ions, such as copper, cadmium, cobalt and zinc. In the case of copper, stoichiometry of binding is one Cu+ ion per binding domain. Repeats of the HMA domain in copper chaperone has been associated with Menkes/Wilson disease due to binding of multiple copper ions.
cd HMA 64aa 1e-12
in ref transcript
cd HMA 63aa 5e-12
in ref transcript
cd HMA 59aa 8e-12
in ref transcript
cd HMA 64aa 1e-11
in ref transcript
cd HMA 64aa 2e-11
in ref transcript
cd HAD_like 106aa 0.009
in ref transcript
Haloacid dehalogenase-like hydrolases. The haloacid dehalogenase-like (HAD) superfamily includes L-2-haloacid dehalogenase, epoxide hydrolase, phosphoserine phosphatase, phosphomannomutase, phosphoglycolate phosphatase, P-type ATPase, and many others, all of which use a nucleophilic aspartate in their phosphoryl transfer reaction. All members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. Members of this superfamily are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases.
Changed!
TIGR ATPase-IB1_Cu 646aa 0.0
in ref transcript
A member from Halobacterium is annotated as "molybdenum-binding protein" although no evidence can be found for this classification.
pfam HMA 64aa 2e-15
in ref transcript
Heavy-metal-associated domain.
pfam HMA 65aa 5e-14
in ref transcript
pfam HMA 64aa 9e-14
in ref transcript
pfam HMA 65aa 3e-13
in ref transcript
pfam HMA 59aa 6e-13
in ref transcript
pfam HMA 64aa 7e-13
in ref transcript
Changed!
COG ZntA 791aa 0.0
in ref transcript
Cation transport ATPase [Inorganic ion transport and metabolism].
PRK copA 199aa 3e-09
in ref transcript
copper exporting ATPase; Provisional.
COG CopZ 65aa 1e-08
in ref transcript
Copper chaperone [Inorganic ion transport and metabolism].
PRK copA 172aa 2e-07
in ref transcript
PRK copA 145aa 9e-07
in ref transcript
PRK copA 170aa 6e-06
in ref transcript
Changed!
TIGR ATPase-IB1_Cu 601aa 0.0
in modified transcript
Changed!
COG ZntA 746aa 0.0
in modified transcript
ATRX
refseq_ATRX.F1 refseq_ATRX.R1 112 235
NCBIGene 36.2 546
Single exon skipping, size difference: 123
Inclusion in the protein causing a new stop codon
Reference transcript: NM_000489
Changed!
cd HELICc 147aa 4e-16
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
Changed!
cd DEXDc 163aa 4e-11
in ref transcript
DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
Changed!
pfam SNF2_N 317aa 9e-84
in ref transcript
SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
Changed!
pfam Helicase_C 77aa 4e-16
in ref transcript
Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
Changed!
COG HepA 321aa 1e-27
in ref transcript
Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].
Changed!
COG HepA 194aa 6e-22
in ref transcript
ATXN3
refseq_ATXN3.F1 refseq_ATXN3.R1 240 326
NCBIGene 36.2 4287
Single exon skipping, size difference: 86
Exclusion in the protein causing a frameshift
Reference transcript: NM_004993
Changed!
pfam Josephin 161aa 2e-69
in ref transcript
Josephin.
Changed!
pfam Josephin 71aa 2e-25
in modified transcript
AURKA
refseq_AURKA.F2 refseq_AURKA.R2 133 341
NCBIGene 36.3 6790
Multiple exon skipping, size difference: 208
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_198433
cd S_TKc 252aa 1e-80
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 240aa 2e-80
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
PTZ PTZ00263 244aa 4e-53
in ref transcript
protein kinase A catalytic subunit; Provisional.
AURKA
refseq_AURKA.F3 refseq_AURKA.R3 97 111
NCBIGene 36.3 6790
Alternative 5-prime, size difference: 14
Exclusion in 5'UTR
Reference transcript: NM_198434
cd S_TKc 252aa 1e-80
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 240aa 2e-80
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
PTZ PTZ00263 244aa 4e-53
in ref transcript
protein kinase A catalytic subunit; Provisional.
AURKC
refseq_AURKC.F1 refseq_AURKC.R1 208 400
NCBIGene 36.3 6795
Alternative 5-prime, size difference: 192
Exclusion of the protein initiation site
Reference transcript: NM_001015878
cd S_TKc 252aa 7e-75
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 240aa 9e-75
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
PTZ PTZ00263 262aa 2e-50
in ref transcript
protein kinase A catalytic subunit; Provisional.
AXL
refseq_AXL.F1 refseq_AXL.R1 129 156
NCBIGene 36.3 558
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_021913
cd PTKc_Axl 272aa 1e-167
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Axl. Protein Tyrosine Kinase (PTK) family; Axl; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Axl is a member of the Axl subfamily, which is composed of receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with two immunoglobulin-like domains followed by two fibronectin type III repeats, a transmembrane segment, and an intracellular catalytic domain. Binding to their ligands, Gas6 and protein S, leads to receptor dimerization, autophosphorylation, activation, and intracellular signaling. Axl is widely expressed in a variety of organs and cells including epithelial, mesenchymal, hematopoietic, as well as non-transformed cells. Axl signaling is important in many cellular functions such as survival, anti-apoptosis, proliferation, migration, and adhesion. Axl was originally isolated from patients with chronic myelogenous leukemia and a chronic myeloproliferative disorder. Axl is overexpressed in many human cancers including colon, squamous cell, thyroid, breast, and lung carcinomas.
cd FN3 104aa 2e-06
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd IG 77aa 2e-05
in ref transcript
Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 84aa 0.006
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam fn3 97aa 1e-04
in ref transcript
pfam I-set 85aa 0.008
in ref transcript
Immunoglobulin I-set domain.
COG SPS1 280aa 7e-23
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
AZI1
refseq_AZI1.F2 refseq_AZI1.R2 278 386
NCBIGene 36.3 22994
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_014984
TIGR SMC_prok_B 223aa 5e-09
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG Smc 243aa 5e-08
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
AZIN1
refseq_AZIN1.F2 refseq_AZIN1.R2 136 282
NCBIGene 36.3 51582
Single exon skipping, size difference: 146
Exclusion in 5'UTR
Reference transcript: NM_015878
pfam Orn_Arg_deC_N 236aa 3e-62
in ref transcript
Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold.
pfam Orn_DAP_Arg_deC 125aa 2e-25
in ref transcript
Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates.
COG LysA 367aa 2e-51
in ref transcript
Diaminopimelate decarboxylase [Amino acid transport and metabolism].
B3GALNT1
refseq_B3GALNT1.F1 refseq_B3GALNT1.R1 247 340
NCBIGene 36.3 8706
Single exon skipping, size difference: 93
Exclusion in 5'UTR
Reference transcript: NM_001038628
pfam Galactosyl_T 194aa 4e-55
in ref transcript
Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- galactosyltrans ferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcNAc terminal chains in the synthesis of the lacto-series oligosaccharides types 1 and 2.
B3GALNT1
refseq_B3GALNT1.F3 refseq_B3GALNT1.R3 217 336
NCBIGene 36.3 8706
Single exon skipping, size difference: 119
Exclusion in 5'UTR
Reference transcript: NM_001038628
pfam Galactosyl_T 194aa 4e-55
in ref transcript
Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- galactosyltrans ferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcNAc terminal chains in the synthesis of the lacto-series oligosaccharides types 1 and 2.
B3GALNT1
refseq_B3GALNT1.F5 refseq_B3GALNT1.R5 100 211
NCBIGene 36.3 8706
Single exon skipping, size difference: 111
Exclusion in 5'UTR
Reference transcript: NM_001038628
pfam Galactosyl_T 194aa 4e-55
in ref transcript
Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- galactosyltrans ferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcNAc terminal chains in the synthesis of the lacto-series oligosaccharides types 1 and 2.
B3GALT5
refseq_B3GALT5.F2 refseq_B3GALT5.R2 274 383
NCBIGene 36.3 10317
Single exon skipping, size difference: 109
Exclusion in 5'UTR
Reference transcript: NM_033171
pfam Galactosyl_T 191aa 3e-44
in ref transcript
Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- galactosyltrans ferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcNAc terminal chains in the synthesis of the lacto-series oligosaccharides types 1 and 2.
B4GALT4
refseq_B4GALT4.F1 refseq_B4GALT4.R1 228 311
NCBIGene 36.3 8702
Single exon skipping, size difference: 83
Exclusion in 5'UTR
Reference transcript: NM_212543
cd b4GalT 218aa 1e-102
in ref transcript
Beta-4-Galactosyltransferase is involved in the formation of the poly-N-acetyllactosamine core structures present in glycoproteins and glycosphingolipids. Beta-4-Galactosyltransferase transfers galactose from uridine diphosphogalactose to the terminal beta-N-acetylglucosamine residues, hereby forming the poly-N-acetyllactosamine core structures present in glycoproteins and glycosphingolipids. At least seven homologous beta-4-galactosyltransferase isoforms have been identified that use different types of glycoproteins and glycolipids as substrates. Of the seven identified members of the beta-1,4-galactosyltransferase subfamily (beta1,4-Gal-T1 to -T7), b1,4-Gal-T1 is most characterized (biochemically). It is a Golgi-resident type II membrane enzyme with a cytoplasmic domain, membrane spanning region, and a stem region and catalytic domain facing the lumen.
pfam Galactosyl_T_2 268aa 1e-127
in ref transcript
Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys; all three of which are possessed by one sequence in some cases. EC:2.4.1.90, N-acetyllactosamine synthase; EC:2.4.1.38, Beta-N-acetylglucosaminyl-glycopeptide beta-1,4- galactosyltransferase; and EC:2.4.1.22 Lactose synthase. Note that N-acetyllactosamine synthase is a component of Lactose synthase along with alpha-lactalbumin, in the absence of alpha-lactalbumin EC:2.4.1.90 is the catalysed reaction.
BAALC
refseq_BAALC.F1 refseq_BAALC.R1 215 382
NCBIGene 36.3 79870
Single exon skipping, size difference: 167
Exclusion in the protein causing a frameshift
Reference transcript: NM_024812
pfam BAALC_N 31aa 1e-11
in ref transcript
BAALC N-terminus. This family represents the N-terminal region of the mammalian BAALC proteins. BAALC (brain and acute leukaemia, cytoplasmic), that is highly conserved among mammals but evidently absent from lower organisms. Two isoforms are specifically expressed in neuroectoderm-derived tissues, but not in tumours or cancer cell lines of non-neural tissue origin. It has been shown that blasts from a subset of patients with acute leukaemia greatly overexpress eight different BAALC transcripts, resulting in five protein isoforms. Among patients with acute myeloid leukaemia, those overexpressing BAALC show distinctly poor prognosis, pointing to a key role of the BAALC products in leukaemia. It has been suggested that BAALC is a gene implicated in both neuroectodermal and hematopoietic cell functions.
BACE1
refseq_BACE1.F1 refseq_BACE1.R1 113 245
NCBIGene 36.3 23621
Alternative 5-prime, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_012104
Changed!
cd beta_secretase_like 366aa 0.0
in ref transcript
Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which have at least six residues in length with hydrophobic residues in both the P1 and P1' positions. The active site is located at the groove formed by the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors in the active site. Specificity is determined by nearest-neighbor hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. The enzymes are mostly secreted from cells as inactive proenzymes that activate autocatalytically at acidic pH. This family of aspartate proteases is classified by MEROPS as the peptidase family A1 (pepsin A, clan AA).
Changed!
pfam Asp 342aa 5e-36
in ref transcript
Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.
Changed!
PTZ PTZ00165 371aa 1e-15
in ref transcript
aspartyl protease; Provisional.
Changed!
cd beta_secretase_like 322aa 1e-178
in modified transcript
Changed!
pfam Asp 298aa 2e-26
in modified transcript
Changed!
PTZ PTZ00165 327aa 2e-10
in modified transcript
BACE1
refseq_BACE1.F4 refseq_BACE1.R4 284 359
NCBIGene 36.3 23621
Alternative 3-prime, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_012104
Changed!
cd beta_secretase_like 366aa 0.0
in ref transcript
Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which have at least six residues in length with hydrophobic residues in both the P1 and P1' positions. The active site is located at the groove formed by the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors in the active site. Specificity is determined by nearest-neighbor hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. The enzymes are mostly secreted from cells as inactive proenzymes that activate autocatalytically at acidic pH. This family of aspartate proteases is classified by MEROPS as the peptidase family A1 (pepsin A, clan AA).
Changed!
pfam Asp 342aa 5e-36
in ref transcript
Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.
Changed!
PTZ PTZ00165 371aa 1e-15
in ref transcript
aspartyl protease; Provisional.
Changed!
cd beta_secretase_like 341aa 0.0
in modified transcript
Changed!
pfam Asp 317aa 2e-31
in modified transcript
Changed!
PTZ PTZ00013 319aa 5e-12
in modified transcript
plasmepsin 4 (PM4); Provisional.
BAIAP2
refseq_BAIAP2.F1 refseq_BAIAP2.R1 102 148
NCBIGene 36.3 10458
Single exon skipping, size difference: 46
Inclusion in the protein causing a new stop codon
Reference transcript: NM_017451
cd SH3 57aa 8e-07
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
pfam IMD 220aa 5e-90
in ref transcript
IRSp53/MIM homology domain. The N-terminal predicted helical stretch of the insulin receptor tyrosine kinase substrate p53 (IRSp53) is an evolutionary conserved F-actin bundling domain involved in filopodium formation. The domain has been named IMD after the IRSp53 and missing in metastasis (MIM) proteins in which it occurs. Filopodium-inducing IMD activity is regulated by Cdc42 and Rac1 and is SH3-independent.
smart SH3 61aa 8e-07
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Changed!
COG SbcC 220aa 0.008
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
BAT1
refseq_BAT1.F1 refseq_BAT1.R1 233 363
NCBIGene 36.3 7919
Alternative 3-prime, size difference: 130
Inclusion in 5'UTR
Reference transcript: NM_080598
cd DEADc 204aa 9e-65
in ref transcript
DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region.
cd HELICc 127aa 1e-25
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
pfam DEAD 168aa 3e-42
in ref transcript
DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
pfam Helicase_C 78aa 1e-18
in ref transcript
Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
COG SrmB 383aa 8e-98
in ref transcript
Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis].
BAT3
refseq_BAT3.F1 refseq_BAT3.R1 116 140
NCBIGene 36.3 7917
Alternative 5-prime, size difference: 24
Exclusion in 5'UTR
Reference transcript: NM_004639
cd Scythe_N 71aa 5e-24
in ref transcript
Scythe protein (also known as Bat3) is an apoptotic regulator that is highly conserved in eukaryotes and contains a ubiquitin-like domain near its N-terminus. Scythe binds reaper, a potent apoptotic inducer, and Scythe/Reaper are thought to signal apoptosis, in part through regulating the folding and activity of apoptotic signaling molecules.
smart UBQ 62aa 3e-14
in ref transcript
Ubiquitin homologues. Ubiquitin-mediated proteolysis is involved in the regulated turnover of proteins required for controlling cell cycle progression.
PTZ PTZ00044 69aa 3e-05
in ref transcript
ubiquitin; Provisional.
BAT3
refseq_BAT3.F3 refseq_BAT3.R3 102 120
NCBIGene 36.3 7917
Alternative 5-prime and 3-prime, size difference: 18
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_004639
cd Scythe_N 71aa 5e-24
in ref transcript
Scythe protein (also known as Bat3) is an apoptotic regulator that is highly conserved in eukaryotes and contains a ubiquitin-like domain near its N-terminus. Scythe binds reaper, a potent apoptotic inducer, and Scythe/Reaper are thought to signal apoptosis, in part through regulating the folding and activity of apoptotic signaling molecules.
smart UBQ 62aa 3e-14
in ref transcript
Ubiquitin homologues. Ubiquitin-mediated proteolysis is involved in the regulated turnover of proteins required for controlling cell cycle progression.
PTZ PTZ00044 69aa 3e-05
in ref transcript
ubiquitin; Provisional.
BAX
refseq_BAX.F1 refseq_BAX.R1 134 173
NCBIGene 36.3 581
Alternative 3-prime, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_138761
Changed!
cd Bcl-2_like 140aa 7e-33
in ref transcript
Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
Changed!
TIGR bcl-2 188aa 2e-63
in ref transcript
in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.
Changed!
cd Bcl-2_like 134aa 7e-31
in modified transcript
Changed!
TIGR bcl-2 175aa 2e-55
in modified transcript
BAX
refseq_BAX.F3 refseq_BAX.R3 128 226
NCBIGene 36.3 581
Single exon skipping, size difference: 98
Inclusion in the protein causing a frameshift
Reference transcript: NM_004324
Changed!
cd Bcl-2_like 139aa 5e-31
in ref transcript
Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
Changed!
TIGR bcl-2 158aa 2e-51
in ref transcript
in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.
Changed!
cd Bcl-2_like 100aa 6e-18
in modified transcript
Changed!
TIGR bcl-2 120aa 3e-35
in modified transcript
BAX
refseq_BAX.F4 bax.r.7 105 252
NCBIGene 36.3 581
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_004324
Changed!
cd Bcl-2_like 139aa 5e-31
in ref transcript
Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
Changed!
TIGR bcl-2 158aa 2e-51
in ref transcript
in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.
Changed!
cd Bcl-2_like 81aa 2e-24
in modified transcript
Changed!
TIGR bcl-2 109aa 1e-30
in modified transcript
BAZ1A
refseq_BAZ1A.F1 refseq_BAZ1A.R1 207 303
NCBIGene 36.3 11177
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_013448
cd Bromo_Acf1_like 113aa 4e-51
in ref transcript
Bromodomain; Acf1_like or BAZ1A_like subfamily. Bromo adjacent to zinc finger 1A (BAZ1A) was identified as a novel human bromodomain gene by cDNA library screening. The Drosophila homologue, Acf1, is part of the CHRAC (chromatin accessibility complex) and regulates ISWI-induced nucleosome remodeling. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd BAH_plant_2 24aa 0.001
in ref transcript
BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
pfam WAC_Acf1_DNA_bd 101aa 2e-39
in ref transcript
ATP-utilising chromatin assembly and remodelling N-terminal. ACF (for ATP-utilising chromatin assembly and remodelling factor) is a chromatin-remodelling complex that catalyses the ATP-dependent assembly of periodic nucleosome arrays. The WAC (WSTF/Acf1/cbp146) domain is an approximately 110-residue module present at the N-termini of Acf1-related proteins in a variety of organisms. The DNA-binding region of Acf1 includes the WAC domain, which is necessary for the efficient binding of ACF complex to DNA.
smart BROMO 92aa 8e-26
in ref transcript
bromo domain.
pfam DDT 63aa 3e-16
in ref transcript
DDT domain. This domain is approximately 60 residues in length, and is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors). It is exclusively associated with nuclear domains, and is thought to be arranged into three alpha helices.
pfam PHD 46aa 2e-13
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
Changed!
COG COG5076 181aa 3e-14
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
Changed!
COG COG5076 143aa 2e-14
in modified transcript
BCAP29
refseq_BCAP29.F1 refseq_BCAP29.R1 208 287
NCBIGene 36.3 55973
Single exon skipping, size difference: 79
Exclusion in the protein causing a frameshift
Reference transcript: NM_001008405
pfam Bap31 220aa 4e-45
in ref transcript
B-cell receptor-associated protein 31-like. Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31.
Changed!
COG COG4372 160aa 0.001
in ref transcript
Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown].
BCL11A
refseq_BCL11A.F1 refseq_BCL11A.R1 101 119
NCBIGene 36.2 53335
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_138553
BCL11B
refseq_BCL11B.F2 refseq_BCL11B.R2 129 342
NCBIGene 36.3 64919
Single exon skipping, size difference: 213
Exclusion in the protein (no frameshift)
Reference transcript: NM_138576
BCL2L14
refseq_BCL2L14.F2 refseq_BCL2L14.R2 199 352
NCBIGene 36.3 79370
Alternative 3-prime, size difference: 153
Inclusion in the protein causing a new stop codon
Reference transcript: NM_138723
Changed!
cd Bcl-2_like 135aa 2e-04
in ref transcript
Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
Changed!
TIGR bcl-2 29aa 0.008
in ref transcript
in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.
BCL7A
refseq_BCL7A.F2 refseq_BCL7A.R2 334 397
NCBIGene 36.3 605
Alternative 5-prime, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_020993
pfam BCL_N 49aa 5e-17
in ref transcript
BCL7, N-terminal conserver region. Members of the BCL family have significant sequence similarity at their N-terminus, represented in this family. The function of BCL7 proteins is unknown. They may be involved in early development. In addition, BCL7B is commonly hemizygously deleted in patients with Williams syndrome.
BCR
refseq_BCR.F1 refseq_BCR.R1 138 270
NCBIGene 36.3 613
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_004327
cd RhoGAP_Bcr 201aa 1e-85
in ref transcript
RhoGAP_Bcr: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of Bcr (breakpoint cluster region protein)-like proteins. Bcr is a multidomain protein with a variety of enzymatic functions. It contains a RhoGAP and a Rho GEF domain, a Ser/Thr kinase domain, an N-terminal oligomerization domain, and a C-terminal PDZ binding domain, in addition to PH and C2 domains. Bcr is a negative regulator of: i) RacGTPase, via the Rho GAP domain, ii) the Ras-Raf-MEK-ERK pathway, via phosphorylation of the Ras binding protein AF-6, and iii) the Wnt signaling pathway through binding beta-catenin. Bcr can form a complex with beta-catenin and Tcf1. The Wnt signaling pathway is involved in cell proliferation, differentiation, and cell renewal. Bcr was discovered as a fusion partner of Abl. The Bcr-Abl fusion is characteristic for a large majority of chronic myelogenous leukemias (CML). Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
cd RhoGEF 191aa 3e-29
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
cd PH_BCR-related 61aa 6e-26
in ref transcript
BCR (breakpoint cluster region)-related pleckstrin homology (PH) domain. The BCR-related protein has a RhoGEF(DH) domain followed by a PH domain, a C2 domain and a RhoGAP domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinases, tyrosine kinases, regulators of G-proteins, endocytotic GTPAses, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes.
cd PH_BCR-related 38aa 7e-12
in ref transcript
Changed!
cd C2 94aa 5e-06
in ref transcript
Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
pfam RhoGEF 189aa 5e-53
in ref transcript
RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains.
smart RhoGAP 164aa 3e-42
in ref transcript
GTPase-activator protein for Rho-like GTPases. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases. etter domain limits and outliers.
pfam Bcr-Abl_Oligo 71aa 3e-33
in ref transcript
Bcr-Abl oncoprotein oligomerisation domain. The Bcr-Abl oncoprotein oligomerisation domain consists of a short N-terminal helix (alpha-1), a flexible loop and a long C-terminal helix (alpha-2). Together these form an N-shaped structure, with the loop allowing the two helices to assume a parallel orientation. The monomeric domains associate into a dimer through the formation of an antiparallel coiled coil between the alpha-2 helices and domain swapping of two alpha-1 helices, where one alpha-1 helix swings back and packs against the alpha-2 helix from the second monomer. Two dimers then associate into a tetramer. The oligomerisation domain is essential for the oncogenicity of the Bcr-Abl protein.
Changed!
smart C2 105aa 2e-09
in ref transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
Changed!
smart C2 49aa 3e-04
in modified transcript
BDH1
refseq_BDH1.F1 refseq_BDH1.R1 100 252
NCBIGene 36.3 622
Single exon skipping, size difference: 152
Exclusion in 5'UTR
Reference transcript: NM_203314
TIGR 3oxo_ACP_reduc 215aa 3e-18
in ref transcript
This model represents 3-oxoacyl-[ACP] reductase, also called 3-ketoacyl-acyl carrier protein reductase, an enzyme of fatty acid biosynthesis.
PRK PRK06182 263aa 7e-37
in ref transcript
short chain dehydrogenase; Validated.
BICD1
refseq_BICD1.F2 refseq_BICD1.R2 101 405
NCBIGene 36.3 636
Exon skipping and alternative 3-prime or 5-prime, size difference: 304
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001714
pfam BicD 726aa 0.0
in ref transcript
Microtubule-associated protein Bicaudal-D. BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER). For full functioning they bind with GSK-3beta pfam05350 to maintain the anchoring of microtubules to the centromere. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells.
COG Smc 393aa 9e-09
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
BIN1
refseq_BIN1.F1 refseq_BIN1.R1 128 260
NCBIGene 36.3 274
Multiple exon skipping, size difference: 132
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_139343
cd SH3 68aa 5e-08
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart BAR 253aa 2e-51
in ref transcript
BAR domain.
smart SH3 70aa 1e-09
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Changed!
PRK PRK07764 241aa 0.002
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
Changed!
PRK PRK07764 191aa 2e-04
in modified transcript
BIN1
refseq_BIN1.F3 refseq_BIN1.R3 255 384
NCBIGene 36.3 274
Single exon skipping, size difference: 129
Exclusion in the protein (no frameshift)
Reference transcript: NM_139346
cd SH3 68aa 4e-08
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart BAR 222aa 2e-45
in ref transcript
BAR domain.
smart SH3 70aa 1e-09
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
BIN1
refseq_BIN1.F4 refseq_BIN1.R4 167 296
NCBIGene 36.3 274
Single exon skipping, size difference: 129
Exclusion in the protein (no frameshift)
Reference transcript: NM_139343
cd SH3 68aa 5e-08
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart BAR 253aa 2e-51
in ref transcript
BAR domain.
smart SH3 70aa 1e-09
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Changed!
PRK PRK07764 241aa 0.002
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
Changed!
PRK PRK07764 175aa 1e-05
in modified transcript
BIN1
refseq_BIN1.F6 refseq_BIN1.R6 306 399
NCBIGene 36.3 274
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_139343
cd SH3 68aa 5e-08
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Changed!
smart BAR 253aa 2e-51
in ref transcript
BAR domain.
smart SH3 70aa 1e-09
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
PRK PRK07764 241aa 0.002
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
Changed!
smart BAR 222aa 1e-47
in modified transcript
BIN1
refseq_BIN1.F7 refseq_BIN1.R7 126 171
NCBIGene 36.3 274
Single exon skipping, size difference: 45
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_139343
cd SH3 68aa 5e-08
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart BAR 253aa 2e-51
in ref transcript
BAR domain.
smart SH3 70aa 1e-09
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Changed!
PRK PRK07764 241aa 0.002
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
Changed!
PRK PRK07764 212aa 0.010
in modified transcript
XAF1
refseq_BIRC4BP.F2 refseq_BIRC4BP.R2 217 274
NCBIGene 36.3 54739
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_017523
BMF
refseq_BMF.F1 refseq_BMF.R1 324 387
NCBIGene 36.3 90427
Alternative 5-prime, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_001003940
BMP1
refseq_BMP1.F1 refseq_BMP1.R1 103 469
NCBIGene 36.3 649
Single exon skipping, size difference: 366
Exclusion of the stop codon
Reference transcript: NM_006128
cd ZnMc_BMP1_TLD 195aa 1e-111
in ref transcript
Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
cd CUB 112aa 1e-31
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd CUB 110aa 2e-31
in ref transcript
cd CUB 112aa 8e-30
in ref transcript
cd EGF_CA 42aa 2e-05
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
pfam Astacin 194aa 4e-89
in ref transcript
Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family have an amino terminal propeptide which is cleaved to give the active protease domain. All other linked domains are found to the carboxyl terminus of this domain. This family includes: Astacin, a digestive enzyme from Crayfish. Meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain. Proteins involved in morphogenesis, and Tolloid from drosophila.
pfam CUB 110aa 6e-46
in ref transcript
CUB domain.
pfam CUB 110aa 4e-41
in ref transcript
pfam CUB 110aa 2e-39
in ref transcript
smart EGF_CA 42aa 2e-06
in ref transcript
Calcium-binding EGF-like domain.
BNIP1
refseq_BNIP1.F1 refseq_BNIP1.R1 130 232
NCBIGene 36.3 662
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_013979
pfam Sec20 92aa 6e-21
in ref transcript
Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway.
BNIP1
refseq_BNIP1.F3 refseq_BNIP1.R3 119 248
NCBIGene 36.3 662
Single exon skipping, size difference: 129
Exclusion in the protein (no frameshift)
Reference transcript: NM_013979
pfam Sec20 92aa 6e-21
in ref transcript
Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway.
BNIPL
refseq_BNIPL.F1 refseq_BNIPL.R1 125 279
NCBIGene 36.2 149428
Single exon skipping, size difference: 154
Inclusion in the protein causing a new stop codon
Reference transcript: NM_138278
Changed!
cd SEC14 147aa 2e-17
in ref transcript
Sec14p-like lipid-binding domain. Found in secretory proteins, such as S. cerevisiae phosphatidylinositol transfer protein (Sec14p), and in lipid regulated proteins such as RhoGAPs, RhoGEFs and neurofibromin (NF1). SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
Changed!
smart SEC14 148aa 2e-14
in ref transcript
Domain in homologues of a S. cerevisiae phosphatidylinositol transfer protein (Sec14p). Domain in homologues of a S. cerevisiae phosphatidylinositol transfer protein (Sec14p) and in RhoGAPs, RhoGEFs and the RasGAP, neurofibromin (NF1). Lipid-binding domain. The SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
Changed!
cd SEC14 89aa 0.004
in modified transcript
BOLA3
refseq_BOLA3.F1 refseq_BOLA3.R1 248 337
NCBIGene 36.3 388962
Single exon skipping, size difference: 89
Exclusion in the protein causing a frameshift
Reference transcript: NM_212552
Changed!
pfam BolA 68aa 8e-08
in ref transcript
BolA-like protein. This family consist of the morphoprotein BolA from Escherichia coli and its various homologues. In Escherichia coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems during cell division. The expression of BolA is growth rate regulated and is induced during the transition into the the stationary phase. BolA is also induced by stress during early stages of growth and may have a general role in stress response. It has also been suggested that BolA can induce the transcription of penicillin binding proteins 6 and 5.
Histidine phosphatase domain found in phosphoglycerate mutases and related proteins, mostly phosphatases; contains a His residue which is phosphorylated during the reaction. Subgroup of the catalytic domain of a functionally diverse set of proteins, most of which are phosphatases. The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. This subgroup contains cofactor-dependent and cofactor-independent phosphoglycerate mutases (dPGM, and BPGM respectively), fructose-2,6-bisphosphatase (F26BP)ase, Sts-1, SixA, and related proteins. Functions include roles in metabolism, signaling, or regulation, for example, F26BPase affects glycolysis and gluconeogenesis through controlling the concentration of F26BP; BPGM controls the concentration of 2,3-BPG (the main allosteric effector of hemoglobin in human blood cells); human Sts-1 is a T-cell regulator; Escherichia coli Six A participates in the ArcB-dependent His-to-Asp phosphorelay signaling system. Deficiency and mutation in many of the human members result in disease, for example erythrocyte BPGM deficiency is a disease associated with a decrease in the concentration of 2,3-BPG.
cd HP 67aa 2e-13
in ref transcript
Histidine phosphatase domain found in a functionally diverse set of proteins, mostly phosphatases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of a functionally diverse set of proteins, most of which are phosphatases. The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. This set of proteins includes cofactor-dependent and cofactor-independent phosphoglycerate mutases (dPGM, and BPGM respectively), fructose-2,6-bisphosphatase (F26BP)ase, Sts-1, SixA, histidine acid phosphatases, phytases, and related proteins. Functions include roles in metabolism, signaling, or regulation, for example F26BPase affects glycolysis and gluconeogenesis through controlling the concentration of F26BP; BPGM controls the concentration of 2,3-BPG (the main allosteric effector of hemoglobin in human blood cells); human Sts-1 is a T-cell regulator; Escherichia coli Six A participates in the ArcB-dependent His-to-Asp phosphorelay signaling system; phytases scavenge phosphate from extracellular sources. Deficiency and mutation in many of the human members result in disease, for example erythrocyte BPGM deficiency is a disease associated with a decrease in the concentration of 2,3-BPG. Clinical applications include the use of prostatic acid phosphatase (PAP) as a serum marker for prostate cancer. Agricultural applications include the addition of phytases to animal feed.
TIGR pgm_1 249aa 2e-94
in ref transcript
Most members of this family are phosphoglycerate mutase (EC 5.4.2.1). This enzyme interconverts 2-phosphoglycerate and 3-phosphoglycerate. The enzyme is transiently phosphorylated on an active site histidine by 2,3-diphosphoglyerate, which is both substrate and product. Some members of this family have are phosphoglycerate mutase as a minor activity and act primarily as a bisphoglycerate mutase, interconverting 2,3-diphosphoglycerate and 1,3-diphosphoglycerate (EC 5.4.2.4). This model is designated as a subfamily for this reason. The second and third paralogs in S. cerevisiae are somewhat divergent and apparently inactive (see PUBMED:9544241) but are also part of this subfamily phylogenetically.
PTZ PTZ00123 240aa 2e-86
in ref transcript
phosphoglycerate mutase I; Provisional.
BRCC3
refseq_BRCC3.F1 refseq_BRCC3.R1 104 179
NCBIGene 36.3 79184
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_024332
smart JAB_MPN 166aa 3e-23
in ref transcript
JAB/MPN domain. Domain in Jun kinase activation domain binding protein and proteasomal subunits. Domain at Mpr1p and Pad1p N-termini. Domain of unknown function.
COG COG1310 31aa 0.001
in ref transcript
Predicted metal-dependent protease of the PAD1/JAB1 superfamily [General function prediction only].
BRD8
refseq_BRD8.F2 refseq_BRD8.R2 191 236
NCBIGene 36.3 10902
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_006696
cd Bromo_brd8_like 104aa 3e-53
in ref transcript
Bromodomain, brd8_like subgroup. In mammals, brd8 (bromodomain containing 8) interacts with the thyroid hormone receptor in a ligand-dependent fashion and enhances thyroid hormone-dependent activation from thyroid response elements. Brd8 is thought to be a nuclear receptor coactivator. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
smart BROMO 103aa 1e-30
in ref transcript
bromo domain.
COG COG5076 105aa 6e-13
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
BRD8
refseq_BRD8.F4 refseq_BRD8.R4 153 372
NCBIGene 36.3 10902
Single exon skipping, size difference: 219
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_139199
cd Bromo_brd8_like 104aa 8e-53
in ref transcript
Bromodomain, brd8_like subgroup. In mammals, brd8 (bromodomain containing 8) interacts with the thyroid hormone receptor in a ligand-dependent fashion and enhances thyroid hormone-dependent activation from thyroid response elements. Brd8 is thought to be a nuclear receptor coactivator. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd Bromo_brd8_like 104aa 3e-45
in ref transcript
smart BROMO 103aa 5e-30
in ref transcript
bromo domain.
smart BROMO 98aa 1e-26
in ref transcript
COG COG5076 201aa 8e-14
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
COG COG5076 105aa 8e-13
in ref transcript
BRD8
refseq_BRD8.F5 refseq_BRD8.R5 172 289
NCBIGene 36.3 10902
Alternative 3-prime, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_139199
cd Bromo_brd8_like 104aa 8e-53
in ref transcript
Bromodomain, brd8_like subgroup. In mammals, brd8 (bromodomain containing 8) interacts with the thyroid hormone receptor in a ligand-dependent fashion and enhances thyroid hormone-dependent activation from thyroid response elements. Brd8 is thought to be a nuclear receptor coactivator. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd Bromo_brd8_like 104aa 3e-45
in ref transcript
smart BROMO 103aa 5e-30
in ref transcript
bromo domain.
smart BROMO 98aa 1e-26
in ref transcript
COG COG5076 201aa 8e-14
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
Changed!
COG COG5076 105aa 8e-13
in ref transcript
Changed!
COG COG5076 81aa 1e-12
in modified transcript
BRD9
refseq_BRD9.F1 refseq_BRD9.R1 108 168
NCBIGene 36.3 65980
Alternative 3-prime, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_023924
cd Bromo_brd7_like 98aa 2e-45
in ref transcript
Bromodomain, brd7_like subgroup. The BRD7 gene encodes a nuclear protein that has been shown to inhibit cell growth and the progression of the cell cycle by regulating cell-cycle genes at the transcriptional level. BRD7 has been identified as a gene involved in nasopharyngeal carcinoma. The protein interacts with acetylated histone H3 via its bromodomain. Bromodomains are 110 amino acid long domains that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Changed!
smart BROMO 104aa 2e-22
in ref transcript
bromo domain.
Changed!
COG COG5076 125aa 2e-11
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
Changed!
smart BROMO 103aa 1e-22
in modified transcript
Changed!
COG COG5076 99aa 2e-11
in modified transcript
BRE
refseq_BRE.F1 refseq_BRE.R1 112 178
NCBIGene 36.3 9577
Single exon skipping, size difference: 66
Inclusion in 5'UTR
Reference transcript: NM_004899
pfam BRE 333aa 0.0
in ref transcript
Brain and reproductive organ-expressed protein (BRE). This family consists of several eukaryotic brain and reproductive organ-expressed (BRE) proteins. BRE is a putative stress-modulating gene, found able to down-regulate TNF-alpha-induced-NF-kappaB activation upon over expression. A total of six isoforms are produced by alternative splicing predominantly at either end of the gene.Compared to normal cells, immortalised human cell lines uniformly express higher levels of BRE. Peripheral blood monocytes respond to LPS by down-regulating the expression of all the BRE isoforms.It is thought that the function of BRE and its isoforms is to regulate peroxisomal activities.
BRMS1
refseq_BRMS1.F1 refseq_BRMS1.R1 255 337
NCBIGene 36.3 25855
Alternative 3-prime, size difference: 82
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001024957
pfam Sds3 170aa 1e-33
in ref transcript
Sds3-like. Repression of gene transcription is mediated by histone deacetylases containing repressor-co-repressor complexes, which are recruited to promoters of target genes via interactions with sequence-specific transcription factors. The co-repressor complex contains a core of at least seven proteins. This family represents the conserved region found in Sds3, Dep1 and BRMS1-homologue p40 proteins.
BRPF1
refseq_BRPF1.F2 refseq_BRPF1.R2 102 120
NCBIGene 36.3 7862
Alternative 5-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_001003694
Changed!
cd Bromo_brd1_like 104aa 2e-50
in ref transcript
Bromodomain; brd1_like subfamily. BRD1 is a mammalian gene which encodes for a nuclear protein assumed to be a transcriptional regulator. BRD1 has been implicated with brain development and susceptibility to schizophrenia and bipolar affective disorder. Bromodomains are 110 amino acid long domains that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd BR140_related 116aa 2e-50
in ref transcript
The PWWP domain is found in the BR140 family, which includes peregrin and BR140-like proteins 1 and 2. BR140 is the only family to contain the PWWP domain at the C terminus, with PHD and bromo domains in the N-terminal region. In myeloid leukemias, BR140 is disrupted by chromosomal translocations, similar to translocations of WHSC1 in lymphoid multiple myeloma. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes.
Changed!
smart BROMO 113aa 9e-25
in ref transcript
bromo domain.
pfam EPL1 151aa 2e-15
in ref transcript
Enhancer of polycomb-like. This is a family of EPL1 (Enhancer of polycomb-like) proteins. The EPL1 protein is a member of a histone acetyltransferase complex which is involved in transcriptional activation of selected genes.
smart PWWP 84aa 5e-15
in ref transcript
domain with conserved PWWP motif. conservation of Pro-Trp-Trp-Pro residues.
smart PHD 46aa 3e-07
in ref transcript
PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RI NG finger and the FYV E finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BR OMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RI NG fingers misidentified as PHD fingers.
COG COG5141 435aa 5e-72
in ref transcript
PHD zinc finger-containing protein [General function prediction only].
Changed!
COG COG5076 129aa 2e-09
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
Changed!
cd Bromo_brd1_like 98aa 2e-52
in modified transcript
Changed!
smart BROMO 107aa 1e-26
in modified transcript
Changed!
COG COG5076 123aa 4e-11
in modified transcript
BSG
refseq_BSG.F1 refseq_BSG.R1 100 448
NCBIGene 36.3 682
Single exon skipping, size difference: 348
Exclusion in the protein (no frameshift)
Reference transcript: NM_001728
Changed!
cd IGcam 89aa 3e-07
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 98aa 5e-04
in ref transcript
smart IG_like 92aa 7e-09
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
Changed!
smart IG_like 85aa 3e-07
in ref transcript
BSG
refseq_BSG.F3 refseq_BSG.R3 238 395
NCBIGene 36.3 682
Single exon skipping, size difference: 157
Inclusion in the protein causing a new stop codon
Reference transcript: NM_198591
Changed!
cd IGcam 98aa 2e-04
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
smart IG_like 92aa 1e-09
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
BTBD1
refseq_BTBD1.F2 refseq_BTBD1.R2 250 338
NCBIGene 36.3 53339
Single exon skipping, size difference: 88
Exclusion in the protein causing a frameshift
Reference transcript: NM_025238
Changed!
pfam PHR 150aa 1e-49
in ref transcript
PHR domain. This domain is called PHR as it was original found in the proteins PAM, highwire and RPM. This domain can be duplicated in the highwire, PFAM and PRM sequence. The C-terminal region of the protein BTBD1 includes the PHR domain and is known to interact with Topoisomerase I, an enzyme which relaxes DNA supercoils.
pfam BTB 114aa 1e-22
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
smart BACK 100aa 9e-12
in ref transcript
BTB And C-terminal Kelch. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues.
BTF3
refseq_BTF3.F2 refseq_BTF3.R2 113 354
NCBIGene 36.3 689
Alternative 5-prime, size difference: 241
Exclusion of the protein initiation site
Reference transcript: NM_001037637
pfam NAC 60aa 1e-18
in ref transcript
NAC domain.
BTN2A1
refseq_BTN2A1.F1 refseq_BTN2A1.R1 168 397
NCBIGene 36.3 11120
Alternative 3-prime, size difference: 229
Inclusion in the protein causing a new stop codon
Reference transcript: NM_007049
Changed!
smart SPRY 119aa 5e-29
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
Changed!
smart PRY 55aa 4e-19
in ref transcript
associated with SPRY domains.
pfam V-set 97aa 2e-10
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
pfam C2-set_2 83aa 0.003
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
BTN3A3
refseq_BTN3A3.F1 refseq_BTN3A3.R1 226 316
NCBIGene 36.3 10384
Single exon skipping, size difference: 90
Exclusion of the protein initiation site
Reference transcript: NM_006994
smart SPRY 123aa 9e-35
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
smart PRY 52aa 2e-17
in ref transcript
associated with SPRY domains.
Changed!
pfam V-set 105aa 1e-09
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
Changed!
pfam V-set 91aa 5e-07
in modified transcript
BTNL3
refseq_BTNL3.F1 refseq_BTNL3.R1 119 227
NCBIGene 36.2 10917
Alternative 5-prime, size difference: 108
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_197975
smart SPRY 124aa 5e-25
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
smart PRY 50aa 2e-11
in ref transcript
associated with SPRY domains.
pfam V-set 97aa 2e-06
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
BTNL3
refseq_BTNL3.F3 refseq_BTNL3.R3 108 318
NCBIGene 36.2 10917
Alternative 5-prime and 3-prime, size difference: 210
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_197975
smart SPRY 124aa 5e-25
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
smart PRY 50aa 2e-11
in ref transcript
associated with SPRY domains.
pfam V-set 97aa 2e-06
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
BTRC
refseq_BTRC.F1 refseq_BTRC.R1 284 392
NCBIGene 36.3 8945
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_033637
cd WD40 279aa 2e-60
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
smart WD40 38aa 7e-06
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 37aa 2e-05
in ref transcript
smart FBOX 39aa 1e-04
in ref transcript
A Receptor for Ubiquitination Targets.
pfam WD40 36aa 2e-04
in ref transcript
WD domain, G-beta repeat.
smart WD40 38aa 3e-04
in ref transcript
COG COG2319 311aa 8e-30
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
smart WD40 38aa 0.009
in modified transcript
Changed!
smart WD40 35aa 0.010
in modified transcript
C10orf128
refseq_C10orf128.F1 refseq_C10orf128.R1 106 184
NCBIGene 36.2 170371
Alternative 3-prime, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: XM_926220
C10orf128
refseq_C10orf128.F1 refseq_C10orf128.R3 206 285
NCBIGene 36.2 170371
Single exon skipping, size difference: 79
Exclusion in the protein causing a frameshift
Reference transcript: XM_931097
C10orf61
refseq_C10orf61.F2 refseq_C10orf61.R2 116 197
NCBIGene 36.2 26123
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_001013840
Changed!
pfam DUF1619 301aa 8e-86
in ref transcript
Protein of unknown function (DUF1619). This is a family of sequences derived from hypothetical eukaryotic proteins. The region in question is approximately 330 residues long and has a cysteine rich amino-terminus.
Changed!
pfam DUF1619 274aa 2e-73
in modified transcript
C13orf23
refseq_C13orf23.F2 refseq_C13orf23.R2 216 282
NCBIGene 36.3 80209
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_025138
C14orf102
refseq_C14orf102.F2 refseq_C14orf102.R2 100 443
NCBIGene 36.3 55051
Multiple exon skipping, size difference: 343
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_017970
Changed!
pfam DUF1740 238aa 8e-74
in ref transcript
Protein of unknown function (DUF1740). This is a family of eukaryotic proteins of unknown function.
INF2
refseq_C14orf173.F1 refseq_C14orf173.R1 295 352
NCBIGene 36.3 64423
Single exon skipping, size difference: 57
Exclusion of the stop codon
Reference transcript: NM_022489
pfam FH2 372aa 2e-66
in ref transcript
Formin Homology 2 Domain.
pfam Drf_FH3 67aa 2e-09
in ref transcript
Diaphanous FH3 Domain. This region is found in the Formin-like and and diaphanous proteins.
pfam Drf_GBD 60aa 6e-08
in ref transcript
Diaphanous GTPase-binding Domain. This domain is bound to by GTP-attached Rho proteins, leading to activation of the Drf protein.
ABHD12B
refseq_C14orf29.F1 refseq_C14orf29.R1 144 247
NCBIGene 36.3 145447
Single exon skipping, size difference: 103
Inclusion in the protein causing a new stop codon
Reference transcript: NM_181814
Changed!
TIGR hydr2_PEP 125aa 1e-09
in ref transcript
This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 1, TIGR03100) of the same superfamily.
Changed!
pfam Peptidase_S9 173aa 2e-04
in ref transcript
Prolyl oligopeptidase family.
Changed!
COG COG1073 222aa 2e-15
in ref transcript
Hydrolases of the alpha/beta superfamily [General function prediction only].
C15orf21
refseq_C15orf21.F1 refseq_C15orf21.R1 200 305
NCBIGene 36.3 283651
Exon skipping and alternative 3-prime or 5-prime, size difference: 105
Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
Reference transcript: NM_001005266
C15orf21
refseq_C15orf21.F4 refseq_C15orf21.R4 148 268
NCBIGene 36.3 283651
Single exon skipping, size difference: 120
Exclusion in 5'UTR
Reference transcript: NM_001005266
C15orf21
refseq_C15orf21.F4 refseq_C15orf21.R5 104 349
NCBIGene 36.3 283651
Multiple exon skipping, size difference: 245
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_001005266
C15orf21
refseq_C15orf21.F6 refseq_C15orf21.R7 194 271
NCBIGene 36.3 283651
Single exon skipping, size difference: 77
Inclusion in 5'UTR
Reference transcript: NM_001005267
C17orf58
refseq_C17orf58.F1 refseq_C17orf58.R1 227 370
NCBIGene 36.3 284018
Alternative 5-prime, size difference: 143
Inclusion in the protein causing a new stop codon
Reference transcript: NM_181655
Changed!
cd NTR_PCOLCE 52aa 3e-07
in ref transcript
NTR domain, PCOLCE subfamily; Procollagen C-endopeptidase enhancers (PCOLCEs) are extracellular matrix proteins that enhance the activity of procollagen C-proteases, by binding to the procollagen I C-peptide. They contain a C-terminal NTR domain, which have been suggested to possess inhibitory functions towards specific serine proteases but not towards metzincins, which are inhibited by the related TIMPs.
C18orf1
refseq_C18orf1.F1 refseq_C18orf1.R1 256 310
NCBIGene 36.3 753
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_181481
cd LDLa 32aa 2e-05
in ref transcript
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
pfam Ldl_recept_a 32aa 1e-05
in ref transcript
Low-density lipoprotein receptor domain class A.
C18orf24
refseq_C18orf24.F1 refseq_C18orf24.R1 143 188
NCBIGene 36.3 220134
Alternative 5-prime, size difference: 45
Exclusion in 5'UTR
Reference transcript: NM_001039535
pfam DUF1395 235aa 9e-68
in ref transcript
Protein of unknown function (DUF1395). This family consists of several hypothetical eukaryotic proteins of around 250 residues in length. The function of this family is unknown.
C19orf12
refseq_C19orf12.F2 refseq_C19orf12.R2 175 345
NCBIGene 36.2 83636
Single exon skipping, size difference: 170
Exclusion of the protein initiation site
Reference transcript: NM_001031726
C19orf2
refseq_C19orf2.F1 refseq_C19orf2.R1 107 142
NCBIGene 36.3 8725
Single exon skipping, size difference: 35
Exclusion in the protein causing a frameshift
Reference transcript: NM_003796
Changed!
cd Prefoldin_alpha 103aa 2e-08
in ref transcript
Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Changed!
pfam Prefoldin 110aa 1e-09
in ref transcript
Prefoldin subunit. This family comprises of several prefoldin subunits. The biogenesis of the cytoskeletal proteins actin and tubulin involves interaction of nascent chains of each of the two proteins with the oligomeric protein prefoldin (PFD) and their subsequent transfer to the cytosolic chaperonin CCT (chaperonin containing TCP-1). Electron microscopy shows that eukaryotic PFD, which has a similar structure to its archaeal counterpart, interacts with unfolded actin along the tips of its projecting arms. In its PFD-bound state, actin seems to acquire a conformation similar to that adopted when it is bound to CCT.
Changed!
PRK PRK03947 93aa 0.001
in ref transcript
prefoldin subunit alpha; Reviewed.
C19orf36
refseq_C19orf36.F1 refseq_C19orf36.R1 178 232
NCBIGene 36.3 113177
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039846
C19orf48
refseq_C19orf48.F1 refseq_C19orf48.R1 162 323
NCBIGene 36.3 84798
Single exon skipping, size difference: 161
Exclusion in 5'UTR
Reference transcript: NM_199249
C19orf6
refseq_C19orf6.F1 refseq_C19orf6.R1 308 408
NCBIGene 36.3 91304
Single exon skipping, size difference: 100
Exclusion in the protein causing a frameshift
Reference transcript: NM_001033026
pfam Membralin 370aa 1e-161
in ref transcript
Tumour-associated protein. Membralin is evolutionarily highly conserved; though it seems to represent a unique protein family. The protein appears to contain several transmembrane regions. In humans it is expressed in certain cancers, particularly ovarian cancers.
C1GALT1C1
refseq_C1GALT1C1.F1 refseq_C1GALT1C1.R1 185 357
NCBIGene 36.3 29071
Single exon skipping, size difference: 172
Exclusion in 5'UTR
Reference transcript: NM_152692
CAPRIN2
refseq_C1QDC1.F1 refseq_C1QDC1.R1 161 244
NCBIGene 36.3 65981
Single exon skipping, size difference: 83
Exclusion in the protein causing a frameshift
Reference transcript: NM_001002259
Changed!
pfam C1q 126aa 8e-34
in ref transcript
C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system.
Changed!
pfam Herpes_BLLF1 204aa 0.009
in ref transcript
Herpes virus major outer envelope glycoprotein (BLLF1). This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
CAPRIN2
refseq_C1QDC1.F3 refseq_C1QDC1.R3 205 352
NCBIGene 36.3 65981
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_001002259
pfam C1q 126aa 8e-34
in ref transcript
C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system.
Changed!
pfam Herpes_BLLF1 204aa 0.009
in ref transcript
Herpes virus major outer envelope glycoprotein (BLLF1). This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
C1QTNF1
refseq_C1QTNF1.F1 refseq_C1QTNF1.R1 231 400
NCBIGene 36.3 114897
Single exon skipping, size difference: 169
Exclusion in the protein causing a frameshift
Reference transcript: NM_030968
Changed!
pfam C1q 129aa 1e-36
in ref transcript
C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system.
Changed!
pfam Collagen 42aa 1e-05
in ref transcript
Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.
C1QTNF3
refseq_C1QTNF3.F2 refseq_C1QTNF3.R2 131 350
NCBIGene 36.3 114899
Alternative 5-prime, size difference: 219
Exclusion in the protein (no frameshift)
Reference transcript: NM_181435
pfam C1q 124aa 2e-35
in ref transcript
C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system.
C1S
refseq_C1S.F1 refseq_C1S.R1 112 199
NCBIGene 36.3 716
Alternative 3-prime, size difference: 87
Inclusion in 5'UTR
Reference transcript: NM_001734
cd Tryp_SPc 241aa 2e-53
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
cd CUB 112aa 1e-20
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd CUB 113aa 9e-18
in ref transcript
cd CCP 62aa 1e-04
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd EGF_CA 35aa 0.005
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
smart Tryp_SPc 239aa 3e-57
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
pfam CUB 113aa 1e-24
in ref transcript
CUB domain.
smart CUB 107aa 7e-22
in ref transcript
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
pfam Sushi 61aa 6e-06
in ref transcript
Sushi domain (SCR repeat).
smart EGF_CA 41aa 5e-04
in ref transcript
Calcium-binding EGF-like domain.
pfam Sushi 63aa 0.008
in ref transcript
COG COG5640 251aa 1e-16
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
C1orf102
refseq_C1orf102.F1 refseq_C1orf102.R1 166 196
NCBIGene 36.3 127700
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_145047
Changed!
pfam Oscp1 184aa 7e-63
in ref transcript
Organic solute transport protein 1. Oscp1 is a family of proteins conserved from plants to humans. It is called organic solute transport protein or oxido-red- nitro domain-containing protein 1, however no reference could be find to confirm the function of the protein.
Changed!
pfam Oscp1 174aa 4e-65
in modified transcript
C1orf125
refseq_C1orf125.F1 refseq_C1orf125.R1 260 482
NCBIGene 36.3 126859
Multiple exon skipping, size difference: 222
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_144696
pfam Ax_dynein_light 109aa 1e-04
in ref transcript
Axonemal dynein light chain. Axonemal dynein light chain proteins play a dynamic role in flagellar and cilia motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains.
C1orf178
refseq_C1orf178.F1 refseq_C1orf178.R1 180 302
NCBIGene 36.2 440603
Single exon skipping, size difference: 122
Exclusion in the protein causing a frameshift
Reference transcript: NM_001010922
C1orf178
refseq_C1orf178.F3 refseq_C1orf178.R3 102 327
NCBIGene 36.2 440603
Single exon skipping, size difference: 225
Exclusion in the protein (no frameshift)
Reference transcript: NM_001010922
C1orf2
refseq_C1orf2.F1 refseq_C1orf2.R1 112 400
NCBIGene 36.3 10712
Multiple exon skipping, size difference: 288
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_006589
C1orf34
refseq_C1orf34.F1 refseq_C1orf34.R1 130 235
NCBIGene 36.2 22996
Alternative 3-prime, size difference: 105
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: XM_930328
Changed!
pfam Deme6 441aa 1e-157
in ref transcript
Breast cancer protein. This is a family of proteins conserved from fungi to humans, which, in humans is expressed in primary breast carcinomas but not in normal breast tissue. There appears to be a putative eukaryotic RNP-1 motif and a candidate anchoring transmembrane domain. Deme6 is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive. Members of this family also carry a TPR_2 domain pfam07719 at their C-terminus.
Changed!
pfam Deme6 476aa 1e-155
in modified transcript
C1orf43
refseq_C1orf43.F2 refseq_C1orf43.R2 253 307
NCBIGene 36.3 25912
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098616
Changed!
pfam NICE-3 189aa 1e-85
in ref transcript
NICE-3 protein. This family consists of several eukaryotic NICE-3 and related proteins. The gene coding for NICE-3 is part of the epidermal differentiation complex (EDC) which comprises a large number of genes that are of crucial importance for the maturation of the human epidermis. The function of NICE-3 is unknown.
Changed!
pfam NICE-3 171aa 4e-72
in modified transcript
C1orf9
refseq_C1orf9.F2 refseq_C1orf9.R2 310 421
NCBIGene 36.3 51430
Single exon skipping, size difference: 111
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_016227
pfam Sad1_UNC 123aa 6e-40
in ref transcript
Sad1 / UNC-like C-terminal. The Caenorhabditis elegans UNC-84 protein is a nuclear envelope protein that is involved in nuclear anchoring and migration during development. The S. pombe Sad1 protein localises at the spindle pole body. UNC-84 and and Sad1 share a common C-terminal region, that is often termed the SUN (Sad1 and UNC) domain. In mammals, the SUN domain is present in two proteins, Sun1 and Sun2. The SUN domain of Sun2 has been demonstrated to be in the periplasm.
pfam SMC_N 159aa 0.008
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
C1orf9
refseq_C1orf9.F3 refseq_C1orf9.R3 108 129
NCBIGene 36.3 51430
Single exon skipping, size difference: 21
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_016227
Changed!
pfam Sad1_UNC 123aa 6e-40
in ref transcript
Sad1 / UNC-like C-terminal. The Caenorhabditis elegans UNC-84 protein is a nuclear envelope protein that is involved in nuclear anchoring and migration during development. The S. pombe Sad1 protein localises at the spindle pole body. UNC-84 and and Sad1 share a common C-terminal region, that is often termed the SUN (Sad1 and UNC) domain. In mammals, the SUN domain is present in two proteins, Sun1 and Sun2. The SUN domain of Sun2 has been demonstrated to be in the periplasm.
pfam SMC_N 159aa 0.008
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Changed!
pfam Sad1_UNC 130aa 9e-40
in modified transcript
C20orf121
refseq_C20orf121.F1 refseq_C20orf121.R1 103 127
NCBIGene 36.3 79183
Single exon skipping, size difference: 24
Exclusion in 5'UTR
Reference transcript: NM_024331
cd SEC14 157aa 2e-31
in ref transcript
Sec14p-like lipid-binding domain. Found in secretory proteins, such as S. cerevisiae phosphatidylinositol transfer protein (Sec14p), and in lipid regulated proteins such as RhoGAPs, RhoGEFs and neurofibromin (NF1). SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
pfam CRAL_TRIO 116aa 2e-25
in ref transcript
CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14.
pfam CRAL_TRIO_N 67aa 2e-07
in ref transcript
CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650.
C20orf132
refseq_C20orf132.F1 refseq_C20orf132.R1 210 315
NCBIGene 36.3 140699
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_152503
MACROD2
refseq_C20orf133.F2 refseq_C20orf133.R2 218 287
NCBIGene 36.3 140733
Single exon skipping, size difference: 69
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_080676
cd Macro_Appr_pase_like 167aa 3e-69
in ref transcript
Macro domain, Appr-1"-pase_like family. The macro domain is a high-affinity ADP-ribose binding module found in a variety of proteins as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Some macro domains recognize poly ADP-ribose as a ligand. Previously identified as displaying an Appr-1"-p (ADP-ribose-1"-monophosphate) processing activity, the macro domain may play roles in distinct ADP-ribose pathways, such as the ADP-ribosylation of proteins, an important post-translational modification which occurs in DNA repair, transcription, chromatin biology, and long-term memory formation, among other processes. This family is composed of uncharacterized proteins that show similarity to Appr-1"-pase, containing conserved putative active site residues. Appr-1"-pase is a phosphatase specific for ADP-ribose-1"-monophosphate.
pfam Macro 114aa 5e-42
in ref transcript
Macro domain. This domain is an ADP-ribose binding module. It is found in a number of otherwise unrelated proteins. It is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA viruses such as NSP3 from alphaviruses. This domain is also found on its own in a family of proteins from bacteria, archaebacteria, and eukaryotes.
Changed!
pfam UPF0560 113aa 4e-04
in ref transcript
Uncharacterised protein family UPF0560. This family of proteins has no known function.
PRK PRK00431 164aa 1e-55
in ref transcript
hypothetical protein; Provisional.
Changed!
pfam UPF0560 125aa 2e-04
in modified transcript
Changed!
COG MDN1 169aa 0.007
in modified transcript
AAA ATPase containing von Willebrand factor type A (vWA) domain [General function prediction only].
BANF2
refseq_C20orf179.F2 refseq_C20orf179.R2 202 365
NCBIGene 36.3 140836
Single exon skipping, size difference: 163
Exclusion in 5'UTR
Reference transcript: NM_178477
pfam BAF 89aa 2e-31
in ref transcript
Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration.
C20orf24
refseq_C20orf24.F1 refseq_C20orf24.R1 105 235
NCBIGene 36.3 55969
Single exon skipping, size difference: 130
Inclusion in the protein causing a frameshift
Reference transcript: NM_199483
Changed!
pfam Rab5ip 64aa 2e-23
in ref transcript
Rab5-interacting protein (Rab5ip). This family consists of several Rab5-interacting protein (RIP5 or Rab5ip ) sequences. The ras-related GTPase rab5 is rate-limiting for homotypic early endosome fusion. Rab5ip represents a novel rab5 interacting protein that may function on endocytic vesicles as a receptor for rab5-GDP and participate in the activation of rab5.
Changed!
pfam Rab5ip 114aa 2e-49
in modified transcript
C20orf30
refseq_C20orf30.F2 refseq_C20orf30.R2 127 220
NCBIGene 36.3 29058
Exon skipping and alternative 3-prime or 5-prime, size difference: 93
Inclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_001009924
pfam DUF872 96aa 4e-15
in ref transcript
Eukaryotic protein of unknown function (DUF872). This family consists of several uncharacterised eukaryotic proteins. The function of this family is unknown.
UQCC
refseq_C20orf44.F1 refseq_C20orf44.R1 111 192
NCBIGene 36.3 55245
Exon skipping and alternative 3-prime or 5-prime, size difference: 81
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_018244
Changed!
pfam Ubiq_cyt_C_chap 149aa 3e-64
in ref transcript
Ubiquinol-cytochrome C chaperone.
Changed!
COG COG5452 170aa 2e-11
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
pfam Ubiq_cyt_C_chap 148aa 2e-64
in modified transcript
Changed!
COG COG5452 138aa 3e-09
in modified transcript
C20orf7
refseq_C20orf7.F1 refseq_C20orf7.R1 133 217
NCBIGene 36.3 79133
Single exon skipping, size difference: 104
Exclusion in the protein causing a frameshift
Reference transcript: NM_024120
Changed!
cd AdoMet_MTases 92aa 7e-09
in ref transcript
S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
Changed!
TIGR BioC 237aa 1e-27
in ref transcript
This enzyme, which is found in biotin biosynthetic gene clusters in proteobacteria, firmicutes, green-sulfur bacteria, fusobacterium and bacteroides, is believed to carry out an enzymatic step prior to the formation of pimeloyl-CoA (although attribution of this annotation is not traceable). The enzyme appears related to methyltransferases by homology.
Changed!
COG UbiE 105aa 3e-12
in ref transcript
Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism].
Changed!
PRK PRK10258 217aa 8e-08
in ref transcript
biotin biosynthesis protein BioC; Provisional.
C20orf7
refseq_C20orf7.F2 refseq_C20orf7.R2 285 369
NCBIGene 36.3 79133
Single exon skipping, size difference: 104
Exclusion in the protein causing a frameshift
Reference transcript: NM_024120
Changed!
cd AdoMet_MTases 92aa 7e-09
in ref transcript
S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
Changed!
TIGR BioC 237aa 1e-27
in ref transcript
This enzyme, which is found in biotin biosynthetic gene clusters in proteobacteria, firmicutes, green-sulfur bacteria, fusobacterium and bacteroides, is believed to carry out an enzymatic step prior to the formation of pimeloyl-CoA (although attribution of this annotation is not traceable). The enzyme appears related to methyltransferases by homology.
Changed!
COG UbiE 105aa 3e-12
in ref transcript
Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism].
Changed!
PRK PRK10258 217aa 8e-08
in ref transcript
biotin biosynthesis protein BioC; Provisional.
C20orf7
refseq_C20orf7.F3 refseq_C20orf7.R3 214 298
NCBIGene 36.3 79133
Single exon skipping, size difference: 104
Exclusion in the protein causing a frameshift
Reference transcript: NM_024120
Changed!
cd AdoMet_MTases 92aa 7e-09
in ref transcript
S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
Changed!
TIGR BioC 237aa 1e-27
in ref transcript
This enzyme, which is found in biotin biosynthetic gene clusters in proteobacteria, firmicutes, green-sulfur bacteria, fusobacterium and bacteroides, is believed to carry out an enzymatic step prior to the formation of pimeloyl-CoA (although attribution of this annotation is not traceable). The enzyme appears related to methyltransferases by homology.
Changed!
COG UbiE 105aa 3e-12
in ref transcript
Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism].
Changed!
PRK PRK10258 217aa 8e-08
in ref transcript
biotin biosynthesis protein BioC; Provisional.
C21orf57
refseq_C21orf57.F1 refseq_C21orf57.R1 105 427
NCBIGene 36.3 54059
Alternative 5-prime, size difference: 322
Exclusion in 5'UTR
Reference transcript: NM_058181
TIGR TIGR00043 115aa 8e-36
in ref transcript
This uncharacterized protein family is represented by a single member sequence only in nearly every bacterium.
PRK PRK00016 117aa 5e-24
in ref transcript
putative metalloprotease; Provisional.
C21orf58
refseq_C21orf58.F1 refseq_C21orf58.R1 203 372
NCBIGene 36.2 54058
Alternative 3-prime, size difference: 169
Inclusion in the protein causing a new stop codon
Reference transcript: NM_058180
C21orf58
refseq_C21orf58.F3 refseq_C21orf58.R3 176 256
NCBIGene 36.2 54058
Alternative 3-prime, size difference: 80
Inclusion in the protein causing a new stop codon
Reference transcript: NM_058180
C21orf66
refseq_C21orf66.F1 refseq_C21orf66.R1 96 113
NCBIGene 36.2 94104
Alternative 5-prime, size difference: 17
Inclusion in the protein causing a frameshift
Reference transcript: NM_016631
Changed!
pfam GCFC 501aa 1e-172
in ref transcript
GC-rich sequence DNA-binding factor-like protein. Sequences found in this family are similar to a region of a human GC-rich sequence DNA-binding factor homolog. This is thought to be a protein involved in transcriptional regulation due to partial homologies to a transcription repressor and histone-interacting protein.
Changed!
pfam GCFC 106aa 4e-38
in modified transcript
C2orf28
refseq_C2orf28.F1 refseq_C2orf28.R1 201 341
NCBIGene 36.3 51374
Alternative 5-prime, size difference: 140
Inclusion in the protein causing a new stop codon
Reference transcript: NM_080592
C3orf17
refseq_C3orf17.F1 refseq_C3orf17.R1 131 286
NCBIGene 36.3 25871
Single exon skipping, size difference: 155
Exclusion in the protein causing a frameshift
Reference transcript: NM_015412
C3orf43
refseq_C3orf43.F1 refseq_C3orf43.R1 176 232
NCBIGene 36.2 255798
Alternative 5-prime, size difference: 56
Inclusion in the protein causing a frameshift
Reference transcript: XM_931549
C4orf13
refseq_C4orf13.F1 refseq_C4orf13.R1 232 450
NCBIGene 36.2 84068
Multiple exon skipping, size difference: 218
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001029998
Changed!
COG COG0385 336aa 1e-21
in ref transcript
Predicted Na+-dependent transporter [General function prediction only].
Changed!
COG COG0385 185aa 2e-09
in modified transcript
C4orf18
refseq_C4orf18.F1 refseq_C4orf18.R1 124 148
NCBIGene 36.3 51313
Exon skipping and alternative 3-prime or 5-prime, size difference: 24
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001031700
C6orf1
refseq_C6orf1.F1 refseq_C6orf1.R1 179 239
NCBIGene 36.3 221491
Alternative 5-prime, size difference: 60
Exclusion in 5'UTR
Reference transcript: NM_001008703
TIGR FadB 33aa 0.002
in ref transcript
Members represent alpha subunit of multifunctional enzyme complex of the fatty acid degradation cycle. Activities include: enoyl-CoA hydratase (EC 4.2.1.17), dodecenoyl-CoA delta-isomerase activity (EC 5.3.3.8), 3-hydroxyacyl-CoA dehydrogenase (EC 1.1.1.35), 3-hydroxybutyryl-CoA epimerase (EC 5.1.2.3). A representative is E. coli FadB. This model excludes the FadJ family.
C6orf1
refseq_C6orf1.F2 refseq_C6orf1.R2 134 386
NCBIGene 36.3 221491
Alternative 5-prime, size difference: 252
Exclusion in 5'UTR
Reference transcript: NM_178508
TIGR FadB 33aa 0.002
in ref transcript
Members represent alpha subunit of multifunctional enzyme complex of the fatty acid degradation cycle. Activities include: enoyl-CoA hydratase (EC 4.2.1.17), dodecenoyl-CoA delta-isomerase activity (EC 5.3.3.8), 3-hydroxyacyl-CoA dehydrogenase (EC 1.1.1.35), 3-hydroxybutyryl-CoA epimerase (EC 5.1.2.3). A representative is E. coli FadB. This model excludes the FadJ family.
C6orf106
refseq_C6orf106.F2 refseq_C6orf106.R2 171 369
NCBIGene 36.3 64771
Single exon skipping, size difference: 198
Exclusion in the protein (no frameshift)
Reference transcript: NM_024294
C6orf134
refseq_C6orf134.F1 refseq_C6orf134.R1 121 403
NCBIGene 36.3 79969
Alternative 5-prime, size difference: 282
Exclusion of the protein initiation site
Reference transcript: NM_001031722
pfam DUF738 121aa 1e-54
in ref transcript
Protein of unknown function (DUF738). This family consists of several uncharacterised eukaryotic proteins of unknown function.
C6orf25
refseq_C6orf25.F1 refseq_C6orf25.R1 278 369
NCBIGene 36.3 80739
Single exon skipping, size difference: 91
Exclusion in the protein causing a frameshift
Reference transcript: NM_138272
Changed!
pfam V-set 105aa 0.008
in modified transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
C6orf48
refseq_C6orf48.F1 refseq_C6orf48.R1 175 269
NCBIGene 36.3 50854
Single exon skipping, size difference: 94
Exclusion in 5'UTR
Reference transcript: NM_001040437
C9orf105
refseq_C9orf105.F1 refseq_C9orf105.R1 240 342
NCBIGene 36.2 401505
Alternative 5-prime, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: XM_933960
C9orf131
refseq_C9orf131.F1 refseq_C9orf131.R1 194 299
NCBIGene 36.3 138724
Alternative 5-prime, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_203299
C9orf23
refseq_C9orf23.F1 refseq_C9orf23.R1 156 360
NCBIGene 36.3 138716
Alternative 5-prime, size difference: 204
Exclusion in 5'UTR
Reference transcript: NM_148179
pfam Alba 65aa 5e-14
in ref transcript
Alba. Alba is a novel chromosomal protein that coats archaeal DNA without compacting it.
C9orf24
refseq_C9orf24.F1 refseq_C9orf24.R1 152 306
NCBIGene 36.3 84688
Alternative 5-prime, size difference: 154
Exclusion in the protein causing a frameshift
Reference transcript: NM_032596
C9orf58
refseq_C9orf58.F1 refseq_C9orf58.R1 97 119
NCBIGene 36.2 83543
Alternative 3-prime, size difference: 22
Inclusion in the protein causing a frameshift
Reference transcript: NM_031426
Changed!
cd EFh 61aa 1e-04
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Changed!
PTZ PTZ00184 69aa 2e-04
in ref transcript
calmodulin; Provisional.
CA12
refseq_CA12.F2 refseq_CA12.R2 143 176
NCBIGene 36.3 771
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_001218
cd alpha_CA_XII_XIV 251aa 1e-120
in ref transcript
Carbonic anhydrase alpha, isozymes XII and XIV. Carbonic anhydrases (CAs) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism: a nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionary distinct groups - alpha, beta and gamma carbonic anhydrases - which show no significant sequence identity or structural similarity. Most alpha CAs are monomeric enzymes. The zinc ion is complexed by three histidine residues. This sub-family comprises the membrane proteins CA XII and XIV.
pfam Carb_anhydrase 258aa 3e-66
in ref transcript
Eukaryotic-type carbonic anhydrase.
COG Cah 264aa 6e-29
in ref transcript
Carbonic anhydrase [Inorganic ion transport and metabolism].
CABP1
refseq_CABP1.F1 refseq_CABP1.R1 126 306
NCBIGene 36.3 9478
Single exon skipping, size difference: 180
Exclusion in the protein (no frameshift)
Reference transcript: NM_031205
cd EFh 64aa 8e-13
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 54aa 5e-04
in ref transcript
smart EFh 29aa 7e-05
in ref transcript
EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
smart EFh 28aa 6e-04
in ref transcript
PTZ PTZ00184 132aa 4e-24
in ref transcript
calmodulin; Provisional.
Changed!
smart EH 58aa 0.007
in modified transcript
Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
CABP2
refseq_CABP2.F1 refseq_CABP2.R1 109 280
NCBIGene 36.3 51475
Single exon skipping, size difference: 171
Exclusion in the protein (no frameshift)
Reference transcript: NM_016366
cd EFh 64aa 5e-14
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 59aa 6e-06
in ref transcript
pfam efhand 29aa 1e-04
in ref transcript
EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typically undergo a calcium-dependent conformational change which opens a target binding site. The latter group is represented by calbindin D9k and do not undergo calcium dependent conformational changes.
smart EFh 28aa 5e-04
in ref transcript
EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
Changed!
pfam efhand 29aa 0.006
in ref transcript
PTZ PTZ00184 145aa 1e-27
in ref transcript
calmodulin; Provisional.
CABYR
refseq_CABYR.F2 refseq_CABYR.R2 306 360
NCBIGene 36.3 26256
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_012189
smart RIIa 38aa 3e-08
in ref transcript
RIIalpha, Regulatory subunit portion of type II PKA R-subunit. RIIalpha, Regulatory subunit portion of type II PKA R-subunit. Contains dimerisation interface and binding site for A-kinase-anchoring proteins (AKAPs).
CACNA1G
refseq_CACNA1G.F2 refseq_CACNA1G.R2 106 175
NCBIGene 36.3 8913
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_018896
pfam Ion_trans 187aa 4e-37
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
pfam Ion_trans 206aa 6e-35
in ref transcript
pfam Ion_trans 223aa 2e-26
in ref transcript
pfam Ion_trans 112aa 2e-16
in ref transcript
pfam Ion_trans 79aa 4e-11
in ref transcript
CACNA1G
refseq_CACNA1G.F3 refseq_CACNA1G.R3 194 338
NCBIGene 36.3 8913
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_018896
pfam Ion_trans 187aa 4e-37
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
pfam Ion_trans 206aa 6e-35
in ref transcript
pfam Ion_trans 223aa 2e-26
in ref transcript
pfam Ion_trans 112aa 2e-16
in ref transcript
pfam Ion_trans 79aa 4e-11
in ref transcript
CACNA1G
refseq_CACNA1G.F4 refseq_CACNA1G.R4 187 322
NCBIGene 36.3 8913
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_018896
pfam Ion_trans 187aa 4e-37
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
pfam Ion_trans 206aa 6e-35
in ref transcript
pfam Ion_trans 223aa 2e-26
in ref transcript
pfam Ion_trans 112aa 2e-16
in ref transcript
pfam Ion_trans 79aa 4e-11
in ref transcript
Changed!
pfam Atrophin-1 286aa 0.001
in modified transcript
Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteristic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
CACNA1H
refseq_CACNA1H.F1 refseq_CACNA1H.R1 100 118
NCBIGene 36.3 8912
Single exon skipping, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_021098
pfam Ion_trans 223aa 4e-33
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
pfam Ion_trans 169aa 9e-32
in ref transcript
pfam Ion_trans 188aa 2e-29
in ref transcript
pfam Ion_trans 126aa 3e-15
in ref transcript
pfam Ion_trans 79aa 3e-11
in ref transcript
CACNA1I
refseq_CACNA1I.F1 refseq_CACNA1I.R1 294 399
NCBIGene 36.3 8911
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_021096
pfam Ion_trans 223aa 4e-35
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
pfam Ion_trans 209aa 3e-31
in ref transcript
pfam Ion_trans 188aa 4e-18
in ref transcript
pfam Ion_trans 89aa 7e-16
in ref transcript
pfam Ion_trans 79aa 2e-08
in ref transcript
CACNA2D4
refseq_CACNA2D4.F1 refseq_CACNA2D4.R1 104 149
NCBIGene 36.2 93589
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_172364
cd vWA_VGCC_like 184aa 9e-66
in ref transcript
VWA Voltage gated Calcium channel like: Voltage-gated calcium channels are a complex of five proteins: alpha 1, beta 1, gamma, alpha 2 and delta. The alpha 2 and delta subunits result from proteolytic processing of a single gene product and carries at its N-terminus the VWA and cache domains, The alpha 2 delta gene family has orthologues in D. melanogaster and C. elegans but none have been detected in aither A. thaliana or yeast. The exact biochemical function of the VWA domain is not known but the alpha 2 delta complex has been shown to regulate various functional properties of the channel complex.
pfam VWA_N 117aa 1e-46
in ref transcript
VWA N-terminal. This domain is found at the N-terminus of proteins containing von Willebrand factor type A (VWA, pfam00092) and Cache (pfam02743) domains. It has been found in vertebrates, Drosophila and Caenorhabditis elegans but has not yet been identified in other eukaryotes. It is probably involved in the function of some voltage-dependent calcium channel subunits.
pfam Cache_1 93aa 6e-15
in ref transcript
Cache domain.
smart VWA 180aa 4e-14
in ref transcript
von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
COG COG2425 346aa 5e-04
in ref transcript
Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only].
CACNA2D4
refseq_CACNA2D4.F3 refseq_CACNA2D4.R3 108 179
NCBIGene 36.2 93589
Single exon skipping, size difference: 71
Exclusion in the protein causing a frameshift
Reference transcript: NM_172364
cd vWA_VGCC_like 184aa 9e-66
in ref transcript
VWA Voltage gated Calcium channel like: Voltage-gated calcium channels are a complex of five proteins: alpha 1, beta 1, gamma, alpha 2 and delta. The alpha 2 and delta subunits result from proteolytic processing of a single gene product and carries at its N-terminus the VWA and cache domains, The alpha 2 delta gene family has orthologues in D. melanogaster and C. elegans but none have been detected in aither A. thaliana or yeast. The exact biochemical function of the VWA domain is not known but the alpha 2 delta complex has been shown to regulate various functional properties of the channel complex.
pfam VWA_N 117aa 1e-46
in ref transcript
VWA N-terminal. This domain is found at the N-terminus of proteins containing von Willebrand factor type A (VWA, pfam00092) and Cache (pfam02743) domains. It has been found in vertebrates, Drosophila and Caenorhabditis elegans but has not yet been identified in other eukaryotes. It is probably involved in the function of some voltage-dependent calcium channel subunits.
pfam Cache_1 93aa 6e-15
in ref transcript
Cache domain.
smart VWA 180aa 4e-14
in ref transcript
von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
Changed!
COG COG2425 346aa 5e-04
in ref transcript
Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only].
Changed!
COG COG2425 325aa 0.003
in modified transcript
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_000723
cd SH3 57aa 6e-04
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
CACNG6
refseq_CACNG6.F1 refseq_CACNG6.R1 133 271
NCBIGene 36.3 59285
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_145814
CADPS
refseq_CADPS.F1 refseq_CADPS.R1 118 139
NCBIGene 36.3 8618
Single exon skipping, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_003716
cd PH_CADPS 110aa 2e-57
in ref transcript
CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
Changed!
pfam DUF1041 106aa 4e-13
in ref transcript
Domain of Unknown Function (DUF1041). This family consists of several eukaryotic domains of unknown function. Members of this family are often found in tandem repeats and co-occur with pfam00168, pfam00130 and pfam00169 domains.
smart PH 102aa 6e-09
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
Changed!
pfam DUF1041 99aa 6e-15
in modified transcript
CADPS
refseq_CADPS.F4 refseq_CADPS.R4 109 160
NCBIGene 36.3 8618
Alternative 5-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_003716
cd PH_CADPS 110aa 2e-57
in ref transcript
CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam DUF1041 106aa 4e-13
in ref transcript
Domain of Unknown Function (DUF1041). This family consists of several eukaryotic domains of unknown function. Members of this family are often found in tandem repeats and co-occur with pfam00168, pfam00130 and pfam00169 domains.
smart PH 102aa 6e-09
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
CADPS
refseq_CADPS.F6 refseq_CADPS.R6 170 317
NCBIGene 36.3 8618
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_003716
cd PH_CADPS 110aa 2e-57
in ref transcript
CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam DUF1041 106aa 4e-13
in ref transcript
Domain of Unknown Function (DUF1041). This family consists of several eukaryotic domains of unknown function. Members of this family are often found in tandem repeats and co-occur with pfam00168, pfam00130 and pfam00169 domains.
smart PH 102aa 6e-09
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
CADPS2
refseq_CADPS2.F1 refseq_CADPS2.R1 382 487
NCBIGene 36.3 93664
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_017954
cd PH_CADPS 110aa 2e-52
in ref transcript
CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam DUF1041 95aa 1e-13
in ref transcript
Domain of Unknown Function (DUF1041). This family consists of several eukaryotic domains of unknown function. Members of this family are often found in tandem repeats and co-occur with pfam00168, pfam00130 and pfam00169 domains.
smart PH 102aa 2e-09
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
CADPS2
refseq_CADPS2.F1 refseq_CADPS2.R2 116 236
NCBIGene 36.3 93664
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_017954
cd PH_CADPS 110aa 2e-52
in ref transcript
CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam DUF1041 95aa 1e-13
in ref transcript
Domain of Unknown Function (DUF1041). This family consists of several eukaryotic domains of unknown function. Members of this family are often found in tandem repeats and co-occur with pfam00168, pfam00130 and pfam00169 domains.
smart PH 102aa 2e-09
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
CAGE1
refseq_CAGE1.F1 refseq_CAGE1.R1 119 224
NCBIGene 36.2 285782
Single exon skipping, size difference: 105
Exclusion in 3'UTR
Reference transcript: NM_175745
pfam SMC_N 256aa 7e-06
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
COG Smc 189aa 2e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
CAGE1
refseq_CAGE1.F3 refseq_CAGE1.R3 112 248
NCBIGene 36.2 285782
Exon skipping and alternative 3-prime or 5-prime, size difference: 136
Exclusion in the protein (no frameshift), Exclusion of the stop codon
Reference transcript: NM_175745
pfam SMC_N 256aa 7e-06
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
COG Smc 189aa 2e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
CALB2
refseq_CALB2.F2 refseq_CALB2.R2 114 154
NCBIGene 36.3 794
Single exon skipping, size difference: 40
Exclusion in the protein causing a frameshift
Reference transcript: NM_001740
Changed!
cd EFh 70aa 4e-07
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 73aa 2e-05
in ref transcript
Changed!
cd EFh 67aa 4e-05
in ref transcript
Changed!
PTZ PTZ00184 169aa 4e-07
in ref transcript
calmodulin; Provisional.
Changed!
cd EFh 69aa 3e-07
in modified transcript
Changed!
PTZ PTZ00184 167aa 8e-07
in modified transcript
CALML4
refseq_CALML4.F1 refseq_CALML4.R1 104 434
NCBIGene 36.2 91860
Multiple exon skipping, size difference: 330
Exclusion of the protein initiation site, Exclusion in the protein (no frameshift)
Reference transcript: NM_001031733
Changed!
cd EFh 63aa 1e-05
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Changed!
PTZ PTZ00184 107aa 2e-21
in ref transcript
calmodulin; Provisional.
CAMK2B
refseq_CAMK2B.F2 refseq_CAMK2B.R2 117 189
NCBIGene 36.3 816
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_001220
cd S_TKc 260aa 2e-82
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 249aa 1e-85
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam CaMKII_AD 128aa 1e-58
in ref transcript
Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
PTZ PTZ00263 249aa 6e-40
in ref transcript
protein kinase A catalytic subunit; Provisional.
CAMK2B
refseq_CAMK2B.F4 refseq_CAMK2B.R4 100 472
NCBIGene 36.3 816
Multiple exon skipping, size difference: 372
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001220
cd S_TKc 260aa 2e-82
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 249aa 1e-85
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam CaMKII_AD 128aa 1e-58
in ref transcript
Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
Changed!
PTZ PTZ00263 249aa 6e-40
in ref transcript
protein kinase A catalytic subunit; Provisional.
Changed!
COG SPS1 265aa 4e-40
in modified transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
CAMK2B
refseq_CAMK2B.F5 refseq_CAMK2B.R5 171 249
NCBIGene 36.3 816
Alternative 3-prime, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_001220
cd S_TKc 260aa 2e-82
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 249aa 1e-85
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
pfam CaMKII_AD 128aa 1e-58
in ref transcript
Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
PTZ PTZ00263 249aa 6e-40
in ref transcript
protein kinase A catalytic subunit; Provisional.
Changed!
pfam CaMKII_AD 102aa 8e-40
in modified transcript
CAMK2G
refseq_CAMK2G.F2 refseq_CAMK2G.R2 187 257
NCBIGene 36.2 818
Alternative 5-prime, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_172171
cd S_TKc 260aa 5e-84
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 249aa 2e-86
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam CaMKII_AD 128aa 5e-59
in ref transcript
Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
PTZ PTZ00263 249aa 3e-43
in ref transcript
protein kinase A catalytic subunit; Provisional.
CAMK2G
refseq_CAMK2G.F3 refseq_CAMK2G.R3 215 329
NCBIGene 36.3 818
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_172171
cd S_TKc 260aa 5e-84
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 249aa 2e-86
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam CaMKII_AD 128aa 5e-59
in ref transcript
Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
PTZ PTZ00263 249aa 3e-43
in ref transcript
protein kinase A catalytic subunit; Provisional.
CAMK2G
refseq_CAMK2G.F4 refseq_CAMK2G.R4 126 189
NCBIGene 36.3 818
Single exon skipping, size difference: 63
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_172171
cd S_TKc 260aa 5e-84
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 249aa 2e-86
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam CaMKII_AD 128aa 5e-59
in ref transcript
Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
PTZ PTZ00263 249aa 3e-43
in ref transcript
protein kinase A catalytic subunit; Provisional.
CAMK2G
refseq_CAMK2G.F6 refseq_CAMK2G.R1 106 134
NCBIGene 36.2 818
Alternative 3-prime, size difference: 28
Exclusion in the protein causing a frameshift
Reference transcript: NM_172171
cd S_TKc 260aa 5e-84
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 249aa 2e-86
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
pfam CaMKII_AD 128aa 5e-59
in ref transcript
Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
PTZ PTZ00263 249aa 3e-43
in ref transcript
protein kinase A catalytic subunit; Provisional.
CAMKK1
refseq_CAMKK1.F1 refseq_CAMKK1.R1 133 247
NCBIGene 36.3 84254
Single exon skipping, size difference: 114
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_172206
Changed!
cd S_TKc 283aa 9e-69
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 272aa 6e-73
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
COG SPS1 359aa 2e-36
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
cd S_TKc 321aa 3e-62
in modified transcript
Changed!
smart S_TKc 310aa 1e-65
in modified transcript
Changed!
COG SPS1 397aa 4e-31
in modified transcript
CAMKK2
refseq_CAMKK2.F2 refseq_CAMKK2.R2 199 242
NCBIGene 36.3 10645
Single exon skipping, size difference: 43
Exclusion in the protein causing a frameshift
Reference transcript: NM_006549
cd S_TKc 283aa 5e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 272aa 4e-70
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
PTZ PTZ00263 295aa 6e-38
in ref transcript
protein kinase A catalytic subunit; Provisional.
CAMKK2
refseq_CAMKK2.F3 refseq_CAMKK2.R3 134 263
NCBIGene 36.3 10645
Single exon skipping, size difference: 129
Exclusion in the protein (no frameshift)
Reference transcript: NM_006549
Changed!
cd S_TKc 283aa 5e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 272aa 4e-70
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00263 295aa 6e-38
in ref transcript
protein kinase A catalytic subunit; Provisional.
Changed!
cd S_TKc 278aa 4e-65
in modified transcript
Changed!
smart S_TKc 267aa 3e-67
in modified transcript
Changed!
PTZ PTZ00263 275aa 3e-37
in modified transcript
CANX
refseq_CANX.F1 refseq_CANX.R1 132 185
NCBIGene 36.3 821
Alternative 5-prime, size difference: 53
Exclusion in 5'UTR
Reference transcript: NM_001746
pfam Calreticulin 372aa 1e-143
in ref transcript
Calreticulin family.
CAPN9
refseq_CAPN9.F1 refseq_CAPN9.R1 167 245
NCBIGene 36.3 10753
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_006615
Changed!
cd CysPc 305aa 1e-105
in ref transcript
Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
cd Calpain_III 150aa 5e-61
in ref transcript
Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
cd EFh 56aa 0.002
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Changed!
smart CysPc 316aa 1e-148
in ref transcript
Calpain-like thiol protease family. Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).
pfam Calpain_III 147aa 1e-69
in ref transcript
Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions.
Changed!
cd CysPc 279aa 1e-94
in modified transcript
Changed!
smart CysPc 290aa 1e-132
in modified transcript
CAPS
refseq_CAPS.F1 refseq_CAPS.R1 159 240
NCBIGene 36.3 828
Alternative 5-prime, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_004058
cd EFh 62aa 2e-08
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 58aa 6e-04
in ref transcript
Changed!
COG FRQ1 146aa 6e-07
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
Changed!
PTZ PTZ00184 129aa 4e-07
in modified transcript
calmodulin; Provisional.
CARS
refseq_CARS.F1 refseq_CARS.R1 115 162
NCBIGene 36.3 833
Alternative 3-prime, size difference: 47
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001014437
Changed!
cd CysRS_core 102aa 1e-47
in ref transcript
This is the catalytic core domain of cysteinyl tRNA synthetase (CysRS). This class I enzyme is a monomer, which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding.
Changed!
cd CysRS_core 102aa 5e-37
in ref transcript
Changed!
cd CysRS_core 34aa 4e-05
in ref transcript
Changed!
pfam tRNA-synt_1e 246aa 9e-88
in ref transcript
tRNA synthetases class I (C) catalytic domain. This family includes only cysteinyl tRNA synthetases.
Changed!
TIGR cysS 86aa 7e-35
in ref transcript
This model finds the cysteinyl-tRNA synthetase from most but not from all species. The enzyme from one archaeal species, Archaeoglobus fulgidus, is found but the equivalent enzymes from some other Archaea, including Methanococcus jannaschii, are not found, although biochemical evidence suggests that tRNA(Cys) in these species are charged directly with Cys rather than through a misacylation and correction pathway as for tRNA(Gln).
Changed!
TIGR leuS_arch 169aa 2e-07
in ref transcript
The leucyl-tRNA synthetases belong to two families so broadly different that they are represented by separate models. This model includes both archaeal and cytosolic eukaryotic leucyl-tRNA synthetases; the eubacterial and mitochondrial forms differ so substantially that some other tRNA ligases score higher by this model than does any eubacterial LeuS.
Changed!
COG CysS 424aa 2e-96
in ref transcript
Cysteinyl-tRNA synthetase [Translation, ribosomal structure and biogenesis].
Changed!
PRK cysS 81aa 2e-36
in ref transcript
cysteinyl-tRNA synthetase; Validated.
Changed!
cd GST_C_EFB1gamma 48aa 0.004
in modified transcript
GST_C family, Gamma subunit of Elongation Factor 1B (EFB1gamma) subfamily; EF1Bgamma is part of the eukaryotic translation elongation factor-1 (EF1) complex which plays a central role in the elongation cycle during protein biosynthesis. EF1 consists of two functionally distinct units, EF1A and EF1B. EF1A catalyzes the GTP-dependent binding of aminoacyl-tRNA to the ribosomal A site concomitant with the hydrolysis of GTP. The resulting inactive EF1A:GDP complex is recycled to the active GTP form by the guanine-nucleotide exchange factor EF1B, a complex composed of at least two subunits, alpha and gamma. Metazoan EFB1 contain a third subunit, beta. The EF1B gamma subunit contains a GST fold consisting of an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain. The GST-like domain of EF1Bgamma is believed to mediate the dimerization of the EF1 complex, which in yeast is a dimer of the heterotrimer EF1A:EF1Balpha:EF1Bgamma. In addition to its role in protein biosynthesis, EF1Bgamma may also display other functions. The recombinant rice protein has been shown to possess GSH conjugating activity. The yeast EF1Bgamma binds membranes in a calcium dependent manner and is also part of a complex that binds to the msrA (methionine sulfoxide reductase) promoter suggesting a function in the regulation of its gene expression. Also included in this subfamily is the GST_C-like domain at the N-terminus of human valyl-tRNA synthetase and its homologs from zebrafish and Xenopus. Although not included in the alignment, the GST_C-like domain containing a deletion present in some aminoacyl-tRNA synthetases is recognized by this model. This domain will be represented in the future by a deletion model of GST_C.
CARS
refseq_CARS.F3 refseq_CARS.R3 149 398
NCBIGene 36.3 833
Single exon skipping, size difference: 249
Exclusion in the protein (no frameshift)
Reference transcript: NM_001014437
cd CysRS_core 102aa 1e-47
in ref transcript
This is the catalytic core domain of cysteinyl tRNA synthetase (CysRS). This class I enzyme is a monomer, which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding.
cd CysRS_core 102aa 5e-37
in ref transcript
cd CysRS_core 34aa 4e-05
in ref transcript
pfam tRNA-synt_1e 246aa 9e-88
in ref transcript
tRNA synthetases class I (C) catalytic domain. This family includes only cysteinyl tRNA synthetases.
TIGR cysS 86aa 7e-35
in ref transcript
This model finds the cysteinyl-tRNA synthetase from most but not from all species. The enzyme from one archaeal species, Archaeoglobus fulgidus, is found but the equivalent enzymes from some other Archaea, including Methanococcus jannaschii, are not found, although biochemical evidence suggests that tRNA(Cys) in these species are charged directly with Cys rather than through a misacylation and correction pathway as for tRNA(Gln).
TIGR leuS_arch 169aa 2e-07
in ref transcript
The leucyl-tRNA synthetases belong to two families so broadly different that they are represented by separate models. This model includes both archaeal and cytosolic eukaryotic leucyl-tRNA synthetases; the eubacterial and mitochondrial forms differ so substantially that some other tRNA ligases score higher by this model than does any eubacterial LeuS.
COG CysS 424aa 2e-96
in ref transcript
Cysteinyl-tRNA synthetase [Translation, ribosomal structure and biogenesis].
PRK cysS 81aa 2e-36
in ref transcript
cysteinyl-tRNA synthetase; Validated.
CASC5
refseq_CASC5.F1 refseq_CASC5.R1 416 494
NCBIGene 36.3 57082
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_170589
CASP1
refseq_CASP1.F2 refseq_CASP1.R2 306 369
NCBIGene 36.3 834
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_033292
cd CASc 250aa 1e-72
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
smart CASc 250aa 5e-94
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
pfam CARD 88aa 5e-20
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
CASP4
refseq_CASP4.F1 refseq_CASP4.R1 141 199
NCBIGene 36.2 837
Alternative 3-prime, size difference: 58
Inclusion in the protein causing a frameshift
Reference transcript: NM_001225
Changed!
cd CASc 250aa 3e-69
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
Changed!
smart CASc 250aa 2e-93
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
Changed!
smart CARD 89aa 2e-10
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signalling. Mediates homodimerisation. Structure consists of six antiparallel helices arranged in a topology homologue to the DEATH and the DED domain.
Changed!
smart CARD 89aa 3e-11
in modified transcript
CASP6
refseq_CASP6.F2 refseq_CASP6.R2 102 369
NCBIGene 36.3 839
Multiple exon skipping, size difference: 267
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_001226
Changed!
cd CASc 253aa 2e-82
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
Changed!
smart CASc 254aa 7e-91
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
Changed!
cd CASc 187aa 6e-60
in modified transcript
Changed!
smart CASc 184aa 1e-64
in modified transcript
CASP7
refseq_CASP7.F1 refseq_CASP7.R1 192 226
NCBIGene 36.3 840
Exon skipping and alternative 3-prime or 5-prime, size difference: 34
Inclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_033338
Changed!
cd CASc 242aa 4e-84
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
Changed!
smart CASc 243aa 2e-90
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
Changed!
cd CASc 105aa 2e-26
in modified transcript
Changed!
smart CASc 90aa 5e-28
in modified transcript
CASP8
refseq_CASP8.F1 refseq_CASP8.R1 277 373
NCBIGene 36.3 841
Single exon skipping, size difference: 96
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001080125
cd CASc 253aa 3e-85
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
cd DED 75aa 3e-17
in ref transcript
Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
Changed!
cd DED 78aa 7e-13
in ref transcript
smart CASc 252aa 4e-90
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
pfam DED 84aa 5e-23
in ref transcript
Death effector domain.
Changed!
pfam DED 82aa 2e-17
in ref transcript
Changed!
cd DED 75aa 7e-12
in modified transcript
Changed!
pfam DED 81aa 7e-17
in modified transcript
CAST
refseq_CAST.F2 refseq_CAST.R2 103 282
NCBIGene 36.2 831
Exon skipping and alternative 3-prime or 5-prime, size difference: 179
Inclusion in the protein causing a frameshift, Inclusion in the protein causing a new stop codon
Reference transcript: NM_001750
Changed!
pfam Calpain_inhib 127aa 3e-19
in ref transcript
Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins.
Changed!
pfam Calpain_inhib 118aa 2e-16
in ref transcript
Changed!
pfam Calpain_inhib 99aa 1e-13
in ref transcript
pfam Calpain_inhib 126aa 2e-11
in ref transcript
Changed!
pfam Calpain_inhib 86aa 2e-12
in modified transcript
CAV2
refseq_CAV2.F1 refseq_CAV2.R1 207 395
NCBIGene 36.3 858
Single exon skipping, size difference: 188
Exclusion in the protein causing a frameshift
Reference transcript: NM_001233
Changed!
pfam Caveolin 149aa 4e-55
in ref transcript
Caveolin. All three known Caveolin forms have the FEDVIAEP caveolin 'signature motif' within their hydrophilic N-terminal domain. Caveolin 2 (Cav-2) is co-localised and co-expressed with Cav-1/VIP21, forms heterodimers with it and needs Cav-1 for proper membrane localisation. Cav-3 has greater protein sequence similarity to Cav-1 than to Cav-2. Cellular processes caveolins are involved in include vesicular transport, cholesterol homeostasis, signal transduction, and tumour suppression.
Changed!
pfam Caveolin 41aa 2e-04
in modified transcript
CBFB
refseq_CBFB.F1 refseq_CBFB.R1 141 172
NCBIGene 36.3 865
Alternative 5-prime, size difference: 31
Inclusion in the protein causing a frameshift
Reference transcript: NM_022845
Changed!
pfam CBF_beta 178aa 4e-73
in ref transcript
Core binding factor beta subunit. Core binding factor (CBF) is a heterodimeric transcription factor essential for genetic regulation of hematopoiesis and osteogenesis. The beta subunit enhances DNA-binding ability of the alpha subunit in vitro, and has been show to have a structure related to the OB fold.
Changed!
pfam CBF_beta 170aa 2e-70
in modified transcript
CCDC108
refseq_CCDC108.F1 refseq_CCDC108.R1 132 287
NCBIGene 36.3 255101
Single exon skipping, size difference: 155
Exclusion of the protein initiation site
Reference transcript: NM_194302
CCDC55
refseq_CCDC55.F1 refseq_CCDC55.R1 220 316
NCBIGene 36.2 84081
Single exon skipping, size difference: 96
Inclusion in the protein causing a new stop codon
Reference transcript: NM_032141
Changed!
pfam DUF2040 126aa 7e-07
in ref transcript
Coiled-coil domain-containing protein 55 (DUF2040). This entry is a conserved domain of approximately 130 residues of proteins conserved from fungi to humans. The proteins do contain a coiled-coil domain, but the function is unknown.
Changed!
TIGR U2AF_lg 110aa 0.009
in ref transcript
Members of this subfamily are found in plants, metazoa and fungi.
CCDC62
refseq_CCDC62.F1 refseq_CCDC62.R1 260 354
NCBIGene 36.3 84660
Single exon skipping, size difference: 94
Exclusion of the stop codon
Reference transcript: NM_201435
pfam SMC_N 256aa 5e-07
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
COG Smc 261aa 6e-06
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
CCDC7
refseq_CCDC7.F2 refseq_CCDC7.R2 164 398
NCBIGene 36.3 221016
Alternative 5-prime, size difference: 234
Exclusion in 5'UTR
Reference transcript: NM_145023
CCL14
refseq_CCL14.F1 refseq_CCL14.R1 100 148
NCBIGene 36.3 6358
Single exon skipping, size difference: 48
Exclusion in 3'UTR
Reference transcript: NM_032962
cd Chemokine_CC 57aa 4e-17
in ref transcript
Chemokine_CC: 1 of 4 subgroup designations based on the arrangement of the two N-terminal cysteine residues; includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity; some members (e.g. 2HCC) contain an additional disulfide bond which is thought to compensate for the highly conserved Trp missing in these; chemotatic for monocytes, macrophages, eosinophils, basophils, and T cells, but not neutrophils; exist as monomers and dimers, but are believed to be functional as monomers; found only in vertebrates and a few viruses; a subgroup of CC, identified by an N-terminal DCCL motif (Exodus-1, Exodus-2, and Exodus-3), has been shown to inhibit specific types of human cancer cell growth in a mouse model. See CDs: Chemokine (cd00169) for the general alignment of chemokines, or Chemokine_CXC (cd00273), Chemokine_C (cd00271), and Chemokine_CX3C (cd00274) for the additional chemokine subgroups, and Chemokine_CC_DCCL for the DCCL subgroup of this CD.
smart SCY 57aa 3e-19
in ref transcript
Intercrine alpha family (small cytokine C-X-C) (chemokine CXC). Family of cytokines involved in cell-specific chemotaxis, mediation of cell growth, and the inflammatory response.
CCL23
refseq_CCL23.F2 refseq_CCL23.R2 270 321
NCBIGene 36.3 6368
Alternative 3-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_005064
cd Chemokine_CC 57aa 9e-15
in ref transcript
Chemokine_CC: 1 of 4 subgroup designations based on the arrangement of the two N-terminal cysteine residues; includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity; some members (e.g. 2HCC) contain an additional disulfide bond which is thought to compensate for the highly conserved Trp missing in these; chemotatic for monocytes, macrophages, eosinophils, basophils, and T cells, but not neutrophils; exist as monomers and dimers, but are believed to be functional as monomers; found only in vertebrates and a few viruses; a subgroup of CC, identified by an N-terminal DCCL motif (Exodus-1, Exodus-2, and Exodus-3), has been shown to inhibit specific types of human cancer cell growth in a mouse model. See CDs: Chemokine (cd00169) for the general alignment of chemokines, or Chemokine_CXC (cd00273), Chemokine_C (cd00271), and Chemokine_CX3C (cd00274) for the additional chemokine subgroups, and Chemokine_CC_DCCL for the DCCL subgroup of this CD.
smart SCY 59aa 3e-18
in ref transcript
Intercrine alpha family (small cytokine C-X-C) (chemokine CXC). Family of cytokines involved in cell-specific chemotaxis, mediation of cell growth, and the inflammatory response.
CCNL2
refseq_CCNL2.F1 refseq_CCNL2.R1 126 342
NCBIGene 36.3 81669
Multiple exon skipping, size difference: 216
Inclusion in the protein causing a new stop codon, Inclusion in the protein causing a frameshift
Reference transcript: NM_030937
cd CYCLIN 90aa 5e-09
in ref transcript
Cyclin box fold. Protein binding domain functioning in cell-cycle and transcription control. Present in cyclins, TFIIB and Retinoblastoma (RB).The cyclins consist of 8 classes of cell cycle regulators that regulate cyclin dependent kinases (CDKs). TFIIB is a transcription factor that binds the TATA box. Cyclins, TFIIB and RB contain 2 copies of the domain.
cd CYCLIN 74aa 2e-07
in ref transcript
smart CYCLIN 83aa 3e-12
in ref transcript
domain present in cyclins, TFIIB and Retinoblastoma. A helical domain present in cyclins and TFIIB (twice) and Retinoblastoma (once). A protein recognition domain functioning in cell-cycle and transcription control.
smart CYCLIN 65aa 8e-07
in ref transcript
TIGR ccl1 224aa 2e-04
in ref transcript
University).
COG CCL1 205aa 2e-27
in ref transcript
Cdk activating kinase (CAK)/RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH/TFIIK, cyclin H subunit [Cell division and chromosome partitioning / Transcription / DNA replication, recombination, and repair].
CCR3
refseq_CCR3.F1 refseq_CCR3.R1 204 283
NCBIGene 36.3 1232
Single exon skipping, size difference: 79
Exclusion in 5'UTR
Reference transcript: NM_001837
pfam 7tm_1 245aa 4e-42
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
CCR9
refseq_CCR9.F2 refseq_CCR9.R2 246 295
NCBIGene 36.3 10803
Single exon skipping, size difference: 49
Exclusion of the protein initiation site
Reference transcript: NM_031200
pfam 7tm_1 247aa 3e-35
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
CCRK
refseq_CCRK.F1 refseq_CCRK.R1 176 239
NCBIGene 36.3 23552
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039803
Changed!
cd S_TKc 286aa 2e-64
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 275aa 6e-63
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00024 292aa 4e-49
in ref transcript
cyclin-dependent protein kinase; Provisional.
Changed!
cd S_TKc 265aa 2e-52
in modified transcript
Changed!
smart S_TKc 254aa 3e-50
in modified transcript
Changed!
PTZ PTZ00024 271aa 6e-37
in modified transcript
CCRK
refseq_CCRK.F4 refseq_CCRK.R4 209 248
NCBIGene 36.3 23552
Alternative 3-prime, size difference: 39
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001039803
Changed!
cd S_TKc 286aa 2e-64
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 275aa 6e-63
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00024 292aa 4e-49
in ref transcript
cyclin-dependent protein kinase; Provisional.
Changed!
cd S_TKc 299aa 7e-62
in modified transcript
Changed!
smart S_TKc 288aa 3e-60
in modified transcript
Changed!
PTZ PTZ00024 305aa 1e-47
in modified transcript
CCT3
refseq_CCT3.F1 refseq_CCT3.R1 129 243
NCBIGene 36.3 7203
Multiple exon skipping, size difference: 114
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_005998
Changed!
cd TCP1_gamma 520aa 0.0
in ref transcript
TCP-1 (CTT or eukaryotic type II) chaperonin family, gamma subunit. Chaperonins are involved in productive folding of proteins. They share a common general morphology, a double toroid of 2 stacked rings. In contrast to bacterial group I chaperonins (GroEL), each ring of the eukaryotic cytosolic chaperonin (CTT) consists of eight different, but homologous subunits. Their common function is to sequester nonnative proteins inside their central cavity and promote folding by using energy derived from ATP hydrolysis. The best studied in vivo substrates of CTT are actin and tubulin.
Changed!
TIGR chap_CCT_gamma 524aa 0.0
in ref transcript
Members of this family, all eukaryotic, are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT gamma chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.
Changed!
COG GroL 521aa 1e-117
in ref transcript
Chaperonin GroEL (HSP60 family) [Posttranslational modification, protein turnover, chaperones].
Changed!
cd TCP1_gamma 482aa 0.0
in modified transcript
Changed!
TIGR chap_CCT_gamma 486aa 0.0
in modified transcript
Changed!
COG GroL 464aa 5e-99
in modified transcript
CCT6A
refseq_CCT6A.F1 refseq_CCT6A.R1 171 306
NCBIGene 36.3 908
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_001762
Changed!
cd TCP1_zeta 520aa 0.0
in ref transcript
TCP-1 (CTT or eukaryotic type II) chaperonin family, zeta subunit. Chaperonins are involved in productive folding of proteins. They share a common general morphology, a double toroid of 2 stacked rings. In contrast to bacterial group I chaperonins (GroEL), each ring of the eukaryotic cytosolic chaperonin (CTT) consists of eight different, but homologous subunits. Their common function is to sequester nonnative proteins inside their central cavity and promote folding by using energy derived from ATP hydrolysis. The best studied in vivo substrates of CTT are actin and tubulin.
Changed!
TIGR chap_CCT_zeta 528aa 0.0
in ref transcript
Members of this family, all eukaryotic, are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT zeta chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.
Changed!
COG GroL 503aa 5e-93
in ref transcript
Chaperonin GroEL (HSP60 family) [Posttranslational modification, protein turnover, chaperones].
Changed!
cd TCP1_zeta 475aa 0.0
in modified transcript
Changed!
TIGR chap_CCT_zeta 483aa 0.0
in modified transcript
Changed!
COG GroL 458aa 8e-73
in modified transcript
CD151
refseq_CD151.F1 refseq_CD151.R1 101 163
NCBIGene 36.3 977
Single exon skipping, size difference: 62
Exclusion in 5'UTR
Reference transcript: NM_004357
cd CD151_like_LEL 109aa 3e-40
in ref transcript
Tetraspanin, extracellular domain or large extracellular loop (LEL), CD151_Like family. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". CD151strongly associates with integrins, especially alpha3beta1, alpha6beta1, alpha7beta1, and alpha6beta4; it may play roles in cell-cell adhesion, cell migration, platelet aggregation, and angiogenesis. For example, CD151 is is involved in regulation of migration of neutrophils, endothelial cells, and various tumor cell lines; it associates specifically with laminin-binding integrins and strengthens alpha6beta1 integrin-mediated adhesion to laminin-1; CD151 also specifically attenuates adhesion-dependent activation of Ras and correspdonding downstream effects, and is involved in epithelial cell-cell adhesion as a modulator of PKC- and Cdc42-dependent actin cytoskeletal reorganization.
pfam Tetraspannin 234aa 2e-31
in ref transcript
Tetraspanin family.
CD163
refseq_CD163.F2 refseq_CD163.R2 309 392
NCBIGene 36.3 9332
Alternative 3-prime, size difference: 83
Exclusion in the protein causing a frameshift
Reference transcript: NM_004244
smart SR 101aa 4e-39
in ref transcript
Scavenger receptor Cys-rich. The sea ucrhin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.
smart SR 101aa 1e-35
in ref transcript
smart SR 102aa 6e-35
in ref transcript
smart SR 101aa 1e-34
in ref transcript
smart SR 101aa 5e-31
in ref transcript
smart SR 100aa 5e-30
in ref transcript
smart SR 101aa 1e-28
in ref transcript
smart SR 101aa 4e-27
in ref transcript
smart SR 102aa 1e-17
in ref transcript
CD200
refseq_CD200.F2 refseq_CD200.R2 280 355
NCBIGene 36.3 4345
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_001004196
smart IGv 77aa 2e-07
in ref transcript
Immunoglobulin V-Type.
pfam ig 64aa 0.003
in ref transcript
Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involved in protein-protein and protein-ligand interactions. The Pfam alignments do not include the first and last strand of the immunoglobulin-like domain.
CD200R1
refseq_CD200R1.F2 refseq_CD200R1.R2 318 387
NCBIGene 36.3 131450
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_138806
pfam C2-set_2 70aa 0.002
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
CD34
refseq_CD34.F1 refseq_CD34.R1 151 346
NCBIGene 36.3 947
Alternative 3-prime, size difference: 195
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001025109
Changed!
pfam CD34_antigen 196aa 3e-55
in ref transcript
CD34/Podocalyxin family. This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion.
Changed!
pfam CD34_antigen 139aa 3e-38
in modified transcript
CD37
refseq_CD37.F1 refseq_CD37.R1 172 206
NCBIGene 36.3 951
Alternative 5-prime, size difference: 34
Exclusion in the protein causing a frameshift
Reference transcript: NM_001774
Changed!
cd CD37_CD82_like_LEL 136aa 9e-38
in ref transcript
Tetraspanin, extracellular domain or large extracellular loop (LEL), CD37_CD82_Like family. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". CD37 is a leukocyte-specific protein, and its restricted expression pattern suggests a role in the immune system. A regulatory role in T-cell proliferation has been suggested. CD82 is a metastasis suppressor implicated in biological processes ranging from fusion, adhesion, and migration to apoptosis and alterations of cell morphology.
Changed!
pfam Tetraspannin 262aa 5e-25
in ref transcript
Tetraspanin family.
CD46
refseq_CD46.F1 refseq_CD46.R1 197 239
NCBIGene 36.3 4179
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_172359
cd CCP 60aa 4e-08
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd CCP 63aa 1e-06
in ref transcript
cd CCP 50aa 4e-05
in ref transcript
pfam Sushi 62aa 5e-09
in ref transcript
Sushi domain (SCR repeat).
pfam Sushi 59aa 2e-08
in ref transcript
smart CCP 50aa 6e-06
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
smart CCP 54aa 5e-04
in ref transcript
CD46
refseq_CD46.F3 refseq_CD46.R3 120 213
NCBIGene 36.3 4179
Single exon skipping, size difference: 93
Inclusion in the protein causing a new stop codon
Reference transcript: NM_172359
cd CCP 60aa 4e-08
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd CCP 63aa 1e-06
in ref transcript
cd CCP 50aa 4e-05
in ref transcript
pfam Sushi 62aa 5e-09
in ref transcript
Sushi domain (SCR repeat).
pfam Sushi 59aa 2e-08
in ref transcript
smart CCP 50aa 6e-06
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
smart CCP 54aa 5e-04
in ref transcript
CD46
refseq_CD46.F4 refseq_CD46.R4 228 273
NCBIGene 36.3 4179
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_172359
cd CCP 60aa 4e-08
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd CCP 63aa 1e-06
in ref transcript
Changed!
cd CCP 50aa 4e-05
in ref transcript
pfam Sushi 62aa 5e-09
in ref transcript
Sushi domain (SCR repeat).
pfam Sushi 59aa 2e-08
in ref transcript
Changed!
smart CCP 50aa 6e-06
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
smart CCP 54aa 5e-04
in ref transcript
Changed!
cd CCP 56aa 6e-08
in modified transcript
Changed!
smart CCP 56aa 5e-09
in modified transcript
CD68
refseq_CD68.F1 refseq_CD68.R1 234 315
NCBIGene 36.3 968
Alternative 3-prime, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_001251
pfam Lamp 201aa 2e-34
in ref transcript
Lysosome-associated membrane glycoprotein (Lamp).
CD74
refseq_CD74.F1 refseq_CD74.R1 118 310
NCBIGene 36.3 972
Single exon skipping, size difference: 192
Exclusion in the protein (no frameshift)
Reference transcript: NM_001025159
Changed!
cd TY 60aa 5e-14
in ref transcript
Thyroglobulin type I repeats.; The N-terminal region of human thyroglobulin contains 11 type-1 repeats TY repeats are proposed to be inhibitors of cysteine proteases.
pfam MHC2-interact 113aa 2e-35
in ref transcript
CLIP, MHC2 interacting. Members of this family are found in class II invariant chain-associated peptide (CLIP), and are required for association with class II major histocompatibility complex (MHC) in the MHC class II processing pathway.
pfam MHCassoc_trimer 72aa 1e-31
in ref transcript
Class II MHC-associated invariant chain trimerisation domain. The class II associated invariant chain peptide is required for folding and localisation of MHC class II heterodimers. This domain is involved in trimerisation of the ectoderm and interferes with DM/class II binding. The trimeric protein forms a cylindrical shape which is thought to be important for interactions between the invariant chain and class II molecules.
Changed!
pfam Thyroglobulin_1 59aa 1e-16
in ref transcript
Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 and 5 with 6.
CD79A
refseq_CD79A.F1 refseq_CD79A.R1 202 316
NCBIGene 36.3 973
Alternative 5-prime, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_001783
Changed!
cd IGcam 91aa 2e-04
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
smart IG_like 88aa 2e-11
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
Changed!
pfam ITAM 21aa 0.010
in modified transcript
Immunoreceptor tyrosine-based activation motif.
CD79B
refseq_CD79B.F1 refseq_CD79B.R1 101 413
NCBIGene 36.3 974
Single exon skipping, size difference: 312
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039933
Changed!
cd IGv 87aa 0.006
in ref transcript
Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
pfam V-set 101aa 5e-11
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
CD8A
refseq_CD8A.F2 refseq_CD8A.R2 155 266
NCBIGene 36.3 925
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_001768
cd IGv 94aa 3e-05
in ref transcript
Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam V-set 107aa 4e-11
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
CD8B
refseq_CD8B.F2 refseq_CD8B.R2 192 250
NCBIGene 36.3 926
Alternative 3-prime, size difference: 58
Exclusion in the protein causing a frameshift
Reference transcript: NM_172099
cd IGv 105aa 1e-09
in ref transcript
Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam V-set 116aa 3e-19
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
CD8B
refseq_CD8B.F2 refseq_CD8B.R5 180 234
NCBIGene 36.3 926
Single exon skipping, size difference: 54
Inclusion in the protein causing a new stop codon
Reference transcript: NM_172213
cd IGv 105aa 4e-09
in ref transcript
Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam V-set 116aa 6e-19
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
CD8B
refseq_CD8B.F3 refseq_CD8B.R3 123 213
NCBIGene 36.3 926
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_172099
cd IGv 105aa 1e-09
in ref transcript
Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam V-set 116aa 3e-19
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
CD96
refseq_CD96.F1 refseq_CD96.R1 191 239
NCBIGene 36.3 10225
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_198196
COG COG5099 153aa 0.008
in ref transcript
RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis].
CD97
refseq_CD97.F1 refseq_CD97.R1 204 351
NCBIGene 36.3 976
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_078481
Changed!
cd EGF_CA 37aa 3e-05
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
cd EGF_CA 33aa 4e-05
in ref transcript
Changed!
cd EGF_CA 34aa 6e-05
in ref transcript
cd EGF_CA 35aa 0.001
in ref transcript
pfam 7tm_2 250aa 1e-78
in ref transcript
7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
Changed!
pfam EGF_CA 48aa 4e-10
in ref transcript
Calcium binding EGF domain.
smart GPS 51aa 7e-10
in ref transcript
G-protein-coupled receptor proteolytic site domain. Present in latrophilin/CL-1, sea urchin REJ and polycystin.
Changed!
pfam EGF_CA 35aa 2e-09
in ref transcript
smart EGF_CA 33aa 3e-06
in ref transcript
Calcium-binding EGF-like domain.
pfam EGF_CA 51aa 9e-06
in ref transcript
COG PutP 125aa 0.004
in ref transcript
Na+/proline symporter [Amino acid transport and metabolism / General function prediction only].
Changed!
cd EGF_CA 37aa 1e-05
in modified transcript
Changed!
pfam EGF_CA 35aa 6e-10
in modified transcript
Changed!
smart EGF_CA 44aa 1e-06
in modified transcript
CD99L2
refseq_CD99L2.F1 refseq_CD99L2.R1 209 356
NCBIGene 36.3 83692
Multiple exon skipping, size difference: 147
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_031462
CDC14B
refseq_CDC14B.F2 refseq_CDC14B.R2 231 348
NCBIGene 36.3 8555
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_033331
cd DSPc 108aa 1e-09
in ref transcript
Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
smart DSPc 116aa 7e-12
in ref transcript
Dual specificity phosphatase, catalytic domain.
PTZ PTZ00242 139aa 2e-20
in ref transcript
protein tyrosine phosphatase; Provisional.
CDC25A
refseq_CDC25A.F1 refseq_CDC25A.R1 145 265
NCBIGene 36.3 993
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_001789
cd Cdc25 119aa 5e-50
in ref transcript
Cdc25 phosphatases are members of the Rhodanese Homology Domain superfamily. They activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. Cdc25A phosphatase functions to regulate S phase entry and Cdc25B is required for G2/M phase transition of the cell cycle. The Cdc25 domain binds oxyanions at the catalytic site and has the signature motif (H/YCxxxxxR).
Changed!
pfam M-inducer_phosp 244aa 4e-65
in ref transcript
M-phase inducer phosphatase. This family represents a region within eukaryotic M-phase inducer phosphatases (EC:3.1.3.48), which also contain the pfam00581 domain. These proteins are involved in the control of mitosis.
pfam Rhodanese 108aa 4e-24
in ref transcript
Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
COG MIH1 233aa 1e-31
in ref transcript
Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning].
Changed!
pfam M-inducer_phosp 204aa 2e-54
in modified transcript
CDC25B
refseq_CDC25B.F2 refseq_CDC25B.R2 129 171
NCBIGene 36.3 994
Alternative 3-prime, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_021873
cd Cdc25 120aa 9e-49
in ref transcript
Cdc25 phosphatases are members of the Rhodanese Homology Domain superfamily. They activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. Cdc25A phosphatase functions to regulate S phase entry and Cdc25B is required for G2/M phase transition of the cell cycle. The Cdc25 domain binds oxyanions at the catalytic site and has the signature motif (H/YCxxxxxR).
pfam M-inducer_phosp 272aa 1e-76
in ref transcript
M-phase inducer phosphatase. This family represents a region within eukaryotic M-phase inducer phosphatases (EC:3.1.3.48), which also contain the pfam00581 domain. These proteins are involved in the control of mitosis.
pfam Rhodanese 109aa 9e-25
in ref transcript
Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
COG MIH1 158aa 7e-34
in ref transcript
Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning].
CDC25B
refseq_CDC25B.F3 refseq_CDC25B.R3 101 224
NCBIGene 36.3 994
Single exon skipping, size difference: 123
Exclusion in the protein (no frameshift)
Reference transcript: NM_021873
cd Cdc25 120aa 9e-49
in ref transcript
Cdc25 phosphatases are members of the Rhodanese Homology Domain superfamily. They activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. Cdc25A phosphatase functions to regulate S phase entry and Cdc25B is required for G2/M phase transition of the cell cycle. The Cdc25 domain binds oxyanions at the catalytic site and has the signature motif (H/YCxxxxxR).
Changed!
pfam M-inducer_phosp 272aa 1e-76
in ref transcript
M-phase inducer phosphatase. This family represents a region within eukaryotic M-phase inducer phosphatases (EC:3.1.3.48), which also contain the pfam00581 domain. These proteins are involved in the control of mitosis.
pfam Rhodanese 109aa 9e-25
in ref transcript
Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
COG MIH1 158aa 7e-34
in ref transcript
Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning].
Changed!
pfam M-inducer_phosp 231aa 9e-65
in modified transcript
CDC25C
refseq_CDC25C.F1 refseq_CDC25C.R1 133 228
NCBIGene 36.3 995
Single exon skipping, size difference: 95
Exclusion in the protein causing a frameshift
Reference transcript: NM_001790
Changed!
cd Cdc25 120aa 4e-45
in ref transcript
Cdc25 phosphatases are members of the Rhodanese Homology Domain superfamily. They activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. Cdc25A phosphatase functions to regulate S phase entry and Cdc25B is required for G2/M phase transition of the cell cycle. The Cdc25 domain binds oxyanions at the catalytic site and has the signature motif (H/YCxxxxxR).
Changed!
pfam M-inducer_phosp 180aa 1e-43
in ref transcript
M-phase inducer phosphatase. This family represents a region within eukaryotic M-phase inducer phosphatases (EC:3.1.3.48), which also contain the pfam00581 domain. These proteins are involved in the control of mitosis.
Changed!
pfam Rhodanese 107aa 6e-22
in ref transcript
Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
Changed!
COG MIH1 241aa 1e-35
in ref transcript
Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning].
CDC25C
refseq_CDC25C.F2 refseq_CDC25C.R2 141 360
NCBIGene 36.3 995
Multiple exon skipping, size difference: 124
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_001790
Changed!
cd Cdc25 120aa 4e-45
in ref transcript
Cdc25 phosphatases are members of the Rhodanese Homology Domain superfamily. They activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. Cdc25A phosphatase functions to regulate S phase entry and Cdc25B is required for G2/M phase transition of the cell cycle. The Cdc25 domain binds oxyanions at the catalytic site and has the signature motif (H/YCxxxxxR).
Changed!
pfam M-inducer_phosp 180aa 1e-43
in ref transcript
M-phase inducer phosphatase. This family represents a region within eukaryotic M-phase inducer phosphatases (EC:3.1.3.48), which also contain the pfam00581 domain. These proteins are involved in the control of mitosis.
Changed!
pfam Rhodanese 107aa 6e-22
in ref transcript
Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
Changed!
COG MIH1 241aa 1e-35
in ref transcript
Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning].
CDC25C
refseq_CDC25C.F3 refseq_CDC25C.R3 281 500
NCBIGene 36.3 995
Multiple exon skipping, size difference: 124
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_001790
Changed!
cd Cdc25 120aa 4e-45
in ref transcript
Cdc25 phosphatases are members of the Rhodanese Homology Domain superfamily. They activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. Cdc25A phosphatase functions to regulate S phase entry and Cdc25B is required for G2/M phase transition of the cell cycle. The Cdc25 domain binds oxyanions at the catalytic site and has the signature motif (H/YCxxxxxR).
Changed!
pfam M-inducer_phosp 180aa 1e-43
in ref transcript
M-phase inducer phosphatase. This family represents a region within eukaryotic M-phase inducer phosphatases (EC:3.1.3.48), which also contain the pfam00581 domain. These proteins are involved in the control of mitosis.
Changed!
pfam Rhodanese 107aa 6e-22
in ref transcript
Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
Changed!
COG MIH1 241aa 1e-35
in ref transcript
Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning].
CDC2L1
refseq_CDC2L1.F2 refseq_CDC2L1.R5 134 167
NCBIGene 36.3 984
Alternative 5-prime and 3-prime, size difference: 33
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_033486
cd S_TKc 287aa 3e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 276aa 1e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
PTZ PTZ00024 295aa 4e-55
in ref transcript
cyclin-dependent protein kinase; Provisional.
CDC2L1
refseq_CDC2L1.F3 refseq_CDC2L1.R3 106 133
NCBIGene 36.3 984
Alternative 3-prime, size difference: 27
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_033486
cd S_TKc 287aa 3e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 276aa 1e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
PTZ PTZ00024 295aa 4e-55
in ref transcript
cyclin-dependent protein kinase; Provisional.
CDC2L2
refseq_CDC2L2.F3 refseq_CDC2L2.R3 128 244
NCBIGene 36.2 985
Alternative 5-prime, size difference: 116
Exclusion in the protein causing a frameshift
Reference transcript: NM_033531
Changed!
cd S_TKc 287aa 1e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 276aa 5e-69
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00024 295aa 4e-55
in ref transcript
cyclin-dependent protein kinase; Provisional.
CDC2L5
refseq_CDC2L5.F1 refseq_CDC2L5.R1 141 321
NCBIGene 36.3 8621
Alternative 3-prime, size difference: 180
Exclusion in the protein (no frameshift)
Reference transcript: NM_003718
cd S_TKc 295aa 4e-72
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 294aa 2e-74
in ref transcript
Protein kinase domain.
PTZ PTZ00024 301aa 2e-47
in ref transcript
cyclin-dependent protein kinase; Provisional.
CDC42
refseq_CDC42.F1 refseq_CDC42.R1 235 361
NCBIGene 36.3 998
Single exon skipping, size difference: 126
Exclusion in 5'UTR
Reference transcript: NM_001039802
cd Cdc42 175aa 1e-104
in ref transcript
Cdc42 subfamily. Cdc42 is an essential GTPase that belongs to the Rho family of Ras-like GTPases. These proteins act as molecular switches by responding to exogenous and/or endogenous signals and relaying those signals to activate downstream components of a biological pathway. Cdc42 transduces signals to the actin cytoskeleton to initiate and maintain polarized growth and to mitogen-activated protein morphogenesis. In the budding yeast Saccharomyces cerevisiae, Cdc42 plays an important role in multiple actin-dependent morphogenetic events such as bud emergence, mating-projection formation, and pseudohyphal growth. In mammalian cells, Cdc42 regulates a variety of actin-dependent events and induces the JNK/SAPK protein kinase cascade, which leads to the activation of transcription factors within the nucleus. Cdc42 mediates these processes through interactions with a myriad of downstream effectors, whose number and regulation we are just starting to understand. In addition, Cdc42 has been implicated in a number of human diseases through interactions with its regulators and downstream effectors. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
smart RHO 174aa 5e-94
in ref transcript
Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
COG COG1100 183aa 2e-30
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
CDC42SE1
refseq_CDC42SE1.F2 refseq_CDC42SE1.R2 100 236
NCBIGene 36.3 56882
Single exon skipping, size difference: 136
Exclusion in 5'UTR
Reference transcript: NM_001038707
CDC42SE2
refseq_CDC42SE2.F1 refseq_CDC42SE2.R1 232 401
NCBIGene 36.3 56990
Single exon skipping, size difference: 169
Exclusion in 5'UTR
Reference transcript: NM_020240
cd CRIB 41aa 1e-06
in ref transcript
PAK (p21 activated kinase) Binding Domain (PBD), binds Cdc42p- and/or Rho-like small GTPases; also known as the Cdc42/Rac interactive binding (CRIB) motif; has been shown to inhibit transcriptional activation and cell transformation mediated by the Ras-Rac pathway. CRIB-containing effector proteins are functionally diverse and include serine/threonine kinases, tyrosine kinases, actin-binding proteins, and adapter molecules.
pfam PBD 15aa 0.002
in ref transcript
P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB).
NUF2
refseq_CDCA1.F1 refseq_CDCA1.R1 243 389
NCBIGene 36.3 83540
Alternative 5-prime, size difference: 146
Exclusion in 5'UTR
Reference transcript: NM_145697
pfam Nuf2 148aa 3e-62
in ref transcript
Nuf2 family. Members of this family are components of the mitotic spindle. It has been shown that Nuf2 from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle. An arabidopsis protein has been included in this family that has previously not been identified as a member of this family. The match is not strong, but in common with other members of this family contains coiled-coil to the C terminus of this region.
pfam SMC_N 307aa 4e-10
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
COG Smc 316aa 4e-07
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
CDCA7
refseq_CDCA7.F1 refseq_CDCA7.R1 152 389
NCBIGene 36.3 83879
Single exon skipping, size difference: 237
Exclusion in the protein (no frameshift)
Reference transcript: NM_031942
pfam zf-4CXXC_R1 101aa 6e-51
in ref transcript
Zinc-finger domain of monoamine-oxidase A repressor R1. R1 is a transcription factor repressor that inhibits monoamine oxidase A gene expression. This domain is a four-CXXC zinc finger putative DNA-binding domain found at the C-terminal end of R1. The domain carries 12 cysteines of which four pairs are of the CXXC type.
CDH24
refseq_CDH24.F1 refseq_CDH24.R1 112 226
NCBIGene 36.3 64403
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_022478
Changed!
cd CA 192aa 2e-37
in ref transcript
Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
cd CA 186aa 2e-36
in ref transcript
cd CA 214aa 4e-34
in ref transcript
Changed!
cd CA 237aa 2e-25
in ref transcript
pfam Cadherin_C 150aa 9e-23
in ref transcript
Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic binding units. A key determinant to the strength of the binding that it is mediated by cadherins is the juxtamembrane region of the cadherin. This region induces clustering and also binds to the protein p120ctn.
smart CA 79aa 1e-16
in ref transcript
Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
pfam Cadherin 103aa 1e-15
in ref transcript
Cadherin domain.
Changed!
pfam Cadherin 77aa 3e-15
in ref transcript
pfam Cadherin 81aa 1e-11
in ref transcript
Changed!
smart CA 116aa 1e-10
in ref transcript
pfam Cadherin 95aa 9e-05
in ref transcript
Changed!
cd CA 208aa 3e-39
in modified transcript
Changed!
cd CA 199aa 6e-30
in modified transcript
Changed!
pfam Cadherin 90aa 2e-16
in modified transcript
CDK10
refseq_CDK10.F3 refseq_CDK10.R3 195 329
NCBIGene 36.3 8558
Alternative 3-prime, size difference: 134
Exclusion of the stop codon
Reference transcript: NM_052988
cd S_TKc 286aa 1e-62
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 285aa 5e-62
in ref transcript
Protein kinase domain.
Changed!
PTZ PTZ00024 296aa 2e-52
in ref transcript
cyclin-dependent protein kinase; Provisional.
Changed!
PTZ PTZ00024 293aa 6e-50
in modified transcript
CDK10
refseq_CDK10.F5 refseq_CDK10.R6 109 257
NCBIGene 36.2 8558
Exon skipping and alternative 3-prime or 5-prime, size difference: 148
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_003674
cd S_TKc 252aa 3e-49
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 251aa 2e-51
in ref transcript
Protein kinase domain.
PTZ PTZ00024 259aa 2e-44
in ref transcript
cyclin-dependent protein kinase; Provisional.
CDK2
refseq_CDK2.F2 refseq_CDK2.R2 187 289
NCBIGene 36.3 1017
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_001798
Changed!
cd S_TKc 284aa 1e-81
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
pfam Pkinase 283aa 2e-86
in ref transcript
Protein kinase domain.
Changed!
PTZ PTZ00024 285aa 3e-57
in ref transcript
cyclin-dependent protein kinase; Provisional.
Changed!
cd S_TKc 250aa 4e-60
in modified transcript
Changed!
pfam Pkinase 249aa 2e-62
in modified transcript
Changed!
PTZ PTZ00024 251aa 1e-35
in modified transcript
CDK5RAP1
refseq_CDK5RAP1.F2 refseq_CDK5RAP1.R2 113 261
NCBIGene 36.3 51654
Alternative 3-prime, size difference: 148
Exclusion of the protein initiation site
Reference transcript: NM_016408
TIGR TIGR00089 472aa 1e-121
in ref transcript
This subfamily is aparrently a part of a larger superfamily of enzymes utilizing both a 4Fe4S cluster and S-adenosyl methionine (SAM) to initiate radical reactions. MiaB acts on a particular isoprenylated Adenine base of certain tRNAs causing thiolation at an aromatic carbon, and probably also transferring a methyl grouyp from SAM to the thiol. The particular substrate of the three other clades is unknown but may be very closely related.
COG MiaB 476aa 1e-114
in ref transcript
2-methylthioadenine synthetase [Translation, ribosomal structure and biogenesis].
CDK5RAP2
refseq_CDK5RAP2.F1 refseq_CDK5RAP2.R1 113 350
NCBIGene 36.3 55755
Single exon skipping, size difference: 237
Exclusion in the protein (no frameshift)
Reference transcript: NM_018249
pfam Microtub_assoc 75aa 3e-15
in ref transcript
Microtubule associated. This presumed domain has been identified in two microtubule associated proteins in Schizosaccharomyces pombe, Mto1 and Pcp1. Mto1 has been identified in association with spindle pole body and non-spindle pole body microtubules. The pericentrin homolog Pcp1 is also associated with the fungal centrosome or spindle pole body (SPB).
TIGR SMC_prok_B 306aa 5e-10
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
TIGR SMC_prok_B 259aa 1e-05
in ref transcript
pfam SMC_N 625aa 3e-04
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
COG Smc 363aa 4e-12
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
COG Smc 270aa 0.001
in ref transcript
Changed!
TIGR SMC_prok_B 182aa 0.001
in modified transcript
CDKN2B
refseq_CDKN2B.F2 refseq_CDKN2B.R2 170 293
NCBIGene 36.3 1030
Alternative 5-prime, size difference: 123
Inclusion in the protein causing a new stop codon
Reference transcript: NM_004936
Changed!
cd ANK 115aa 1e-15
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
PTZ PTZ00322 81aa 9e-06
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 75aa 5e-05
in ref transcript
cd IGcam 71aa 0.003
in ref transcript
pfam V-set 106aa 3e-11
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
smart IGc2 62aa 2e-09
in ref transcript
Immunoglobulin C-2 Type.
smart IG_like 61aa 1e-08
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IG_like 78aa 1e-05
in ref transcript
CENPM
refseq_CENPM.F2 refseq_CENPM.R2 145 237
NCBIGene 36.3 79019
Single exon skipping, size difference: 92
Exclusion in the protein causing a frameshift
Reference transcript: NM_024053
CENTG2
refseq_CENTG2.F2 refseq_CENTG2.R2 166 325
NCBIGene 36.3 116987
Single exon skipping, size difference: 159
Exclusion in the protein (no frameshift)
Reference transcript: NM_001037131
cd Centaurin_gamma 158aa 4e-90
in ref transcript
Centaurin gamma. The centaurins (alpha, beta, gamma, and delta) are large, multi-domain proteins that all contain an ArfGAP domain and ankyrin repeats, and in some cases, numerous additional domains. Centaurin gamma contains an additional GTPase domain near its N-terminus. The specific function of this GTPase domain has not been well characterized, but centaurin gamma 2 (CENTG2) may play a role in the development of autism. Centaurin gamma 1 is also called PIKE (phosphatidyl inositol (PI) 3-kinase enhancer) and centaurin gamma 2 is also known as AGAP (ArfGAP protein with a GTPase-like domain, ankyrin repeats and a Pleckstrin homology domain) or GGAP. Three isoforms of PIKE have been identified. PIKE-S (short) and PIKE-L (long) are brain-specific isoforms, with PIKE-S restricted to the nucleus and PIKE-L found in multiple cellular compartments. A third isoform, PIKE-A was identified in human glioblastoma brain cancers and has been found in various tissues. GGAP has been shown to have high GTPase activity due to a direct intramolecular interaction between the N-terminal GTPase domain and the C-terminal ArfGAP domain. In human tissue, AGAP mRNA was detected in skeletal muscle, kidney, placenta, brain, heart, colon, and lung. Reduced expression levels were also observed in the spleen, liver, and small intestine.
cd PH_centaurin 64aa 9e-16
in ref transcript
Centaurin Pleckstrin homology (PH) domain. Centaurin beta and gamma consist of a PH domain, an ArfGAP domain and three ankyrin repeats. Centaurain gamma also has an N-terminal Ras homology domain. Centaurin alpha has a different domain architecture and its PH domain is in a different subfamily. Centaurin can bind to phosphatidlyinositol (3,4,5)P3. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
cd ANK 90aa 3e-11
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd PH_centaurin 42aa 2e-05
in ref transcript
pfam ArfGap 117aa 7e-37
in ref transcript
Putative GTPase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
pfam Miro 108aa 3e-22
in ref transcript
Miro-like protein. Mitochondrial Rho proteins (Miro-1 and Miro-2), are atypical Rho GTPases. They have a unique domain organisation, with tandem GTP-binding domains and two EF hand domains (pfam00036), that may bind calcium. They are also larger than classical small GTPases. It has been proposed that they are involved in mitochondrial homeostasis and apoptosis.
pfam Ank 32aa 7e-06
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
smart PH 65aa 9e-05
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
smart PH 40aa 0.001
in ref transcript
COG COG5347 118aa 3e-24
in ref transcript
GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
COG COG1100 170aa 6e-08
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
Changed!
pfam DAGK_cat 165aa 2e-15
in ref transcript
Diacylglycerol kinase catalytic domain. Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues. YegS is the Escherichia coli protein in this family whose crystal structure reveals an active site in the inter-domain cleft formed by four conserved sequence motifs, revealing a novel metal-binding site. The residues of this site are conserved across the family.
Changed!
COG LCB5 384aa 3e-16
in ref transcript
Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only].
Changed!
pfam DAGK_cat 139aa 1e-11
in modified transcript
Changed!
COG LCB5 358aa 7e-13
in modified transcript
CES2
refseq_CES2.F1 refseq_CES2.R1 117 165
NCBIGene 36.3 8824
Alternative 5-prime, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_003869
Changed!
cd Esterase_lipase 490aa 1e-134
in ref transcript
Esterases and lipases (includes fungal lipases, cholinesterases, etc.) These enzymes act on carboxylic esters (EC: 3.1.1.-). The catalytic apparatus involves three residues (catalytic triad): a serine, a glutamate or aspartate and a histidine.These catalytic residues are responsible for the nucleophilic attack on the carbonyl carbon atom of the ester bond. In contrast with other alpha/beta hydrolase fold family members, p-nitrobenzyl esterase and acetylcholine esterase have a Glu instead of Asp at the active site carboxylate.
Changed!
pfam COesterase 510aa 1e-171
in ref transcript
Carboxylesterase.
Changed!
COG PnbA 510aa 7e-90
in ref transcript
Carboxylesterase type B [Lipid metabolism].
Changed!
cd Esterase_lipase 474aa 1e-128
in modified transcript
Changed!
pfam COesterase 494aa 1e-164
in modified transcript
Changed!
COG PnbA 494aa 3e-84
in modified transcript
CGGBP1
refseq_CGGBP1.F1 refseq_CGGBP1.R1 199 301
NCBIGene 36.3 8545
Single exon skipping, size difference: 102
Exclusion in 5'UTR
Reference transcript: NM_001008390
TPPP3
refseq_CGI-38.F1 refseq_CGI-38.R1 149 240
NCBIGene 36.3 51673
Single exon skipping, size difference: 91
Exclusion in 5'UTR
Reference transcript: NM_016140
pfam p25-alpha 165aa 2e-60
in ref transcript
p25-alpha. This family encodes a 25 kDa protein that is phosphorylated by a Ser/Thr-Pro kinase. It has been described as a brain specific protein, but it is found in Tetrahymena thermophila.
CHAT
refseq_CHAT.F1 refseq_CHAT.R1 128 300
NCBIGene 36.3 1103
Single exon skipping, size difference: 172
Exclusion in 5'UTR
Reference transcript: NM_020985
pfam Carn_acyltransf 590aa 0.0
in ref transcript
Choline/Carnitine o-acyltransferase.
CHCHD7
refseq_CHCHD7.F1 refseq_CHCHD7.R1 105 125
NCBIGene 36.3 79145
Alternative 5-prime, size difference: 20
Exclusion in the protein causing a frameshift
Reference transcript: NM_001011668
Changed!
pfam CHCH 36aa 0.008
in ref transcript
CHCH domain. we have identified a conserved motif in the LOC118487 protein that we have called the CHCH motif. Alignment of this protein with related members showed the presence of three subgroups of proteins, which are called the S (Small), N (N-terminal extended) and C (C-terminal extended) subgroups. All three sub-groups of proteins have in common that they contain a predicted conserved [coiled coil 1]-[helix 1]-[coiled coil 2]-[helix 2] domain (CHCH domain). Within each helix of the CHCH domain, there are two cysteines present in a C-X9-C motif. The N-group contains an additional double helix domain, and each helix contains the C-X9-C motif. This family contains a number of characterised proteins: Cox19 protein - a nuclear gene of Saccharomyces cerevisiae, codes for an 11-kDa protein (Cox19p) required for expression of cytochrome oxidase. Because cox19 mutants are able to synthesise the mitochondrial and nuclear gene products of cytochrome oxidase, Cox19p probably functions post-translationally during assembly of the enzyme. Cox19p is present in the cytoplasm and mitochondria, where it exists as a soluble intermembrane protein. This dual location is similar to what was previously reported for Cox17p, a low molecular weight copper protein thought to be required for maturation of the CuA centre of subunit 2 of cytochrome oxidase. Cox19p have four conserved potential metal ligands, these are three cysteines and one histidine. Mrp10 - belongs to the class of yeast mitochondrial ribosomal proteins that are essential for translation. Eukaryotic NADH-ubiquinone oxidoreductase 19 kDa (NDUFA8) subunit. The CHCH domain was previously called DUF657.
CHCHD7
refseq_CHCHD7.F3 refseq_CHCHD7.R3 133 177
NCBIGene 36.3 79145
Alternative 3-prime, size difference: 44
Inclusion in the protein causing a frameshift
Reference transcript: NM_001011668
Changed!
pfam CHCH 36aa 0.008
in ref transcript
CHCH domain. we have identified a conserved motif in the LOC118487 protein that we have called the CHCH motif. Alignment of this protein with related members showed the presence of three subgroups of proteins, which are called the S (Small), N (N-terminal extended) and C (C-terminal extended) subgroups. All three sub-groups of proteins have in common that they contain a predicted conserved [coiled coil 1]-[helix 1]-[coiled coil 2]-[helix 2] domain (CHCH domain). Within each helix of the CHCH domain, there are two cysteines present in a C-X9-C motif. The N-group contains an additional double helix domain, and each helix contains the C-X9-C motif. This family contains a number of characterised proteins: Cox19 protein - a nuclear gene of Saccharomyces cerevisiae, codes for an 11-kDa protein (Cox19p) required for expression of cytochrome oxidase. Because cox19 mutants are able to synthesise the mitochondrial and nuclear gene products of cytochrome oxidase, Cox19p probably functions post-translationally during assembly of the enzyme. Cox19p is present in the cytoplasm and mitochondria, where it exists as a soluble intermembrane protein. This dual location is similar to what was previously reported for Cox17p, a low molecular weight copper protein thought to be required for maturation of the CuA centre of subunit 2 of cytochrome oxidase. Cox19p have four conserved potential metal ligands, these are three cysteines and one histidine. Mrp10 - belongs to the class of yeast mitochondrial ribosomal proteins that are essential for translation. Eukaryotic NADH-ubiquinone oxidoreductase 19 kDa (NDUFA8) subunit. The CHCH domain was previously called DUF657.
CHD3
refseq_CHD3.F1 refseq_CHD3.R1 160 262
NCBIGene 36.3 1107
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_001005271
cd HELICc 109aa 8e-21
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
cd DEXDc 159aa 1e-16
in ref transcript
DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
cd CHROMO 56aa 4e-06
in ref transcript
Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
cd CHROMO 45aa 4e-04
in ref transcript
cd BAH_plant_2 24aa 0.008
in ref transcript
BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
pfam CHDCT2 173aa 8e-92
in ref transcript
CHDCT2 (NUC038) domain. The CHDCT2 C-terminal domain is found in PHD/RING finger and chromo domain-associated CHD-like helicases.
pfam SNF2_N 287aa 1e-62
in ref transcript
SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
pfam DUF1086 133aa 5e-61
in ref transcript
Domain of Unknown Function (DUF1086). This family consists of several eukaryotic domains of unknown function which are present in chromodomain helicase DNA binding proteins. This domain is often found in conjunction with pfam00176, pfam00271, pfam06465, pfam00385 and pfam00628.
pfam CHDNT 55aa 7e-23
in ref transcript
CHDNT (NUC034) domain. The CHDNT domain is found in PHD/RING finger and chromo domain-associated helicases.
pfam Helicase_C 81aa 3e-19
in ref transcript
Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
pfam DUF1087 46aa 4e-16
in ref transcript
Domain of Unknown Function (DUF1087). Members of this family are found in various chromatin remodelling factors and transposases. Their exact function is, as yet, unknown.
pfam PHD 45aa 8e-12
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
cd GH18_chitolectin_chitotriosidase 359aa 1e-152
in ref transcript
This conserved domain family includes a large number of catalytically inactive chitinase-like lectins (chitolectins) including YKL-39, YKL-40 (HCGP39), YM1, oviductin, and AMCase (acidic mammalian chitinase), as well as catalytically active chitotriosidases. The conserved domain is an eight-stranded alpha/beta barrel fold belonging to the family 18 glycosyl hydrolases. The fold has a pronounced active-site cleft at the C-terminal end of the beta-barrel. The chitolectins lack a key active site glutamate (the proton donor required for hydrolytic activity) but retain highly conserved residues involved in oligosaccharide binding. Chitotriosidase is a chitinolytic enzyme expressed in maturing macrophages, which suggests that it plays a part in antimicrobial defense. Chitotriosidase hydrolyzes chitotriose, as well as colloidal chitin to yield chitobiose and is therefore considered an exochitinase. Chitotriosidase occurs in two major forms, the large form being converted to the small form by either RNA or post-translational processing. Although the small form, containing the chitinase domain alone, is sufficient for the chitinolytic activity, the additional C-terminal chitin-binding domain of the large form plays a role in processing colloidal chitin. The chitotriosidase gene is nonessential in humans, as about 35% of the population are heterozygous and 6% homozygous for an inactivated form of the gene. HCGP39 is a 39-kDa human cartilage glycoprotein thought to play a role in connective tissue remodeling and defense against pathogens.
smart Glyco_18 338aa 2e-94
in ref transcript
Glycosyl hydrolase family 18.
COG ChiA 340aa 6e-39
in ref transcript
Chitinase [Carbohydrate transport and metabolism].
CHIA
refseq_CHIA.F2 refseq_CHIA.R2 100 162
NCBIGene 36.2 27159
Multiple exon skipping, size difference: 62
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_201653
cd GH18_chitolectin_chitotriosidase 279aa 1e-134
in ref transcript
This conserved domain family includes a large number of catalytically inactive chitinase-like lectins (chitolectins) including YKL-39, YKL-40 (HCGP39), YM1, oviductin, and AMCase (acidic mammalian chitinase), as well as catalytically active chitotriosidases. The conserved domain is an eight-stranded alpha/beta barrel fold belonging to the family 18 glycosyl hydrolases. The fold has a pronounced active-site cleft at the C-terminal end of the beta-barrel. The chitolectins lack a key active site glutamate (the proton donor required for hydrolytic activity) but retain highly conserved residues involved in oligosaccharide binding. Chitotriosidase is a chitinolytic enzyme expressed in maturing macrophages, which suggests that it plays a part in antimicrobial defense. Chitotriosidase hydrolyzes chitotriose, as well as colloidal chitin to yield chitobiose and is therefore considered an exochitinase. Chitotriosidase occurs in two major forms, the large form being converted to the small form by either RNA or post-translational processing. Although the small form, containing the chitinase domain alone, is sufficient for the chitinolytic activity, the additional C-terminal chitin-binding domain of the large form plays a role in processing colloidal chitin. The chitotriosidase gene is nonessential in humans, as about 35% of the population are heterozygous and 6% homozygous for an inactivated form of the gene. HCGP39 is a 39-kDa human cartilage glycoprotein thought to play a role in connective tissue remodeling and defense against pathogens.
smart Glyco_18 257aa 3e-80
in ref transcript
Glycosyl hydrolase family 18.
smart ChtBD2 48aa 3e-09
in ref transcript
Chitin-binding domain type 2.
COG ChiA 257aa 1e-37
in ref transcript
Chitinase [Carbohydrate transport and metabolism].
CHKA
refseq_CHKA.F1 refseq_CHKA.R1 121 175
NCBIGene 36.3 1119
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_001277
Changed!
cd ChoK_euk 317aa 1e-91
in ref transcript
Choline Kinase (ChoK) in eukaryotes. The ChoK subfamily is part of a larger superfamily that includes the catalytic domains of other kinases, such as the typical serine/threonine/tyrosine protein kinases (PKs), RIO kinases, actin-fragmin kinase (AFK), and phosphoinositide 3-kinase (PI3K). It is composed of bacterial and eukaryotic choline kinases, as well as eukaryotic ethanolamine kinase. ChoK catalyzes the transfer of the gamma-phosphoryl group from ATP (or CTP) to its substrate, choline, producing phosphorylcholine (PCho), a precursor to the biosynthesis of two major membrane phospholipids, phosphatidylcholine (PC) and sphingomyelin (SM). Although choline is the preferred substrate, ChoK also shows substantial activity towards ethanolamine and its N-methylated derivatives. ChoK plays an important role in cell signaling pathways and the regulation of cell growth. Along with PCho, it is involved in malignant transformation through Ras oncogenes in various human cancers such as breast, lung, colon, prostate, neuroblastoma, and hepatic lymphoma. In mammalian cells, there are three ChoK isoforms (A-1, A-2, and B) which are active in homo or heterodimeric forms.
Changed!
pfam Choline_kinase 236aa 4e-62
in ref transcript
Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway. This alignment covers the protein kinase portion of the protein. The divergence of this family makes it very difficult to create a model that specifically predicts choline/ethanolamine kinases only.
Changed!
PTZ PTZ00296 361aa 6e-36
in ref transcript
choline kinase; Provisional.
Changed!
cd ChoK_euk 299aa 1e-93
in modified transcript
Changed!
pfam Choline_kinase 218aa 3e-64
in modified transcript
Changed!
PTZ PTZ00296 343aa 1e-36
in modified transcript
CHRFAM7A
refseq_CHRFAM7A.F2 refseq_CHRFAM7A.R2 146 210
NCBIGene 36.3 89832
Single exon skipping, size difference: 64
Exclusion of the protein initiation site
Reference transcript: NM_139320
Changed!
TIGR LIC 373aa 1e-107
in ref transcript
selective while glycine receptors are anion selective).
Changed!
TIGR LIC 307aa 3e-78
in modified transcript
CHRM2
refseq_CHRM2.F1 refseq_CHRM2.R1 130 186
NCBIGene 36.3 1129
Alternative 5-prime, size difference: 56
Exclusion in 5'UTR
Reference transcript: NM_001006632
pfam 7tm_1 174aa 7e-36
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
pfam 7tm_1 81aa 9e-06
in ref transcript
CHRM2
refseq_CHRM2.F3 refseq_CHRM2.R3 108 186
NCBIGene 36.3 1129
Single exon skipping, size difference: 78
Exclusion in 5'UTR
Reference transcript: NM_001006626
pfam 7tm_1 174aa 7e-36
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
pfam 7tm_1 81aa 9e-06
in ref transcript
CHRNA1
refseq_CHRNA1.F1 refseq_CHRNA1.R1 116 191
NCBIGene 36.3 1134
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039523
Changed!
pfam Neur_chan_LBD 233aa 3e-70
in ref transcript
Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure.
Changed!
pfam Neur_chan_memb 209aa 7e-45
in ref transcript
Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel.
Changed!
pfam Neur_chan_LBD 208aa 4e-74
in modified transcript
Changed!
TIGR LIC 427aa 1e-65
in modified transcript
selective while glycine receptors are anion selective).
NLRP3
refseq_CIAS1.F1 refseq_CIAS1.R1 125 296
NCBIGene 36.3 114548
Single exon skipping, size difference: 171
Exclusion in the protein (no frameshift)
Reference transcript: NM_001079821
Changed!
cd LRR_RI 317aa 6e-50
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Changed!
cd LRR_RI 207aa 1e-07
in ref transcript
pfam NACHT 170aa 1e-41
in ref transcript
NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
pfam PAAD_DAPIN 84aa 6e-21
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
smart LRR_RI 28aa 0.001
in ref transcript
Leucine rich repeat, ribonuclease inhibitor type.
Changed!
smart LRR_RI 28aa 0.002
in ref transcript
Changed!
COG RNA1 230aa 0.003
in ref transcript
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].
Changed!
cd LRR_RI 320aa 5e-54
in modified transcript
NLRP3
refseq_CIAS1.F3 refseq_CIAS1.R3 215 386
NCBIGene 36.3 114548
Single exon skipping, size difference: 171
Exclusion in the protein (no frameshift)
Reference transcript: NM_001079821
Changed!
cd LRR_RI 317aa 6e-50
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Changed!
cd LRR_RI 207aa 1e-07
in ref transcript
pfam NACHT 170aa 1e-41
in ref transcript
NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
pfam PAAD_DAPIN 84aa 6e-21
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
smart LRR_RI 28aa 0.001
in ref transcript
Leucine rich repeat, ribonuclease inhibitor type.
smart LRR_RI 28aa 0.002
in ref transcript
Changed!
COG RNA1 230aa 0.003
in ref transcript
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].
Changed!
cd LRR_RI 320aa 2e-48
in modified transcript
Changed!
COG RNA1 114aa 0.001
in modified transcript
CKAP5
refseq_CKAP5.F2 refseq_CKAP5.R2 183 363
NCBIGene 36.3 9793
Single exon skipping, size difference: 180
Exclusion in the protein (no frameshift)
Reference transcript: NM_001008938
CKLF
refseq_CKLF.F1 refseq_CKLF.R1 173 269
NCBIGene 36.3 51192
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_016951
Changed!
pfam MARVEL 118aa 3e-05
in ref transcript
Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
CKLF
refseq_CKLF.F3 refseq_CKLF.R3 206 365
NCBIGene 36.3 51192
Single exon skipping, size difference: 159
Exclusion in the protein (no frameshift)
Reference transcript: NM_016951
Changed!
pfam MARVEL 118aa 3e-05
in ref transcript
Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
CLCN3
refseq_CLCN3.F2 refseq_CLCN3.R2 286 362
NCBIGene 36.3 1182
Single exon skipping, size difference: 76
Exclusion in the protein causing a frameshift
Reference transcript: NM_173872
cd ClC_3_like 435aa 1e-176
in ref transcript
ClC-3-like chloride channel proteins. This CD includes ClC-3, ClC-4, ClC-5 and ClC-Y1. ClC-3 was initially cloned from rat kidney. Expression of ClC-3 produces outwardly-rectifying Cl currents that are inhibited by protein kinase C activation. It has been suggested that ClC-3 may be a ubiquitous swelling-activated Cl channel that has very similar characteristics to those of native volume-regulated Cl currents. The function of ClC-4 is unclear. Studies of human ClC-4 have revealed that it gives rise to Cl currents that rapidly activate at positive voltages, and are sensitive to extracellular pH, with currents decreasing when pH falls below 6.5. ClC-4 is broadly distributed, especially in brain and heart. ClC-5 is predominantly expressed in the kidney, but can be found in the brain and liver. Mutations in the ClC-5 gene cause certain hereditary diseases, including Dent's disease, an X-chromosome linked syndrome characterised by proteinuria, hypercalciuria, and kidney stones (nephrolithiasis), leading to progressive renal failure. These proteins belong to the ClC superfamily of chloride ion channels, which share the unique double-barreled architecture and voltage-dependent gating mechanism. The gating is conferred by the permeating anion itself, acting as the gating charge. This domain is found in the eukaryotic halogen ion (Cl- and I-) channel proteins, that perform a variety of functions including cell volume regulation, the membrane potential stabilization, transepithelial chloride transport and charge compensation necessary for the acidification of intracellular organelles.
Changed!
cd CBS_pair_EriC_assoc_euk_bac 139aa 4e-24
in ref transcript
This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes and bacteria. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in CLC chloride channel family members have been associated with classic Bartter syndrome, Osteopetrosis, Dent's disease, idiopathic generalized epilepsy, and myotonia.
pfam Voltage_CLC 403aa 1e-62
in ref transcript
Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. In terms of primary structure, they are unrelated to known cation channels or other types of anion channels. Three ClC subfamilies are found in animals. ClC-1 is involved in setting and restoring the resting membrane potential of skeletal muscle, while other channels play important parts in solute concentration mechanisms in the kidney. These proteins contain two pfam00571 domains.
Changed!
pfam CBS 149aa 3e-10
in ref transcript
CBS domain pair. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a pair of CBS domains, that has been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP.
COG EriC 455aa 3e-38
in ref transcript
Chloride channel protein EriC [Inorganic ion transport and metabolism].
Changed!
COG TlyC 153aa 0.001
in ref transcript
Hemolysins and related proteins containing CBS domains [General function prediction only].
Changed!
cd CBS_pair_EriC_assoc_euk_bac 139aa 3e-26
in modified transcript
Changed!
pfam CBS 150aa 1e-11
in modified transcript
Changed!
COG COG0517 54aa 3e-05
in modified transcript
FOG: CBS domain [General function prediction only].
Changed!
COG COG4109 164aa 7e-05
in modified transcript
Predicted transcriptional regulator containing CBS domains [Transcription].
CLCN6
refseq_CLCN6.F1 refseq_CLCN6.R1 134 301
NCBIGene 36.3 1185
Single exon skipping, size difference: 167
Exclusion in the protein causing a frameshift
Reference transcript: NM_001286
Changed!
cd ClC_6_like 322aa 1e-105
in ref transcript
ClC-6-like chloride channel proteins. This CD includes ClC-6, ClC-7 and ClC-B, C, D in plants. Proteins in this family are ubiquitous in eukarotes and their functions are unclear. They are expressed in intracellular organelles membranes. This family belongs to the ClC superfamily of chloride ion channels, which share the unique double-barreled architecture and voltage-dependent gating mechanism. The gating is conferred by the permeating anion itself, acting as the gating charge. ClC chloride ion channel superfamily perform a variety of functions including cellular excitability regulation, cell volume regulation, membrane potential stabilization, acidification of intracellular organelles, signal transduction, and transepithelial transport in animals.
Changed!
cd ClC_6_like 112aa 5e-39
in ref transcript
Changed!
cd CBS_pair_EriC_assoc_euk_bac 54aa 2e-13
in ref transcript
This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes and bacteria. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in CLC chloride channel family members have been associated with classic Bartter syndrome, Osteopetrosis, Dent's disease, idiopathic generalized epilepsy, and myotonia.
Changed!
pfam Voltage_CLC 231aa 4e-28
in ref transcript
Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. In terms of primary structure, they are unrelated to known cation channels or other types of anion channels. Three ClC subfamilies are found in animals. ClC-1 is involved in setting and restoring the resting membrane potential of skeletal muscle, while other channels play important parts in solute concentration mechanisms in the kidney. These proteins contain two pfam00571 domains.
Changed!
pfam Voltage_CLC 129aa 2e-15
in ref transcript
Changed!
pfam CBS 58aa 3e-08
in ref transcript
CBS domain pair. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a pair of CBS domains, that has been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP.
Changed!
smart CBS 48aa 0.009
in ref transcript
Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease.
Changed!
COG EriC 287aa 2e-13
in ref transcript
Chloride channel protein EriC [Inorganic ion transport and metabolism].
Changed!
COG EriC 111aa 3e-09
in ref transcript
Changed!
PRK PRK05567 58aa 3e-05
in ref transcript
Changed!
cd ClC_6_like 270aa 7e-85
in modified transcript
Changed!
pfam Voltage_CLC 175aa 2e-20
in modified transcript
Changed!
COG EriC 239aa 2e-09
in modified transcript
CLDN15
refseq_CLDN15.F1 refseq_CLDN15.R1 252 392
NCBIGene 36.2 24146
Single exon skipping, size difference: 140
Inclusion in the protein causing a frameshift
Reference transcript: NM_014343
Changed!
pfam PMP22_Claudin 169aa 5e-16
in ref transcript
PMP-22/EMP/MP20/Claudin family.
CLEC10A
refseq_CLEC10A.F1 refseq_CLEC10A.R1 156 228
NCBIGene 36.3 10462
Alternative 3-prime, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_182906
cd CLECT_DC-SIGN_like 125aa 8e-41
in ref transcript
CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
pfam Lectin_C 108aa 1e-28
in ref transcript
Lectin C-type domain. This family includes both long and short form C-type.
Changed!
pfam Lectin_N 165aa 2e-26
in ref transcript
Hepatic lectin, N-terminal domain.
Changed!
pfam Lectin_N 138aa 2e-30
in modified transcript
CLEC10A
refseq_CLEC10A.F1 refseq_CLEC10A.R3 109 190
NCBIGene 36.3 10462
Alternative 3-prime, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_182906
cd CLECT_DC-SIGN_like 125aa 8e-41
in ref transcript
CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
pfam Lectin_C 108aa 1e-28
in ref transcript
Lectin C-type domain. This family includes both long and short form C-type.
Changed!
pfam Lectin_N 165aa 2e-26
in ref transcript
Hepatic lectin, N-terminal domain.
Changed!
pfam Lectin_N 138aa 2e-30
in modified transcript
CLEC2D
refseq_CLEC2D.F1 refseq_CLEC2D.R1 161 243
NCBIGene 36.3 29121
Single exon skipping, size difference: 82
Exclusion in 5'UTR
Reference transcript: NM_001004419
pfam Nucleoplasmin 52aa 6e-17
in ref transcript
Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays.
CLEC2D
refseq_CLEC2D.F3 refseq_CLEC2D.R3 107 182
NCBIGene 36.2 29121
Alternative 5-prime, size difference: 75
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001004419
Changed!
cd CLECT_NK_receptors_like 97aa 9e-25
in ref transcript
CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
Changed!
smart CLECT 78aa 2e-16
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
Changed!
cd CLECT_NK_receptors_like 45aa 1e-11
in modified transcript
Changed!
smart CLECT 45aa 2e-09
in modified transcript
CLEC4A
refseq_CLEC4A.F1 refseq_CLEC4A.R1 130 229
NCBIGene 36.3 50856
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_016184
cd CLECT_DC-SIGN_like 127aa 8e-40
in ref transcript
CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
smart CLECT 126aa 7e-26
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
CLEC4A
refseq_CLEC4A.F3 refseq_CLEC4A.R3 122 242
NCBIGene 36.3 50856
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_016184
cd CLECT_DC-SIGN_like 127aa 8e-40
in ref transcript
CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
smart CLECT 126aa 7e-26
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
CLEC4C
refseq_CLEC4C.F1 refseq_CLEC4C.R1 254 347
NCBIGene 36.3 170482
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_130441
cd CLECT_DC-SIGN_like 126aa 3e-38
in ref transcript
CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
smart CLECT 125aa 2e-24
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
CLEC4M
refseq_CLEC4M.F1 refseq_CLEC4M.R1 156 219
NCBIGene 36.3 10332
Exon skipping and alternative 3-prime or 5-prime, size difference: 63
Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_214675
cd CLECT_DC-SIGN_like 124aa 5e-42
in ref transcript
CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
smart CLECT 123aa 6e-30
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
TIGR SMC_prok_B 182aa 1e-07
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
PRK PRK03918 172aa 4e-06
in ref transcript
chromosome segregation protein; Provisional.
Changed!
PRK PRK05771 186aa 2e-05
in modified transcript
V-type ATP synthase subunit I; Validated.
CLEC4M
refseq_CLEC4M.F3 refseq_CLEC4M.R3 183 296
NCBIGene 36.3 10332
Single exon skipping, size difference: 113
Exclusion in the protein causing a frameshift
Reference transcript: NM_214675
Changed!
cd CLECT_DC-SIGN_like 124aa 5e-42
in ref transcript
CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
Changed!
smart CLECT 123aa 6e-30
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
TIGR SMC_prok_B 182aa 1e-07
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
PRK PRK03918 172aa 4e-06
in ref transcript
chromosome segregation protein; Provisional.
Changed!
cd CLECT_DC-SIGN_like 45aa 5e-13
in modified transcript
Changed!
smart CLECT 45aa 7e-11
in modified transcript
CLEC7A
refseq_CLEC7A.F1 refseq_CLEC7A.R1 247 366
NCBIGene 36.3 64581
Single exon skipping, size difference: 119
Exclusion in the protein causing a frameshift
Reference transcript: NM_197947
Changed!
cd CLECT_NK_receptors_like 124aa 2e-27
in ref transcript
CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
Changed!
smart CLECT 123aa 1e-18
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
Changed!
cd CLECT_NK_receptors_like 45aa 9e-10
in modified transcript
Changed!
smart CLECT 45aa 2e-08
in modified transcript
CLEC7A
refseq_CLEC7A.F3 refseq_CLEC7A.R3 158 296
NCBIGene 36.3 64581
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_197947
cd CLECT_NK_receptors_like 124aa 2e-27
in ref transcript
CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
smart CLECT 123aa 1e-18
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
CLK1
refseq_CLK1.F1 refseq_CLK1.R1 109 200
NCBIGene 36.2 1195
Single exon skipping, size difference: 91
Exclusion in the protein causing a frameshift
Reference transcript: NM_004071
Changed!
cd S_TKc 318aa 1e-52
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
pfam Pkinase 317aa 4e-60
in ref transcript
Protein kinase domain.
Changed!
PTZ PTZ00284 332aa 1e-52
in ref transcript
protein kinase; Provisional.
CLK2
refseq_CLK2.F1 refseq_CLK2.R1 228 313
NCBIGene 36.2 1196
Single exon skipping, size difference: 85
Exclusion in the protein causing a frameshift
Reference transcript: NM_003993
Changed!
cd S_TKc 318aa 1e-46
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
pfam Pkinase 317aa 3e-53
in ref transcript
Protein kinase domain.
Changed!
PTZ PTZ00284 331aa 6e-49
in ref transcript
protein kinase; Provisional.
CLK3
refseq_CLK3.F2 refseq_CLK3.R2 197 294
NCBIGene 36.3 1198
Single exon skipping, size difference: 97
Exclusion in the protein causing a frameshift
Reference transcript: NM_003992
Changed!
cd S_TKc 318aa 8e-45
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 307aa 4e-49
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00284 341aa 3e-46
in ref transcript
protein kinase; Provisional.
CLSTN1
refseq_CLSTN1.F1 refseq_CLSTN1.R1 165 195
NCBIGene 36.3 22883
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_001009566
Changed!
cd CA 218aa 7e-21
in ref transcript
Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
smart CA 76aa 1e-07
in ref transcript
Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
pfam Cadherin 89aa 3e-05
in ref transcript
Cadherin domain.
Changed!
cd CA 208aa 6e-22
in modified transcript
CLTA
refseq_CLTA.F2 refseq_CLTA.R2 287 377
NCBIGene 36.3 1211
Multiple exon skipping, size difference: 90
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_007096
Changed!
pfam Clathrin_lg_ch 220aa 2e-69
in ref transcript
Clathrin light chain.
Changed!
pfam Clathrin_lg_ch 190aa 7e-58
in modified transcript
CLTB
refseq_CLTB.F1 refseq_CLTB.R1 219 273
NCBIGene 36.3 1212
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_007097
Changed!
pfam Clathrin_lg_ch 201aa 2e-71
in ref transcript
Clathrin light chain.
Changed!
pfam Clathrin_lg_ch 183aa 1e-59
in modified transcript
CLTCL1
refseq_CLTCL1.F1 refseq_CLTCL1.R1 186 357
NCBIGene 36.2 8218
Single exon skipping, size difference: 171
Exclusion in 3'UTR
Reference transcript: NM_007098
pfam Clathrin 143aa 1e-33
in ref transcript
Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain.
pfam Clathrin 140aa 8e-33
in ref transcript
pfam Clathrin 146aa 9e-32
in ref transcript
pfam Clathrin 143aa 8e-27
in ref transcript
pfam Clathrin 74aa 8e-11
in ref transcript
pfam Clathrin-link 24aa 2e-04
in ref transcript
Clathrin, heavy-chain linker. Members of this family adopt a structure consisting of alpha-alpha superhelix. They are predominantly found in clathrin, where they act as a heavy-chain linker domain.
CMTM3
refseq_CMTM3.F2 refseq_CMTM3.R2 139 245
NCBIGene 36.3 123920
Alternative 5-prime, size difference: 106
Exclusion in 5'UTR
Reference transcript: NM_144601
pfam MARVEL 55aa 2e-04
in ref transcript
Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
CMTM5
refseq_CMTM5.F1 refseq_CMTM5.R1 204 297
NCBIGene 36.3 116173
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_138460
Changed!
pfam MARVEL 61aa 3e-05
in ref transcript
Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
Changed!
pfam MARVEL 53aa 2e-04
in modified transcript
CMTM7
refseq_CMTM7.F2 refseq_CMTM7.R2 159 258
NCBIGene 36.3 112616
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_138410
Changed!
pfam MARVEL 124aa 7e-10
in ref transcript
Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
Changed!
pfam MARVEL 82aa 6e-06
in modified transcript
CNIH
refseq_CNIH.F1 refseq_CNIH.R1 121 234
NCBIGene 36.2 10175
Single exon skipping, size difference: 113
Exclusion in the protein causing a frameshift
Reference transcript: NM_005776
Changed!
pfam Cornichon 117aa 2e-39
in ref transcript
Cornichon protein.
Changed!
pfam Cornichon 45aa 1e-07
in modified transcript
CNN2
refseq_CNN2.F1 refseq_CNN2.R1 226 343
NCBIGene 36.3 1265
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_004368
Changed!
cd CH 101aa 1e-16
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
Changed!
pfam CH 102aa 8e-18
in ref transcript
Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
Changed!
pfam Calponin 26aa 1e-05
in ref transcript
Calponin family repeat.
pfam Calponin 26aa 3e-04
in ref transcript
pfam Calponin 25aa 0.001
in ref transcript
Changed!
COG SCP1 167aa 2e-20
in ref transcript
Calponin [Cytoskeleton].
Changed!
cd CH 100aa 8e-17
in modified transcript
Changed!
pfam CH 100aa 7e-18
in modified transcript
Changed!
pfam Calponin 23aa 9e-04
in modified transcript
Changed!
COG SCP1 168aa 5e-16
in modified transcript
CNNM2
refseq_CNNM2.F1 refseq_CNNM2.R1 151 217
NCBIGene 36.3 54805
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_017649
cd CBS_pair_CorC_HlyC_assoc 106aa 2e-16
in ref transcript
This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the CorC_HlyC domain. CorC_HlyC is a transporter associated domain. This small domain is found in Na+/H+ antiporters, in proteins involved in magnesium and cobalt efflux, and in association with some proteins of unknown function. The function of the CorC_HlyC domain is uncertain but it might be involved in modulating transport of ion substrates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. The second CBS domain in this CD is degenerate.
TIGR GldE 326aa 7e-29
in ref transcript
Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldC is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. GldE was discovered because of its adjacency to GldD in F. johnsonii. Overexpression of GldE partially supresses the effects of a GldB point mutant suggesting that GldB and GldE interact. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility and in fact some do not appear to express the gliding phenotype.
COG TlyC 331aa 9e-37
in ref transcript
Hemolysins and related proteins containing CBS domains [General function prediction only].
CNNM3
refseq_CNNM3.F2 refseq_CNNM3.R2 276 420
NCBIGene 36.3 26505
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_017623
Changed!
cd CBS_pair_CorC_HlyC_assoc 122aa 8e-19
in ref transcript
This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the CorC_HlyC domain. CorC_HlyC is a transporter associated domain. This small domain is found in Na+/H+ antiporters, in proteins involved in magnesium and cobalt efflux, and in association with some proteins of unknown function. The function of the CorC_HlyC domain is uncertain but it might be involved in modulating transport of ion substrates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. The second CBS domain in this CD is degenerate.
Changed!
TIGR GldE 143aa 8e-20
in ref transcript
Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldC is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. GldE was discovered because of its adjacency to GldD in F. johnsonii. Overexpression of GldE partially supresses the effects of a GldB point mutant suggesting that GldB and GldE interact. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility and in fact some do not appear to express the gliding phenotype.
Changed!
COG TlyC 250aa 1e-22
in ref transcript
Hemolysins and related proteins containing CBS domains [General function prediction only].
Changed!
cd CBS_pair_CorC_HlyC_assoc 87aa 5e-10
in modified transcript
Changed!
TIGR GldE 95aa 5e-09
in modified transcript
Changed!
COG TlyC 191aa 2e-12
in modified transcript
CNTFR
refseq_CNTFR.F1 refseq_CNTFR.R1 144 255
NCBIGene 36.3 1271
Single exon skipping, size difference: 111
Exclusion in 5'UTR
Reference transcript: NM_147164
cd FN3 97aa 1e-06
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
smart IGc2 58aa 2e-07
in ref transcript
Immunoglobulin C-2 Type.
pfam fn3 91aa 1e-04
in ref transcript
Fibronectin type III domain.
CNTN1
refseq_CNTN1.F1 refseq_CNTN1.R1 134 167
NCBIGene 36.3 1272
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_001843
cd IGcam 88aa 2e-12
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 86aa 8e-12
in ref transcript
cd IGcam 76aa 9e-11
in ref transcript
cd FN3 88aa 3e-10
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd IGcam 86aa 6e-10
in ref transcript
cd FN3 91aa 1e-07
in ref transcript
cd FN3 93aa 6e-07
in ref transcript
cd IGcam 96aa 4e-06
in ref transcript
cd IGcam 69aa 0.002
in ref transcript
cd FN3 85aa 0.005
in ref transcript
smart IGc2 60aa 2e-14
in ref transcript
Immunoglobulin C-2 Type.
smart IG_like 81aa 2e-11
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam fn3 83aa 8e-11
in ref transcript
Fibronectin type III domain.
pfam I-set 78aa 3e-10
in ref transcript
Immunoglobulin I-set domain.
smart IG_like 68aa 5e-09
in ref transcript
smart IG_like 91aa 2e-07
in ref transcript
pfam fn3 84aa 3e-07
in ref transcript
smart FN3 91aa 2e-04
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
pfam fn3 81aa 0.007
in ref transcript
smart IG_like 73aa 0.009
in ref transcript
CNTROB
refseq_CNTROB.F1 refseq_CNTROB.R1 183 249
NCBIGene 36.3 116840
Alternative 5-prime, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_001037144
COG5
refseq_COG5.F1 refseq_COG5.R1 183 246
NCBIGene 36.3 10466
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_006348
pfam COG5 124aa 4e-26
in ref transcript
Golgi transport complex subunit 5. The COG complex, the peripheral membrane oligomeric protein complex involved in intra-Golgi protein trafficking, consists of eight subunits arranged in two lobes bridged by Cog1. Cog5 is in the smaller, B lobe, bound in with Cog6-8, and is itself bound to Cog1 as well as, strongly, to Cog7.
COL11A2
refseq_COL11A2.F1 refseq_COL11A2.R1 108 288
NCBIGene 36.3 1302
Single exon skipping, size difference: 180
Exclusion in the protein (no frameshift)
Reference transcript: NM_080680
cd LamG 152aa 1e-05
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
smart COLFI 196aa 6e-85
in ref transcript
Fibrillar collagens C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.
smart TSPN 183aa 7e-56
in ref transcript
Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin.
COL13A1
refseq_COL13A1.F1 refseq_COL13A1.R1 126 213
NCBIGene 36.3 1305
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_005203
Changed!
pfam Collagen 63aa 0.007
in modified transcript
Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.
COL13A1
refseq_COL13A1.F3 refseq_COL13A1.R3 155 221
NCBIGene 36.3 1305
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_005203
COL13A1
refseq_COL13A1.F4 refseq_COL13A1.R4 93 150
NCBIGene 36.3 1305
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_005203
COL13A1
refseq_COL13A1.F6 refseq_COL13A1.R6 92 143
NCBIGene 36.3 1305
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_005203
COL2A1
refseq_COL2A1.F1 refseq_COL2A1.R1 129 336
NCBIGene 36.3 1280
Single exon skipping, size difference: 207
Exclusion in the protein (no frameshift)
Reference transcript: NM_001844
smart COLFI 236aa 1e-115
in ref transcript
Fibrillar collagens C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.
Changed!
pfam VWC 56aa 2e-18
in ref transcript
von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094.
pfam Collagen 59aa 1e-04
in ref transcript
Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.
COL4A3
refseq_COL4A3.F1 refseq_COL4A3.R1 182 355
NCBIGene 36.3 1285
Single exon skipping, size difference: 173
Exclusion in the protein causing a frameshift
Reference transcript: NM_000091
pfam C4 110aa 6e-54
in ref transcript
C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome.
Changed!
pfam C4 113aa 5e-47
in ref transcript
Changed!
pfam C4 31aa 2e-10
in modified transcript
COL4A3BP
refseq_COL4A3BP.F1 refseq_COL4A3BP.R1 195 273
NCBIGene 36.3 10087
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_005713
cd PH_GPBP 90aa 3e-42
in ref transcript
Goodpasture antigen binding protein (GPBP) Pleckstrin homology (PH) domain. The GPBP protein is a kinase that phosphorylates an N-terminal region of the alpha 3 chain of type IV collagen , which is commonly known as the goodpasture antigen. It has has an N-terminal PH domain and a C-terminal START domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinsases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
Changed!
cd START 220aa 1e-25
in ref transcript
START(STeroidogenic Acute Regulatory (STAR) related lipid Transfer) Domain. These domains are 200-210 amino acid in length and occur in proteins involved in lipid transport (phosphatidylcholine) and metabolism, signal transduction, and transcriptional regulation. The most striking feature of the START domain structure is a predominantly hydrophobic tunnel extending nearly the entire protein and used to binding a single molecule of large lipophilic compounds, like cholesterol.
smart START 202aa 3e-18
in ref transcript
in StAR and phosphatidylcholine transfer protein. putative lipid-binding domain in StAR and phosphatidylcholine transfer protein.
pfam PH 92aa 3e-13
in ref transcript
PH domain. PH stands for pleckstrin homology.
Changed!
cd START 224aa 7e-26
in modified transcript
COL6A2
refseq_COL6A2.F2 refseq_COL6A2.R2 142 435
NCBIGene 36.3 1292
Alternative 3-prime, size difference: 293
Inclusion in the protein causing a new stop codon
Reference transcript: NM_058174
cd vWA_collagen_alpha_1-VI-type 192aa 1e-51
in ref transcript
VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
cd vWA_collagen_alpha_1-VI-type 189aa 1e-46
in ref transcript
pfam VWA 187aa 2e-29
in ref transcript
von Willebrand factor type A domain.
smart VWA 169aa 1e-18
in ref transcript
von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
pfam Collagen 46aa 8e-06
in ref transcript
Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.
pfam Collagen 62aa 4e-05
in ref transcript
pfam Collagen 88aa 6e-05
in ref transcript
COL8A1
refseq_COL8A1.F1 refseq_COL8A1.R1 119 181
NCBIGene 36.3 1295
Single exon skipping, size difference: 62
Exclusion in 5'UTR
Reference transcript: NM_001850
smart C1Q 136aa 9e-55
in ref transcript
Complement component C1q domain. Globular domain found in many collagens and eponymously in complement C1q. When part of full length proteins these domains form a 'bouquet' due to the multimerization of heterotrimers. The C1q fold is similar to that of tumour necrosis factor.
COLEC11
refseq_COLEC11.F1 refseq_COLEC11.R1 100 179
NCBIGene 36.3 78989
Single exon skipping, size difference: 79
Inclusion in the protein causing a new stop codon
Reference transcript: NM_024027
Changed!
cd CLECT_collectin_like 116aa 5e-45
in ref transcript
CLECT_collectin_like: C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CTLDs of these collectins bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, or apoptotic cells) and mediate functions associated with killing and phagocytosis. MBPs recognize high mannose oligosaccharides in a calcium dependent manner, bind to a broad range of pathogens, and trigger cell killing by activating the complement pathway. MBP also acts directly as an opsonin. SP-A and SP-D in addition to functioning as host defense components, are components of pulmonary surfactant which play a role in surfactant homeostasis. Pulmonary surfactant is a phospholipid-protein complex which reduces the surface tension within the lungs. SP-A binds the major surfactant lipid: dipalmitoylphosphatidylcholine (DPPC). SP-D binds two minor components of surfactant that contain sugar moieties: glucosylceramide and phosphatidylinositol (PI). MBP and SP-A, -D monomers are homotrimers with an N-terminal collagen region and three CTLDs. Multiple homotrimeric units associate to form supramolecular complexes. MBL deficiency results in an increased susceptibility to a large number of different infections and to inflammatory disease, such as rheumatoid arthritis.
Changed!
pfam Lectin_C 108aa 2e-19
in ref transcript
Lectin C-type domain. This family includes both long and short form C-type.
COLQ
refseq_COLQ.F1 refseq_COLQ.R1 142 169
NCBIGene 36.3 8292
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_005677
TIGR myxo_disulf_rpt 25aa 5e-04
in ref transcript
This model represents a sequence region shared between several proteins of Myxococcus xanthus DK 1622 and some eukaryotic proteins that include human pappalysin-1. The region of about 40 amino acids contains several conserved Cys residues presumed to form disulfide bonds. The region appears in up to 13 repeats in Myxococcus.
pfam Collagen 63aa 0.004
in ref transcript
Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.
COLQ
refseq_COLQ.F4 refseq_COLQ.R4 136 276
NCBIGene 36.2 8292
Single exon skipping, size difference: 140
Exclusion in the protein causing a frameshift
Reference transcript: NM_005677
Changed!
TIGR myxo_disulf_rpt 25aa 5e-04
in ref transcript
This model represents a sequence region shared between several proteins of Myxococcus xanthus DK 1622 and some eukaryotic proteins that include human pappalysin-1. The region of about 40 amino acids contains several conserved Cys residues presumed to form disulfide bonds. The region appears in up to 13 repeats in Myxococcus.
Changed!
pfam Collagen 63aa 0.004
in ref transcript
Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.
COMMD6
refseq_COMMD6.F1 refseq_COMMD6.R1 147 186
NCBIGene 36.3 170622
Single exon skipping, size difference: 39
Exclusion in 5'UTR
Reference transcript: NM_203497
COP1
refseq_COP1.F1 refseq_COP1.R1 333 396
NCBIGene 36.3 114769
Single exon skipping, size difference: 63
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001017534
pfam CARD 88aa 7e-19
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
COPE
refseq_COPE.F1 refseq_COPE.R1 222 378
NCBIGene 36.3 11316
Single exon skipping, size difference: 156
Exclusion in the protein (no frameshift)
Reference transcript: NM_007263
Changed!
cd TPR 83aa 0.007
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
Changed!
pfam Coatomer_E 291aa 1e-109
in ref transcript
Coatomer epsilon subunit. This family represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex.
Changed!
pfam Coatomer_E 239aa 5e-75
in modified transcript
COPS8
refseq_COPS8.F1 refseq_COPS8.R1 191 256
NCBIGene 36.3 10920
Single exon skipping, size difference: 65
Inclusion in the protein causing a new stop codon
Reference transcript: NM_006710
Changed!
pfam PCI_Csn8 175aa 3e-60
in ref transcript
COP9 signalosome, subunit CSN8. This PCI_Csn8 domain is conserved from plants to humans. It is a signature protein motif found in components of CSN (COP9 signalosome). It functions as a structural scaffold for subunit-subunit interactions within the complex and is a key regulator of photomorphogenic development.
COQ6
refseq_COQ6.F1 refseq_COQ6.R1 168 227
NCBIGene 36.3 51004
Single exon skipping, size difference: 59
Exclusion in the protein causing a frameshift
Reference transcript: NM_182476
Changed!
TIGR COQ6 428aa 0.0
in ref transcript
This model represents the monooxygenase responsible for the 4-hydroxylateion of the phenol ring in the aerobic biosynthesis of ubiquinone.
Changed!
PRK PRK08850 390aa 3e-61
in ref transcript
Changed!
TIGR COQ6 67aa 2e-14
in modified transcript
ENOX2
refseq_COVA1.F2 refseq_COVA1.R2 184 273
NCBIGene 36.3 10495
Single exon skipping, size difference: 89
Exclusion of the protein initiation site
Reference transcript: NM_182314
cd RRM 58aa 1e-06
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
pfam RRM_1 58aa 3e-07
in ref transcript
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
COG COG0724 79aa 0.001
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
CPB2
refseq_CPB2.F1 refseq_CPB2.R1 100 211
NCBIGene 36.3 1361
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_001872
Changed!
cd M14_CPB2 302aa 1e-170
in ref transcript
Peptidase M14 Carboxypeptidase (CP) B2 (CPB2, also known as plasma carboxypeptidase B, carboxypeptidase U, and CPU), belongs to the carboxpeptidase A/B subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPB2 enzyme displays B-like activity; it only cleaves the basic residues lysine or arginine. It is produced and secreted by the liver as the inactive precursor, procarboxypeptidase U or PCPB2, commonly referred to as thrombin-activatable fibrinolysis inhibitor (TAFI). It circulates in plasma as a zymogen bound to plasminogen, and the active enzyme, TAFIa, inhibits fibrinolysis. It is highly regulated, increased TAFI concentrations are thought to increase the risk of thrombosis and coronary artery disease by reducing fibrinolytic activity while low TAFI levels have been correlated with chronic liver disease.
Changed!
smart Zn_pept 283aa 2e-92
in ref transcript
Zn_pept.
pfam Propep_M14 78aa 2e-20
in ref transcript
Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of folding and activity of the pro-enzyme.
Changed!
COG COG2866 310aa 3e-08
in ref transcript
Predicted carboxypeptidase [Amino acid transport and metabolism].
Changed!
cd M14_CPB2 265aa 1e-145
in modified transcript
Changed!
smart Zn_pept 246aa 3e-76
in modified transcript
CPEB2
refseq_CPEB2.F2 refseq_CPEB2.R2 316 406
NCBIGene 36.3 132864
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_182485
cd RRM 88aa 1e-09
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 79aa 0.002
in ref transcript
smart RRM_2 63aa 9e-08
in ref transcript
RNA recognition motif.
Changed!
COG COG0724 176aa 1e-04
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
COG COG0724 272aa 3e-04
in modified transcript
CPM
refseq_CPM.F1 refseq_CPM.R1 102 120
NCBIGene 36.3 1368
Alternative 5-prime, size difference: 18
Exclusion in 5'UTR
Reference transcript: NM_198320
cd M14_CPM 376aa 0.0
in ref transcript
Peptidase M14 Carboxypeptidase (CP) M (CPM) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPM is an extracellular glycoprotein, bound to cell membranes via a glycosyl-phosphatidylinositol on the C-terminus of the protein. It specifically removes C-terminal basic residues such as lysine and arginine from peptides and proteins. The highest levels of CPM have been found in human lung and placenta, but significant amounts are present in kidney, blood vessels, intestine, brain, and peripheral nerves. CPM has also been found in soluble form in various body fluids, including amniotic fluid, seminal plasma and urine. Due to its wide distribution in a variety of tissues, it is believed that it plays an important role in the control of peptide hormones and growth factor activity on the cell surface and in the membrane-localized degradation of extracellular proteins, for example it hydrolyses the C-terminal arginine of epidermal growth factor (EGF) resulting in des-Arg-EGF which binds to the EGF receptor (EGFR) with an equal or greater affinity than native EGF. CPM is a required processing enzyme that generates specific agonists for the B1 receptor.
smart Zn_pept 273aa 8e-69
in ref transcript
Zn_pept.
COG COG2866 138aa 1e-04
in ref transcript
Predicted carboxypeptidase [Amino acid transport and metabolism].
COG PcaH 71aa 0.002
in ref transcript
Protocatechuate 3,4-dioxygenase beta subunit [Secondary metabolites biosynthesis, transport, and catabolism].
CPNE1
refseq_CPNE1.F1 refseq_CPNE1.R1 152 295
NCBIGene 36.3 8904
Single exon skipping, size difference: 143
Exclusion in 5'UTR
Reference transcript: NM_152930
cd vWA_copine_like 259aa 4e-88
in ref transcript
VWA Copine: Copines are phospholipid-binding proteins originally identified in paramecium. They are found in human and orthologues have been found in C. elegans and Arabidopsis Thaliana. None have been found in D. Melanogaster or S. Cereviciae. Phylogenetic distribution suggests that copines have been lost in some eukaryotes. No functional properties have been assigned to the VWA domains present in copines. The members of this subgroup contain a functional MIDAS motif based on their preferential binding to magnesium and manganese. However, the MIDAS motif is not totally conserved, in most cases the MIDAS consists of the sequence DxTxS instead of the motif DxSxS that is found in most cases. The C2 domains present in copines mediate phospholipid binding.
cd C2 109aa 7e-15
in ref transcript
Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
cd C2 99aa 2e-10
in ref transcript
pfam Copine 148aa 4e-65
in ref transcript
Copine. This family represents a conserved region approximately 180 residues long within eukaryotic copines. Copines are Ca(2+)-dependent phospholipid-binding proteins that are thought to be involved in membrane-trafficking, and may also be involved in cell division and growth.
pfam C2 85aa 3e-14
in ref transcript
C2 domain.
smart C2 97aa 3e-10
in ref transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
COG COG5038 94aa 3e-07
in ref transcript
Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
CPNE1
refseq_CPNE1.F3 refseq_CPNE1.R3 180 212
NCBIGene 36.3 8904
Alternative 3-prime, size difference: 32
Inclusion in 5'UTR
Reference transcript: NM_152931
cd vWA_copine_like 259aa 4e-88
in ref transcript
VWA Copine: Copines are phospholipid-binding proteins originally identified in paramecium. They are found in human and orthologues have been found in C. elegans and Arabidopsis Thaliana. None have been found in D. Melanogaster or S. Cereviciae. Phylogenetic distribution suggests that copines have been lost in some eukaryotes. No functional properties have been assigned to the VWA domains present in copines. The members of this subgroup contain a functional MIDAS motif based on their preferential binding to magnesium and manganese. However, the MIDAS motif is not totally conserved, in most cases the MIDAS consists of the sequence DxTxS instead of the motif DxSxS that is found in most cases. The C2 domains present in copines mediate phospholipid binding.
cd C2 109aa 7e-15
in ref transcript
Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
cd C2 99aa 2e-10
in ref transcript
pfam Copine 148aa 4e-65
in ref transcript
Copine. This family represents a conserved region approximately 180 residues long within eukaryotic copines. Copines are Ca(2+)-dependent phospholipid-binding proteins that are thought to be involved in membrane-trafficking, and may also be involved in cell division and growth.
pfam C2 85aa 3e-14
in ref transcript
C2 domain.
smart C2 97aa 3e-10
in ref transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
COG COG5038 94aa 3e-07
in ref transcript
Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
CPNE7
refseq_CPNE7.F1 refseq_CPNE7.R1 140 365
NCBIGene 36.3 27132
Multiple exon skipping, size difference: 225
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_014427
cd vWA_copine_like 258aa 4e-84
in ref transcript
VWA Copine: Copines are phospholipid-binding proteins originally identified in paramecium. They are found in human and orthologues have been found in C. elegans and Arabidopsis Thaliana. None have been found in D. Melanogaster or S. Cereviciae. Phylogenetic distribution suggests that copines have been lost in some eukaryotes. No functional properties have been assigned to the VWA domains present in copines. The members of this subgroup contain a functional MIDAS motif based on their preferential binding to magnesium and manganese. However, the MIDAS motif is not totally conserved, in most cases the MIDAS consists of the sequence DxTxS instead of the motif DxSxS that is found in most cases. The C2 domains present in copines mediate phospholipid binding.
Changed!
cd C2 98aa 5e-11
in ref transcript
Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
cd C2 87aa 9e-09
in ref transcript
pfam Copine 148aa 2e-70
in ref transcript
Copine. This family represents a conserved region approximately 180 residues long within eukaryotic copines. Copines are Ca(2+)-dependent phospholipid-binding proteins that are thought to be involved in membrane-trafficking, and may also be involved in cell division and growth.
Changed!
pfam C2 87aa 2e-11
in ref transcript
C2 domain.
pfam C2 86aa 1e-10
in ref transcript
Changed!
COG COG5038 151aa 6e-04
in ref transcript
Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
Changed!
cd C2 100aa 3e-11
in modified transcript
Changed!
smart C2 98aa 2e-11
in modified transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
Changed!
COG COG5038 112aa 0.001
in modified transcript
CPSF3L
refseq_CPSF3L.F2 refseq_CPSF3L.R2 211 277
NCBIGene 36.2 54973
Alternative 3-prime, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_017871
Changed!
TIGR arCOG00543 443aa 5e-86
in ref transcript
This family of proteins is universal in the archaea and consistsof an N-terminal type-1 KH-domain (pfam00013) a central beta-lactamase-domain (pfam00753) with a C-terminal motif associated with RNA metabolism (pfam07521). KH-domains are associated with RNA-binding, so taken together, this protein is a likely metal-dependent RNAase. This family was defined in as arCOG01782.
Changed!
COG YSH1 443aa 1e-94
in ref transcript
Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis].
Changed!
TIGR arCOG00543 421aa 9e-80
in modified transcript
Changed!
COG YSH1 421aa 3e-83
in modified transcript
CPVL
refseq_CPVL.F2 refseq_CPVL.R2 185 273
NCBIGene 36.3 54504
Alternative 5-prime, size difference: 88
Exclusion in 5'UTR
Reference transcript: NM_019029
pfam Peptidase_S10 400aa 2e-97
in ref transcript
Serine carboxypeptidase.
COG COG2939 383aa 1e-35
in ref transcript
Carboxypeptidase C (cathepsin A) [Amino acid transport and metabolism].
CR2
refseq_CR2.F1 refseq_CR2.R1 145 322
NCBIGene 36.3 1380
Single exon skipping, size difference: 177
Exclusion in the protein (no frameshift)
Reference transcript: NM_001006658
cd CCP 58aa 1e-09
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
Changed!
cd CCP 58aa 9e-09
in ref transcript
cd CCP 68aa 1e-08
in ref transcript
cd CCP 58aa 2e-08
in ref transcript
cd CCP 57aa 6e-08
in ref transcript
cd CCP 61aa 6e-08
in ref transcript
cd CCP 58aa 9e-08
in ref transcript
cd CCP 57aa 2e-07
in ref transcript
cd CCP 61aa 2e-07
in ref transcript
cd CCP 68aa 6e-07
in ref transcript
cd CCP 57aa 4e-06
in ref transcript
cd CCP 57aa 1e-05
in ref transcript
cd CCP 39aa 5e-05
in ref transcript
smart CCP 57aa 1e-11
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
smart CCP 67aa 6e-11
in ref transcript
Changed!
smart CCP 57aa 1e-10
in ref transcript
smart CCP 57aa 8e-10
in ref transcript
smart CCP 61aa 1e-09
in ref transcript
smart CCP 57aa 2e-09
in ref transcript
smart CCP 51aa 7e-09
in ref transcript
pfam Sushi 60aa 9e-09
in ref transcript
Sushi domain (SCR repeat).
smart CCP 56aa 2e-08
in ref transcript
smart CCP 67aa 2e-08
in ref transcript
smart CCP 56aa 6e-07
in ref transcript
pfam Sushi 56aa 1e-06
in ref transcript
smart CCP 57aa 1e-05
in ref transcript
pfam Sushi 57aa 5e-04
in ref transcript
Changed!
cd CCP 58aa 0.010
in modified transcript
CRAT
refseq_CRAT.F2 refseq_CRAT.R2 139 385
NCBIGene 36.2 1384
Exon skipping and alternative 3-prime or 5-prime, size difference: 246
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_000755
Changed!
pfam Carn_acyltransf 582aa 0.0
in ref transcript
Choline/Carnitine o-acyltransferase.
Changed!
pfam Carn_acyltransf 250aa 1e-107
in modified transcript
Changed!
pfam Carn_acyltransf 248aa 2e-94
in modified transcript
CREB1
refseq_CREB1.F1 refseq_CREB1.R1 151 193
NCBIGene 36.3 1385
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_134442
pfam pKID 29aa 3e-10
in ref transcript
pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB.
pfam bZIP_1 57aa 2e-06
in ref transcript
bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.
CRELD1
refseq_CRELD1.F1 refseq_CRELD1.R1 168 362
NCBIGene 36.3 78987
Single exon skipping, size difference: 194
Exclusion in the protein causing a frameshift
Reference transcript: NM_001031717
smart EGF_CA 32aa 0.003
in ref transcript
Calcium-binding EGF-like domain.
pfam EGF_CA 28aa 0.006
in ref transcript
Calcium binding EGF domain.
CREM
refseq_CREM.F1 refseq_CREM.R1 306 495
NCBIGene 36.3 1390
Single exon skipping, size difference: 189
Exclusion in the protein (no frameshift)
Reference transcript: NM_182769
Changed!
pfam pKID 41aa 6e-11
in ref transcript
pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB.
pfam bZIP_1 53aa 8e-07
in ref transcript
bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.
Changed!
pfam pKID 37aa 4e-10
in modified transcript
CREM
refseq_CREM.F2 refseq_CREM.R2 106 204
NCBIGene 36.3 1390
Single exon skipping, size difference: 98
Exclusion of the protein initiation site
Reference transcript: NM_183013
pfam pKID 41aa 2e-10
in ref transcript
pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB.
pfam bZIP_1 53aa 1e-04
in ref transcript
bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.
CREM
refseq_CREM.F5 refseq_CREM.R4 185 221
NCBIGene 36.3 1390
Single exon skipping, size difference: 36
Exclusion of the protein initiation site
Reference transcript: NM_182723
pfam bZIP_1 53aa 3e-04
in ref transcript
bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.
CREM
refseq_CREM.F7 refseq_CREM.R5 153 251
NCBIGene 36.3 1390
Single exon skipping, size difference: 98
Exclusion of the protein initiation site
Reference transcript: NM_181571
pfam pKID 41aa 9e-11
in ref transcript
pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB.
pfam bZIP_1 53aa 4e-06
in ref transcript
bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.
CREM
refseq_CREM.F8 refseq_CREM.R6 98 500
NCBIGene 36.3 1390
Alternative 3-prime, size difference: 402
Inclusion in the protein causing a new stop codon
Reference transcript: NM_181571
pfam pKID 41aa 9e-11
in ref transcript
pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB.
Changed!
pfam bZIP_1 53aa 4e-06
in ref transcript
bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.
Changed!
pfam bZIP_1 53aa 3e-06
in modified transcript
CRISP1
refseq_CRISP1.F1 refseq_CRISP1.R1 258 347
NCBIGene 36.3 167
Single exon skipping, size difference: 89
Exclusion in the protein causing a frameshift
Reference transcript: NM_001131
cd SCP_CRISP 139aa 3e-51
in ref transcript
SCP_CRISP: SCP-like extracellular protein domain, CRISP-like sub-family. The wider family of SCP containing proteins includes plant pathogenesis-related protein 1 (PR-1), CRISPs, mammalian cysteine-rich secretory proteins, which combine SCP with a C-terminal cysteine rich domain, and allergen 5 from vespid venom. Involvement of CRISP in response to pathogens, fertilization, and sperm maturation have been proposed. One member, Tex31 from the venom duct of Conus textile, has been shown to possess proteolytic activity sensitive to serine protease inhibitors. SCP has also been proposed to be a Ca++ chelating serine protease. The Ca++-chelating function would fit with various signaling processes that members of this family, such as the CRISPs, are involved in, and is supported by sequence and structural evidence of a conserved pocket containing two histidines and a glutamate. It also may explain how helothermine, a toxic peptide secreted by the beaded lizard, blocks Ca++ transporting ryanodine receptors. One member, DE or CRISP-1, has been shown to mediate gamete fusion by binding to the egg surface; a sequence motif in the SCP domain plays a role in that binding.
smart SCP 139aa 1e-31
in ref transcript
SCP / Tpx-1 / Ag5 / PR-1 / Sc7 family of extracellular domains. Human glioma pathogenesis-related protein GliPR and the plant pathogenesis-related protein represent functional links between plant defense systems and human immune system. This family has no known function.
Changed!
pfam Crisp 55aa 3e-18
in ref transcript
Crisp. This domain is found on Crisp proteins which contain pfam00188 and has been termed the Crisp domain. It is found in the mammalian reproductive tract and the venom of reptiles, and has been shown to regulate ryanodine receptor Ca2+ signalling. It contains 10 conserved cysteines which are all involved in disulphide bonds and is structurally related to the ion channel inhibitor toxins BgK and ShK.
Changed!
cd SCP_CRISP 140aa 1e-51
in modified transcript
CRK
refseq_CRK.F1 refseq_CRK.R1 156 326
NCBIGene 36.3 1398
Alternative 5-prime, size difference: 170
Exclusion in the protein causing a frameshift
Reference transcript: NM_016823
cd SH2 106aa 2e-17
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
cd SH3 54aa 1e-12
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Changed!
cd SH3 55aa 0.009
in ref transcript
smart SH2 97aa 9e-18
in ref transcript
Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
smart SH3 58aa 1e-14
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Changed!
pfam SH3_2 55aa 5e-11
in ref transcript
Variant SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
MED23
refseq_CRSP3.F1 refseq_CRSP3.R1 102 120
NCBIGene 36.3 9439
Single exon skipping, size difference: 18
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_004830
CRTC1
refseq_CRTC1.F2 refseq_CRTC1.R2 100 187
NCBIGene 36.2 23373
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_015321
CS
refseq_CS.F1 refseq_CS.R1 149 232
NCBIGene 36.3 1431
Single exon skipping, size difference: 83
Inclusion in the protein causing a new stop codon
Reference transcript: NM_004077
Changed!
cd ScCit1-2_like 427aa 0.0
in ref transcript
Saccharomyces cerevisiae (Sc) citrate synthases Cit1-2_like. Citrate synthases (CS) catalyzes the condensation of acetyl coenzyme A (AcCoA) with oxaloacetate (OAA) to form citrate and coenzyme A (CoA), the first step in the citric acid cycle (TCA or Krebs cycle). Some CS proteins function as 2-methylcitrate synthase (2MCS). 2MCS catalyzes the condensation of propionyl-coenzyme A (PrCoA) and OAA to form 2-methylcitrate and CoA during propionate metabolism. The overall CS reaction is thought to proceed through three partial reactions and involves both closed and open conformational forms of the enzyme: a) the carbanion or equivalent is generated from AcCoA by base abstraction of a proton, b) the nucleophilic attack of this carbanion on OAA to generate citryl-CoA, and c) the hydrolysis of citryl-CoA to produce citrate and CoA. There are two types of CSs: type I CS and type II CSs. Type I CSs are found in eukarya, gram-positive bacteria, archaea, and in some gram-negative bacteria and are homodimers with both subunits participating in the active site. Type II CSs are unique to gram-negative bacteria and are homohexamers of identical subunits (approximated as a trimer of dimers). ScCit1 is a nuclear-encoded mitochondrial CS with highly specificity for AcCoA. In addition to its CS function, ScCit1 plays a part in the construction of the TCA cycle metabolon. Yeast cells deleted for Cit1 are hyper-susceptible to apoptosis induced by heat and aging stress. ScCit2 is a peroxisomal CS involved in the glyoxylate cycle; in addition to having activity with AcCoA, it may have activity with PrCoA. Chicken and pig heart CS, two Arabidopsis thaliana (Ath) CSs, CSY4 and -5, and Aspergillus niger (An) CS also belong to this group. Ath CSY4 has a mitochondrial targeting sequence; AthCSY5 has no identifiable targeting sequence. AnCS encoded by the citA gene has both an N-terminal mitochondrial import signal and a C-terminal peroxisiomal target sequence; it is not known if both these signals are functional in vivo. This group contains proteins which functions exclusively as either a CS or a 2MCS, as well as those with relaxed specificity which have dual functions as both a CS and a 2MCS.
Changed!
TIGR cit_synth_euk 428aa 0.0
in ref transcript
This model includes both mitochondrial and peroxisomal forms of citrate synthase. Citrate synthase is the entry point to the TCA cycle from acetyl-CoA. Peroxisomal forms, such as a member from yeast (recognized by the C-terminal targeting motif SKL) act in the glyoxylate cycle. Eukaryotic homologs excluded by the high trusted cutoff of this model include a Tetrahymena thermophila citrate synthase that doubles as a filament protein, a putative citrate synthase from Plasmodium falciparum (no TCA cycle), and a methylcitrate synthase from Aspergillus nidulans.
Changed!
PRK PRK09569 428aa 1e-148
in ref transcript
type I citrate synthase; Reviewed.
CSAG1
refseq_CSAG1.F1 refseq_CSAG1.R1 100 316
NCBIGene 36.3 158511
Single exon skipping, size difference: 216
Exclusion in 5'UTR
Reference transcript: NM_153478
CSDE1
refseq_CSDE1.F1 refseq_CSDE1.R1 260 353
NCBIGene 36.3 7812
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_001007553
cd CSP_CDS 61aa 6e-08
in ref transcript
Cold-Shock Protein (CSP) contains an S1-like cold-shock domain (CSD) that is found in eukaryotes, prokaryotes, and archaea. CSP's include the major cold-shock proteins CspA and CspB in bacteria and the eukaryotic gene regulatory factor Y-box protein. CSP expression is up-regulated by an abrupt drop in growth temperature. CSP's are also expressed under normal condition at lower level. The function of cold-shock proteins is not fully understood. They preferentially bind poly-pyrimidine region of single-stranded RNA and DNA. CSP's are thought to bind mRNA and regulate ribosomal translation, mRNA degradation, and the rate of transcription termination. The human Y-box protein, which contains a CSD, regulates transcription and translation of genes that contain the Y-box sequence in their promoters. This specific ssDNA-binding properties of CSD are required for the binding of Y-box protein to the promoter's Y-box sequence, thereby regulating transcription.
cd CSP_CDS 50aa 1e-06
in ref transcript
cd CSP_CDS 53aa 0.003
in ref transcript
Changed!
cd CSP_CDS 54aa 0.010
in ref transcript
pfam CSD 63aa 3e-13
in ref transcript
'Cold-shock' DNA-binding domain.
smart CSP 64aa 5e-12
in ref transcript
Cold shock protein domain. RNA-binding domain that functions as a RNA-chaperone in bacteria and is involved in regulating translation in eukaryotes. Contains sub-family of RNA-binding domains in the Rho transcription termination factor.
pfam CSD 65aa 4e-11
in ref transcript
Changed!
pfam CSD 62aa 3e-10
in ref transcript
smart CSP 64aa 9e-10
in ref transcript
TIGR 3_prime_RNase 174aa 3e-04
in ref transcript
This model is defined to identify a pair of paralogous 3-prime exoribonucleases in E. coli, plus the set of proteins apparently orthologous to one or the other in other eubacteria. VacB was characterized originally as required for the expression of virulence genes, but is now recognized as the exoribonuclease RNase R (Rnr). Its paralog in E. coli and H. influenzae is designated exoribonuclease II (Rnb). Both are involved in the degradation of mRNA, and consequently have strong pleiotropic effects that may be difficult to disentangle. Both these proteins share domain-level similarity (RNB, S1) with a considerable number of other proteins, and full-length similarity scoring below the trusted cutoff to proteins associated with various phenotypes but uncertain biochemistry; it may be that these latter proteins are also 3-prime exoribonucleases.
COG CspC 64aa 7e-06
in ref transcript
Cold shock proteins [Transcription].
Changed!
smart CSP 61aa 3e-10
in modified transcript
CSF3R
refseq_CSF3R.F1 refseq_CSF3R.R1 170 251
NCBIGene 36.3 1441
Alternative 3-prime, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_156039
cd FN3 81aa 1e-04
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 91aa 0.002
in ref transcript
cd FN3 79aa 0.007
in ref transcript
pfam Lep_receptor_Ig 90aa 5e-27
in ref transcript
Ig-like C2-type domain. This domain is a ligand-binding immunoglobulin-like domain. The two cysteine residues form a disulphide bridge.
smart FN3 81aa 7e-06
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
pfam fn3 85aa 0.002
in ref transcript
Fibronectin type III domain.
CSF3R
refseq_CSF3R.F4 refseq_CSF3R.R4 122 216
NCBIGene 36.3 1441
Single exon skipping, size difference: 94
Exclusion in the protein causing a frameshift
Reference transcript: NM_156039
cd FN3 81aa 1e-04
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 91aa 0.002
in ref transcript
cd FN3 79aa 0.007
in ref transcript
pfam Lep_receptor_Ig 90aa 5e-27
in ref transcript
Ig-like C2-type domain. This domain is a ligand-binding immunoglobulin-like domain. The two cysteine residues form a disulphide bridge.
smart FN3 81aa 7e-06
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
pfam fn3 85aa 0.002
in ref transcript
Fibronectin type III domain.
CSMD3
refseq_CSMD3.F1 refseq_CSMD3.R1 100 412
NCBIGene 36.3 114788
Single exon skipping, size difference: 312
Exclusion in the protein (no frameshift)
Reference transcript: NM_198123
cd CUB 108aa 3e-26
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd CUB 108aa 3e-25
in ref transcript
cd CUB 108aa 2e-23
in ref transcript
cd CUB 107aa 4e-23
in ref transcript
cd CUB 108aa 6e-23
in ref transcript
cd CUB 109aa 1e-21
in ref transcript
cd CUB 107aa 1e-20
in ref transcript
cd CUB 108aa 7e-20
in ref transcript
cd CUB 109aa 8e-18
in ref transcript
cd CUB 109aa 1e-17
in ref transcript
cd CUB 107aa 4e-17
in ref transcript
cd CUB 108aa 4e-16
in ref transcript
Changed!
cd CUB 85aa 2e-14
in ref transcript
cd CCP 59aa 6e-09
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd CCP 55aa 7e-09
in ref transcript
cd CCP 55aa 1e-08
in ref transcript
cd CCP 55aa 2e-08
in ref transcript
cd CCP 56aa 2e-08
in ref transcript
cd CCP 57aa 4e-08
in ref transcript
cd CCP 54aa 4e-08
in ref transcript
cd CCP 54aa 4e-08
in ref transcript
cd CCP 55aa 6e-08
in ref transcript
cd CCP 54aa 2e-07
in ref transcript
cd CCP 55aa 4e-07
in ref transcript
cd CCP 59aa 5e-07
in ref transcript
cd CCP 55aa 5e-07
in ref transcript
cd CUB 108aa 1e-06
in ref transcript
cd CCP 57aa 1e-06
in ref transcript
cd CCP 57aa 1e-06
in ref transcript
cd CCP 59aa 2e-06
in ref transcript
cd CCP 54aa 3e-06
in ref transcript
cd CCP 58aa 8e-06
in ref transcript
cd CCP 56aa 9e-06
in ref transcript
cd CCP 62aa 2e-05
in ref transcript
cd CCP 59aa 3e-05
in ref transcript
cd CCP 58aa 3e-05
in ref transcript
cd CCP 59aa 4e-05
in ref transcript
cd CCP 57aa 1e-04
in ref transcript
cd CCP 60aa 3e-04
in ref transcript
cd CCP 58aa 4e-04
in ref transcript
cd CCP 59aa 4e-04
in ref transcript
cd CCP 58aa 0.001
in ref transcript
pfam CUB 106aa 3e-26
in ref transcript
CUB domain.
pfam CUB 106aa 1e-25
in ref transcript
pfam CUB 106aa 4e-23
in ref transcript
pfam CUB 106aa 4e-23
in ref transcript
pfam CUB 101aa 3e-22
in ref transcript
pfam CUB 106aa 8e-21
in ref transcript
pfam CUB 107aa 4e-20
in ref transcript
pfam CUB 109aa 9e-20
in ref transcript
pfam CUB 106aa 5e-19
in ref transcript
pfam CUB 108aa 7e-18
in ref transcript
smart CUB 98aa 2e-17
in ref transcript
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
pfam CUB 106aa 5e-16
in ref transcript
pfam CUB 102aa 5e-14
in ref transcript
smart CCP 54aa 7e-10
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
smart CCP 58aa 3e-09
in ref transcript
smart CCP 54aa 6e-09
in ref transcript
smart CCP 55aa 7e-09
in ref transcript
smart CCP 56aa 1e-08
in ref transcript
smart CCP 54aa 1e-08
in ref transcript
smart CCP 54aa 1e-08
in ref transcript
smart CCP 55aa 6e-08
in ref transcript
smart CCP 54aa 8e-08
in ref transcript
pfam CUB 106aa 1e-07
in ref transcript
smart CCP 54aa 1e-07
in ref transcript
smart CCP 56aa 1e-07
in ref transcript
pfam Sushi 58aa 1e-07
in ref transcript
Sushi domain (SCR repeat).
smart CCP 54aa 2e-07
in ref transcript
smart CCP 61aa 2e-07
in ref transcript
smart CCP 54aa 3e-07
in ref transcript
smart CCP 54aa 4e-07
in ref transcript
smart CCP 56aa 6e-07
in ref transcript
smart CCP 58aa 1e-06
in ref transcript
smart CCP 58aa 1e-06
in ref transcript
smart CCP 56aa 3e-06
in ref transcript
smart CCP 57aa 4e-06
in ref transcript
pfam Sushi 58aa 7e-06
in ref transcript
pfam Sushi 57aa 3e-05
in ref transcript
smart CCP 58aa 5e-05
in ref transcript
smart CCP 56aa 7e-05
in ref transcript
smart CCP 59aa 2e-04
in ref transcript
smart CCP 58aa 4e-04
in ref transcript
smart CCP 58aa 0.001
in ref transcript
Changed!
cd CUB 86aa 2e-14
in modified transcript
CSMD3
refseq_CSMD3.F2 refseq_CSMD3.R2 155 350
NCBIGene 36.3 114788
Single exon skipping, size difference: 195
Exclusion in the protein (no frameshift)
Reference transcript: NM_198123
cd CUB 108aa 3e-26
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd CUB 108aa 3e-25
in ref transcript
cd CUB 108aa 2e-23
in ref transcript
cd CUB 107aa 4e-23
in ref transcript
cd CUB 108aa 6e-23
in ref transcript
cd CUB 109aa 1e-21
in ref transcript
cd CUB 107aa 1e-20
in ref transcript
cd CUB 108aa 7e-20
in ref transcript
cd CUB 109aa 8e-18
in ref transcript
cd CUB 109aa 1e-17
in ref transcript
cd CUB 107aa 4e-17
in ref transcript
cd CUB 108aa 4e-16
in ref transcript
cd CUB 85aa 2e-14
in ref transcript
cd CCP 59aa 6e-09
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd CCP 55aa 7e-09
in ref transcript
cd CCP 55aa 1e-08
in ref transcript
cd CCP 55aa 2e-08
in ref transcript
cd CCP 56aa 2e-08
in ref transcript
cd CCP 57aa 4e-08
in ref transcript
cd CCP 54aa 4e-08
in ref transcript
cd CCP 54aa 4e-08
in ref transcript
cd CCP 55aa 6e-08
in ref transcript
cd CCP 54aa 2e-07
in ref transcript
cd CCP 55aa 4e-07
in ref transcript
cd CCP 59aa 5e-07
in ref transcript
cd CCP 55aa 5e-07
in ref transcript
cd CUB 108aa 1e-06
in ref transcript
cd CCP 57aa 1e-06
in ref transcript
cd CCP 57aa 1e-06
in ref transcript
cd CCP 59aa 2e-06
in ref transcript
cd CCP 54aa 3e-06
in ref transcript
cd CCP 58aa 8e-06
in ref transcript
cd CCP 56aa 9e-06
in ref transcript
Changed!
cd CCP 62aa 2e-05
in ref transcript
cd CCP 59aa 3e-05
in ref transcript
cd CCP 58aa 3e-05
in ref transcript
cd CCP 59aa 4e-05
in ref transcript
cd CCP 57aa 1e-04
in ref transcript
cd CCP 60aa 3e-04
in ref transcript
cd CCP 58aa 4e-04
in ref transcript
cd CCP 59aa 4e-04
in ref transcript
cd CCP 58aa 0.001
in ref transcript
pfam CUB 106aa 3e-26
in ref transcript
CUB domain.
pfam CUB 106aa 1e-25
in ref transcript
pfam CUB 106aa 4e-23
in ref transcript
pfam CUB 106aa 4e-23
in ref transcript
pfam CUB 101aa 3e-22
in ref transcript
pfam CUB 106aa 8e-21
in ref transcript
pfam CUB 107aa 4e-20
in ref transcript
pfam CUB 109aa 9e-20
in ref transcript
pfam CUB 106aa 5e-19
in ref transcript
pfam CUB 108aa 7e-18
in ref transcript
smart CUB 98aa 2e-17
in ref transcript
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
pfam CUB 106aa 5e-16
in ref transcript
pfam CUB 102aa 5e-14
in ref transcript
smart CCP 54aa 7e-10
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
smart CCP 58aa 3e-09
in ref transcript
smart CCP 54aa 6e-09
in ref transcript
smart CCP 55aa 7e-09
in ref transcript
smart CCP 56aa 1e-08
in ref transcript
smart CCP 54aa 1e-08
in ref transcript
smart CCP 54aa 1e-08
in ref transcript
smart CCP 55aa 6e-08
in ref transcript
smart CCP 54aa 8e-08
in ref transcript
pfam CUB 106aa 1e-07
in ref transcript
smart CCP 54aa 1e-07
in ref transcript
smart CCP 56aa 1e-07
in ref transcript
pfam Sushi 58aa 1e-07
in ref transcript
Sushi domain (SCR repeat).
smart CCP 54aa 2e-07
in ref transcript
Changed!
smart CCP 61aa 2e-07
in ref transcript
smart CCP 54aa 3e-07
in ref transcript
smart CCP 54aa 4e-07
in ref transcript
smart CCP 56aa 6e-07
in ref transcript
smart CCP 58aa 1e-06
in ref transcript
smart CCP 58aa 1e-06
in ref transcript
smart CCP 56aa 3e-06
in ref transcript
smart CCP 57aa 4e-06
in ref transcript
pfam Sushi 58aa 7e-06
in ref transcript
pfam Sushi 57aa 3e-05
in ref transcript
smart CCP 58aa 5e-05
in ref transcript
smart CCP 56aa 7e-05
in ref transcript
smart CCP 59aa 2e-04
in ref transcript
smart CCP 58aa 4e-04
in ref transcript
smart CCP 58aa 0.001
in ref transcript
CSNK1A1
refseq_CSNK1A1.F2 refseq_CSNK1A1.R2 138 222
NCBIGene 36.3 1452
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_001025105
Changed!
cd S_TKc 292aa 1e-31
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
pfam Pkinase 240aa 2e-30
in ref transcript
Protein kinase domain.
Changed!
COG SPS1 239aa 5e-22
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
cd S_TKc 264aa 1e-34
in modified transcript
Changed!
pfam Pkinase 212aa 3e-32
in modified transcript
Changed!
COG SPS1 211aa 2e-25
in modified transcript
CSNK1D
refseq_CSNK1D.F2 refseq_CSNK1D.R2 238 302
NCBIGene 36.3 1453
Single exon skipping, size difference: 64
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001893
cd S_TKc 264aa 7e-42
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 253aa 4e-42
in ref transcript
Protein kinase domain.
COG SPS1 277aa 1e-26
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
CSNK1G3
refseq_CSNK1G3.F1 refseq_CSNK1G3.R1 122 146
NCBIGene 36.3 1456
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_001044723
cd S_TKc 265aa 6e-37
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 257aa 3e-37
in ref transcript
Protein kinase domain.
COG SPS1 325aa 4e-27
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
CSNK1G3
refseq_CSNK1G3.F3 refseq_CSNK1G3.R3 333 405
NCBIGene 36.3 1456
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_001044723
cd S_TKc 265aa 6e-37
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 257aa 3e-37
in ref transcript
Protein kinase domain.
Changed!
COG SPS1 325aa 4e-27
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
COG SPS1 349aa 2e-27
in modified transcript
CSNK2A1
refseq_CSNK2A1.F2 refseq_CSNK2A1.R2 225 342
NCBIGene 36.3 1457
Single exon skipping, size difference: 117
Exclusion in 5'UTR
Reference transcript: NM_177559
cd S_TKc 287aa 4e-46
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 286aa 1e-57
in ref transcript
Protein kinase domain.
COG SPS1 287aa 3e-26
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
CST11
refseq_CST11.F1 refseq_CST11.R1 227 332
NCBIGene 36.3 140880
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_130794
Changed!
cd CY 92aa 3e-09
in ref transcript
Cystatin-like domain; Cystatins are a family of cysteine protease inhibitors that occur mainly as single domain proteins. However some extracellular proteins such as kininogen, His-rich glycoprotein and fetuin also contain these domains.
Changed!
smart CY 83aa 5e-11
in ref transcript
Cystatin-like domain. Cystatins are a family of cysteine protease inhibitors that occur mainly as single domain proteins. However some extracellular proteins such as kininogen, His-rich glycoprotein and fetuin also contain these domains.
Changed!
smart CY 26aa 0.003
in modified transcript
CSTF1
refseq_CSTF1.F2 refseq_CSTF1.R2 117 219
NCBIGene 36.3 1477
Alternative 5-prime, size difference: 102
Exclusion in 5'UTR
Reference transcript: NM_001033522
cd WD40 325aa 1e-44
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
pfam WD40 36aa 2e-06
in ref transcript
WD domain, G-beta repeat.
pfam WD40 30aa 7e-04
in ref transcript
COG COG2319 345aa 5e-21
in ref transcript
FOG: WD40 repeat [General function prediction only].
CSTF1
refseq_CSTF1.F3 refseq_CSTF1.R1 149 324
NCBIGene 36.3 1477
Alternative 5-prime, size difference: 175
Exclusion in 5'UTR
Reference transcript: NM_001324
cd WD40 325aa 1e-44
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
pfam WD40 36aa 2e-06
in ref transcript
WD domain, G-beta repeat.
pfam WD40 30aa 7e-04
in ref transcript
COG COG2319 345aa 5e-21
in ref transcript
FOG: WD40 repeat [General function prediction only].
CTBP1
refseq_CTBP1.F1 refseq_CTBP1.R1 186 381
NCBIGene 36.3 1487
Single exon skipping, size difference: 195
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001328
Changed!
pfam 2-Hacid_dh 291aa 2e-58
in ref transcript
D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain.
Changed!
COG LdhA 296aa 3e-61
in ref transcript
Lactate dehydrogenase and related dehydrogenases [Energy production and conversion / Coenzyme metabolism / General function prediction only].
CTDSPL
refseq_CTDSPL.F2 refseq_CTDSPL.R2 144 177
NCBIGene 36.3 10217
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_001008392
Changed!
TIGR HIF-SF_euk 160aa 5e-72
in ref transcript
This domain is related to domains found in FCP1-like phosphatases (TIGR02250), and together both are detected by the pfam03031.
COG FCP1 176aa 7e-47
in ref transcript
TFIIF-interacting CTD phosphatases, including NLI-interacting factor [Transcription].
Changed!
pfam NIF 165aa 6e-72
in modified transcript
NLI interacting factor-like phosphatase. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase. This region has been identified as the minimal phosphatase domain.
CTF8
refseq_CTF8.F1 refseq_CTF8.R1 149 207
NCBIGene 36.3 54921
Single exon skipping, size difference: 58
Exclusion in 5'UTR
Reference transcript: NM_001039690
CTLA4
refseq_CTLA4.F2 refseq_CTLA4.R2 213 323
NCBIGene 36.3 1493
Single exon skipping, size difference: 110
Exclusion in the protein causing a frameshift
Reference transcript: NM_005214
smart IGv 74aa 2e-06
in ref transcript
Immunoglobulin V-Type.
CTNNBIP1
refseq_CTNNBIP1.F1 refseq_CTNNBIP1.R1 187 221
NCBIGene 36.3 56998
Single exon skipping, size difference: 34
Exclusion in 5'UTR
Reference transcript: NM_020248
CTSE
refseq_CTSE.F1 refseq_CTSE.R1 109 251
NCBIGene 36.3 1510
Single exon skipping, size difference: 142
Exclusion in the protein causing a frameshift
Reference transcript: NM_001910
Changed!
cd Cathespin_E 316aa 1e-176
in ref transcript
Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal lobes of the enzyme. The aspartic acid residues act together to allow a water molecule to attack the peptide bond. One aspartic acid residue (in its deprotonated form) activates the attacking water molecule, whereas the other aspartic acid residue (in its protonated form) polarizes the peptide carbonyl, increasing its susceptibility to attack. This family of aspartate proteases is classified by MEROPS as the peptidase family A1 (pepsin A, clan AA).
Changed!
pfam Asp 318aa 1e-128
in ref transcript
Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.
pfam A1_Propeptide 29aa 5e-05
in ref transcript
A1 Propeptide. Most eukaryotic endopeptidases (Merops Family A1) are synthesised with signal and propeptides. The animal pepsin-like endopeptidase propeptides form a distinct family of propeptides, which contain a conserved motif approximately 30 residues long. In pepsinogen A, the first 11 residues of the mature pepsin sequence are displaced by residues of the propeptide. The propeptide contains two helices that block the active site cleft, in particular the conserved Asp11 residue, in pepsin, hydrogen bonds to a conserved Arg residues in the propeptide. This hydrogen bond stabilises the propeptide conformation and is probably responsible for triggering the conversion of pepsinogen to pepsin under acidic conditions.
Changed!
PTZ PTZ00165 368aa 2e-67
in ref transcript
aspartyl protease; Provisional.
Changed!
cd Cathespin_E 186aa 1e-101
in modified transcript
Changed!
pfam Asp 185aa 3e-77
in modified transcript
Changed!
PTZ PTZ00165 238aa 4e-40
in modified transcript
CTSL1
refseq_CTSL.F2 refseq_CTSL.R2 224 369
NCBIGene 36.3 1514
Alternative 5-prime, size difference: 145
Exclusion in 5'UTR
Reference transcript: NM_001912
cd Peptidase_C1A 217aa 4e-83
in ref transcript
Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to hatch or to evade the host immune system. Mammalian CPs are primarily lysosomal enzymes with the exception of cathepsin W, which is retained in the endoplasmic reticulum. They are responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. In addition to its inhibitory role, the propeptide is required for proper folding of the newly synthesized enzyme and its stabilization in denaturing pH conditions. Residues within the propeptide region also play a role in the transport of the proenzyme to lysosomes or acidified vesicles. Also included in this subfamily are proteins classified as non-peptidase homologs, which lack peptidase activity or have missing active site residues.
pfam Peptidase_C1 219aa 9e-99
in ref transcript
Papain family cysteine protease.
pfam Inhibitor_I29 60aa 1e-14
in ref transcript
Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS.
PTZ PTZ00203 288aa 1e-44
in ref transcript
cathepsin L protease; Provisional.
CTTN
refseq_CTTN.F1 refseq_CTTN.R1 123 234
NCBIGene 36.3 2017
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_005231
cd SH3 53aa 7e-14
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart SH3 53aa 3e-16
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
pfam HS1_rep 37aa 5e-12
in ref transcript
Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018.
pfam HS1_rep 37aa 2e-11
in ref transcript
pfam HS1_rep 37aa 6e-11
in ref transcript
Changed!
pfam HS1_rep 37aa 2e-10
in ref transcript
Changed!
pfam HS1_rep 35aa 4e-10
in ref transcript
pfam HS1_rep 37aa 1e-09
in ref transcript
Changed!
pfam HS1_rep 37aa 3e-11
in modified transcript
CYB5A
refseq_CYB5A.F1 refseq_CYB5A.R1 124 148
NCBIGene 36.3 1528
Single exon skipping, size difference: 24
Inclusion in the protein causing a new stop codon
Reference transcript: NM_148923
pfam Cyt-b5 75aa 1e-24
in ref transcript
Cytochrome b5-like Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anchored by an N-terminal transmembrane alpha helix. This family also includes a domain in some chitin synthases. There is no known ligand for this domain in the chitin synthases.
COG CYB5 81aa 6e-16
in ref transcript
Cytochrome b involved in lipid metabolism [Energy production and conversion / Lipid metabolism].
CLIP2
refseq_CYLN2.F1 refseq_CYLN2.R1 257 362
NCBIGene 36.3 7461
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_003388
pfam CAP_GLY 66aa 1e-26
in ref transcript
CAP-Gly domain. Cytoskeleton-associated proteins (CAPs) are involved in the organisation of microtubules and transportation of vesicles and organelles along the cytoskeletal network. A conserved motif, CAP-Gly, has been identified in a number of CAPs, including CLIP-170 and dynactins. The crystal structure of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain was recently solved. The domain contains three beta-strands. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove.
pfam CAP_GLY 66aa 2e-24
in ref transcript
Changed!
TIGR SMC_prok_B 354aa 2e-08
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
pfam Myosin_tail_1 592aa 2e-07
in ref transcript
Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
COG NIP100 61aa 5e-13
in ref transcript
Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning].
COG NIP100 62aa 5e-12
in ref transcript
Changed!
COG Smc 657aa 6e-10
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
TIGR SMC_prok_B 313aa 8e-09
in modified transcript
Changed!
TIGR SMC_prok_B 217aa 2e-07
in modified transcript
Changed!
pfam Myosin_tail_1 557aa 2e-07
in modified transcript
Changed!
COG Smc 399aa 2e-08
in modified transcript
Changed!
COG Smc 350aa 2e-06
in modified transcript
CYP11B1
refseq_CYP11B1.F2 refseq_CYP11B1.R2 173 371
NCBIGene 36.3 1584
Single exon skipping, size difference: 198
Exclusion in the protein (no frameshift)
Reference transcript: NM_000497
Changed!
pfam p450 448aa 2e-93
in ref transcript
Cytochrome P450. Cytochrome P450s are haem-thiolate proteins involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures.
Changed!
COG CypX 460aa 5e-29
in ref transcript
Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism].
Changed!
pfam p450 359aa 1e-66
in modified transcript
Changed!
COG CypX 360aa 1e-15
in modified transcript
CYP19A1
refseq_CYP19A1.F1 refseq_CYP19A1.R1 232 341
NCBIGene 36.3 1588
Single exon skipping, size difference: 109
Exclusion in 5'UTR
Reference transcript: NM_031226
pfam p450 424aa 1e-90
in ref transcript
Cytochrome P450. Cytochrome P450s are haem-thiolate proteins involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures.
COG CypX 415aa 2e-29
in ref transcript
Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism].
CYP20A1
refseq_CYP20A1.F1 refseq_CYP20A1.R1 303 382
NCBIGene 36.2 57404
Single exon skipping, size difference: 79
Exclusion in the protein causing a frameshift
Reference transcript: NM_177538
Changed!
pfam p450 375aa 1e-26
in ref transcript
Cytochrome P450. Cytochrome P450s are haem-thiolate proteins involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures.
Changed!
COG CypX 238aa 4e-13
in ref transcript
Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism].
Changed!
pfam p450 46aa 2e-05
in modified transcript
CYP2A7
refseq_CYP2A7.F1 refseq_CYP2A7.R1 147 300
NCBIGene 36.3 1549
Exon skipping and alternative 3-prime or 5-prime, size difference: 153
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_000764
Changed!
pfam p450 458aa 1e-131
in ref transcript
Cytochrome P450. Cytochrome P450s are haem-thiolate proteins involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures.
Changed!
COG CypX 429aa 2e-32
in ref transcript
Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism].
Changed!
pfam p450 378aa 1e-105
in modified transcript
Changed!
COG CypX 233aa 2e-27
in modified transcript
CYP2D6
refseq_CYP2D6.F1 refseq_CYP2D6.R1 218 371
NCBIGene 36.3 1565
Single exon skipping, size difference: 153
Exclusion in the protein (no frameshift)
Reference transcript: NM_000106
Changed!
pfam p450 436aa 1e-120
in ref transcript
Cytochrome P450. Cytochrome P450s are haem-thiolate proteins involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures.
Changed!
COG CypX 415aa 1e-31
in ref transcript
Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism].
Changed!
pfam p450 385aa 1e-101
in modified transcript
Changed!
COG CypX 263aa 1e-26
in modified transcript
DAP3
refseq_DAP3.F2 refseq_DAP3.R2 202 271
NCBIGene 36.3 7818
Alternative 5-prime, size difference: 69
Exclusion in 5'UTR
Reference transcript: NM_033657
pfam DAP3 296aa 8e-93
in ref transcript
Mitochondrial ribosomal death-associated protein 3. This is a family of conserved proteins which were originally described as death-associated-protein-3 (DAP-3). The proteins carry a P-loop DNA-binding motif, and induce apoptosis. DAP3 has been shown to be a pro-apoptotic factor in the mitochondrial matrix and to be crucial for mitochondrial biogenesis and so has also been designated as MRP-S29 (mitochondrial ribosomal protein subunit 29).
DAZAP1
refseq_DAZAP1.F1 refseq_DAZAP1.R1 189 278
NCBIGene 36.3 26528
Single exon skipping, size difference: 89
Inclusion in the protein causing a new stop codon
Reference transcript: NM_018959
cd RRM 74aa 1e-17
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 70aa 1e-15
in ref transcript
smart RRM 70aa 8e-18
in ref transcript
RNA recognition motif.
smart RRM 70aa 2e-16
in ref transcript
TIGR SF-CC1 174aa 4e-15
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
COG COG0724 172aa 2e-14
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
DCHS2
refseq_DCHS2.F2 refseq_DCHS2.R2 100 136
NCBIGene 36.2 54798
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_017639
cd CA 199aa 1e-48
in ref transcript
Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
cd CA 197aa 4e-45
in ref transcript
cd CA 198aa 2e-40
in ref transcript
cd CA 194aa 2e-39
in ref transcript
cd CA 209aa 4e-39
in ref transcript
cd CA 206aa 1e-38
in ref transcript
cd CA 196aa 2e-38
in ref transcript
cd CA 200aa 2e-37
in ref transcript
cd CA 239aa 7e-34
in ref transcript
cd CA 195aa 2e-33
in ref transcript
cd CA 189aa 3e-33
in ref transcript
cd CA 201aa 1e-32
in ref transcript
cd CA 192aa 3e-32
in ref transcript
cd CA 201aa 2e-30
in ref transcript
Changed!
cd CA 167aa 1e-26
in ref transcript
cd CA 207aa 5e-25
in ref transcript
cd CA 197aa 2e-24
in ref transcript
cd CA 193aa 2e-20
in ref transcript
smart CA 79aa 7e-23
in ref transcript
Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
smart CA 79aa 5e-21
in ref transcript
smart CA 78aa 1e-19
in ref transcript
smart CA 79aa 1e-18
in ref transcript
pfam Cadherin 91aa 3e-17
in ref transcript
Cadherin domain.
smart CA 77aa 3e-16
in ref transcript
pfam Cadherin 89aa 6e-16
in ref transcript
smart CA 90aa 1e-15
in ref transcript
smart CA 73aa 8e-15
in ref transcript
smart CA 68aa 1e-13
in ref transcript
smart CA 75aa 2e-12
in ref transcript
pfam Cadherin 79aa 4e-12
in ref transcript
Changed!
smart CA 94aa 4e-11
in ref transcript
pfam Cadherin 95aa 2e-10
in ref transcript
pfam Cadherin 89aa 5e-10
in ref transcript
pfam Cadherin 90aa 6e-10
in ref transcript
smart CA 121aa 7e-10
in ref transcript
smart CA 70aa 3e-09
in ref transcript
smart CA 67aa 1e-07
in ref transcript
pfam Cadherin 81aa 2e-06
in ref transcript
pfam Cadherin 51aa 1e-04
in ref transcript
Changed!
cd CA 155aa 6e-28
in modified transcript
Changed!
smart CA 82aa 2e-12
in modified transcript
DCHS2
refseq_DCHS2.F4 refseq_DCHS2.R4 103 388
NCBIGene 36.2 54798
Multiple exon skipping, size difference: 285
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_017639
cd CA 199aa 1e-48
in ref transcript
Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
cd CA 197aa 4e-45
in ref transcript
cd CA 198aa 2e-40
in ref transcript
cd CA 194aa 2e-39
in ref transcript
cd CA 209aa 4e-39
in ref transcript
cd CA 206aa 1e-38
in ref transcript
cd CA 196aa 2e-38
in ref transcript
cd CA 200aa 2e-37
in ref transcript
cd CA 239aa 7e-34
in ref transcript
cd CA 195aa 2e-33
in ref transcript
cd CA 189aa 3e-33
in ref transcript
cd CA 201aa 1e-32
in ref transcript
cd CA 192aa 3e-32
in ref transcript
cd CA 201aa 2e-30
in ref transcript
Changed!
cd CA 167aa 1e-26
in ref transcript
cd CA 207aa 5e-25
in ref transcript
cd CA 197aa 2e-24
in ref transcript
cd CA 193aa 2e-20
in ref transcript
smart CA 79aa 7e-23
in ref transcript
Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
smart CA 79aa 5e-21
in ref transcript
smart CA 78aa 1e-19
in ref transcript
smart CA 79aa 1e-18
in ref transcript
pfam Cadherin 91aa 3e-17
in ref transcript
Cadherin domain.
smart CA 77aa 3e-16
in ref transcript
pfam Cadherin 89aa 6e-16
in ref transcript
smart CA 90aa 1e-15
in ref transcript
smart CA 73aa 8e-15
in ref transcript
smart CA 68aa 1e-13
in ref transcript
smart CA 75aa 2e-12
in ref transcript
pfam Cadherin 79aa 4e-12
in ref transcript
smart CA 94aa 4e-11
in ref transcript
pfam Cadherin 95aa 2e-10
in ref transcript
pfam Cadherin 89aa 5e-10
in ref transcript
pfam Cadherin 90aa 6e-10
in ref transcript
smart CA 121aa 7e-10
in ref transcript
smart CA 70aa 3e-09
in ref transcript
smart CA 67aa 1e-07
in ref transcript
pfam Cadherin 81aa 2e-06
in ref transcript
Changed!
pfam Cadherin 51aa 1e-04
in ref transcript
Changed!
cd CA 207aa 2e-32
in modified transcript
Changed!
cd CA 197aa 3e-31
in modified transcript
Changed!
smart CA 71aa 3e-09
in modified transcript
DCHS2
refseq_DCHS2.F6 refseq_DCHS2.R6 162 294
NCBIGene 36.2 54798
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_017639
cd CA 199aa 1e-48
in ref transcript
Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
cd CA 197aa 4e-45
in ref transcript
cd CA 198aa 2e-40
in ref transcript
cd CA 194aa 2e-39
in ref transcript
cd CA 209aa 4e-39
in ref transcript
cd CA 206aa 1e-38
in ref transcript
cd CA 196aa 2e-38
in ref transcript
cd CA 200aa 2e-37
in ref transcript
Changed!
cd CA 239aa 7e-34
in ref transcript
cd CA 195aa 2e-33
in ref transcript
cd CA 189aa 3e-33
in ref transcript
cd CA 201aa 1e-32
in ref transcript
cd CA 192aa 3e-32
in ref transcript
cd CA 201aa 2e-30
in ref transcript
cd CA 167aa 1e-26
in ref transcript
cd CA 207aa 5e-25
in ref transcript
cd CA 197aa 2e-24
in ref transcript
cd CA 193aa 2e-20
in ref transcript
smart CA 79aa 7e-23
in ref transcript
Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
smart CA 79aa 5e-21
in ref transcript
smart CA 78aa 1e-19
in ref transcript
smart CA 79aa 1e-18
in ref transcript
pfam Cadherin 91aa 3e-17
in ref transcript
Cadherin domain.
smart CA 77aa 3e-16
in ref transcript
pfam Cadherin 89aa 6e-16
in ref transcript
smart CA 90aa 1e-15
in ref transcript
smart CA 73aa 8e-15
in ref transcript
smart CA 68aa 1e-13
in ref transcript
smart CA 75aa 2e-12
in ref transcript
pfam Cadherin 79aa 4e-12
in ref transcript
smart CA 94aa 4e-11
in ref transcript
pfam Cadherin 95aa 2e-10
in ref transcript
pfam Cadherin 89aa 5e-10
in ref transcript
pfam Cadherin 90aa 6e-10
in ref transcript
Changed!
smart CA 121aa 7e-10
in ref transcript
smart CA 70aa 3e-09
in ref transcript
smart CA 67aa 1e-07
in ref transcript
pfam Cadherin 81aa 2e-06
in ref transcript
pfam Cadherin 51aa 1e-04
in ref transcript
Changed!
cd CA 195aa 8e-40
in modified transcript
Changed!
cd CA 199aa 3e-32
in modified transcript
Changed!
smart CA 77aa 3e-16
in modified transcript
DCLRE1C
refseq_DCLRE1C.F1 refseq_DCLRE1C.R1 116 172
NCBIGene 36.3 64421
Single exon skipping, size difference: 56
Exclusion in the protein causing a frameshift
Reference transcript: NM_001033855
Changed!
pfam DRMBL 99aa 2e-16
in ref transcript
DNA repair metallo-beta-lactamase. The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in pfam00753 and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in DNA repair.
Changed!
smart Lactamase_B 115aa 3e-08
in ref transcript
Metallo-beta-lactamase superfamily. Apart from the beta-lactamases a number of other proteins contain this domain PUBMED:7588620. These proteins include thiolesterases, members of the glyoxalase II family, that catalyse the hydrolysis of S-D-lactoyl-glutathione to form glutathione and D-lactic acid and a competence protein that is essential for natural transformation in Neisseria gonorrhoeae and could be a transporter involved in DNA uptake. Except for the competence protein these proteins bind two zinc ions per molecule as cofactor.
Changed!
COG YSH1 175aa 1e-11
in ref transcript
Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis].
Changed!
smart Lactamase_B 62aa 0.002
in modified transcript
Changed!
COG COG0595 79aa 0.009
in modified transcript
Predicted hydrolase of the metallo-beta-lactamase superfamily [General function prediction only].
DCLRE1C
refseq_DCLRE1C.F3 refseq_DCLRE1C.R3 172 257
NCBIGene 36.3 64421
Single exon skipping, size difference: 85
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001033855
Changed!
pfam DRMBL 99aa 2e-16
in ref transcript
DNA repair metallo-beta-lactamase. The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in pfam00753 and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in DNA repair.
Changed!
smart Lactamase_B 115aa 3e-08
in ref transcript
Metallo-beta-lactamase superfamily. Apart from the beta-lactamases a number of other proteins contain this domain PUBMED:7588620. These proteins include thiolesterases, members of the glyoxalase II family, that catalyse the hydrolysis of S-D-lactoyl-glutathione to form glutathione and D-lactic acid and a competence protein that is essential for natural transformation in Neisseria gonorrhoeae and could be a transporter involved in DNA uptake. Except for the competence protein these proteins bind two zinc ions per molecule as cofactor.
Changed!
COG YSH1 175aa 1e-11
in ref transcript
Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis].
DCN
refseq_DCN.F2 refseq_DCN.R2 102 429
NCBIGene 36.3 1634
Multiple exon skipping, size difference: 327
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001920
Changed!
cd LRR_RI 217aa 0.002
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Changed!
pfam LRRNT 24aa 1e-05
in ref transcript
Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.
Changed!
COG COG4886 150aa 5e-05
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
Changed!
smart LRRNT 24aa 9e-04
in modified transcript
Leucine rich repeat N-terminal domain.
Changed!
COG COG4886 134aa 6e-05
in modified transcript
DCTD
refseq_DCTD.F2 refseq_DCTD.R2 101 124
NCBIGene 36.3 1635
Alternative 5-prime, size difference: 23
Exclusion in the protein causing a frameshift
Reference transcript: NM_001012732
Changed!
cd deoxycytidylate_deaminase 122aa 1e-37
in ref transcript
Deoxycytidylate deaminase domain. Deoxycytidylate deaminase catalyzes the deamination of dCMP to dUMP, providing the nucleotide substrate for thymidylate synthase. The enzyme binds Zn++, which is required for catalytic activity. The activity of the enzyme is allosterically regulated by the ratio of dCTP to dTTP not only in eukaryotic cells but also in T-even phage-infected Escherichia coli, with dCTP acting as an activator and dTTP as an inhibitor.
Changed!
pfam dCMP_cyt_deam_1 115aa 4e-24
in ref transcript
Cytidine and deoxycytidylate deaminase zinc-binding region.
Changed!
COG ComEB 152aa 1e-32
in ref transcript
Deoxycytidylate deaminase [Nucleotide transport and metabolism].
DCTN3
refseq_DCTN3.F1 refseq_DCTN3.R1 303 400
NCBIGene 36.3 11258
Alternative 3-prime, size difference: 97
Inclusion in the protein causing a frameshift
Reference transcript: NM_007234
Changed!
pfam Dynactin_p22 172aa 1e-82
in ref transcript
Dynactin subunit p22. This family contains p22, the smallest subunit of dynactin, a complex that binds to cytoplasmic dynein and is a required activator for cytoplasmic dynein-mediated vesicular transport. Dynactin localises to the cleavage furrow and to the midbodies of dividing cells, suggesting that it may function in cytokinesis. Family members are approximately 170 residues long and seem to be restricted to mammals.
Changed!
pfam Dynactin_p22 137aa 6e-70
in modified transcript
DCUN1D4
refseq_DCUN1D4.F1 refseq_DCUN1D4.R1 374 479
NCBIGene 36.3 23142
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040402
Changed!
pfam DUF298 113aa 3e-33
in ref transcript
Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif. This domain is implicated in some aspect of neddylation of the cullin 3 family and has a possible role in the regulation of the protein modifier Nedd8 E3 ligase. Neddylation is the process by which the C-terminal glycine of the ubiquitin-like protein Nedd8 is covalently linked to lysine residues in a protein through an isopeptide bond.
Changed!
pfam DUF298 78aa 2e-18
in modified transcript
DDO
refseq_DDO.F1 refseq_DDO.R1 165 342
NCBIGene 36.3 8528
Single exon skipping, size difference: 177
Exclusion in the protein (no frameshift)
Reference transcript: NM_003649
TIGR thiamin_ThiO 152aa 2e-09
in ref transcript
This family consists of the homotetrameric, FAD-dependent glycine oxidase ThiO, from species such as Bacillus subtilis that use glycine in thiamine biosynthesis. In general, members of this family will not be found in species such as E. coli that instead use tyrosine and the ThiH protein.
Changed!
pfam DAO 309aa 2e-09
in ref transcript
FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1.
Changed!
COG DadA 150aa 1e-04
in ref transcript
Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism].
Changed!
COG DadA 193aa 1e-04
in modified transcript
DDR2
refseq_DDR2.F2 refseq_DDR2.R2 171 251
NCBIGene 36.3 4921
Single exon skipping, size difference: 80
Exclusion in 5'UTR
Reference transcript: NM_001014796
cd PTKc_DDR2 295aa 0.0
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Discoidin Domain Receptor 2. Protein Tyrosine Kinase (PTK) family; mammalian Discoidin domain receptor 2 (DDR2) and homologs; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. DDR2 is a member of the DDR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular discoidin homology domain, a transmembrane segment, an extended juxtamembrane region, and an intracellular catalytic domain. The binding of the ligand, collagen, to DDRs results in a slow but sustained receptor activation. DDR2 binds mostly to fibrillar collagens. More recently, it has been reported to also bind collagen X. DDR2 is widely expressed in many tissues with the highest levels found in skeletal muscle, skin, kidney and lung. It is important in cell proliferation and development. Mice, with a deletion of DDR2, suffer from dwarfism and delayed healing of epidermal wounds. DDR2 also contributes to collagen (type I) regulation by inhibiting fibrillogenesis and altering the morphology of collagen fibers. It is also expressed in immature dendritic cells (DCs), where it plays a role in DC activation and function.
cd FA58C 153aa 2e-37
in ref transcript
Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
COG SPS1 289aa 1e-20
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
DDX11
refseq_DDX11.F1 refseq_DDX11.R1 122 272
NCBIGene 36.3 1663
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_152438
Changed!
TIGR rad3 596aa 1e-101
in ref transcript
All proteins in this family for which funcitons are known are DNA-DNA helicases that funciton in the initiation of transcription and nucleotide excision repair as part of the TFIIH complex. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
smart DEXDc3 68aa 6e-25
in ref transcript
DEAD-like helicases superfamily.
Changed!
COG DinG 590aa 3e-29
in ref transcript
Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair].
COG DinG 53aa 8e-10
in ref transcript
Changed!
TIGR rad3 546aa 7e-88
in modified transcript
Changed!
COG DinG 540aa 2e-23
in modified transcript
DDX19B
refseq_DDX19B.F1 refseq_DDX19B.R1 264 357
NCBIGene 36.3 11269
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_007242
Changed!
cd DEADc 203aa 2e-51
in ref transcript
DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region.
cd HELICc 136aa 2e-26
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
Changed!
pfam DEAD 166aa 4e-32
in ref transcript
DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
pfam Helicase_C 83aa 2e-22
in ref transcript
Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
TIGR recQ_fam 201aa 6e-07
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Changed!
COG SrmB 399aa 8e-84
in ref transcript
Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis].
Changed!
cd DEADc 166aa 7e-40
in modified transcript
Changed!
pfam DEAD 159aa 2e-30
in modified transcript
Changed!
COG SrmB 348aa 1e-73
in modified transcript
DDX19B
refseq_DDX19B.F3 refseq_DDX19B.R3 219 322
NCBIGene 36.3 11269
Multiple exon skipping, size difference: 103
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_007242
Changed!
cd DEADc 203aa 2e-51
in ref transcript
DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region.
Changed!
cd HELICc 136aa 2e-26
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
Changed!
pfam DEAD 166aa 4e-32
in ref transcript
DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
Changed!
pfam Helicase_C 83aa 2e-22
in ref transcript
Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
Changed!
TIGR recQ_fam 201aa 6e-07
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Changed!
COG SrmB 399aa 8e-84
in ref transcript
Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis].
DDX42
refseq_DDX42.F1 refseq_DDX42.R1 113 184
NCBIGene 36.3 11325
Single exon skipping, size difference: 71
Exclusion in 5'UTR
Reference transcript: NM_007372
cd DEADc 205aa 2e-73
in ref transcript
DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region.
cd HELICc 126aa 5e-28
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
pfam DEAD 172aa 1e-56
in ref transcript
DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
smart HELICc 82aa 2e-24
in ref transcript
helicase superfamily c-terminal domain.
TIGR recG 309aa 1e-06
in ref transcript
PTZ PTZ00110 445aa 1e-116
in ref transcript
helicase; Provisional.
DDX47
refseq_DDX47.F2 refseq_DDX47.R2 162 309
NCBIGene 36.3 51202
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_016355
cd DEADc 202aa 5e-66
in ref transcript
DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region.
Changed!
cd HELICc 130aa 3e-28
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
pfam DEAD 168aa 3e-48
in ref transcript
DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
Changed!
pfam Helicase_C 78aa 2e-23
in ref transcript
Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
Changed!
TIGR recQ_fam 373aa 4e-18
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Changed!
COG SrmB 424aa 1e-103
in ref transcript
Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis].
Changed!
cd HELICc 67aa 9e-17
in modified transcript
Changed!
pfam Helicase_C 60aa 9e-17
in modified transcript
Changed!
COG SrmB 375aa 2e-84
in modified transcript
DDX52
refseq_DDX52.F1 refseq_DDX52.R1 247 362
NCBIGene 36.3 11056
Single exon skipping, size difference: 115
Inclusion in the protein causing a new stop codon
Reference transcript: NM_007010
Changed!
cd DEADc 208aa 1e-41
in ref transcript
DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region.
Changed!
cd HELICc 124aa 2e-27
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
Changed!
pfam DEAD 173aa 2e-37
in ref transcript
DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
Changed!
pfam Helicase_C 77aa 3e-24
in ref transcript
Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
Changed!
TIGR recQ 349aa 1e-09
in ref transcript
The ATP-dependent DNA helicase RecQ of E. coli is about 600 residues long. This model represents bacterial proteins with a high degree of similarity in domain architecture and in primary sequence to E. coli RecQ. The model excludes eukaryotic and archaeal proteins with RecQ-like regions, as well as more distantly related bacterial helicases related to RecQ.
Changed!
COG SrmB 398aa 7e-80
in ref transcript
Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis].
DEDD
refseq_DEDD.F1 refseq_DEDD.R1 168 201
NCBIGene 36.3 9191
Single exon skipping, size difference: 33
Exclusion in 5'UTR
Reference transcript: NM_032998
cd DED 78aa 7e-11
in ref transcript
Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
pfam DED 85aa 3e-15
in ref transcript
Death effector domain.
DEFB119
refseq_DEFB119.F1 refseq_DEFB119.R1 224 264
NCBIGene 36.3 245932
Single exon skipping, size difference: 40
Inclusion in the protein causing a frameshift
Reference transcript: NM_153289
DGKH
refseq_DGKH.F2 refseq_DGKH.R2 109 240
NCBIGene 36.3 160851
Single exon skipping, size difference: 131
Exclusion in the protein causing a frameshift
Reference transcript: NM_178009
Changed!
cd SAM 62aa 2e-14
in ref transcript
Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
cd PH 89aa 5e-13
in ref transcript
Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
cd C1 50aa 5e-12
in ref transcript
Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
cd C1 51aa 4e-06
in ref transcript
smart DAGKa 158aa 3e-68
in ref transcript
Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. DAG can be produced from the hydrolysis of phosphatidylinositol 4,5-bisphosphate (PIP2) by a phosphoinositide-specific phospholipase C and by the degradation of phosphatidylcholine (PC) by a phospholipase C or the concerted actions of phospholipase D and phosphatidate phosphohydrolase. This domain might either be an accessory domain or else contribute to the catalytic domain. Bacterial homologues are known.
smart DAGKc 123aa 2e-41
in ref transcript
Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. DAG can be produced from the hydrolysis of phosphatidylinositol 4,5-bisphosphate (PIP2) by a phosphoinositide-specific phospholipase C and by the degradation of phosphatidylcholine (PC) by a phospholipase C or the concerted actions of phospholipase D and phosphatidate phosphohydrolase. This domain is presumed to be the catalytic domain. Bacterial homologues areknown.
smart PH 91aa 2e-14
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
Changed!
smart SAM 64aa 7e-14
in ref transcript
Sterile alpha motif. Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerisation.
smart C1 50aa 6e-12
in ref transcript
Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains). Some bind phorbol esters and diacylglycerol. Some bind RasGTP. Zinc-binding domains.
smart C1 45aa 2e-06
in ref transcript
COG LCB5 122aa 3e-07
in ref transcript
Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only].
DGUOK
refseq_DGUOK.F1 refseq_DGUOK.R1 120 384
NCBIGene 36.3 1716
Multiple exon skipping, size difference: 264
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_080916
Changed!
cd dNK 215aa 2e-48
in ref transcript
Deoxyribonucleoside kinase (dNK) catalyzes the phosphorylation of deoxyribonucleosides to yield corresponding monophosphates (dNMPs). This family consists of various deoxynucleoside kinases including deoxyribo- cytidine (EC 2.7.1.74), guanosine (EC 2.7.1.113), adenosine (EC 2.7.1.76), and thymidine (EC 2.7.1.21) kinases. They are key enzymes in the salvage of deoxyribonucleosides originating from extra- or intracellular breakdown of DNA.
Changed!
pfam dNK 158aa 2e-44
in ref transcript
Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes catalyse the production of deoxynucleotide 5'-monophosphate from a deoxynucleoside. Using ATP and yielding ADP in the process.
Changed!
COG COG1428 239aa 3e-22
in ref transcript
Deoxynucleoside kinases [Nucleotide transport and metabolism].
Changed!
cd dNK 117aa 4e-21
in modified transcript
Changed!
pfam dNK 47aa 6e-06
in modified transcript
Changed!
COG COG1428 144aa 6e-08
in modified transcript
DHODH
refseq_DHODH.F1 refseq_DHODH.R1 100 402
NCBIGene 36.2 1723
Multiple exon skipping, size difference: 302
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_001361
Changed!
cd DHOD_2_like 331aa 1e-125
in ref transcript
Dihydroorotate dehydrogenase (DHOD) class 2. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences, their cellular location and their natural electron acceptor used to reoxidize the flavin group. Members of class 1 are cytosolic enzymes and multimers, while class 2 enzymes are membrane associated, monomeric and use respiratory quinones as their physiological electron acceptors.
Changed!
TIGR pyrD_sub2 335aa 1e-136
in ref transcript
The subfamilies 1 and 2 share extensive homology, particularly toward the C-terminus. This subfamily has a longer N-terminal region.
Changed!
PRK PRK05286 330aa 1e-123
in ref transcript
dihydroorotate dehydrogenase 2; Reviewed.
Changed!
cd DHOD_2_like 125aa 2e-49
in modified transcript
Changed!
TIGR pyrD_sub2 124aa 1e-54
in modified transcript
Changed!
PRK PRK05286 120aa 1e-47
in modified transcript
DHPS
refseq_DHPS.F2 refseq_DHPS.R2 111 279
NCBIGene 36.3 1725
Exon skipping and alternative 3-prime or 5-prime, size difference: 168
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001930
Changed!
pfam DS 315aa 1e-149
in ref transcript
Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine synthase (DS) EC:1.1.1.249. The modified version of eIF-5A, and DS, are required for eukaryotic cell proliferation.
Changed!
COG DYS1 331aa 1e-101
in ref transcript
Deoxyhypusine synthase [Posttranslational modification, protein turnover, chaperones].
Changed!
pfam DS 259aa 1e-105
in modified transcript
Changed!
COG DYS1 275aa 7e-67
in modified transcript
DHX30
refseq_DHX30.F1 refseq_DHX30.R1 163 192
NCBIGene 36.3 22907
Single exon skipping, size difference: 29
Inclusion in the protein causing a new stop codon
Reference transcript: NM_138615
Changed!
cd DEXDc 142aa 2e-16
in ref transcript
DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
Changed!
cd HELICc 128aa 1e-05
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
Changed!
TIGR DEAH_box_HrpA 541aa 8e-86
in ref transcript
This model represents HrpA, one of two related but uncharacterized DEAH-box ATP-dependent helicases in many Proteobacteria and a few high-GC Gram-positive bacteria. HrpA is about 1300 amino acids long, while its paralog HrpB, also uncharacterized, is about 800 amino acids long. Related characterized eukarotic proteins are RNA helicases associated with pre-mRNA processing.
Changed!
pfam DUF1605 123aa 1e-04
in ref transcript
Domain of unknown function (DUF1605). This family is found towards the C-terminus of the DEAD-box helicases (pfam00270). In these helicases it apparently always found in association with pfam04408. There do seem to be a couple of instances where it occurs by itself.
Changed!
COG HrpA 650aa 1e-122
in ref transcript
HrpA-like helicases [DNA replication, recombination, and repair].
DHX34
refseq_DHX34.F1 refseq_DHX34.R1 103 178
NCBIGene 36.2 9704
Single exon skipping, size difference: 75
Inclusion in the protein causing a new stop codon
Reference transcript: NM_014681
cd DEXDc 136aa 1e-18
in ref transcript
DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
cd HELICc 94aa 2e-05
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
TIGR DEAH_box_HrpA 474aa 1e-101
in ref transcript
This model represents HrpA, one of two related but uncharacterized DEAH-box ATP-dependent helicases in many Proteobacteria and a few high-GC Gram-positive bacteria. HrpA is about 1300 amino acids long, while its paralog HrpB, also uncharacterized, is about 800 amino acids long. Related characterized eukarotic proteins are RNA helicases associated with pre-mRNA processing.
Changed!
pfam DUF1605 100aa 0.001
in ref transcript
Domain of unknown function (DUF1605). This family is found towards the C-terminus of the DEAD-box helicases (pfam00270). In these helicases it apparently always found in association with pfam04408. There do seem to be a couple of instances where it occurs by itself.
COG HrpA 505aa 1e-131
in ref transcript
HrpA-like helicases [DNA replication, recombination, and repair].
DIABLO
refseq_DIABLO.F1 refseq_DIABLO.R1 130 262
NCBIGene 36.3 56616
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_019887
Changed!
pfam Smac_DIABLO 234aa 1e-100
in ref transcript
Second Mitochondria-derived Activator of Caspases. Second Mitochondria-derived Activator of Caspases promotes apoptosis by activating caspases in the cytochrome c/Apaf-1/caspase-9 pathway, and by opposing the inhibitory activity of inhibitor of apoptosis proteins (XIAP-BIR3). The protein assumes an elongated three-helix bundle structure, and forms a dimer in solution.
Changed!
pfam Smac_DIABLO 190aa 3e-73
in modified transcript
DIO1
refseq_DIO1.F1 refseq_DIO1.R1 189 333
NCBIGene 36.3 1733
Single exon skipping, size difference: 144
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001039715
Changed!
pfam T4_deiodinase 168aa 5e-55
in ref transcript
Iodothyronine deiodinase. Iodothyronine deiodinase converts thyroxine (T4) to 3,5,3'-triiodothyronine (T3).
Changed!
pfam T4_deiodinase 93aa 2e-28
in modified transcript
DIO1
refseq_DIO1.F3 refseq_DIO1.R3 154 346
NCBIGene 36.3 1733
Alternative 5-prime, size difference: 192
Exclusion in the protein (no frameshift)
Reference transcript: NM_000792
Changed!
pfam T4_deiodinase 93aa 2e-28
in ref transcript
Iodothyronine deiodinase. Iodothyronine deiodinase converts thyroxine (T4) to 3,5,3'-triiodothyronine (T3).
Changed!
pfam T4_deiodinase 13aa 0.005
in modified transcript
DIO1
refseq_DIO1.F4 refseq_DIO1.R4 102 302
NCBIGene 36.3 1733
Single exon skipping, size difference: 200
Exclusion in 3'UTR
Reference transcript: NM_000792
pfam T4_deiodinase 93aa 2e-28
in ref transcript
Iodothyronine deiodinase. Iodothyronine deiodinase converts thyroxine (T4) to 3,5,3'-triiodothyronine (T3).
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
cd CUB 107aa 5e-25
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd CCP 64aa 2e-07
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd EGF_CA 33aa 7e-04
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
smart Tryp_SPc 255aa 1e-27
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
smart CUB 92aa 5e-23
in ref transcript
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
smart CCP 63aa 4e-08
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
smart EGF_CA 33aa 0.002
in ref transcript
Calcium-binding EGF-like domain.
smart CCP 35aa 0.003
in ref transcript
COG COG5640 259aa 5e-07
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
cd vWA_transcription_factor_IIH_type 181aa 1e-75
in ref transcript
Transcription factors IIH type: TFIIH is a multiprotein complex that is one of the five general transcription factors that binds RNA polymerase II holoenzyme. Orthologues of these genes are found in all completed eukaryotic genomes and all these proteins contain a VWA domain. The p44 subunit of TFIIH functions as a DNA helicase in RNA polymerase II transcription initiation and DNA repair, and its transcriptional activity is dependent on its C-terminal Zn-binding domains. The function of the vWA domain is unclear, but may be involved in complex assembly. The MIDAS motif is not conserved in this sub-group.
pfam Ssl1 249aa 1e-117
in ref transcript
Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex.
TIGR ssl1 101aa 3e-41
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
COG SSL1 375aa 6e-76
in ref transcript
RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair].
MIZ/SP-RING zinc finger. This domain has SUMO (small ubiquitin-like modifier) ligase activity and is involved in DNA repair and chromosome organisation.
TIGR PABP-1234 81aa 5e-04
in ref transcript
There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
DLEC1
refseq_DLEC1.F1 refseq_DLEC1.R1 141 265
NCBIGene 36.3 9940
Single exon skipping, size difference: 124
Inclusion in the protein causing a frameshift
Reference transcript: NM_007337
DLX1
refseq_DLX1.F2 refseq_DLX1.R2 161 361
NCBIGene 36.3 1745
Single exon skipping, size difference: 200
Exclusion in the protein causing a frameshift
Reference transcript: NM_178120
Changed!
cd homeodomain 57aa 3e-15
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Changed!
pfam Homeobox 57aa 9e-21
in ref transcript
Changed!
cd ZZ_dystrophin 49aa 2e-23
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
Changed!
cd SPEC 217aa 8e-14
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
Changed!
cd SPEC 216aa 4e-11
in ref transcript
Changed!
cd SPEC 244aa 5e-10
in ref transcript
Changed!
cd SPEC 209aa 4e-09
in ref transcript
Changed!
cd SPEC 111aa 1e-06
in ref transcript
Changed!
cd WW 30aa 1e-05
in ref transcript
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
Changed!
cd SPEC 189aa 2e-05
in ref transcript
Changed!
cd SPEC 210aa 3e-05
in ref transcript
Changed!
cd SPEC 226aa 4e-04
in ref transcript
Changed!
pfam efhand_1 121aa 8e-45
in ref transcript
EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
Changed!
pfam efhand_2 92aa 4e-40
in ref transcript
EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
Changed!
pfam ZZ 46aa 1e-18
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin. Putative zinc finger; binding not yet shown. Four to six cysteine residues in its sequence are responsible for coordinating zinc ions, to reinforce the structure.
Changed!
smart SPEC 101aa 9e-09
in ref transcript
Spectrin repeats.
Changed!
pfam Spectrin 107aa 2e-08
in ref transcript
Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
Changed!
smart SPEC 102aa 5e-08
in ref transcript
Changed!
smart SPEC 110aa 6e-07
in ref transcript
Changed!
TIGR SMC_prok_B 582aa 6e-07
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
smart WW 32aa 4e-06
in ref transcript
Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
Changed!
pfam Spectrin 117aa 2e-05
in ref transcript
Changed!
TIGR SMC_prok_B 278aa 8e-05
in ref transcript
Changed!
COG SbcC 548aa 1e-08
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
Changed!
PRK PRK00409 138aa 4e-04
in ref transcript
recombination and DNA strand exchange inhibitor protein; Reviewed.
DMD
refseq_DMD.F3 refseq_DMD.R3 166 198
NCBIGene 36.3 1756
Single exon skipping, size difference: 32
Exclusion in the protein causing a frameshift
Reference transcript: NM_004006
cd ZZ_dystrophin 49aa 3e-23
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
cd CH 106aa 6e-16
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
cd SPEC 217aa 7e-14
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
cd SPEC 218aa 5e-13
in ref transcript
cd SPEC 216aa 3e-11
in ref transcript
cd SPEC 244aa 5e-10
in ref transcript
cd SPEC 209aa 4e-09
in ref transcript
cd CH 104aa 6e-09
in ref transcript
cd SPEC 207aa 7e-09
in ref transcript
cd SPEC 215aa 1e-07
in ref transcript
cd SPEC 111aa 1e-06
in ref transcript
cd SPEC 183aa 5e-06
in ref transcript
cd SPEC 207aa 5e-06
in ref transcript
cd SPEC 189aa 2e-05
in ref transcript
cd SPEC 210aa 3e-05
in ref transcript
cd WW 30aa 4e-05
in ref transcript
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
cd SPEC 226aa 4e-04
in ref transcript
cd SPEC 208aa 0.004
in ref transcript
pfam efhand_1 121aa 1e-44
in ref transcript
EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
pfam efhand_2 92aa 7e-40
in ref transcript
EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
pfam ZZ 46aa 2e-18
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin. Putative zinc finger; binding not yet shown. Four to six cysteine residues in its sequence are responsible for coordinating zinc ions, to reinforce the structure.
smart CH 100aa 4e-18
in ref transcript
Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
smart CH 101aa 4e-11
in ref transcript
TIGR SMC_prok_B 595aa 6e-09
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart SPEC 101aa 1e-08
in ref transcript
Spectrin repeats.
pfam Spectrin 107aa 2e-08
in ref transcript
Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
smart SPEC 103aa 5e-08
in ref transcript
smart SPEC 102aa 6e-08
in ref transcript
pfam Spectrin 105aa 1e-07
in ref transcript
smart SPEC 110aa 8e-07
in ref transcript
smart WW 32aa 1e-05
in ref transcript
Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
pfam Spectrin 117aa 3e-05
in ref transcript
TIGR SMC_prok_B 278aa 8e-05
in ref transcript
smart SPEC 101aa 3e-04
in ref transcript
smart SPEC 98aa 0.003
in ref transcript
pfam Spectrin 66aa 0.007
in ref transcript
TIGR SMC_prok_B 391aa 0.010
in ref transcript
COG SAC6 225aa 6e-25
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
COG SbcC 643aa 3e-09
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
PRK PRK00409 138aa 5e-04
in ref transcript
recombination and DNA strand exchange inhibitor protein; Reviewed.
DMD
refseq_DMD.F5 refseq_DMD.R5 270 354
NCBIGene 36.3 1756
Alternative 5-prime, size difference: 84
Inclusion in the protein causing a new stop codon
Reference transcript: NM_004009
Changed!
cd ZZ_dystrophin 49aa 3e-23
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
Changed!
cd CH 106aa 5e-16
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
Changed!
cd SPEC 217aa 8e-14
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
Changed!
cd SPEC 218aa 5e-13
in ref transcript
Changed!
cd SPEC 216aa 3e-11
in ref transcript
Changed!
cd SPEC 244aa 5e-10
in ref transcript
Changed!
cd SPEC 209aa 4e-09
in ref transcript
Changed!
cd CH 104aa 6e-09
in ref transcript
Changed!
cd SPEC 207aa 8e-09
in ref transcript
Changed!
cd SPEC 215aa 1e-07
in ref transcript
Changed!
cd SPEC 111aa 1e-06
in ref transcript
Changed!
cd SPEC 183aa 5e-06
in ref transcript
Changed!
cd SPEC 207aa 5e-06
in ref transcript
Changed!
cd SPEC 189aa 2e-05
in ref transcript
Changed!
cd SPEC 210aa 3e-05
in ref transcript
Changed!
cd WW 30aa 3e-05
in ref transcript
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
Changed!
cd SPEC 226aa 4e-04
in ref transcript
Changed!
cd SPEC 208aa 0.004
in ref transcript
Changed!
pfam efhand_1 121aa 1e-44
in ref transcript
EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
Changed!
pfam efhand_2 92aa 6e-40
in ref transcript
EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
Changed!
pfam ZZ 46aa 2e-18
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin. Putative zinc finger; binding not yet shown. Four to six cysteine residues in its sequence are responsible for coordinating zinc ions, to reinforce the structure.
Changed!
smart CH 100aa 4e-18
in ref transcript
Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
Changed!
smart CH 101aa 4e-11
in ref transcript
Changed!
TIGR SMC_prok_B 595aa 6e-09
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
smart SPEC 101aa 1e-08
in ref transcript
Spectrin repeats.
Changed!
pfam Spectrin 107aa 2e-08
in ref transcript
Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
Changed!
smart SPEC 103aa 5e-08
in ref transcript
Changed!
smart SPEC 102aa 6e-08
in ref transcript
Changed!
pfam Spectrin 105aa 1e-07
in ref transcript
Changed!
smart SPEC 110aa 8e-07
in ref transcript
Changed!
smart WW 32aa 1e-05
in ref transcript
Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
Changed!
pfam Spectrin 117aa 3e-05
in ref transcript
Changed!
TIGR SMC_prok_B 278aa 8e-05
in ref transcript
Changed!
smart SPEC 101aa 3e-04
in ref transcript
Changed!
smart SPEC 98aa 0.003
in ref transcript
Changed!
pfam Spectrin 66aa 0.007
in ref transcript
Changed!
COG SAC6 225aa 6e-25
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
Changed!
COG SbcC 643aa 3e-09
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
Changed!
PRK PRK00409 138aa 5e-04
in ref transcript
recombination and DNA strand exchange inhibitor protein; Reviewed.
DMD
refseq_DMD.F7 refseq_DMD.R7 116 155
NCBIGene 36.3 1756
Single exon skipping, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_004006
cd ZZ_dystrophin 49aa 3e-23
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
cd CH 106aa 6e-16
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
cd SPEC 217aa 7e-14
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
cd SPEC 218aa 5e-13
in ref transcript
cd SPEC 216aa 3e-11
in ref transcript
cd SPEC 244aa 5e-10
in ref transcript
cd SPEC 209aa 4e-09
in ref transcript
cd CH 104aa 6e-09
in ref transcript
cd SPEC 207aa 7e-09
in ref transcript
cd SPEC 215aa 1e-07
in ref transcript
cd SPEC 111aa 1e-06
in ref transcript
cd SPEC 183aa 5e-06
in ref transcript
cd SPEC 207aa 5e-06
in ref transcript
cd SPEC 189aa 2e-05
in ref transcript
cd SPEC 210aa 3e-05
in ref transcript
cd WW 30aa 4e-05
in ref transcript
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
cd SPEC 226aa 4e-04
in ref transcript
cd SPEC 208aa 0.004
in ref transcript
pfam efhand_1 121aa 1e-44
in ref transcript
EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
pfam efhand_2 92aa 7e-40
in ref transcript
EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
pfam ZZ 46aa 2e-18
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin. Putative zinc finger; binding not yet shown. Four to six cysteine residues in its sequence are responsible for coordinating zinc ions, to reinforce the structure.
smart CH 100aa 4e-18
in ref transcript
Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
smart CH 101aa 4e-11
in ref transcript
TIGR SMC_prok_B 595aa 6e-09
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart SPEC 101aa 1e-08
in ref transcript
Spectrin repeats.
pfam Spectrin 107aa 2e-08
in ref transcript
Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
smart SPEC 103aa 5e-08
in ref transcript
smart SPEC 102aa 6e-08
in ref transcript
pfam Spectrin 105aa 1e-07
in ref transcript
smart SPEC 110aa 8e-07
in ref transcript
smart WW 32aa 1e-05
in ref transcript
Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
pfam Spectrin 117aa 3e-05
in ref transcript
TIGR SMC_prok_B 278aa 8e-05
in ref transcript
smart SPEC 101aa 3e-04
in ref transcript
smart SPEC 98aa 0.003
in ref transcript
pfam Spectrin 66aa 0.007
in ref transcript
Changed!
TIGR SMC_prok_B 391aa 0.010
in ref transcript
COG SAC6 225aa 6e-25
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
COG SbcC 643aa 3e-09
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
PRK PRK00409 138aa 5e-04
in ref transcript
recombination and DNA strand exchange inhibitor protein; Reviewed.
DNAJC21
refseq_DNAJA5.F1 refseq_DNAJA5.R1 216 351
NCBIGene 36.3 134218
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_194283
cd DnaJ 56aa 1e-17
in ref transcript
DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
TIGR DnaJ_bact 88aa 2e-25
in ref transcript
This model represents bacterial forms of DnaJ, part of the DnaK-DnaJ-GrpE chaperone system. The three components typically are encoded by consecutive genes. DnaJ homologs occur in many genomes, typically not near DnaK and GrpE-like genes; most such genes are not included by this family. Eukaryotic (mitochondrial and chloroplast) forms are not included in the scope of this family.
smart ZnF_U1 30aa 8e-04
in ref transcript
U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
COG DnaJ 75aa 8e-23
in ref transcript
DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones].
Changed!
COG ZUO1 264aa 4e-12
in ref transcript
Ribosome-associated chaperone zuotin [Translation, ribosomal structure and biogenesis / Posttranslational modification, protein turnover, chaperones].
Changed!
COG ZUO1 245aa 9e-12
in modified transcript
DNAJC19
refseq_DNAJC19.F1 refseq_DNAJC19.R1 137 189
NCBIGene 36.2 131118
Single exon skipping, size difference: 52
Exclusion in the protein causing a frameshift
Reference transcript: NM_145261
Changed!
cd DnaJ 45aa 5e-07
in ref transcript
DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Changed!
smart DnaJ 46aa 2e-07
in ref transcript
DnaJ molecular chaperone homology domain.
Changed!
PTZ PTZ00100 58aa 4e-20
in ref transcript
DnaJ chaperone protein; Provisional.
DNASE1L1
refseq_DNASE1L1.F1 refseq_DNASE1L1.R1 111 369
NCBIGene 36.3 1774
Single exon skipping, size difference: 258
Exclusion in 5'UTR
Reference transcript: NM_001009932
smart DNaseIc 272aa 1e-114
in ref transcript
deoxyribonuclease I. Deoxyribonuclease I catalyzes the endonucleolytic cleavage of double-stranded DNA. The enzyme is secreted outside the cell and also involved in apoptosis in the nucleus.
DNM1
refseq_DNM1.F2 refseq_DNM1.R2 159 196
NCBIGene 36.3 1759
Single exon skipping, size difference: 37
Inclusion in the protein causing a new stop codon
Reference transcript: NM_004408
cd PH_dynamin 110aa 1e-50
in ref transcript
Dynamin pleckstrin homology (PH) domain. Dynamin is a GTPase that regulates endocytic vesicle formation. It has an N-terminal GTPase domain, followed by a PH domain, a GTPase effector domain and a C-terminal proline arginine rich domain. Dynamin-like proteins, which are found in metazoa, plants and yeast have the same domain architecture as dynamin, but lack the PH domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
smart DYNc 240aa 1e-130
in ref transcript
Dynamin, GTPase. Large GTPases that mediate vesicle trafficking. Dynamin participates in the endocytic uptake of receptors, associated ligands, and plasma membrane following an exocytic event.
pfam Dynamin_M 294aa 1e-118
in ref transcript
Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169.
pfam GED 92aa 2e-22
in ref transcript
Dynamin GTPase effector domain.
smart PH 104aa 2e-05
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
COG COG0699 424aa 5e-34
in ref transcript
Predicted GTPases (dynamin-related) [General function prediction only].
DPAGT1
refseq_DPAGT1.F1 refseq_DPAGT1.R1 194 315
NCBIGene 36.3 1798
Single exon skipping, size difference: 121
Exclusion in the protein causing a frameshift
Reference transcript: NM_001382
Changed!
cd GT_GPT_euk 289aa 1e-113
in ref transcript
UDP-GlcNAc:dolichol-P GlcNAc-1-P transferase (GPT) catalyzes the transfer of GlcNAc-1-P from UDP-GlcNAc to dolichol-P to form GlcNAc-P-P-dolichol. The reaction is the first step in the assembly of dolichol-linked oligosaccharide intermediates and is essential for eukaryotic N-glycosylation. GPT activity has been identified in all eukaryotic cells examined to date. A series of six conserved motifs designated A through F, ranging in length from 5 to 13 amino acid residues, has been identified in this family. They have been determined to be important for stable expression, substrate binding, or catalytic activities.
Changed!
pfam Glycos_transf_4 178aa 1e-44
in ref transcript
Changed!
cd GT_GPT_euk 26aa 0.008
in modified transcript
DPP8
refseq_DPP8.F1 refseq_DPP8.R1 218 354
NCBIGene 36.3 54878
Single exon skipping, size difference: 136
Inclusion in 5'UTR
Reference transcript: NM_130434
pfam DPPIV_N 421aa 7e-72
in ref transcript
Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site. The Prosite motif does not correspond to this Pfam entry.
pfam Peptidase_S9 206aa 9e-53
in ref transcript
Prolyl oligopeptidase family.
COG DAP2 330aa 7e-48
in ref transcript
Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism].
DPP8
refseq_DPP8.F3 refseq_DPP8.R3 184 337
NCBIGene 36.3 54878
Single exon skipping, size difference: 153
Exclusion in the protein (no frameshift)
Reference transcript: NM_197960
pfam DPPIV_N 421aa 4e-72
in ref transcript
Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site. The Prosite motif does not correspond to this Pfam entry.
Changed!
pfam Peptidase_S9 206aa 6e-53
in ref transcript
Prolyl oligopeptidase family.
Changed!
COG DAP2 330aa 4e-48
in ref transcript
Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism].
Changed!
pfam Peptidase_S9 155aa 7e-25
in modified transcript
Changed!
COG DAP2 279aa 1e-27
in modified transcript
DRD2
refseq_DRD2.F1 refseq_DRD2.R1 276 363
NCBIGene 36.3 1813
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_000795
pfam 7tm_1 131aa 8e-23
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
pfam 7tm_1 68aa 7e-17
in ref transcript
DSC1
refseq_DSC1.F2 refseq_DSC1.R2 251 297
NCBIGene 36.3 1823
Single exon skipping, size difference: 46
Inclusion in the protein causing a new stop codon
Reference transcript: NM_024421
cd CA 201aa 1e-34
in ref transcript
Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
cd CA 209aa 2e-34
in ref transcript
cd CA 182aa 6e-23
in ref transcript
cd CA 212aa 4e-21
in ref transcript
pfam Cadherin 97aa 4e-20
in ref transcript
Cadherin domain.
pfam Cadherin_pro 81aa 4e-17
in ref transcript
Cadherin prodomain like. Cadherins are a family of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This domain corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions.
pfam Cadherin 95aa 9e-16
in ref transcript
pfam Cadherin 83aa 3e-13
in ref transcript
pfam Cadherin 91aa 2e-11
in ref transcript
pfam Cadherin 86aa 0.002
in ref transcript
DSC2
refseq_DSC2.F1 refseq_DSC2.R1 217 263
NCBIGene 36.3 1824
Single exon skipping, size difference: 46
Inclusion in the protein causing a new stop codon
Reference transcript: NM_024422
cd CA 210aa 5e-32
in ref transcript
Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
cd CA 209aa 2e-24
in ref transcript
cd CA 196aa 2e-22
in ref transcript
pfam Cadherin 104aa 8e-14
in ref transcript
Cadherin domain.
smart CA 86aa 9e-14
in ref transcript
Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
pfam Cadherin_pro 81aa 2e-13
in ref transcript
Cadherin prodomain like. Cadherins are a family of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This domain corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions.
pfam Cadherin 93aa 6e-13
in ref transcript
pfam Cadherin 95aa 1e-11
in ref transcript
Changed!
pfam Cadherin_C 45aa 0.002
in ref transcript
Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic binding units. A key determinant to the strength of the binding that it is mediated by cadherins is the juxtamembrane region of the cadherin. This region induces clustering and also binds to the protein p120ctn.
DSCAM
refseq_DSCAM.F1 refseq_DSCAM.R1 141 332
NCBIGene 36.2 1826
Alternative 5-prime and 3-prime, size difference: 191
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001389
cd FN3 91aa 1e-12
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd IGcam 94aa 1e-12
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 77aa 3e-12
in ref transcript
cd IGcam 91aa 2e-11
in ref transcript
cd IGcam 88aa 2e-11
in ref transcript
cd IGcam 70aa 1e-09
in ref transcript
cd FN3 98aa 3e-09
in ref transcript
cd IGcam 90aa 1e-08
in ref transcript
cd IGcam 95aa 5e-08
in ref transcript
cd FN3 86aa 6e-08
in ref transcript
cd FN3 94aa 2e-07
in ref transcript
cd FN3 74aa 1e-05
in ref transcript
cd IGcam 64aa 3e-04
in ref transcript
cd IGcam 90aa 0.002
in ref transcript
cd FN3 73aa 0.005
in ref transcript
pfam I-set 76aa 5e-13
in ref transcript
Immunoglobulin I-set domain.
pfam fn3 84aa 1e-12
in ref transcript
Fibronectin type III domain.
pfam I-set 95aa 1e-11
in ref transcript
pfam I-set 79aa 3e-11
in ref transcript
pfam I-set 85aa 5e-11
in ref transcript
smart IGc2 65aa 1e-09
in ref transcript
Immunoglobulin C-2 Type.
smart FN3 88aa 2e-09
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
smart IGc2 58aa 8e-09
in ref transcript
pfam fn3 87aa 2e-08
in ref transcript
smart IG_like 90aa 2e-08
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam fn3 79aa 8e-08
in ref transcript
pfam fn3 69aa 9e-06
in ref transcript
PSMG1
refseq_DSCR2.F1 refseq_DSCR2.R1 214 277
NCBIGene 36.3 8624
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_003720
DSTN
refseq_DSTN.F1 refseq_DSTN.R1 152 284
NCBIGene 36.3 11034
Single exon skipping, size difference: 132
Inclusion in the protein causing a new stop codon
Reference transcript: NM_006870
Changed!
cd ADF 150aa 6e-32
in ref transcript
Actin depolymerisation factor/cofilin -like domains; present in a family of essential eukaryotic actin regulatory proteins; these proteins enhance the turnover rate of actin and interact with actin monomers as well as actin filaments.
Changed!
smart ADF 135aa 4e-35
in ref transcript
Actin depolymerisation factor/cofilin -like domains. Severs actin filaments and binds to actin monomers.
Changed!
PTZ PTZ00152 131aa 1e-05
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
pfam efhand_1 127aa 6e-50
in ref transcript
EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
pfam efhand_2 89aa 2e-35
in ref transcript
EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
smart ZnF_ZZ 45aa 3e-13
in ref transcript
Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
pfam Cast 103aa 0.005
in ref transcript
RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
pfam efhand_1 127aa 6e-50
in ref transcript
EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
pfam efhand_2 89aa 2e-35
in ref transcript
EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
smart ZnF_ZZ 45aa 3e-13
in ref transcript
Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
Changed!
pfam Cast 103aa 0.005
in ref transcript
RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Changed!
TIGR SMC_prok_B 77aa 0.006
in modified transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
DTNB
refseq_DTNB.F1 refseq_DTNB.R1 114 204
NCBIGene 36.3 1838
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_021907
cd ZZ_dystrophin 49aa 4e-18
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
pfam efhand_1 127aa 3e-47
in ref transcript
EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
pfam efhand_2 89aa 3e-37
in ref transcript
EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
smart ZnF_ZZ 45aa 3e-11
in ref transcript
Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
TIGR SMC_prok_B 66aa 0.008
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
PRK PRK05431 65aa 0.003
in ref transcript
seryl-tRNA synthetase; Provisional.
DTNB
refseq_DTNB.F2 refseq_DTNB.R2 115 205
NCBIGene 36.3 1838
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_021907
cd ZZ_dystrophin 49aa 4e-18
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
pfam efhand_1 127aa 3e-47
in ref transcript
EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
pfam efhand_2 89aa 3e-37
in ref transcript
EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
smart ZnF_ZZ 45aa 3e-11
in ref transcript
Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
TIGR SMC_prok_B 66aa 0.008
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
PRK PRK05431 65aa 0.003
in ref transcript
seryl-tRNA synthetase; Provisional.
DTNB
refseq_DTNB.F5 refseq_DTNB.R5 110 164
NCBIGene 36.3 1838
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_021907
cd ZZ_dystrophin 49aa 4e-18
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
pfam efhand_1 127aa 3e-47
in ref transcript
EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
pfam efhand_2 89aa 3e-37
in ref transcript
EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
smart ZnF_ZZ 45aa 3e-11
in ref transcript
Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
TIGR SMC_prok_B 66aa 0.008
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
PRK PRK05431 65aa 0.003
in ref transcript
seryl-tRNA synthetase; Provisional.
DTNBP1
refseq_DTNBP1.F1 refseq_DTNBP1.R1 226 271
NCBIGene 36.3 84062
Alternative 3-prime, size difference: 45
Inclusion in the protein causing a new stop codon
Reference transcript: NM_032122
Changed!
pfam Dysbindin 121aa 7e-16
in ref transcript
Dysbindin (Dystrobrevin binding protein 1). Dysbindin is an evolutionary conserved 40-kDa coiled-coil-containing protein that binds to alpha- and beta-dystrobrevin in muscle and brain. Dystrophin and alpha-dystrobrevin are co-immunoprecipitated with dysbindin, indicating that dysbindin is DPC-associated in muscle. Dysbindin co-localises with alpha-dystrobrevin at the sarcolemma and is up-regulated in dystrophin-deficient muscle. In the brain, dysbindin is found primarily in axon bundles and especially in certain axon terminals, notably mossy fibre synaptic terminals in the cerebellum and hippocampus. Dysbindin may have implications for the molecular pathology of Duchenne muscular dystrophy and may provide an alternative route for anchoring dystrobrevin and the DPC to the muscle membrane. Genetic variation in the human dysbindin gene is also thought to be associated with Schizophrenia.
DUOX1
refseq_DUOX1.F2 refseq_DUOX1.R2 127 319
NCBIGene 36.3 53905
Single exon skipping, size difference: 192
Exclusion in 5'UTR
Reference transcript: NM_017434
cd NOX_Duox_like_FAD_NADP 180aa 9e-45
in ref transcript
NADPH oxidase (NOX) catalyzes the generation of reactive oxygen species (ROS) such as superoxide and hydrogen peroxide. ROS were originally identified as bactericidal agents in phagocytes, but are now also implicated in cell signaling and metabolism. NOX has a 6-alpha helix heme-binding transmembrane domain fused to a flavoprotein with the nucleotide binding domain located in the cytoplasm. Duox enzymes link a peroxidase domain to the NOX domain via a single transmembrane and EF-hand Ca2+ binding sites. The flavoprotein module has a ferredoxin like FAD/NADPH binding domain. In classical phagocytic NOX2, electron transfer occurs from NADPH to FAD to the heme of cytb to oxygen leading to superoxide formation.
cd EFh 62aa 1e-11
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 62aa 8e-05
in ref transcript
cd NOX_Duox_like_FAD_NADP 36aa 0.007
in ref transcript
pfam An_peroxidase 528aa 1e-139
in ref transcript
Animal haem peroxidase.
pfam NAD_binding_6 155aa 2e-27
in ref transcript
Ferric reductase NAD binding domain.
pfam FAD_binding_8 97aa 4e-17
in ref transcript
FAD-binding domain.
pfam Ferric_reduct 70aa 7e-08
in ref transcript
Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mouse-ear cress. This may be a family of flavocytochromes capable of moving electrons across the plasma membrane. The Frp1 protein from S. pombe is a ferric reductase component and is required for cell surface ferric reductase activity, mutants in frp1 are deficient in ferric iron uptake. Cytochrome B-245 heavy chain is a FAD-dependent dehydrogenase it is also has electron transferase activity which reduces molecular oxygen to superoxide anion, a precursor in the production of microbicidal oxidants. Mutations in the sequence of cytochrome B-245 heavy chain (gp91-phox) lead to the X-linked chronic granulomatous disease. The bacteriocidal ability of phagocytic cells is reduced and is characterised by the absence of a functional plasma membrane associated NADPH oxidase. The chronic granulomatous disease gene codes for the beta chain of cytochrome B-245 and cytochrome B-245 is missing from patients with the disease.
smart EFh 25aa 4e-04
in ref transcript
EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
smart EFh 25aa 0.002
in ref transcript
COG FRQ1 171aa 2e-11
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
COG UbiB 195aa 3e-10
in ref transcript
2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases [Coenzyme metabolism / Energy production and conversion].
DUSP13
refseq_DUSP13.F1 refseq_DUSP13.R1 144 352
NCBIGene 36.2 51207
Single exon skipping, size difference: 208
Exclusion of the protein initiation site
Reference transcript: NM_001007271
cd DSPc 144aa 2e-32
in ref transcript
Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
Changed!
PRK PRK07764 117aa 0.006
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
DYRK1A
refseq_DYRK1A.F2 refseq_DYRK1A.R2 143 170
NCBIGene 36.3 1859
Alternative 3-prime, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_001396
cd S_TKc 322aa 6e-56
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 230aa 2e-54
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
smart S_TKc 28aa 4e-04
in ref transcript
PTZ PTZ00284 225aa 9e-30
in ref transcript
protein kinase; Provisional.
DYRK1A
refseq_DYRK1A.F3 refseq_DYRK1A.R3 133 264
NCBIGene 36.3 1859
Alternative 5-prime, size difference: 131
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001396
Changed!
cd S_TKc 322aa 6e-56
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 230aa 2e-54
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
smart S_TKc 28aa 4e-04
in ref transcript
PTZ PTZ00284 225aa 9e-30
in ref transcript
protein kinase; Provisional.
Changed!
cd S_TKc 241aa 9e-54
in modified transcript
DYRK1A
refseq_DYRK1A.F5 refseq_DYRK1A.R5 101 273
NCBIGene 36.3 1859
Alternative 3-prime, size difference: 172
Exclusion in the protein causing a frameshift
Reference transcript: NM_101395
cd S_TKc 241aa 9e-54
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 230aa 2e-52
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
smart S_TKc 28aa 7e-04
in ref transcript
PTZ PTZ00284 225aa 4e-30
in ref transcript
protein kinase; Provisional.
DYRK2
refseq_DYRK2.F1 refseq_DYRK2.R1 198 347
NCBIGene 36.3 8445
Single exon skipping, size difference: 149
Exclusion in the protein causing a frameshift
Reference transcript: NM_006482
Changed!
cd S_TKc 237aa 4e-60
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
cd STKc_Cdc7_like 30aa 3e-04
in ref transcript
Serine/threonine kinases (STKs), cell division control protein 7 (Cdc7)-like subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The Cdc7-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily include Schizosaccharomyces pombe Cdc7, Saccharomyces cerevisiae Cdc15, Arabidopsis thaliana mitogen-activated protein kinase (MAPK) kinase kinase (MAPKKK) epsilon, and related proteins. MAPKKKs phosphorylate and activate MAPK kinases (MAPKKs or MKKs or MAP2Ks), which in turn phosphorylate and activate MAPKs during signaling cascades that are important in mediating cellular responses to extracellular signals. Fission yeast Cdc7 is essential for cell division by playing a key role in the initiation of septum formation and cytokinesis. Budding yeast Cdc15 functions to coordinate mitotic exit with cytokinesis. Arabidopsis MAPKKK epsilon is required for pollen development in the plasma membrane.
Changed!
smart S_TKc 304aa 2e-63
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
COG SPS1 335aa 6e-31
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
DYRK3
refseq_DYRK3.F1 refseq_DYRK3.R1 260 354
NCBIGene 36.3 8444
Single exon skipping, size difference: 94
Inclusion in the protein causing a frameshift
Reference transcript: NM_003582
Changed!
cd S_TKc 315aa 1e-60
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 304aa 8e-65
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
COG SPS1 315aa 4e-28
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
DYX1C1
refseq_DYX1C1.F1 refseq_DYX1C1.R1 181 287
NCBIGene 36.3 161582
Single exon skipping, size difference: 106
Exclusion in the protein causing a frameshift
Reference transcript: NM_130810
cd p23_DYX1C1_like 78aa 3e-31
in ref transcript
p23_like domain found in proteins similar to dyslexia susceptibility 1 (DYX1) candidate 1 (C1) protein, DYX1C1. The human gene encoding this protein is a positional candidate gene for developmental dyslexia (DD), it is located on 15q21.3 by the DYX1 DD susceptibility locus (15q15-21). Independent association studies have reported conflicting results. However, association of short-term memory, which plays a role in DD, with a variant within the DYX1C1 gene has been reported. Most proteins belonging to this group contain a C-terminal tetratricopeptide repeat (TPR) protein binding region.
Changed!
cd TPR 106aa 5e-05
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
pfam CS 70aa 7e-12
in ref transcript
CS domain. The CS and CHORD (pfam04968) are fused into a single polypeptide chain in metazoans but are found in separate proteins in plants; this is thought to be indicative of an interaction between CS and CHORD. It has been suggested that the CS domain is a binding module for HSP90, implying that CS domain-containing proteins are involved in recruiting heat shock proteins to multiprotein assemblies.
Changed!
TIGR 3a0801s09 121aa 4e-06
in ref transcript
Changed!
TIGR 3a0801s09 75aa 0.009
in modified transcript
DZIP1
refseq_DZIP1.F1 refseq_DZIP1.R1 148 205
NCBIGene 36.3 22873
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_198968
EBF2
refseq_EBF2.F1 refseq_EBF2.R1 109 178
NCBIGene 36.2 64641
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: XM_927203
cd IPT_COE 85aa 3e-43
in ref transcript
IPT domain of the COE family (Col/Olf-1/EBF) of non-basic, helix-loop-helix (HLH)-containing transcription factors. COE family proteins are all transcription factors and play an important role in variety of developmental processes. Mouse EBF is involved in the regulation of the early stages of B-cell differentiation, Drosophila collier is a regulator of the head patterning, and a related protein in Xenopus is involved in primary neurogenesis. All COE family members have a well conserved DNA binding domain that contains an atypical Zn finger motif. The function of the IPT domain is unknown.
pfam TIG 83aa 5e-09
in ref transcript
IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: This family does not currently recognise a significant number of members.
ECM1
refseq_ECM1.F2 refseq_ECM1.R2 100 475
NCBIGene 36.3 1893
Single exon skipping, size difference: 375
Exclusion in the protein (no frameshift)
Reference transcript: NM_004425
Changed!
pfam ECM1 279aa 1e-120
in ref transcript
Extracellular matrix protein 1 (ECM1). This family consists of several eukaryotic extracellular matrix protein 1 (ECM1) sequences. ECM1 has been shown to regulate endochondral bone formation, stimulate the proliferation of endothelial cells and induce angiogenesis. Mutations in the ECM1 gene can cause lipoid proteinosis, a disorder which causes generalised thickening of skin, mucosae and certain viscera. Classical features include beaded eyelid papules and laryngeal infiltration leading to hoarseness.
Changed!
pfam ECM1 242aa 1e-97
in ref transcript
Changed!
pfam ECM1 415aa 0.0
in modified transcript
EFEMP1
refseq_EFEMP1.F1 refseq_EFEMP1.R1 165 206
NCBIGene 36.3 2202
Single exon skipping, size difference: 41
Exclusion in 5'UTR
Reference transcript: NM_001039348
cd vWA_Matrilin 40aa 2e-04
in ref transcript
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
cd vWA_Matrilin 42aa 0.001
in ref transcript
cd vWA_Matrilin 33aa 0.004
in ref transcript
cd vWA_Matrilin 40aa 0.004
in ref transcript
pfam EGF_CA 34aa 3e-06
in ref transcript
Calcium binding EGF domain.
smart EGF_CA 31aa 4e-05
in ref transcript
Calcium-binding EGF-like domain.
smart EGF_CA 40aa 0.002
in ref transcript
EFNA1
refseq_EFNA1.F1 refseq_EFNA1.R1 320 386
NCBIGene 36.3 1942
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_004428
Changed!
pfam Ephrin 135aa 4e-61
in ref transcript
Ephrin.
Changed!
pfam Ephrin 119aa 4e-53
in modified transcript
EFNA4
refseq_EFNA4.F1 refseq_EFNA4.R1 143 289
NCBIGene 36.3 1945
Alternative 3-prime, size difference: 146
Exclusion of the stop codon
Reference transcript: NM_005227
pfam Ephrin 131aa 3e-43
in ref transcript
Ephrin.
EFS
refseq_EFS.F1 refseq_EFS.R1 101 380
NCBIGene 36.3 10278
Single exon skipping, size difference: 279
Exclusion in the protein (no frameshift)
Reference transcript: NM_005864
Changed!
cd SH3 56aa 9e-11
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Changed!
smart SH3 57aa 6e-13
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
EGFLAM
refseq_EGFLAM.F1 refseq_EGFLAM.R1 199 389
NCBIGene 36.3 133584
Single exon skipping, size difference: 190
Exclusion in the protein causing a frameshift
Reference transcript: NM_152403
Changed!
cd LamG 154aa 5e-26
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
Changed!
cd LamG 138aa 9e-25
in ref transcript
Changed!
cd LamG 151aa 3e-23
in ref transcript
cd FN3 98aa 3e-13
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 93aa 2e-12
in ref transcript
Changed!
cd EGF_CA 29aa 1e-04
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Changed!
cd EGF_CA 43aa 2e-04
in ref transcript
Changed!
pfam Laminin_G_2 126aa 6e-29
in ref transcript
Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
Changed!
smart LamG 133aa 4e-27
in ref transcript
Laminin G domain.
Changed!
smart LamG 135aa 2e-22
in ref transcript
pfam fn3 87aa 8e-13
in ref transcript
Fibronectin type III domain.
pfam fn3 91aa 1e-09
in ref transcript
Changed!
pfam EGF 28aa 7e-04
in ref transcript
EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
Changed!
pfam EGF 32aa 0.004
in ref transcript
Changed!
cd LamG 112aa 3e-15
in modified transcript
Changed!
smart LamG 91aa 2e-12
in modified transcript
EHMT2
refseq_EHMT2.F1 refseq_EHMT2.R1 117 219
NCBIGene 36.3 10919
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_006709
cd ANK 120aa 3e-29
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 126aa 2e-28
in ref transcript
pfam SET 127aa 2e-36
in ref transcript
SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
smart PreSET 100aa 4e-27
in ref transcript
N-terminal to some SET domains. A Cys-rich putative Zn2+-binding domain that occurs N-terminal to some SET domains. Function is unknown. Unpublished.
TIGR trp 177aa 1e-08
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
pfam Ank 31aa 4e-06
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
COG COG2940 242aa 3e-20
in ref transcript
Proteins containing SET domain [General function prediction only].
COG Arp 156aa 1e-17
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
COG Arp 158aa 1e-13
in ref transcript
EI24
refseq_EI24.F1 refseq_EI24.R1 159 271
NCBIGene 36.3 9538
Single exon skipping, size difference: 112
Inclusion in the protein causing a frameshift
Reference transcript: NM_001007277
Changed!
pfam EI24 154aa 6e-44
in ref transcript
Etoposide-induced protein 2.4 (EI24). This family contains a number of eukaryotic etoposide-induced 2.4 (EI24) proteins approximately 350 residues long. In cells treated with the cytotoxic drug etoposide, EI24 is induced by p53. It has been suggested to play an important role in negative cell growth control.
Changed!
pfam EI24 212aa 2e-66
in modified transcript
EIF2C3
refseq_EIF2C3.F1 refseq_EIF2C3.R1 187 324
NCBIGene 36.3 192669
Single exon skipping, size difference: 137
Exclusion in the protein causing a frameshift
Reference transcript: NM_024852
Changed!
cd Piwi_ago-like 426aa 1e-173
in ref transcript
Piwi_ago-like: PIWI domain, Argonaute-like subfamily. Argonaute is the central component of the RNA-induced silencing complex (RISC) and related complexes. The PIWI domain is the C-terminal portion of Argonaute and consists of two subdomains, one of which provides the 5' anchoring of the guide RNA and the other, the catalytic site for slicing.
Changed!
cd PAZ_argonaute_like 121aa 1e-32
in ref transcript
PAZ domain, argonaute_like subfamily. Argonaute is part of the RNA-induced silencing complex (RISC), and is an endonuclease that plays a key role in the RNA interference pathway. The PAZ domain has been named after the proteins Piwi,Argonaut, and Zwille. PAZ is found in two families of proteins that are essential components of RNA-mediated gene-silencing pathways, including RNA interference, the Piwi and Dicer families. PAZ functions as a nucleic acid binding domain, with a strong preference for single-stranded nucleic acids (RNA or DNA) or RNA duplexes with single-stranded 3' overhangs. It has been suggested that the PAZ domain provides a unique mode for the recognition of the two 3'-terminal nucleotides in single-stranded nucleic acids and buries the 3' OH group, and that it might recognize characteristic 3' overhangs in siRNAs within RISC (RNA-induced silencing) and other complexes.
Changed!
pfam Piwi 302aa 1e-103
in ref transcript
Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is the dsRNA guided hydrolysis of ssRNA. Determination of the crystal structure of Argonaute reveals that PIWI is an RNase H domain, and identifies Argonaute as Slicer, the enzyme that cleaves mRNA in the RNAi RISC complex. In addition, Mg+2 dependence and production of 3'-OH and 5' phosphate products are shared characteristics of RNaseH and RISC. The PIWI domain core has a tertiary structure belonging to the RNase H family of enzymes. RNase H fold proteins all have a five-stranded mixed beta-sheet surrounded by helices. By analogy to RNase H enzymes which cleave single-stranded RNA guided by the DNA strand in an RNA/DNA hybrid, the PIWI domain can be inferred to cleave single-stranded RNA, for example mRNA, guided by double stranded siRNA.
Changed!
pfam PAZ 117aa 4e-32
in ref transcript
PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the Carpel factory protein. The function of the domains is unknown but has been suggested to mediate complex formation between proteins of the Piwi and Dicer families by hetero-dimerisation. The three-dimensional structure of this domain has been solved. The PAZ domain is composed of two subdomains. One subdomain is similar to the OB fold, albeit with a different topology. The OB-fold is well known as a single-stranded nucleic acid binding fold. The second subdomain is composed of a beta-hairpin followed by an alpha-helix. The PAZ domains shows low-affinity nucleic acid binding and appears to interact with the 3' ends of single-stranded regions of RNA in the cleft between the two subdomains. PAZ can bind the characteristic two-base 3' overhangs of siRNAs, indicating that although PAZ may not be a primary nucleic acid binding site in Dicer or RISC, it may contribute to the specific and productive incorporation of siRNAs and miRNAs into the RNAi pathway.
Changed!
pfam DUF1785 53aa 3e-16
in ref transcript
Domain of unknown function (DUF1785). This region is found in argonaute proteins and often co-occurs with pfam02179 and pfam02171.
EIF2C3
refseq_EIF2C3.F4 refseq_EIF2C3.R4 233 354
NCBIGene 36.3 192669
Single exon skipping, size difference: 121
Exclusion in the protein causing a frameshift
Reference transcript: NM_024852
Changed!
cd Piwi_ago-like 426aa 1e-173
in ref transcript
Piwi_ago-like: PIWI domain, Argonaute-like subfamily. Argonaute is the central component of the RNA-induced silencing complex (RISC) and related complexes. The PIWI domain is the C-terminal portion of Argonaute and consists of two subdomains, one of which provides the 5' anchoring of the guide RNA and the other, the catalytic site for slicing.
Changed!
cd PAZ_argonaute_like 121aa 1e-32
in ref transcript
PAZ domain, argonaute_like subfamily. Argonaute is part of the RNA-induced silencing complex (RISC), and is an endonuclease that plays a key role in the RNA interference pathway. The PAZ domain has been named after the proteins Piwi,Argonaut, and Zwille. PAZ is found in two families of proteins that are essential components of RNA-mediated gene-silencing pathways, including RNA interference, the Piwi and Dicer families. PAZ functions as a nucleic acid binding domain, with a strong preference for single-stranded nucleic acids (RNA or DNA) or RNA duplexes with single-stranded 3' overhangs. It has been suggested that the PAZ domain provides a unique mode for the recognition of the two 3'-terminal nucleotides in single-stranded nucleic acids and buries the 3' OH group, and that it might recognize characteristic 3' overhangs in siRNAs within RISC (RNA-induced silencing) and other complexes.
Changed!
pfam Piwi 302aa 1e-103
in ref transcript
Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is the dsRNA guided hydrolysis of ssRNA. Determination of the crystal structure of Argonaute reveals that PIWI is an RNase H domain, and identifies Argonaute as Slicer, the enzyme that cleaves mRNA in the RNAi RISC complex. In addition, Mg+2 dependence and production of 3'-OH and 5' phosphate products are shared characteristics of RNaseH and RISC. The PIWI domain core has a tertiary structure belonging to the RNase H family of enzymes. RNase H fold proteins all have a five-stranded mixed beta-sheet surrounded by helices. By analogy to RNase H enzymes which cleave single-stranded RNA guided by the DNA strand in an RNA/DNA hybrid, the PIWI domain can be inferred to cleave single-stranded RNA, for example mRNA, guided by double stranded siRNA.
Changed!
pfam PAZ 117aa 4e-32
in ref transcript
PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the Carpel factory protein. The function of the domains is unknown but has been suggested to mediate complex formation between proteins of the Piwi and Dicer families by hetero-dimerisation. The three-dimensional structure of this domain has been solved. The PAZ domain is composed of two subdomains. One subdomain is similar to the OB fold, albeit with a different topology. The OB-fold is well known as a single-stranded nucleic acid binding fold. The second subdomain is composed of a beta-hairpin followed by an alpha-helix. The PAZ domains shows low-affinity nucleic acid binding and appears to interact with the 3' ends of single-stranded regions of RNA in the cleft between the two subdomains. PAZ can bind the characteristic two-base 3' overhangs of siRNAs, indicating that although PAZ may not be a primary nucleic acid binding site in Dicer or RISC, it may contribute to the specific and productive incorporation of siRNAs and miRNAs into the RNAi pathway.
Changed!
pfam DUF1785 53aa 3e-16
in ref transcript
Domain of unknown function (DUF1785). This region is found in argonaute proteins and often co-occurs with pfam02179 and pfam02171.
EIF3C
refseq_EIF3S8.F1 refseq_EIF3S8.R1 189 238
NCBIGene 36.3 8663
Alternative 5-prime, size difference: 49
Exclusion in 5'UTR
Reference transcript: NM_001037808
pfam eIF-3c_N 389aa 1e-168
in ref transcript
Eukaryotic translation initiation factor 3 subunit 8 N-terminus (eIF3c_N). The largest of the mammalian translation initiation factors, eIF3, consists of at least eight subunits ranging in mass from 35 to 170 kDa. eIF3 binds to the 40 S ribosome in an early step of translation initiation and promotes the binding of methionyl-tRNAi and mRNA.
pfam eIF-3c_N 146aa 3e-52
in ref transcript
smart PAM 89aa 8e-16
in ref transcript
PCI/PINT associated module.
EIF3B
refseq_EIF3S9.F2 refseq_EIF3S9.R2 323 398
NCBIGene 36.3 8662
Alternative 3-prime, size difference: 75
Inclusion in 3'UTR
Reference transcript: NM_003751
cd RRM 53aa 3e-06
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
pfam eIF2A 196aa 4e-69
in ref transcript
Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
smart RRM_2 73aa 1e-08
in ref transcript
RNA recognition motif.
COG COG5354 459aa 2e-79
in ref transcript
Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only].
EIF4G1
refseq_EIF4G1.F1 refseq_EIF4G1.R1 132 370
NCBIGene 36.3 1981
Multiple exon skipping, size difference: 238
Exclusion in 5'UTR, Exclusion of the protein initiation site, Exclusion in the protein (no frameshift)
Reference transcript: NM_198241
pfam MIF4G 229aa 3e-45
in ref transcript
MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.
pfam MA3 102aa 6e-27
in ref transcript
MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.
pfam W2 62aa 5e-16
in ref transcript
eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors.
ELAVL3
refseq_ELAVL3.F1 refseq_ELAVL3.R1 119 140
NCBIGene 36.3 1995
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_001420
cd RRM 75aa 3e-18
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 75aa 1e-16
in ref transcript
cd RRM 77aa 1e-12
in ref transcript
Changed!
TIGR ELAV_HUD_SF 331aa 1e-151
in ref transcript
These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
COG COG0724 160aa 2e-14
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
COG COG0724 81aa 2e-11
in ref transcript
Changed!
TIGR ELAV_HUD_SF 324aa 1e-150
in modified transcript
Changed!
COG COG0724 271aa 6e-11
in modified transcript
ELF2
refseq_ELF2.F1 refseq_ELF2.R1 137 173
NCBIGene 36.3 1998
Alternative 3-prime, size difference: 36
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_201999
smart ETS 83aa 7e-39
in ref transcript
erythroblast transformation specific domain. variation of the helix-turn-helix motif.
ELMO2
refseq_ELMO2.F1 refseq_ELMO2.R1 132 236
NCBIGene 36.3 63916
Exon skipping and alternative 3-prime or 5-prime, size difference: 104
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_133171
cd PH_PLC 68aa 1e-04
in ref transcript
Phospholipase C (PLC) pleckstrin homology (PH) domain. There are several isozymes of PLC (beta, gamma, delta, epsilon. zeta). While, PLC beta, gamma and delta all have N-terminal PH domains, lipid binding specificity is not conserved between them. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam ELMO_CED12 184aa 3e-49
in ref transcript
ELMO/CED-12 family. This family represents a conserved domain which is found in a number of eukaryotic proteins including CED-12, ELMO I and ELMO II. ELMO1 is a component of signalling pathways that regulate phagocytosis and cell migration and is the mammalian orthologue of the Caenorhabditis elegans gene, ced-12. CED-12 is required for the engulfment of dying cells and cell migration. In mammalian cells, ELMO1 interacts with Dock180 as part of the CrkII/Dock180/Rac pathway responsible for phagocytosis and cell migration. ELMO1 is ubiquitously expressed, although its expression is highest in the spleen, an organ rich in immune cells. ELMO1 has a PH domain and a polyproline sequence motif at its C terminus which are not present in this alignment.
EML1
refseq_EML1.F1 refseq_EML1.R1 129 186
NCBIGene 36.3 2009
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_001008707
cd WD40 389aa 3e-22
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 197aa 2e-09
in ref transcript
pfam HELP 78aa 6e-36
in ref transcript
HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localise to membranes. A preliminary study has shown that the N terminus of Sea urchin EMAP containing HELP is sufficient for microtubule binding in vitro (Eichenmuller et al In press).
pfam WD40 49aa 0.009
in ref transcript
WD domain, G-beta repeat.
COG COG2319 404aa 2e-16
in ref transcript
FOG: WD40 repeat [General function prediction only].
EMR2
refseq_EMR2.F2 refseq_EMR2.R2 244 391
NCBIGene 36.3 30817
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_013447
cd EGF_CA 36aa 2e-05
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Changed!
cd EGF_CA 37aa 2e-05
in ref transcript
Changed!
cd EGF_CA 34aa 5e-05
in ref transcript
cd EGF_CA 35aa 0.002
in ref transcript
pfam 7tm_2 253aa 9e-46
in ref transcript
7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
smart GPS 51aa 9e-12
in ref transcript
G-protein-coupled receptor proteolytic site domain. Present in latrophilin/CL-1, sea urchin REJ and polycystin.
Changed!
pfam EGF_CA 48aa 4e-10
in ref transcript
Calcium binding EGF domain.
Changed!
pfam EGF_CA 35aa 1e-09
in ref transcript
smart EGF_CA 33aa 3e-06
in ref transcript
Calcium-binding EGF-like domain.
pfam EGF_CA 51aa 3e-05
in ref transcript
Changed!
cd EGF_CA 37aa 9e-06
in modified transcript
Changed!
pfam EGF_CA 35aa 5e-10
in modified transcript
EMR2
refseq_EMR2.F3 refseq_EMR2.R3 175 208
NCBIGene 36.3 30817
Alternative 3-prime, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_013447
cd EGF_CA 36aa 2e-05
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
cd EGF_CA 37aa 2e-05
in ref transcript
cd EGF_CA 34aa 5e-05
in ref transcript
cd EGF_CA 35aa 0.002
in ref transcript
pfam 7tm_2 253aa 9e-46
in ref transcript
7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
smart GPS 51aa 9e-12
in ref transcript
G-protein-coupled receptor proteolytic site domain. Present in latrophilin/CL-1, sea urchin REJ and polycystin.
pfam EGF_CA 48aa 4e-10
in ref transcript
Calcium binding EGF domain.
pfam EGF_CA 35aa 1e-09
in ref transcript
smart EGF_CA 33aa 3e-06
in ref transcript
Calcium-binding EGF-like domain.
pfam EGF_CA 51aa 3e-05
in ref transcript
EMR3
refseq_EMR3.F1 refseq_EMR3.R1 212 253
NCBIGene 36.2 84658
Exon skipping and alternative 3-prime or 5-prime, size difference: 41
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_032571
Changed!
cd EGF_CA 38aa 3e-05
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Changed!
pfam 7tm_2 253aa 6e-50
in ref transcript
7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
Changed!
pfam GPS 46aa 2e-09
in ref transcript
Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
smart EGF_CA 35aa 2e-05
in ref transcript
Calcium-binding EGF-like domain.
ENAH
refseq_ENAH.F1 refseq_ENAH.R1 133 196
NCBIGene 36.3 55740
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_001008493
cd Ena-Vasp 110aa 5e-54
in ref transcript
Enabled-VASP-type homology (EVH1) domain. The EVH1 domain binds to other proteins at proline rich sequences. It is found in proteins involved in cytoskeletal reorganization such as Enabled and VASP. Ena-VASP type EVH1 domains specifically recognize FPPPP motifs in the focal adhesion proteins zyxin and vinculin, and the ActA surface protein of Listeria monocytogenes. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
pfam WH1 107aa 2e-38
in ref transcript
WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGluR5 bind a protein called homer, which is a WH1 domain homologue. A subset of WH1 domains has been termed a "EVH1" domain and appear to bind a polyproline motif.
pfam VASP_tetra 40aa 3e-13
in ref transcript
VASP tetramerisation domain. Vasodilator-stimulated phosphoprotein (VASP) is an actin cytoskeletal regulatory protein. This region corresponds to the tetramerisation domain which forms a right handed alpha helical coiled coil structure.
ENO3
refseq_ENO3.F1 refseq_ENO3.R1 109 151
NCBIGene 36.3 2027
Alternative 5-prime, size difference: 42
Exclusion in 5'UTR
Reference transcript: NM_001976
cd enolase 411aa 0.0
in ref transcript
Enolase: Enolases are homodimeric enzymes that catalyse the reversible dehydration of 2-phospho-D-glycerate to phosphoenolpyruvate as part of the glycolytic and gluconeogenesis pathways. The reaction is facilitated by the presence of metal ions.
Changed!
pfam Endosulfine 64aa 2e-05
in ref transcript
cAMP-regulated phosphoprotein/endosulfine conserved region. Conserved region found in both cAMP-regulated phosphoprotein 19 (ARPP-19) and Alpha/Beta endosulfine. No function has yet been assigned to ARPP-19. Endosulfine is the endogenous ligand for the ATP-dependent potassium (K ATP) channels which occupy a key position in the control of insulin release from the pancreatic beta cell by coupling cell polarity to metabolism. In both cases the region occupies the majority of the protein.
Changed!
pfam Endosulfine 48aa 1e-06
in modified transcript
ENTPD2
refseq_ENTPD2.F1 refseq_ENTPD2.R1 133 202
NCBIGene 36.3 954
Alternative 3-prime, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_203468
Changed!
pfam GDA1_CD39 419aa 2e-82
in ref transcript
GDA1/CD39 (nucleoside phosphatase) family.
COG COG5371 212aa 1e-12
in ref transcript
Golgi nucleoside diphosphatase [Carbohydrate transport and metabolism / Posttranslational modification, protein turnover, chaperones].
Changed!
pfam GDA1_CD39 396aa 9e-77
in modified transcript
EPB41
refseq_EPB41.F1 refseq_EPB41.R1 204 306
NCBIGene 36.3 2035
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_203343
cd FERM_C 94aa 1e-32
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
smart B41 157aa 1e-40
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
Changed!
pfam 4_1_CTD 115aa 3e-26
in ref transcript
4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
pfam FERM_C 85aa 2e-20
in ref transcript
FERM C-terminal PH-like domain.
pfam SAB 47aa 5e-14
in ref transcript
SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
pfam FA 45aa 5e-13
in ref transcript
FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
Changed!
pfam 4_1_CTD 81aa 7e-29
in modified transcript
EPB41
refseq_EPB41.F3 refseq_EPB41.R3 313 393
NCBIGene 36.3 2035
Single exon skipping, size difference: 80
Inclusion in the protein causing a new stop codon
Reference transcript: NM_203343
Changed!
cd FERM_C 94aa 1e-32
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
Changed!
smart B41 157aa 1e-40
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
Changed!
pfam 4_1_CTD 115aa 3e-26
in ref transcript
4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
Changed!
pfam FERM_C 85aa 2e-20
in ref transcript
FERM C-terminal PH-like domain.
Changed!
pfam SAB 47aa 5e-14
in ref transcript
SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
Changed!
pfam FA 45aa 5e-13
in ref transcript
FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
EPB41
refseq_EPB41.F4 refseq_EPB41.R4 253 358
NCBIGene 36.3 2035
Single exon skipping, size difference: 105
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_203343
cd FERM_C 94aa 1e-32
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
Changed!
smart B41 157aa 1e-40
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam 4_1_CTD 115aa 3e-26
in ref transcript
4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
pfam FERM_C 85aa 2e-20
in ref transcript
FERM C-terminal PH-like domain.
pfam SAB 47aa 5e-14
in ref transcript
SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
pfam FA 45aa 5e-13
in ref transcript
FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
Changed!
smart B41 192aa 2e-55
in modified transcript
EPB41
refseq_EPB41.F5 refseq_EPB41.R5 96 113
NCBIGene 36.3 2035
Alternative 3-prime, size difference: 17
Exclusion of the protein initiation site
Reference transcript: NM_203343
cd FERM_C 94aa 1e-32
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
smart B41 157aa 1e-40
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam 4_1_CTD 115aa 3e-26
in ref transcript
4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
pfam FERM_C 85aa 2e-20
in ref transcript
FERM C-terminal PH-like domain.
pfam SAB 47aa 5e-14
in ref transcript
SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
pfam FA 45aa 5e-13
in ref transcript
FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
EPB41L1
refseq_EPB41L1.F2 refseq_EPB41L1.R2 186 222
NCBIGene 36.3 2036
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_012156
cd FERM_C 94aa 1e-31
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
smart B41 190aa 1e-54
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam 4_1_CTD 93aa 1e-24
in ref transcript
4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
pfam FERM_C 89aa 1e-22
in ref transcript
FERM C-terminal PH-like domain.
pfam FA 47aa 9e-13
in ref transcript
FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
Changed!
pfam SAB 44aa 4e-09
in ref transcript
SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
Changed!
pfam SAB 43aa 2e-08
in modified transcript
EPB41L1
refseq_EPB41L1.F3 refseq_EPB41L1.R3 117 201
NCBIGene 36.3 2036
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_012156
cd FERM_C 94aa 1e-31
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
smart B41 190aa 1e-54
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam 4_1_CTD 93aa 1e-24
in ref transcript
4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
pfam FERM_C 89aa 1e-22
in ref transcript
FERM C-terminal PH-like domain.
pfam FA 47aa 9e-13
in ref transcript
FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
pfam SAB 44aa 4e-09
in ref transcript
SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
EPHA5
refseq_EPHA5.F1 refseq_EPHA5.R1 187 253
NCBIGene 36.3 2044
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_004439
cd PTKc_EphR_A 267aa 1e-169
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinases, Class EphA Ephrin Receptors. Protein Tyrosine Kinase (PTK) family; Ephrin Receptor (EphR) subfamily; most class EphA receptors including EphA3, EphA4, EphA5, and EphA7, but excluding EphA1, EphA2 and EphA10; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. EphRs comprise the largest subfamily of receptor tyr kinases (RTKs). In general, class EphA receptors bind GPI-anchored ephrin-A ligands. There are ten vertebrate EphA receptors (EphA1-10), which display promiscuous interactions with six ephrin-A ligands. One exception is EphA4, which also binds ephrins-B2/B3. EphRs contain an ephrin-binding domain and two fibronectin repeats extracellularly, a transmembrane segment, and a cytoplasmic tyr kinase domain. Binding of the ephrin ligand to EphR requires cell-cell contact since both are anchored to the plasma membrane. The resulting downstream signals occur bidirectionally in both EphR-expressing cells (forward signaling) and ephrin-expressing cells (reverse signaling). Ephrin/EphR interaction mainly results in cell-cell repulsion or adhesion, making it important in neural development and plasticity, cell morphogenesis, cell-fate determination, embryonic development, tissue patterning, and angiogenesis. EphARs and ephrin-A ligands are expressed in multiple areas of the developing brain, especially in the retina and tectum. They are part of a system controlling retinotectal mapping.
cd SAM 60aa 7e-16
in ref transcript
Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
cd FN3 91aa 9e-14
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 95aa 3e-05
in ref transcript
pfam Pkinase_Tyr 258aa 1e-114
in ref transcript
Protein tyrosine kinase.
pfam Ephrin_lbd 173aa 6e-97
in ref transcript
Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand.
pfam SAM_1 61aa 6e-19
in ref transcript
SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.
smart FN3 81aa 1e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
pfam fn3 90aa 2e-07
in ref transcript
Fibronectin type III domain.
COG SPS1 295aa 3e-23
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
EPN2
refseq_EPN2.F1 refseq_EPN2.R1 213 384
NCBIGene 36.3 22905
Single exon skipping, size difference: 171
Exclusion in the protein (no frameshift)
Reference transcript: NM_014964
cd ENTH_epsin 123aa 2e-61
in ref transcript
ENTH domain, Epsin family; The epsin (Eps15 interactor) N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. ENTH and ANTH (E/ANTH) domains are structurally similar to the VHS domain and are composed of a superhelix of eight alpha helices. E/ANTH domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membranes. ENTH domains also function in the development of membrane curvature through lipid remodeling during the formation of clathrin-coated vesicles. E/ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the trans-Golgi network, which suggests that E/ANTH domains are universal components of the machinery for clathrin-mediated membrane budding.
pfam ENTH 124aa 1e-62
in ref transcript
ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown.
EPS8L3
refseq_EPS8L3.F1 refseq_EPS8L3.R1 129 219
NCBIGene 36.3 79574
Alternative 3-prime, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_139053
cd EPS8 126aa 4e-46
in ref transcript
Epidermal growth factor receptor kinase substrate (EPS8) Phosphotyrosine-binding (PTB) domain. EPS8 is a regulator of Rac signaling. It consists of a PTB and an SH3 domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
cd SH3 53aa 1e-09
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
pfam PTB 120aa 2e-37
in ref transcript
Phosphotyrosine-binding domain. The phosphotyrosine-binding domain (PTB, also phosphotyrosine-interaction or PI domain) in the protein tensin tends to be found at the C-terminus. Tensin is a multi-domain protein that binds to actin filaments and functions as a focal-adhesion molecule (focal adhesions are regions of plasma membrane through which cells attach to the extracellular matrix). Human tensin has actin-binding sites, an SH2 (pfam00017) domain and a region similar to the tumour suppressor PTEN. The PTB domain interacts with the cytoplasmic tails of beta integrin by binding to an NPXY motif.
smart SH3 57aa 3e-11
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
EPSTI1
refseq_EPSTI1.F2 refseq_EPSTI1.R2 164 196
NCBIGene 36.3 94240
Single exon skipping, size difference: 32
Exclusion in the protein causing a frameshift
Reference transcript: NM_001002264
ERBB2IP
refseq_ERBB2IP.F1 refseq_ERBB2IP.R1 178 385
NCBIGene 36.3 55914
Single exon skipping, size difference: 207
Exclusion in the protein (no frameshift)
Reference transcript: NM_018695
cd PDZ_signaling 60aa 3e-09
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd LRR_RI 162aa 2e-04
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Changed!
smart PDZ 90aa 4e-10
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
COG COG4886 348aa 3e-18
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
Changed!
smart PDZ 90aa 6e-10
in modified transcript
ERG
refseq_ERG.F1 refseq_ERG.R1 100 172
NCBIGene 36.3 2078
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_182918
pfam Ets 84aa 1e-36
in ref transcript
Ets-domain.
smart SAM_PNT 83aa 1e-23
in ref transcript
SAM / Pointed domain. A subfamily of the SAM domain.
ERP29
refseq_ERP29.F1 refseq_ERP29.R1 187 326
NCBIGene 36.3 10961
Single exon skipping, size difference: 139
Exclusion in the protein causing a frameshift
Reference transcript: NM_006817
Changed!
cd PDI_a_ERp29_N 114aa 3e-46
in ref transcript
PDIa family, endoplasmic reticulum protein 29 (ERp29) subfamily; ERp29 is a ubiquitous ER-resident protein expressed in high levels in secretory cells. It forms homodimers and higher oligomers in vitro and in vivo. It contains a redox inactive TRX-like domain at the N-terminus, which is homologous to the redox active TRX (a) domains of PDI, and a C-terminal helical domain similar to the C-terminal domain of P5. The expression profile of ERp29 suggests a role in secretory protein production distinct from that of PDI. It has also been identified as a member of the thyroglobulin folding complex. The Drosophila homolog, Wind, is the product of windbeutel, an essential gene in the development of dorsal-ventral patterning. Wind is required for correct targeting of Pipe, a Golgi-resident type II transmembrane protein with homology to 2-O-sulfotransferase.
Changed!
cd ERp29c 94aa 3e-20
in ref transcript
ERp29 and ERp38, C-terminal domain; composed of the protein disulfide isomerase (PDI)-like proteins ERp29 and ERp38. ERp29 (also called ERp28) is a ubiquitous endoplasmic reticulum (ER)-resident protein expressed in high levels in secretory cells. It contains a redox inactive TRX-like domain at the N-terminus. The expression profile of ERp29 suggests a role in secretory protein production, distinct from that of PDI. It has also been identified as a member of the thyroglobulin folding complex and is essential in regulating the secretion of thyroglobulin. The Drosophila homolog, Wind, is the product of windbeutel, an essential gene in the development of dorsal-ventral patterning. Wind is required for correct targeting of Pipe, a Golgi-resident type II transmembrane protein with homology to 2-O-sulfotransferase. ERp38 is a P5-like protein, first isolated from alfalfa (the cDNA clone was named G1), which contains two redox active TRX domains at the N-terminus, like human P5. However, unlike human P5, ERp38 also contains a C-terminal domain with homology to the C-terminal domain of ERp29. It may be a glucose-regulated protein. The function of the all-helical C-terminal domain of ERp29 and ERp38 remains unclear. The C-terminal domain of Wind is thought to provide a distinct site required for interaction with its substrate, Pipe.
Changed!
pfam ERp29_N 123aa 1e-48
in ref transcript
ERp29, N-terminal domain. ERp29 is a ubiquitously expressed endoplasmic reticulum protein, and is involved in the processes of protein maturation and protein secretion in this organelle. The protein exists as a homodimer, with each monomer being composed of two domains. The N-terminal domain featured in this family is organised into a thioredoxin-like fold that resembles the a domain of human protein disulphide isomerase (PDI). However, this domain lacks the C-X-X-C motif required for the redox function of PDI; it is therefore thought that ERp29's function is similar to the chaperone function of PDI. The N-terminal domain is exclusively responsible for the homodimerisation of the protein, without covalent linkages or additional contacts with other domains.
Changed!
pfam ERp29 97aa 3e-23
in ref transcript
Endoplasmic reticulum protein ERp29, C-terminal domain. ERp29 is a ubiquitously expressed endoplasmic reticulum protein found in mammals. ERp29 is comprised of two domains. This domain, the C-terminal domain, has an all helical fold. ERp29 is thought to form part of the thyroglobulin folding complex.
ESRRG
refseq_ESRRG.F1 refseq_ESRRG.R1 218 336
NCBIGene 36.3 2104
Single exon skipping, size difference: 118
Exclusion in 5'UTR
Reference transcript: NM_206594
cd NR_LBD_ERR 220aa 1e-111
in ref transcript
The ligand binding domain of estrogen receptor-related nuclear receptors. The ligand binding domain of estrogen receptor-related receptors (ERRs): The family of estrogen receptor-related receptors (ERRs), a subfamily of nuclear receptors, is closely related to the estrogen receptor (ER) family, but it lacks the ability to bind estrogen. ERRs can interfere with the classic ER-mediated estrogen signaling pathway, positively or negatively. ERRs share target genes, co-regulators and promoters with the estrogen receptor (ER) family. There are three subtypes of ERRs: alpha, beta and gamma. ERRs bind at least two types of DNA sequence, the estrogen response element and another site, originally characterized as SF-1 (steroidogenic factor 1) response element. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, ERR has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
pfam Hormone_recep 181aa 2e-42
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
pfam zf-C4 76aa 9e-37
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
EVI2A
refseq_EVI2A.F2 refseq_EVI2A.R2 144 285
NCBIGene 36.3 2123
Single exon skipping, size difference: 141
Exclusion of the protein initiation site
Reference transcript: NM_001003927
pfam EVI2A 227aa 1e-104
in ref transcript
Ectropic viral integration site 2A protein (EVI2A). This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours.
EWSR1
refseq_EWSR1.F1 refseq_EWSR1.R1 174 393
NCBIGene 36.3 2130
Multiple exon skipping, size difference: 219
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_005243
cd RRM 82aa 3e-13
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM_2 81aa 1e-14
in ref transcript
RNA recognition motif.
pfam zf-RanBP 31aa 2e-07
in ref transcript
Zn-finger in Ran binding protein and others.
COG COG0724 93aa 2e-07
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
EXO1
refseq_EXO1.F1 refseq_EXO1.R1 290 395
NCBIGene 36.3 9156
Single exon skipping, size difference: 105
Exclusion in 5'UTR
Reference transcript: NM_130398
cd XPG 308aa 1e-73
in ref transcript
Xeroderma pigmentosum G N- and I-regions (XPGN, XPGI); contains the HhH2 motif; domain in nucleases. XPG is a eukaryotic enzyme that functions in nucleotide-excision repair and transcription-coupled repair of oxidative DNA damage. Functionally/structurally related to FEN-1; divalent metal ion-dependent exo- and endonuclease, and bacterial and bacteriophage 5'3' exonucleases.
smart XPGN 83aa 1e-24
in ref transcript
Xeroderma pigmentosum G N-region. domain in nucleases.
pfam XPG_I 92aa 1e-23
in ref transcript
XPG I-region.
TIGR fen_arch 308aa 7e-17
in ref transcript
Endonuclease that cleaves the 5'-overhanging flap structure that is generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment. Has 5'-endo-/exonuclease and 5'-pseudo-Y-endonuclease activities. Cleaves the junction between single and double-stranded regions of flap DNA.
PTZ PTZ00217 290aa 2e-27
in ref transcript
flap endonuclease-1; Provisional.
EXOC1
refseq_EXOC1.F2 refseq_EXOC1.R2 161 317
NCBIGene 36.3 55763
Alternative 5-prime, size difference: 156
Exclusion in 5'UTR
Reference transcript: NM_001024924
pfam Sec3 703aa 0.0
in ref transcript
Exocyst complex component Sec3. This entry is the conserved middle and C-terminus of the Sec3 protein. Sec3 binds to the C-terminal cytoplasmic domain of GLYT1 (glycine transporter protein 1). Sec3 is the exocyst component that is closest to the plasma membrane docking site and it serves as a spatial landmark in the plasma membrane for incoming secretory vesicles. Sec3 is recruited to the sites of polarised membrane growth through its interaction with Rho1p, a small GTP-binding protein.
EXOC7
refseq_EXOC7.F2 refseq_EXOC7.R2 198 291
NCBIGene 36.3 23265
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_001013839
pfam Exo70 367aa 2e-74
in ref transcript
Exo70 exocyst complex subunit. The Exo70 protein forms one subunit of the exocyst complex. First discovered in Saccharomyces cerevisiae, Exo70 and other exocyst proteins have been observed in several other eukaryotes, including humans. In Saccharomyces cerevisiae, the exocyst complex is involved in the late stages of exocytosis, and is localised at the tip of the bud, the major site of exocytosis in yeast. Exo70 interacts with the Rho3 GTPase. This interaction mediates one of the three known functions of Rho3 in cell polarity: vesicle docking and fusion with the plasma membrane (the other two functions are regulation of actin polarity and transport of exocytic vesicles from the mother cell to the bud). In humans, the functions of Exo70 and the exocyst complex are less well characterised: Exo70 is expressed in several tissues and is thought to also be involved in exocytosis.
EXOSC10
refseq_EXOSC10.F1 refseq_EXOSC10.R1 159 234
NCBIGene 36.3 5394
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_001001998
cd Rrp6p_like_exo 191aa 1e-108
in ref transcript
Yeast Rrp6p and its human homolog, the Polymyositis/scleroderma autoantigen 100kDa (PM/Scl-100), are exosome-associated proteins involved in the degradation and processing of precursors to stable RNAs. Both proteins contain a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. The motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. PM/Scl-100, an autoantigen present in the nucleolar compartment of the cell, reacts with autoantibodies produced by about 50% of patients with polymyositis-scleroderma overlap syndrome.
pfam 3_5_exonuc 169aa 8e-54
in ref transcript
3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of Escherichia coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-terminal half of the Klenow fragment in Escherichia coli polI it is also found in the Werner syndrome helicase (WRN), focus forming activity 1 protein (FFA-1) and ribonuclease D (RNase D). Werner syndrome is a human genetic disorder causing premature aging; the WRN protein has helicase activity in the 3'-5' direction. The FFA-1 protein is required for formation of a replication foci and also has helicase activity; it is a homologue of the WRN protein. RNase D is a 3'-5' exonuclease involved in tRNA processing. Also found in this family is the autoantigen PM/Scl thought to be involved in polymyositis-scleroderma overlap syndrome.
pfam PMC2NT 92aa 1e-26
in ref transcript
PMC2NT (NUC016) domain. This domain is found at the N-terminus of 3'-5' exonucleases with HRDC domains, and also in putative exosome components.
smart HRDC 81aa 5e-18
in ref transcript
Helicase and RNase D C-terminal. Hypothetical role in nucleic acid binding. Mutations in the HRDC domain cause human disease.
COG Rnd 290aa 5e-49
in ref transcript
Ribonuclease D [Translation, ribosomal structure and biogenesis].
EXOSC3
refseq_EXOSC3.F1 refseq_EXOSC3.R1 226 378
NCBIGene 36.3 51010
Single exon skipping, size difference: 152
Exclusion in the protein causing a frameshift
Reference transcript: NM_016042
Changed!
cd S1_Rrp40 86aa 3e-38
in ref transcript
S1_Rrp40: Rrp40 S1-like RNA-binding domain. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. Rrp4 protein is a subunit of the exosome complex. The exosome plays a central role in 3' to 5' RNA processing and degradation in eukarytes and archaea. Its functions include the removal of incorrectly processed RNA and the maintenance of proper levels of mRNA, rRNA and a number of small RNA species. In Saccharomyces cerevisiae, the exosome includes nine core components, six of which are homologous to bacterial RNase PH. These form a hexameric ring structure. The other three subunits (RrP4, Rrp40, and Csl4) contain an S1 RNA binding domain and are part of the "S1 pore structure".
Changed!
COG RRP4 207aa 2e-24
in ref transcript
RNA-binding protein Rrp4 and related proteins (contain S1 domain and KH domain) [Translation, ribosomal structure and biogenesis].
Changed!
cd S1_Rrp40 52aa 1e-20
in modified transcript
Changed!
COG RRP4 91aa 1e-07
in modified transcript
EXOSC9
refseq_EXOSC9.F1 refseq_EXOSC9.R1 208 259
NCBIGene 36.3 5393
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_001034194
pfam RNase_PH 133aa 9e-10
in ref transcript
3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase) contains two tandem copies of the domain. PNPase is involved in mRNA degradation in a 3'-5' direction. The exosome is a 3'-5' exoribonuclease complex that is required for 3' processing of the 5.8S rRNA. Three of its five protein components contain a copy of this domain. A hypothetical protein from S. pombe appears to belong to an uncharacterised subfamily. This subfamily is found in both eukaryotes and archaebacteria.
pfam RNase_PH_C 66aa 0.002
in ref transcript
3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase) contains two tandem copies of the domain. PNPase is involved in mRNA degradation in a 3'-5' direction. The exosome is a 3'-5' exoribonuclease complex that is required for 3' processing of the 5.8S rRNA. Three of its five protein components contain a copy of this domain. A hypothetical protein from S. pombe appears to belong to an uncharacterised subfamily. This subfamily is found in both eukaryotes and archaebacteria.
COG COG2123 271aa 1e-55
in ref transcript
RNase PH-related exoribonuclease [Translation, ribosomal structure and biogenesis].
EXT2
refseq_EXT2.F1 refseq_EXT2.R1 143 296
NCBIGene 36.3 2132
Multiple exon skipping, size difference: 153
Exclusion in 5'UTR, Exclusion of the protein initiation site
Reference transcript: NM_000401
pfam Glyco_transf_64 246aa 1e-101
in ref transcript
Glycosyl transferase family 64 domain. Members of this family catalyse the transfer reaction of N-acetylglucosamine and N-acetylgalactosamine from the respective UDP-sugars to the non-reducing end of [glucuronic acid]beta 1-3[galactose]beta 1-O-naphthalenemethanol, an acceptor substrate analog of the natural common linker of various glycosylaminoglycans. They are also required for the biosynthesis of heparan-sulphate.
pfam Exostosin 281aa 5e-54
in ref transcript
Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the most common known skeletal dysplasia. The chromosomal locations of other EXT genes suggest association with other forms of neoplasia. EXT1 and EXT2 have both been shown to encode a heparan sulphate polymerase with both D-glucuronyl (GlcA) and N-acetyl-D-glucosaminoglycan (GlcNAC) transferase activities. The nature of the defect in heparan sulphate biosynthesis in HME is unclear.
EXTL2
refseq_EXTL2.F2 refseq_EXTL2.R2 183 219
NCBIGene 36.3 2135
Alternative 3-prime, size difference: 36
Inclusion in 5'UTR
Reference transcript: NM_001033025
pfam Glyco_transf_64 256aa 3e-86
in ref transcript
Glycosyl transferase family 64 domain. Members of this family catalyse the transfer reaction of N-acetylglucosamine and N-acetylgalactosamine from the respective UDP-sugars to the non-reducing end of [glucuronic acid]beta 1-3[galactose]beta 1-O-naphthalenemethanol, an acceptor substrate analog of the natural common linker of various glycosylaminoglycans. They are also required for the biosynthesis of heparan-sulphate.
EYA1
refseq_EYA1.F2 refseq_EYA1.R2 187 277
NCBIGene 36.3 2138
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_000503
Changed!
TIGR EYA-cons_domain 271aa 1e-126
in ref transcript
This domain is common to all eyes absent (EYA) homologs. Metazoan EYA's also contain a variable N-terminal domain consisting largely of low-complexity sequences.
Changed!
TIGR EYA-cons_domain 241aa 1e-107
in modified transcript
EYA1
refseq_EYA1.F4 refseq_EYA1.R4 171 299
NCBIGene 36.3 2138
Single exon skipping, size difference: 128
Exclusion of the protein initiation site
Reference transcript: NM_172058
TIGR EYA-cons_domain 271aa 1e-126
in ref transcript
This domain is common to all eyes absent (EYA) homologs. Metazoan EYA's also contain a variable N-terminal domain consisting largely of low-complexity sequences.
EYA4
refseq_EYA4.F1 refseq_EYA4.R1 113 182
NCBIGene 36.3 2070
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_172105
TIGR EYA-cons_domain 272aa 1e-129
in ref transcript
This domain is common to all eyes absent (EYA) homologs. Metazoan EYA's also contain a variable N-terminal domain consisting largely of low-complexity sequences.
EZH2
refseq_EZH2.F1 refseq_EZH2.R1 156 273
NCBIGene 36.3 2146
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_004456
pfam SET 124aa 3e-36
in ref transcript
SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
Changed!
COG COG2940 208aa 4e-15
in ref transcript
Proteins containing SET domain [General function prediction only].
Changed!
COG COG2940 180aa 7e-15
in modified transcript
F11
refseq_F11.F1 refseq_F11.R1 141 301
NCBIGene 36.2 2160
Single exon skipping, size difference: 160
Exclusion in the protein causing a frameshift
Reference transcript: NM_000128
Changed!
cd Tryp_SPc 231aa 4e-76
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
Changed!
cd APPLE_Factor_XI_like 84aa 3e-09
in ref transcript
Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
cd APPLE_Factor_XI_like 85aa 4e-09
in ref transcript
Changed!
cd APPLE_Factor_XI_like 85aa 9e-09
in ref transcript
Changed!
cd APPLE_Factor_XI_like 85aa 2e-08
in ref transcript
Changed!
smart Tryp_SPc 232aa 3e-81
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Changed!
smart APPLE 84aa 7e-25
in ref transcript
APPLE domain. Four-fold repeat in plasma kallikrein and coagulation factor XI. Factor XI apple 3 mediates binding to platelets. Factor XI apple 1 binds high-molecular-mass kininogen. Apple 4 in factor XI mediates dimer formation and binds to factor XIIa. Mutations in apple 4 cause factor XI deficiency, an inherited bleeding disorder.
smart APPLE 84aa 1e-23
in ref transcript
Changed!
smart APPLE 84aa 6e-23
in ref transcript
Changed!
smart APPLE 84aa 4e-22
in ref transcript
Changed!
COG COG5640 239aa 4e-29
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
F11R
refseq_F11R.F1 refseq_F11R.R1 102 162
NCBIGene 36.2 50848
Alternative 5-prime and 3-prime, size difference: 60
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_144503
cd IGcam 83aa 1e-09
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
cd IGcam 91aa 2e-05
in ref transcript
Changed!
smart IG_like 90aa 2e-11
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IG_like 79aa 3e-07
in ref transcript
Changed!
cd IGcam 84aa 4e-05
in modified transcript
Changed!
smart IG_like 84aa 2e-11
in modified transcript
F11R
refseq_F11R.F1 refseq_F11R.R4 159 279
NCBIGene 36.2 50848
Alternative 5-prime and 3-prime, size difference: 60
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_144503
cd IGcam 83aa 1e-09
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
cd IGcam 91aa 2e-05
in ref transcript
Changed!
smart IG_like 90aa 2e-11
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IG_like 79aa 3e-07
in ref transcript
Changed!
cd IGcam 71aa 4e-04
in modified transcript
Changed!
smart IG_like 70aa 3e-07
in modified transcript
F11R
refseq_F11R.F4 refseq_F11R.R5 143 177
NCBIGene 36.2 50848
Alternative 5-prime and 3-prime, size difference: 34
Inclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_144503
cd IGcam 83aa 1e-09
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 91aa 2e-05
in ref transcript
smart IG_like 90aa 2e-11
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IG_like 79aa 3e-07
in ref transcript
F11R
refseq_F11R.F6 refseq_F11R.R7 115 193
NCBIGene 36.2 50848
Single exon skipping, size difference: 78
Exclusion in 5'UTR
Reference transcript: NM_144503
cd IGcam 83aa 1e-09
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 91aa 2e-05
in ref transcript
smart IG_like 90aa 2e-11
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IG_like 79aa 3e-07
in ref transcript
BPTF
refseq_FALZ.F1 refseq_FALZ.R1 100 478
NCBIGene 36.3 2186
Multiple exon skipping, size difference: 378
Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_182641
cd Bromo_gcn5_like 101aa 1e-46
in ref transcript
Bromodomain; Gcn5_like subfamily. Gcn5p is a histone acetyltransferase (HAT) which mediates acetylation of histones at lysine residues; such acetylation is generally correlated with the activation of transcription. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd BAH_plant_2 27aa 8e-04
in ref transcript
BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
smart BROMO 102aa 1e-27
in ref transcript
bromo domain.
pfam DDT 61aa 1e-20
in ref transcript
DDT domain. This domain is approximately 60 residues in length, and is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors). It is exclusively associated with nuclear domains, and is thought to be arranged into three alpha helices.
pfam PHD 45aa 6e-10
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
smart PHD 47aa 5e-08
in ref transcript
PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RI NG finger and the FYV E finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BR OMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RI NG fingers misidentified as PHD fingers.
COG COG5076 105aa 8e-14
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
FAM111A
refseq_FAM111A.F1 refseq_FAM111A.R1 122 161
NCBIGene 36.3 63901
Alternative 3-prime, size difference: 39
Inclusion in 5'UTR
Reference transcript: NM_198847
FAM119B
refseq_FAM119B.F2 refseq_FAM119B.R2 155 294
NCBIGene 36.3 25895
Single exon skipping, size difference: 139
Inclusion in the protein causing a frameshift
Reference transcript: NM_015433
Changed!
pfam Methyltransf_16 159aa 5e-35
in ref transcript
Putative methyltransferase.
Changed!
COG COG3897 173aa 8e-08
in ref transcript
Predicted methyltransferase [General function prediction only].
Changed!
pfam Methyltransf_16 75aa 8e-12
in modified transcript
Changed!
COG COG3897 93aa 2e-05
in modified transcript
FAM13C1
refseq_FAM13C1.F1 refseq_FAM13C1.R1 106 400
NCBIGene 36.3 220965
Multiple exon skipping, size difference: 294
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_198215
FAM14B
refseq_FAM14B.F1 refseq_FAM14B.R1 174 220
NCBIGene 36.3 122509
Alternative 3-prime, size difference: 46
Inclusion in 5'UTR
Reference transcript: NM_206949
FAM19A3
refseq_FAM19A3.F2 refseq_FAM19A3.R2 148 216
NCBIGene 36.3 284467
Alternative 3-prime, size difference: 68
Exclusion in the protein causing a frameshift
Reference transcript: NM_001004440
FAM19A4
refseq_FAM19A4.F1 refseq_FAM19A4.R1 266 348
NCBIGene 36.3 151647
Alternative 5-prime, size difference: 82
Exclusion in 5'UTR
Reference transcript: NM_182522
FAM3B
refseq_FAM3B.F1 refseq_FAM3B.R1 208 352
NCBIGene 36.3 54097
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_058186
FAM3C
refseq_FAM3C.F1 refseq_FAM3C.R1 174 214
NCBIGene 36.3 10447
Alternative 5-prime and 3-prime, size difference: 40
Inclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_014888
FAM48A
refseq_FAM48A.F2 refseq_FAM48A.R2 111 215
NCBIGene 36.3 55578
Single exon skipping, size difference: 104
Exclusion in the protein causing a frameshift
Reference transcript: NM_001014286
FAM48A
refseq_FAM48A.F4 refseq_FAM48A.R4 126 205
NCBIGene 36.3 55578
Alternative 5-prime, size difference: 79
Inclusion in 5'UTR
Reference transcript: NM_001014286
FAM86A
refseq_FAM86A.F1 refseq_FAM86A.R1 109 211
NCBIGene 36.3 196483
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_201400
pfam Methyltransf_16 152aa 1e-12
in ref transcript
Putative methyltransferase.
COG COG3897 130aa 6e-04
in ref transcript
Predicted methyltransferase [General function prediction only].
Changed!
PRK PRK03910 88aa 0.003
in ref transcript
D-cysteine desulfhydrase; Validated.
FAM96A
refseq_FAM96A.F2 refseq_FAM96A.R2 137 187
NCBIGene 36.3 84191
Single exon skipping, size difference: 50
Exclusion in the protein causing a frameshift
Reference transcript: NM_032231
Changed!
pfam DUF59 78aa 0.001
in ref transcript
Domain of unknown function DUF59. This family includes prokaryotic proteins of unknown function. The family also includes PhaH from Pseudomonas putida. PhaH forms a complex with PhaF, PhaG and PhaI, which hydroxylates phenylacetic acid to 2-hydroxyphenylacetic acid. So members of this family may all be components of ring hydroxylating complexes.
Changed!
COG COG5133 118aa 1e-24
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
pfam DUF59 59aa 0.005
in modified transcript
Changed!
COG COG5133 58aa 8e-07
in modified transcript
FANCB
refseq_FANCB.F2 refseq_FANCB.R2 142 263
NCBIGene 36.3 2187
Single exon skipping, size difference: 121
Exclusion in 5'UTR
Reference transcript: NM_001018113
FBF1
refseq_FBF1.F1 refseq_FBF1.R1 163 205
NCBIGene 36.2 85302
Single exon skipping, size difference: 42
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: XM_932831
TIGR SMC_prok_B 283aa 8e-09
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG Smc 294aa 1e-07
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
FBLIM1
refseq_FBLIM1.F1 refseq_FBLIM1.R1 121 412
NCBIGene 36.3 54751
Multiple exon skipping, size difference: 291
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001024215
pfam LIM 56aa 2e-10
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
pfam LIM 56aa 5e-05
in ref transcript
FBLN2
refseq_FBLN2.F2 refseq_FBLN2.R2 124 265
NCBIGene 36.3 2199
Single exon skipping, size difference: 141
Exclusion in the protein (no frameshift)
Reference transcript: NM_001004019
cd ANATO 61aa 2e-05
in ref transcript
Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to repeats in fibulins.
cd ANATO 40aa 0.003
in ref transcript
cd vWA_Matrilin 44aa 0.005
in ref transcript
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
cd EGF_CA 34aa 0.005
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
pfam EGF_CA 49aa 2e-05
in ref transcript
Calcium binding EGF domain.
smart EGF_CA 43aa 8e-05
in ref transcript
Calcium-binding EGF-like domain.
pfam EGF_CA 45aa 2e-04
in ref transcript
smart EGF_CA 48aa 3e-04
in ref transcript
smart EGF_CA 42aa 0.002
in ref transcript
pfam ANATO 36aa 0.004
in ref transcript
Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fold repeat in fibulins.
Changed!
pfam EGF_CA 43aa 0.005
in ref transcript
Changed!
pfam EGF_CA 43aa 0.005
in modified transcript
FBXL10
refseq_FBXL10.F1 refseq_FBXL10.R1 193 307
NCBIGene 36.3 84678
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_032590
pfam JmjC 107aa 6e-09
in ref transcript
JmjC domain. The JmjC domain belongs to the Cupin superfamily. JmjC-domain proteins may be protein hydroxylases that catalyse a novel histone modification.
pfam zf-CXXC 31aa 1e-08
in ref transcript
CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
smart JmjC 76aa 5e-05
in ref transcript
A domain family that is part of the cupin metalloenzyme superfamily. Probable enzymes, but of unknown functions, that regulate chromatin reorganisation processes (Clissold and Ponting, in press).
pfam F-box 43aa 7e-04
in ref transcript
F-box domain. This domain is approximately 50 amino acids long, and is usually found in the N-terminal half of a variety of proteins. Two motifs that are commonly found associated with the F-box domain are the leucine rich repeats (LRRs; pfam00560 and pfam07723) and the WD repeat (pfam00400). The F-box domain has a role in mediating protein-protein interactions in a variety of contexts, such as polyubiquitination, transcription elongation, centromere binding and translational repression.
FBXL5
refseq_FBXL5.F1 refseq_FBXL5.R1 267 371
NCBIGene 36.3 26234
Single exon skipping, size difference: 104
Inclusion in the protein causing a new stop codon
Reference transcript: NM_012161
Changed!
pfam F-box 45aa 5e-06
in ref transcript
F-box domain. This domain is approximately 50 amino acids long, and is usually found in the N-terminal half of a variety of proteins. Two motifs that are commonly found associated with the F-box domain are the leucine rich repeats (LRRs; pfam00560 and pfam07723) and the WD repeat (pfam00400). The F-box domain has a role in mediating protein-protein interactions in a variety of contexts, such as polyubiquitination, transcription elongation, centromere binding and translational repression.
Changed!
smart LRR_CC 25aa 0.005
in ref transcript
Leucine-rich repeat - CC (cysteine-containing) subfamily.
FBXL6
refseq_FBXL6.F1 refseq_FBXL6.R1 100 118
NCBIGene 36.3 26233
Alternative 5-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_012162
FBXO21
refseq_FBXO21.F1 refseq_FBXO21.R1 119 140
NCBIGene 36.3 23014
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_033624
TIGR yccV 98aa 1e-36
in ref transcript
This model describes the small protein from E. coli YccV and its homologs in other Proteobacteria. YccV is now described as a hemimethylated DNA binding protein. The model also describes a domain in longer eukaryotic proteins.
COG COG2912 189aa 9e-18
in ref transcript
Uncharacterized conserved protein [Function unknown].
COG COG3785 96aa 2e-06
in ref transcript
Uncharacterized conserved protein [Function unknown].
FBXO24
refseq_FBXO24.F1 refseq_FBXO24.R1 127 169
NCBIGene 36.3 26261
Alternative 5-prime, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_033506
pfam F-box 47aa 9e-05
in ref transcript
F-box domain. This domain is approximately 50 amino acids long, and is usually found in the N-terminal half of a variety of proteins. Two motifs that are commonly found associated with the F-box domain are the leucine rich repeats (LRRs; pfam00560 and pfam07723) and the WD repeat (pfam00400). The F-box domain has a role in mediating protein-protein interactions in a variety of contexts, such as polyubiquitination, transcription elongation, centromere binding and translational repression.
COG ATS1 167aa 3e-04
in ref transcript
Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
FBXO24
refseq_FBXO24.F3 refseq_FBXO24.R3 225 347
NCBIGene 36.3 26261
Single exon skipping, size difference: 122
Exclusion in the protein causing a frameshift
Reference transcript: NM_033506
pfam F-box 47aa 9e-05
in ref transcript
F-box domain. This domain is approximately 50 amino acids long, and is usually found in the N-terminal half of a variety of proteins. Two motifs that are commonly found associated with the F-box domain are the leucine rich repeats (LRRs; pfam00560 and pfam07723) and the WD repeat (pfam00400). The F-box domain has a role in mediating protein-protein interactions in a variety of contexts, such as polyubiquitination, transcription elongation, centromere binding and translational repression.
Changed!
COG ATS1 167aa 3e-04
in ref transcript
Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
FBXO25
refseq_FBXO25.F1 refseq_FBXO25.R1 142 169
NCBIGene 36.3 26260
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_183421
FBXO25
refseq_FBXO25.F4 refseq_FBXO25.R4 270 320
NCBIGene 36.3 26260
Single exon skipping, size difference: 50
Exclusion in the protein causing a frameshift
Reference transcript: NM_183421
FBXO38
refseq_FBXO38.F2 refseq_FBXO38.R2 146 371
NCBIGene 36.3 81545
Alternative 5-prime, size difference: 225
Exclusion in the protein (no frameshift)
Reference transcript: NM_205836
FBXO44
refseq_FBXO44.F1 refseq_FBXO44.R1 101 473
NCBIGene 36.3 93611
Single exon skipping, size difference: 372
Exclusion in 5'UTR
Reference transcript: NM_001014765
pfam FBA 185aa 2e-69
in ref transcript
F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. The human F-box only protein 2 is involved in binding to N-glycosylated proteins.
FBXO44
refseq_FBXO44.F3 refseq_FBXO44.R3 116 241
NCBIGene 36.3 93611
Exon skipping and alternative 3-prime or 5-prime, size difference: 125
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_001014765
Changed!
pfam FBA 185aa 2e-69
in ref transcript
F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. The human F-box only protein 2 is involved in binding to N-glycosylated proteins.
Changed!
pfam FBA 56aa 2e-13
in modified transcript
FBXW8
refseq_FBXW8.F1 refseq_FBXW8.R1 158 356
NCBIGene 36.3 26259
Alternative 5-prime, size difference: 198
Exclusion in the protein (no frameshift)
Reference transcript: NM_153348
cd WD40 288aa 6e-14
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
pfam F-box 43aa 3e-06
in ref transcript
F-box domain. This domain is approximately 50 amino acids long, and is usually found in the N-terminal half of a variety of proteins. Two motifs that are commonly found associated with the F-box domain are the leucine rich repeats (LRRs; pfam00560 and pfam07723) and the WD repeat (pfam00400). The F-box domain has a role in mediating protein-protein interactions in a variety of contexts, such as polyubiquitination, transcription elongation, centromere binding and translational repression.
COG COG2319 302aa 6e-12
in ref transcript
FOG: WD40 repeat [General function prediction only].
FCAR
refseq_FCAR.F4 refseq_FCAR.R4 166 202
NCBIGene 36.3 2204
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_002000
FCGR2B
refseq_FCGR2B.F1 refseq_FCGR2B.R1 152 209
NCBIGene 36.3 2213
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_004001
cd IGcam 54aa 0.002
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
smart IG_like 78aa 4e-06
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IGc2 50aa 5e-05
in ref transcript
Immunoglobulin C-2 Type.
FCGR2C
refseq_FCGR2C.F2 refseq_FCGR2C.R2 161 246
NCBIGene 36.2 9103
Single exon skipping, size difference: 85
Inclusion in 5'UTR
Reference transcript: NM_201563
FCN2
refseq_FCN2.F1 refseq_FCN2.R1 206 320
NCBIGene 36.3 2220
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_004108
cd FReD 211aa 7e-82
in ref transcript
Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.
smart FBG 212aa 3e-86
in ref transcript
Fibrinogen-related domains (FReDs). Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.
FCN3
refseq_FCN3.F1 refseq_FCN3.R1 146 179
NCBIGene 36.3 8547
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_003665
Changed!
cd FReD 210aa 7e-77
in ref transcript
Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.
Changed!
smart FBG 210aa 3e-74
in ref transcript
Fibrinogen-related domains (FReDs). Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.
Changed!
cd FReD 209aa 1e-75
in modified transcript
Changed!
smart FBG 209aa 4e-73
in modified transcript
FDXR
refseq_FDXR.F2 refseq_FDXR.R2 102 120
NCBIGene 36.3 2232
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_004110
TIGR gltA 158aa 3e-15
in ref transcript
This protein is homologous to the small subunit of NADPH and NADH forms of glutamate synthase as found in eukaryotes and some bacteria. This protein is found in numerous species having no homolog of the glutamate synthase large subunit. The prototype of the family, from Pyrococcus sp. KOD1, was shown to be active as a homotetramer and to require NADPH.
Changed!
PTZ PTZ00188 236aa 1e-44
in ref transcript
adrenodoxin reductase; Provisional.
Changed!
COG GltD 451aa 3e-36
in ref transcript
NADPH-dependent glutamate synthase beta chain and related oxidoreductases [Amino acid transport and metabolism / General function prediction only].
Changed!
TIGR Se_ygfK 380aa 2e-09
in modified transcript
Members of this protein family are YgfK, predicted to be one subunit of a three-subunit, molybdopterin-containing selenate reductase. This enzyme is found, typically, in genomic regions associated with xanthine dehydrogenase homologs predicted to belong to the selenium-dependent molybdenum hydroxylases (SDMH). Therefore, the selenate reductase is suggested to play a role in furnishing selenide for SelD, the selenophosphate synthase.
Changed!
PTZ PTZ00188 230aa 2e-46
in modified transcript
Changed!
COG GltD 445aa 4e-38
in modified transcript
FECH
refseq_FECH.F1 refseq_FECH.R1 100 118
NCBIGene 36.3 2235
Alternative 5-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_001012515
cd Ferrochelatase_C 138aa 7e-44
in ref transcript
Ferrochelatase, C-terminal domain: Ferrochelatase (protoheme ferrolyase or HemH) is the terminal enzyme of the heme biosynthetic pathway. It catalyzes the insertion of ferrous iron into the protoporphyrin IX ring yielding protoheme. This enzyme is ubiquitous in nature and widely distributed in bacteria and eukaryotes. Recently, some archaeal members have been identified. The oligomeric state of these enzymes varies depending on the presence of a dimerization motif at the C-terminus.
cd Ferrochelatase_N 162aa 3e-41
in ref transcript
Ferrochelatase, N-terminal domain: Ferrochelatase (protoheme ferrolyase or HemH) is the terminal enzyme of the heme biosynthetic pathway. It catalyzes the insertion of ferrous iron into the protoporphyrin IX ring yielding protoheme. This enzyme is ubiquitous in nature and widely distributed in bacteria and eukaryotes. Recently, some archaeal members have been identified. The oligomeric state of these enzymes varies depending on the presence of a dimerization motif at the C-terminus.
pfam Ferrochelatase 321aa 1e-122
in ref transcript
Ferrochelatase.
Changed!
PRK hemH 325aa 1e-89
in ref transcript
ferrochelatase; Reviewed.
Changed!
PRK hemH 322aa 9e-89
in modified transcript
FER1L3
refseq_FER1L3.F1 refseq_FER1L3.R1 216 255
NCBIGene 36.3 26509
Single exon skipping, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_013451
cd C2 103aa 3e-15
in ref transcript
Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
cd C2_1 114aa 5e-11
in ref transcript
Protein kinase C conserved region 2, subgroup 1; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing synaptotagmins, specific protein kinases C (PKC) subtypes and other proteins, the N-terminal beta strand occupies the position of what is the C-terminal strand in subgroup 2.
Changed!
cd C2 114aa 1e-10
in ref transcript
cd C2 95aa 2e-07
in ref transcript
cd C2 103aa 3e-06
in ref transcript
cd C2 117aa 1e-04
in ref transcript
pfam FerB 76aa 9e-38
in ref transcript
FerB (NUC096) domain. This is central domain B in proteins of the Ferlin family.
pfam FerI 72aa 5e-32
in ref transcript
FerI (NUC094) domain. This domain is present in proteins of the Ferlin family. It is often located between two C2 domains.
pfam FerA 66aa 1e-20
in ref transcript
FerA (NUC095) domain. This is central domain A in proteins of the Ferlin family.
pfam C2 84aa 6e-15
in ref transcript
C2 domain.
smart DysFN 57aa 3e-13
in ref transcript
Dysferlin domain, N-terminal region. Domain of unknown function present in yeast peroxisomal proteins, dysferlin, myoferlin and hypothetical proteins. Due to an insertion of a dysferlin domain within a second dysferlin domain we have chosen to predict these domains in two parts: the N-terminal region and the C-terminal region.
pfam C2 89aa 4e-13
in ref transcript
pfam C2 97aa 2e-11
in ref transcript
smart DysFN 59aa 2e-11
in ref transcript
pfam C2 84aa 2e-09
in ref transcript
smart C2 94aa 1e-07
in ref transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
smart C2 96aa 2e-04
in ref transcript
COG COG5038 77aa 8e-07
in ref transcript
Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
COG COG5038 87aa 6e-05
in ref transcript
COG COG5038 90aa 0.009
in ref transcript
Changed!
cd C2 106aa 5e-10
in modified transcript
FGF1
refseq_FGF1.F2 refseq_FGF1.R2 288 395
NCBIGene 36.3 2246
Exon skipping and alternative 3-prime or 5-prime, size difference: 107
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_000800
Changed!
cd FGF 123aa 1e-38
in ref transcript
Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
Changed!
smart FGF 127aa 1e-44
in ref transcript
Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
Changed!
cd FGF 29aa 1e-05
in modified transcript
Changed!
smart FGF 35aa 1e-06
in modified transcript
FGF5
refseq_FGF5.F2 refseq_FGF5.R2 256 360
NCBIGene 36.3 2250
Single exon skipping, size difference: 104
Exclusion in the protein causing a frameshift
Reference transcript: NM_004464
Changed!
cd FGF 129aa 6e-37
in ref transcript
Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
Changed!
smart FGF 134aa 4e-44
in ref transcript
Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
Changed!
cd FGF 26aa 0.001
in modified transcript
Changed!
pfam FGF 37aa 2e-06
in modified transcript
Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. They are found in a wide range of organisms, from nematodes to humans. Most share an internal core region of high similarity, conserved residues in which are involved in binding with their receptors. On binding, they cause dimerisation of their tyrosine kinase receptors leading to intracellular signalling. There are currently four known tyrosine kinase receptors for fibroblast growth factors. These receptors can each bind several different members of this family. Members of this family have a beta trefoil structure. Most have N-terminal signal peptides and are secreted. A few lack signal sequences but are secreted anyway; still others also lack the signal peptide but are found on the cell surface and within the extracellular matrix. A third group remain intracellular. They have central roles in development, regulating cell proliferation, migration and differentiation. On the other hand, they are important in tissue repair following injury in adult organisms.
FGFR1OP
refseq_FGFR1OP.F1 refseq_FGFR1OP.R1 194 254
NCBIGene 36.3 11116
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_007045
pfam FOP_dimer 81aa 8e-35
in ref transcript
FOP N terminal dimerisation domain. Fibroblast growth factor receptor 1 (FGFR1) oncogene partner (FOP) is a centrosomal protein that is involved in anchoring microtubules to subcellular structures. This domain includes a Lis-homology motif. It forms an alpha helical bundle and is involved in dimerisation.
FGL1
refseq_FGL1.F2 refseq_FGL1.R2 216 343
NCBIGene 36.3 2267
Single exon skipping, size difference: 127
Exclusion in 5'UTR
Reference transcript: NM_201553
cd FReD 227aa 1e-83
in ref transcript
Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.
smart FBG 226aa 2e-90
in ref transcript
Fibrinogen-related domains (FReDs). Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.
FGL1
refseq_FGL1.F3 refseq_FGL1.R1 114 241
NCBIGene 36.3 2267
Single exon skipping, size difference: 127
Exclusion in 5'UTR
Reference transcript: NM_201552
cd FReD 227aa 1e-83
in ref transcript
Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.
smart FBG 226aa 2e-90
in ref transcript
Fibrinogen-related domains (FReDs). Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.
FHL2
refseq_FHL2.F1 refseq_FHL2.R1 205 377
NCBIGene 36.3 2274
Multiple exon skipping, size difference: 172
Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein causing a frameshift
Reference transcript: NM_001450
Changed!
pfam LIM 56aa 7e-06
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
Changed!
smart LIM 38aa 3e-04
in ref transcript
Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.
Changed!
pfam LIM 58aa 5e-04
in ref transcript
Changed!
smart LIM 41aa 0.008
in ref transcript
FHL2
refseq_FHL2.F3 refseq_FHL2.R1 123 244
NCBIGene 36.3 2274
Single exon skipping, size difference: 121
Exclusion in 5'UTR
Reference transcript: NM_201555
pfam LIM 56aa 5e-05
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
smart LIM 38aa 0.001
in ref transcript
Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.
pfam LIM 58aa 0.002
in ref transcript
FHL2
refseq_FHL2.F5 refseq_FHL2.R4 187 315
NCBIGene 36.3 2274
Alternative 5-prime, size difference: 128
Exclusion in 5'UTR
Reference transcript: NM_201555
pfam LIM 56aa 5e-05
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
smart LIM 38aa 0.001
in ref transcript
Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.
pfam LIM 58aa 0.002
in ref transcript
FIBP
refseq_FIBP.F1 refseq_FIBP.R1 116 137
NCBIGene 36.3 9158
Alternative 5-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_198897
Changed!
pfam FIBP 363aa 1e-162
in ref transcript
Acidic fibroblast growth factor binding (FIBP). Acidic fibroblast growth factor (aFGF) intracellular binding protein (FIBP) is a protein found mainly in the nucleus that is thought to be involved in the intracellular function of aFGF.
Changed!
pfam FIBP 356aa 1e-164
in modified transcript
FKBP1B
refseq_FKBP1B.F2 refseq_FKBP1B.R2 144 189
NCBIGene 36.3 2281
Alternative 3-prime, size difference: 45
Inclusion in the protein causing a new stop codon
Reference transcript: NM_004116
Changed!
pfam FKBP_C 95aa 6e-33
in ref transcript
FKBP-type peptidyl-prolyl cis-trans isomerase.
Changed!
COG FkpA 104aa 3e-30
in ref transcript
FKBP-type peptidyl-prolyl cis-trans isomerases 1 [Posttranslational modification, protein turnover, chaperones].
Changed!
pfam FKBP_C 53aa 3e-17
in modified transcript
Changed!
COG FkpA 66aa 9e-14
in modified transcript
FKBP7
refseq_FKBP7.F1 refseq_FKBP7.R1 113 227
NCBIGene 36.2 51661
Single exon skipping, size difference: 114
Inclusion in the protein causing a new stop codon
Reference transcript: NM_181342
Changed!
pfam FKBP_C 98aa 2e-28
in ref transcript
FKBP-type peptidyl-prolyl cis-trans isomerase.
Changed!
COG FkpA 97aa 1e-17
in ref transcript
FKBP-type peptidyl-prolyl cis-trans isomerases 1 [Posttranslational modification, protein turnover, chaperones].
Changed!
pfam FKBP_C 81aa 3e-20
in modified transcript
Changed!
COG FkpA 103aa 1e-11
in modified transcript
FKRP
refseq_FKRP.F2 refseq_FKRP.R2 173 392
NCBIGene 36.3 79147
Alternative 5-prime, size difference: 219
Exclusion in 5'UTR
Reference transcript: NM_001039885
pfam LicD 141aa 2e-16
in ref transcript
LICD Protein Family. The LICD family of proteins show high sequence similarity and are involved in phosphorylcholine metabolism. There is evidence to show that LicD2 mutants have a reduced ability to take up choline, have decreased ability to adhere to host cells and are less virulent.
COG COG3475 48aa 6e-08
in ref transcript
LPS biosynthesis protein [Cell envelope biogenesis, outer membrane].
KRI1-like family. The yeast member of this family (Kri1p) is found to be required for 40S ribosome biogenesis in the nucleolus.
FLJ20105
refseq_FLJ20105.F2 refseq_FLJ20105.R2 241 359
NCBIGene 36.2 54821
Single exon skipping, size difference: 118
Inclusion in the protein causing a new stop codon
Reference transcript: NM_017669
Changed!
cd DEXDc 144aa 8e-17
in ref transcript
DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
Changed!
cd HELICc 134aa 3e-14
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
Changed!
pfam SNF2_N 286aa 3e-60
in ref transcript
SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
Changed!
smart HELICc 84aa 5e-09
in ref transcript
helicase superfamily c-terminal domain.
Changed!
COG HepA 571aa 2e-66
in ref transcript
Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].
DEF8
refseq_FLJ20186.F1 refseq_FLJ20186.R1 139 264
NCBIGene 36.3 54849
Alternative 5-prime, size difference: 125
Exclusion in the protein causing a frameshift
Reference transcript: NM_207514
Changed!
cd C1 51aa 0.001
in ref transcript
Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
Changed!
pfam C1_1 54aa 0.002
in ref transcript
Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
FLJ21839
refseq_FLJ21839.F1 refseq_FLJ21839.R1 200 247
NCBIGene 36.2 60509
Single exon skipping, size difference: 47
Inclusion in the protein causing a frameshift
Reference transcript: NM_001035506
cd M14_AGBL5_like 167aa 2e-83
in ref transcript
Peptidase M14-like domain of ATP/GTP binding protein_like (AGBL)-5, and related proteins. The Peptidase M14 family of metallocarboxypeptidases are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. This eukaryotic subgroup includes the human AGBL5 and the mouse cytosolic carboxypeptidase (CCP)-5. ATP/GTP binding protein (AGTPBP-1/Nna1)-like proteins are active metallopeptidases that are thought to act on cytosolic proteins such as alpha-tubulin, to remove a C-terminal tyrosine. Mutations in AGTPBP-1/Nna1 cause Purkinje cell degeneration (pcd). AGTPBP-1/Nna1 however does not belong to this subgroup. AGTPBP-1/Nna1-like proteins from the different phyla are highly diverse, but they all contain a unique N-terminal conserved domain right before the CP domain. It has been suggested that this N-terminal domain might act as a folding domain.
cd M14_AGBL5_like 145aa 8e-65
in ref transcript
pfam Peptidase_M14 79aa 1e-16
in ref transcript
Zinc carboxypeptidase.
FLJ22222
refseq_FLJ22222.F2 refseq_FLJ22222.R2 193 406
NCBIGene 36.3 79701
Single exon skipping, size difference: 213
Exclusion of the stop codon
Reference transcript: NM_175902
Changed!
smart P4Hc 115aa 5e-05
in ref transcript
Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.
Changed!
smart P4Hc 145aa 1e-14
in modified transcript
FLJ31033
refseq_FLJ31033.F2 refseq_FLJ31033.R2 228 276
NCBIGene 36.2 91351
Alternative 5-prime, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: XM_930988
cd DEXDc 143aa 3e-11
in ref transcript
DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
cd HELICc 103aa 1e-07
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
pfam DEAD 155aa 3e-15
in ref transcript
DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
smart HELICc 77aa 8e-09
in ref transcript
helicase superfamily c-terminal domain.
COG COG4581 123aa 1e-24
in ref transcript
Superfamily II RNA helicase [DNA replication, recombination, and repair].
COG COG4581 176aa 2e-22
in ref transcript
FLJ31033
refseq_FLJ31033.F4 refseq_FLJ31033.R4 111 207
NCBIGene 36.2 91351
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: XM_930988
Changed!
cd DEXDc 143aa 3e-11
in ref transcript
DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
cd HELICc 103aa 1e-07
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
Changed!
pfam DEAD 155aa 3e-15
in ref transcript
DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
smart HELICc 77aa 8e-09
in ref transcript
helicase superfamily c-terminal domain.
Changed!
COG COG4581 123aa 1e-24
in ref transcript
Superfamily II RNA helicase [DNA replication, recombination, and repair].
Changed!
COG COG4581 176aa 2e-22
in ref transcript
Changed!
cd DEXDc 61aa 6e-04
in modified transcript
Changed!
smart DEXDc 74aa 1e-05
in modified transcript
DEAD-like helicases superfamily.
Changed!
COG COG4581 299aa 9e-25
in modified transcript
Changed!
COG COG4581 144aa 3e-13
in modified transcript
FLJ44379
refseq_FLJ44379.F1 refseq_FLJ44379.R1 311 401
NCBIGene 36.2 132203
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: XM_059578
Changed!
cd S-100 115aa 2e-10
in ref transcript
S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents the interaction site of S100 proteins with their target proteins. There is experimental evidence showing that many S100 proteins have multiple binding partners with diverse mode of interaction with different targets. In addition to S100 proteins (such as S100A1,-3,-4,-6,-7,-10,-11,and -13), this group includes the ''fused'' gene family, a group of calcium binding S100-related proteins. The ''fused'' gene family includes multifunctional epidermal differentiation proteins - profilaggrin, trichohyalin, repetin, hornerin, and cornulin; functionally these proteins are associated with keratin intermediate filaments and partially crosslinked to the cell envelope. These ''fused'' gene proteins contain N-terminal sequence with two Ca-binding EF-hands motif, which may be associated with calcium signaling in epidermal cells and autoprocessing in a calcium-dependent manner. In contrast to S100 proteins, "fused" gene family proteins contain an extraordinary high number of almost perfect peptide repeats with regular array of polar and charged residues similar to many known cell envelope proteins.
Changed!
cd S-100 85aa 7e-15
in modified transcript
FLRT3
refseq_FLRT3.F1 refseq_FLRT3.R1 191 385
NCBIGene 36.3 23767
Single exon skipping, size difference: 194
Exclusion in 5'UTR
Reference transcript: NM_198391
cd LRR_RI 271aa 1e-04
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
TIGR PCC 78aa 2e-08
in ref transcript
Note: this model is restricted to the amino half because a full-length model is incompatible with the HMM software package.
smart LRRNT 31aa 0.005
in ref transcript
Leucine rich repeat N-terminal domain.
COG COG4886 241aa 9e-07
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
FMNL3
refseq_FMNL3.F1 refseq_FMNL3.R1 127 280
NCBIGene 36.3 91010
Single exon skipping, size difference: 153
Exclusion in the protein (no frameshift)
Reference transcript: NM_175736
pfam FH2 366aa 1e-77
in ref transcript
Formin Homology 2 Domain.
pfam Drf_FH3 202aa 8e-42
in ref transcript
Diaphanous FH3 Domain. This region is found in the Formin-like and and diaphanous proteins.
Changed!
pfam Drf_GBD 116aa 6e-26
in ref transcript
Diaphanous GTPase-binding Domain. This domain is bound to by GTP-attached Rho proteins, leading to activation of the Drf protein.
Changed!
pfam Drf_GBD 49aa 1e-12
in ref transcript
Changed!
pfam Drf_GBD 183aa 7e-44
in modified transcript
FMO3
refseq_FMO3.F2 refseq_FMO3.R2 96 113
NCBIGene 36.3 2328
Alternative 5-prime, size difference: 17
Exclusion in 5'UTR
Reference transcript: NM_001002294
pfam FMO-like 531aa 0.0
in ref transcript
Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase.
COG TrkA 340aa 4e-34
in ref transcript
Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism].
FOLH1
refseq_FOLH1.F1 refseq_FOLH1.R1 249 342
NCBIGene 36.3 2346
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_004476
cd PA_GCPII_like 227aa 9e-72
in ref transcript
PA_GCPII_like: Protease-associated domain containing protein, glutamate carboxypeptidase II (GCPII)-like. This group contains various PA domain-containing proteins similar to GCPII including, GCPIII (NAALADase2) and NAALADase L. These proteins belong to the peptidase M28 family. GCPII is also known N-acetylated-alpha-linked acidic dipeptidase (NAALDase1), folate hydrolase or prostate-specific membrane antigen (PSMA). GCPII is found in various human tissues including prostate, small intestine, and the central nervous system. In the brain, GCPII is known as NAALDase1, it functions as a NAALDase hydrolyzing the neuropeptide N-acetyl-L-aspartyl-L-glutamate (alpha-NAAG), to release free glutamate. In the small intestine, GCPII releases the terminal glutamate from poly-gamma-glutamated folates. GCPII (PSMA) is a useful cancer marker; its expression is markedly increased in prostate cancer and in tumor-associated neovasculature. GCPIII hydrolyzes alpha-NAAG with a lower efficiency than does GCPII; NAALADase L is not able to hydrolyze alpha-NAAG. The GCPII PA domain (referred to as the apical domain) participates in substrate binding and may act as a protein-protein interaction domain.
Changed!
pfam TFR_dimer 138aa 8e-48
in ref transcript
Transferrin receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure.
pfam PA 93aa 5e-15
in ref transcript
PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family. It has been suggested that this domain forms a lid-like structure that covers the active site in active proteases, and is involved in protein recognition in vacuolar sorting receptors.
pfam Peptidase_M28 85aa 8e-13
in ref transcript
Peptidase family M28.
Changed!
COG Iap 219aa 2e-09
in ref transcript
Predicted aminopeptidases [General function prediction only].
Changed!
pfam TFR_dimer 107aa 8e-26
in modified transcript
Changed!
COG Iap 238aa 2e-09
in modified transcript
FOLR1
refseq_FOLR1.F1 refseq_FOLR1.R1 157 223
NCBIGene 36.3 2348
Single exon skipping, size difference: 66
Exclusion in 5'UTR
Reference transcript: NM_016724
pfam Folate_rec 185aa 4e-78
in ref transcript
Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anchor. The proteins contain 16 conserved cysteines that form eight disulphide bridges.
FOXI1
refseq_FOXI1.F1 refseq_FOXI1.R1 100 385
NCBIGene 36.3 2299
Alternative 3-prime, size difference: 285
Exclusion in the protein (no frameshift)
Reference transcript: NM_012188
Changed!
cd FH 78aa 2e-36
in ref transcript
Forkhead (FH), also known as a "winged helix". FH is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development. This family of transcription factor domains, which bind to B-DNA as monomers, are also found in the Hepatocyte nuclear factor (HNF) proteins, which provide tissue-specific gene regulation. The structure contains 2 flexible loops or "wings" in the C-terminal region, hence the term winged helix.
Changed!
smart FH 89aa 7e-43
in ref transcript
FORKHEAD. FORKHEAD, also known as a "winged helix".
Changed!
COG COG5025 92aa 2e-15
in ref transcript
Transcription factor of the Forkhead/HNF3 family [Transcription].
Changed!
cd FH 69aa 4e-30
in modified transcript
Changed!
smart FH 69aa 3e-31
in modified transcript
Changed!
COG COG5025 121aa 1e-12
in modified transcript
FOXM1
refseq_FOXM1.F2 refseq_FOXM1.R2 100 145
NCBIGene 36.3 2305
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_202002
cd FH 76aa 2e-28
in ref transcript
Forkhead (FH), also known as a "winged helix". FH is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development. This family of transcription factor domains, which bind to B-DNA as monomers, are also found in the Hepatocyte nuclear factor (HNF) proteins, which provide tissue-specific gene regulation. The structure contains 2 flexible loops or "wings" in the C-terminal region, hence the term winged helix.
smart FH 81aa 2e-30
in ref transcript
FORKHEAD. FORKHEAD, also known as a "winged helix".
COG COG5025 78aa 9e-10
in ref transcript
Transcription factor of the Forkhead/HNF3 family [Transcription].
FOXM1
refseq_FOXM1.F3 refseq_FOXM1.R3 284 398
NCBIGene 36.3 2305
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_202002
cd FH 76aa 2e-28
in ref transcript
Forkhead (FH), also known as a "winged helix". FH is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development. This family of transcription factor domains, which bind to B-DNA as monomers, are also found in the Hepatocyte nuclear factor (HNF) proteins, which provide tissue-specific gene regulation. The structure contains 2 flexible loops or "wings" in the C-terminal region, hence the term winged helix.
smart FH 81aa 2e-30
in ref transcript
FORKHEAD. FORKHEAD, also known as a "winged helix".
COG COG5025 78aa 9e-10
in ref transcript
Transcription factor of the Forkhead/HNF3 family [Transcription].
FRMPD2
refseq_FRMPD2.F1 refseq_FRMPD2.R1 169 235
NCBIGene 36.3 143162
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_001018071
cd PDZ_signaling 82aa 1e-12
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd FERM_C 93aa 9e-11
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
cd PDZ_signaling 87aa 8e-10
in ref transcript
cd PDZ_signaling 85aa 2e-08
in ref transcript
Changed!
smart KIND 173aa 6e-39
in ref transcript
kinase non-catalytic C-lobe domain. It is an interaction domain identified as being similar to the C-terminal protein kinase catalytic fold (C lobe). Its presence at the N terminus of signalling proteins and the absence of the active-site residues in the catalytic and activation loops suggest that it folds independently and is likely to be non-catalytic. The occurrence of KIND only in metazoa implies that it has evolved from the catalytic protein kinase domain into an interaction domain possibly by keeping the substrate-binding features.
smart B41 205aa 2e-32
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam FERM_C 88aa 2e-25
in ref transcript
FERM C-terminal PH-like domain.
smart PDZ 88aa 4e-16
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart PDZ 90aa 3e-13
in ref transcript
TIGR degP_htrA_DO 191aa 2e-06
in ref transcript
This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
Changed!
smart KIND 151aa 6e-28
in modified transcript
FSHB
refseq_FSHB.F1 refseq_FSHB.R1 144 174
NCBIGene 36.3 2488
Alternative 5-prime, size difference: 30
Exclusion in 5'UTR
Reference transcript: NM_000510
cd GHB 102aa 5e-34
in ref transcript
Glycoprotein hormone beta chain homologues. Gonadotropins; reproductive hormones consisting of two glycosylated chains (alpha and beta) of similar topology with Cysteine-knot motifs.
smart GHB 107aa 2e-39
in ref transcript
Glycoprotein hormone beta chain homologues. Also called gonadotropins. Glycoprotein hormones consist of two glycosylated chains (alpha and beta) of similar topology.
FST
refseq_FST.F1 refseq_FST.R1 134 398
NCBIGene 36.3 10468
Alternative 3-prime, size difference: 264
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_013409
cd FSL_SPARC 56aa 2e-05
in ref transcript
Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
cd KAZAL_FS 33aa 5e-04
in ref transcript
Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit serine proteases, such as, trypsin, chyomotrypsin, avian ovomucoids, and elastases. The inhibitory domain has one reactive site peptide bond, which serves the cognate enzyme as substrate. The reactive site peptide bond is a combining loop which has an identical conformation in all Kazal inhibitors and in all enzyme/inhibitor complexes. These Kazal domains (small hydrophobic core of alpha/beta structure with 3 to 4 disulfide bonds) often occur in tandem arrays. Similar domains are also present in follistatin (FS) and follistatin-like family members, which play an important role in tissue specific regulation. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a Kazal-like domain and has five disulfide bonds. Although the Kazal-like FS substructure is similar to Kazal proteinase inhibitors, no FS domain has yet been shown to be a proteinase inhibitor. Follistatin-like family members include SPARC, also known as, BM-40 or osteonectin, the Gallus gallus Flik protein, as well as, agrin which has a long array of FS domains. The kazal-type inhibitor domain has also been detected in an extracellular loop region of solute carrier 21 (SLC21) family members (organic anion transporters) , which may regulate the specificity of anion uptake. The distant homolog, Ascidian trypsin inhibitor, is included in this CD.
cd KAZAL_FS 40aa 7e-04
in ref transcript
Changed!
cd FSL_SPARC 60aa 0.009
in ref transcript
smart KAZAL 48aa 7e-07
in ref transcript
Kazal type serine protease inhibitors. Kazal type serine protease inhibitors and follistatin-like domains.
smart KAZAL 47aa 4e-06
in ref transcript
smart KAZAL 48aa 8e-06
in ref transcript
pfam PRKCSH 55aa 3e-05
in ref transcript
Glucosidase II beta subunit-like protein. The sequences found in this family are similar to a region found in the beta-subunit of glucosidase II, which is also known as protein kinase C substrate 80K-H (PRKCSH). The enzyme catalyses the sequential removal of two alpha-1,3-linked glucose residues in the second step of N-linked oligosaccharide processing. The beta subunit is required for the solubility and stability of the heterodimeric enzyme, and is involved in retaining the enzyme within the endoplasmic reticulum. Mutations in the gene coding for PRKCSH have been found to be involved in the development of autosomal dominant polycystic liver disease (ADPLD), but the precise role the protein has in the pathogenesis of this disease is unknown. This family also includes an ER sensor for misfolded glycoproteins and is therefore likely to be a generic sugar binding domain.
pfam FOLN 22aa 0.004
in ref transcript
Follistatin/Osteonectin-like EGF domain. Members of this family are predominantly found in osteonectin and follistatin and adopt an EGF-like structure.
AKTIP
refseq_FTS.F2 refseq_FTS.R2 142 171
NCBIGene 36.3 64400
Alternative 5-prime, size difference: 29
Exclusion in 5'UTR
Reference transcript: NM_001012398
cd UBCc 135aa 4e-11
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
smart UBCc 142aa 2e-24
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic domain homologues. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. This pathway functions in regulating many fundamental processes required for cell viability.TSG101 is one of several UBC homologues that lacks this active site cysteine.
COG COG5078 140aa 1e-12
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
FTSJ1
refseq_FTSJ1.F2 refseq_FTSJ1.R2 254 360
NCBIGene 36.3 24140
Single exon skipping, size difference: 106
Inclusion in 5'UTR
Reference transcript: NM_012280
pfam FtsJ 179aa 2e-58
in ref transcript
FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in complex with its cofactor S-adenosylmethionine revealed that FtsJ has a methyltransferase fold. This family also includes the N terminus of flaviviral NS5 protein. It has been hypothesised that the N-terminal domain of NS5 is a methyltransferase involved in viral RNA capping.
COG FtsJ 200aa 1e-56
in ref transcript
23S rRNA methylase [Translation, ribosomal structure and biogenesis].
FUS
refseq_FUS.F2 refseq_FUS.R2 155 190
NCBIGene 36.2 2521
Single exon skipping, size difference: 35
Exclusion in the protein causing a frameshift
Reference transcript: NM_004960
Changed!
cd RRM 83aa 2e-11
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
smart RRM_2 82aa 3e-11
in ref transcript
RNA recognition motif.
Changed!
pfam zf-RanBP 24aa 0.007
in ref transcript
Zn-finger in Ran binding protein and others.
Changed!
COG COG0724 107aa 1e-06
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
FUT8
refseq_FUT8.F1 refseq_FUT8.R1 264 366
NCBIGene 36.3 2530
Single exon skipping, size difference: 102
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_178155
cd SH3 53aa 5e-05
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart SH3 54aa 1e-06
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
FUT8
refseq_FUT8.F3 refseq_FUT8.R3 186 284
NCBIGene 36.3 2530
Single exon skipping, size difference: 98
Exclusion in 5'UTR
Reference transcript: NM_178155
cd SH3 53aa 5e-05
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart SH3 54aa 1e-06
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
FXR1
refseq_FXR1.F1 refseq_FXR1.R1 103 447
NCBIGene 36.3 8087
Single exon skipping, size difference: 344
Inclusion in the protein causing a new stop codon
Reference transcript: NM_005087
Changed!
cd KH-I 67aa 1e-05
in ref transcript
K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
Changed!
pfam Agenet 60aa 1e-07
in ref transcript
Agenet domain. This domain is related to the TUDOR domain pfam00567. The function of the agenet domain is unknown. This family currently only matches one of the two Agenet domains in the FMR proteins.
Changed!
pfam KH_1 59aa 9e-06
in ref transcript
KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
Changed!
TIGR polynuc_phos 89aa 0.002
in ref transcript
Members of this protein family are polyribonucleotide nucleotidyltransferase, also called polynucleotide phosphorylase. Some members have been shown also to have additional functions as guanosine pentaphosphate synthetase and as poly(A) polymerase (see model TIGR02696 for an exception clade, within this family).
Changed!
PRK PRK11824 59aa 7e-04
in ref transcript
Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) [Translation, ribosomal structure and biogenesis].
FXR1
refseq_FXR1.F3 refseq_FXR1.R3 270 362
NCBIGene 36.3 8087
Single exon skipping, size difference: 92
Exclusion in the protein causing a frameshift
Reference transcript: NM_005087
cd KH-I 67aa 1e-05
in ref transcript
K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
pfam Agenet 60aa 1e-07
in ref transcript
Agenet domain. This domain is related to the TUDOR domain pfam00567. The function of the agenet domain is unknown. This family currently only matches one of the two Agenet domains in the FMR proteins.
pfam KH_1 59aa 9e-06
in ref transcript
KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
Changed!
TIGR polynuc_phos 89aa 0.002
in ref transcript
Members of this protein family are polyribonucleotide nucleotidyltransferase, also called polynucleotide phosphorylase. Some members have been shown also to have additional functions as guanosine pentaphosphate synthetase and as poly(A) polymerase (see model TIGR02696 for an exception clade, within this family).
Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) [Translation, ribosomal structure and biogenesis].
Changed!
TIGR nusA_arch 87aa 0.001
in modified transcript
This model represents a family of archaeal proteins found in a single copy per genome. It contains two KH domains (pfam00013) and is most closely related to the central region bacterial NusA, a transcription termination factor named for its iteraction with phage lambda protein N in E. coli. The proteins required for antitermination by N include NusA, NusB, nusE (ribosomal protein S10), and nusG. This system, on the whole, appears not to be present in the Archaea.
FXYD3
refseq_FXYD3.F1 refseq_FXYD3.R1 102 186
NCBIGene 36.3 5349
Single exon skipping, size difference: 84
Exclusion in 5'UTR
Reference transcript: NM_021910
pfam ATP1G1_PLM_MAT8 74aa 2e-15
in ref transcript
ATP1G1/PLM/MAT8 family.
FYB
refseq_FYB.F1 refseq_FYB.R1 93 231
NCBIGene 36.3 2533
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_001465
cd SH3 54aa 0.003
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart SH3 58aa 5e-05
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
FYTTD1
refseq_FYTTD1.F1 refseq_FYTTD1.R1 149 322
NCBIGene 36.3 84248
Single exon skipping, size difference: 173
Inclusion in the protein causing a new stop codon
Reference transcript: NM_032288
Changed!
pfam FYTT 316aa 1e-143
in ref transcript
Forty-two-three protein. This family consists of several mammalian proteins of around 320 residues in length called 40-2-3 proteins. The function of this family is unknown.
Changed!
pfam FYTT 76aa 2e-29
in modified transcript
G3BP1
refseq_G3BP.F2 refseq_G3BP.R2 131 162
NCBIGene 36.3 10146
Alternative 3-prime, size difference: 31
Inclusion in 5'UTR
Reference transcript: NM_198395
cd NTF2 129aa 8e-32
in ref transcript
Nuclear transport factor 2 (NTF2) domain plays an important role in the trafficking of macromolecules, ions and small molecules between the cytoplasm and nucleus. This bi-directional transport of macromolecules across the nuclear envelope requires many soluble factors that includes GDP-binding protein Ran (RanGDP). RanGDP is required for both import and export of proteins and poly(A) RNA. RanGDP also has been implicated in cell cycle control, specifically in mitotic spindle assembly. In interphase cells, RanGDP is predominately nuclear and thought to be GTP bound, but it is also present in the cytoplasm, probably in the GDP-bound state. NTF2 mediates the nuclear import of RanGDP. NTF2 binds to both RanGDP and FxFG repeat-containing nucleoporins.
cd RRM 70aa 9e-11
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
pfam NTF2 123aa 1e-24
in ref transcript
Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins.
pfam RRM_1 56aa 1e-10
in ref transcript
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
COG COG0724 105aa 2e-05
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
GAB1
refseq_GAB1.F2 refseq_GAB1.R2 275 365
NCBIGene 36.3 2549
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_207123
cd PH_Gab 108aa 2e-52
in ref transcript
Gab (Grb2-associated binder) pleckstrin homology (PH) domain. The Gab subfamily includes several Gab proteins, Drosophila DOS and C. elegans SOC-1. They are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam PH 110aa 8e-17
in ref transcript
PH domain. PH stands for pleckstrin homology.
GABBR1
refseq_GABBR1.F1 refseq_GABBR1.R1 129 315
NCBIGene 36.3 2550
Single exon skipping, size difference: 186
Exclusion in the protein (no frameshift)
Reference transcript: NM_001470
cd PBP1_GABAb_receptor 393aa 1e-118
in ref transcript
Ligand-binding domain of GABAb receptors, which are metabotropic transmembrane receptors for gamma-aminobutyric acid (GABA). GABA is the major inhibitory neurotransmitter in the mammalian CNS and, like glutamate and other transmitters, acts via both ligand gated ion channels (GABAa receptors) and G-protein coupled receptors (GABAb). GABAa receptors are members of the ionotropic receptor superfamily which includes alpha-adrenergic and glycine receptors. The GABAb receptor is a member of a receptor superfamily which includes the mGlu receptors. The GABAb receptor is coupled to G alpha_i proteins, and activation causes a decrease in calcium, an increase in potassium membrane conductance, and inhibition of cAMP formation. The response is thus inhibitory and leads to hyperpolarization and decreased neurotransmitter release, for example.
Changed!
cd CCP 53aa 1e-05
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
pfam ANF_receptor 357aa 5e-63
in ref transcript
Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
pfam 7tm_3 259aa 5e-38
in ref transcript
7 transmembrane sweet-taste receptor of 3 GCPR. This is a domain of seven transmembrane regions that forms the C-terminus of some subclass 3 G-coupled-protein receptors. It is often associated with a downstream cysteine-rich linker domain, NCD3G pfam07562, which is the human sweet-taste receptor, and the N-terminal domain, ANF_receptor pfam01094. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness.
Changed!
smart CCP 52aa 7e-06
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
COG LivK 386aa 5e-27
in ref transcript
ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
GABBR1
refseq_GABBR1.F3 refseq_GABBR1.R3 144 295
NCBIGene 36.3 2550
Single exon skipping, size difference: 151
Exclusion in the protein causing a frameshift
Reference transcript: NM_001470
cd PBP1_GABAb_receptor 393aa 1e-118
in ref transcript
Ligand-binding domain of GABAb receptors, which are metabotropic transmembrane receptors for gamma-aminobutyric acid (GABA). GABA is the major inhibitory neurotransmitter in the mammalian CNS and, like glutamate and other transmitters, acts via both ligand gated ion channels (GABAa receptors) and G-protein coupled receptors (GABAb). GABAa receptors are members of the ionotropic receptor superfamily which includes alpha-adrenergic and glycine receptors. The GABAb receptor is a member of a receptor superfamily which includes the mGlu receptors. The GABAb receptor is coupled to G alpha_i proteins, and activation causes a decrease in calcium, an increase in potassium membrane conductance, and inhibition of cAMP formation. The response is thus inhibitory and leads to hyperpolarization and decreased neurotransmitter release, for example.
cd CCP 53aa 1e-05
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
pfam ANF_receptor 357aa 5e-63
in ref transcript
Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
Changed!
pfam 7tm_3 259aa 5e-38
in ref transcript
7 transmembrane sweet-taste receptor of 3 GCPR. This is a domain of seven transmembrane regions that forms the C-terminus of some subclass 3 G-coupled-protein receptors. It is often associated with a downstream cysteine-rich linker domain, NCD3G pfam07562, which is the human sweet-taste receptor, and the N-terminal domain, ANF_receptor pfam01094. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness.
smart CCP 52aa 7e-06
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
COG LivK 386aa 5e-27
in ref transcript
ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
GABPB2
refseq_GABPB2.F1 refseq_GABPB2.R1 135 159
NCBIGene 36.3 2553
Single exon skipping, size difference: 24
Inclusion in 5'UTR
Reference transcript: NM_005254
cd ANK 123aa 4e-29
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
pfam Ank 30aa 1e-07
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
TIGR trp 156aa 1e-06
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
COG Arp 132aa 5e-14
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
GABPB2
refseq_GABPB2.F3 refseq_GABPB2.R3 199 235
NCBIGene 36.3 2553
Alternative 5-prime, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_005254
cd ANK 123aa 4e-29
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
pfam Ank 30aa 1e-07
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
TIGR trp 156aa 1e-06
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
COG Arp 132aa 5e-14
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
GABRB2
refseq_GABRB2.F1 refseq_GABRB2.R1 285 399
NCBIGene 36.3 2561
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_021911
Changed!
TIGR LIC 502aa 1e-129
in ref transcript
selective while glycine receptors are anion selective).
Changed!
TIGR LIC 464aa 1e-131
in modified transcript
GABRG2
refseq_GABRG2.F1 refseq_GABRG2.R1 101 125
NCBIGene 36.3 2566
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_198904
Changed!
TIGR LIC 404aa 2e-73
in ref transcript
selective while glycine receptors are anion selective).
Changed!
TIGR LIC 396aa 7e-74
in modified transcript
GABRG2
refseq_GABRG2.F2 refseq_GABRG2.R2 127 189
NCBIGene 36.3 2566
Alternative 3-prime, size difference: 62
Exclusion in the protein causing a frameshift
Reference transcript: NM_198904
Changed!
TIGR LIC 404aa 2e-73
in ref transcript
selective while glycine receptors are anion selective).
Changed!
pfam Neur_chan_LBD 41aa 2e-05
in modified transcript
Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure.
GANAB
refseq_GANAB.F1 refseq_GANAB.R1 169 235
NCBIGene 36.3 23193
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_198335
cd GH31_GANC_GANAB_alpha 339aa 0.0
in ref transcript
This family includes the closely related glycosyl hydrolase family 31 (GH31) isozymes, neutral alpha-glucosidase C (GANC) and the alpha subunit of heterodimeric neutral alpha-glucosidase AB (GANAB). Initially distinguished on the basis of differences in electrophoretic mobility in starch gel, GANC and GANAB have been shown to have other differences, including those of substrate specificity. GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. The GANC/GANAB family includes the alpha-glucosidase II (ModA) from Dictyostelium discoideum as well as the alpha-glucosidase II (GLS2, or ROT2 - Reversal of TOR2 lethality protein 2) from Saccharomyces cerevisiae.
pfam Glyco_hydro_31 446aa 1e-171
in ref transcript
Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases.
COG COG1501 634aa 1e-147
in ref transcript
Alpha-glucosidases, family 31 of glycosyl hydrolases [Carbohydrate transport and metabolism].
GARNL1
refseq_GARNL1.F1 refseq_GARNL1.R1 121 152
NCBIGene 36.3 253959
Single exon skipping, size difference: 31
Inclusion in the protein causing a new stop codon
Reference transcript: NM_194301
pfam Rap_GAP 180aa 2e-19
in ref transcript
Rap/ran-GAP.
GCDH
refseq_GCDH.F1 refseq_GCDH.R1 160 392
NCBIGene 36.3 2639
Alternative 3-prime, size difference: 232
Exclusion of the stop codon
Reference transcript: NM_000159
Changed!
cd GCD 387aa 0.0
in ref transcript
Glutaryl-CoA dehydrogenase (GCD). GCD is an acyl-CoA dehydrogenase, which catalyzes the oxidative decarboxylation of glutaryl-CoA to crotonyl-CoA and carbon dioxide in the catabolism of lysine, hydroxylysine, and tryptophan. It uses electron transfer flavoprotein (ETF) as an electron acceptor. GCD is a homotetramer. GCD deficiency leads to a severe neurological disorder in humans.
pfam Acyl-CoA_dh_N 112aa 4e-32
in ref transcript
Acyl-CoA dehydrogenase, N-terminal domain. The N-terminal domain of Acyl-CoA dehydrogenase is an all-alpha domain.
Changed!
TIGR cyc_hxne_CoA_dh 367aa 1e-25
in ref transcript
Cyclohex-1-ene-1carboxyl-CoA is an intermediate in the anaerobic degradation of benzoyl-CoA derived from varioius aromatic compounds, in Rhodopseudomonas palustris but not Thauera aromatica. The aliphatic compound cyclohexanecarboxylate, can be converted to the same intermediate in two steps. The first step is its ligation to coenzyme A. The second is the action of this enzyme, cyclohexanecarboxyl-CoA dehydrogenase.
Changed!
COG CaiA 375aa 2e-70
in ref transcript
Acyl-CoA dehydrogenases [Lipid metabolism].
Changed!
cd GCD 367aa 1e-180
in modified transcript
Changed!
TIGR cyc_hxne_CoA_dh 300aa 1e-25
in modified transcript
Changed!
COG CaiA 355aa 1e-65
in modified transcript
GCET2
refseq_GCET2.F2 refseq_GCET2.R2 318 485
NCBIGene 36.3 257144
Alternative 3-prime, size difference: 167
Inclusion in the protein causing a new stop codon
Reference transcript: NM_152785
GCH1
refseq_GCH1.F1 refseq_GCH1.R1 101 375
NCBIGene 36.3 2643
Alternative 3-prime, size difference: 274
Exclusion of the stop codon
Reference transcript: NM_000161
Changed!
cd GTP_cyclohydro1 168aa 7e-83
in ref transcript
GTP cyclohydrolase I (GTP-CH-I) catalyzes the conversion of GTP into dihydroneopterin triphosphate. The enzyme product is the precursor of tetrahydrofolate in eubacteria, fungi, and plants and of the folate analogs in methanogenic bacteria. In vertebrates and insects it is the biosynthtic precursor of tetrahydrobiopterin (BH4) which is involved in the formation of catacholamines, nitric oxide, and the stimulation of T lymphocytes. The biosynthetic reaction of BH4 is controlled by a regulatory protein GFRP which mediates feedback inhibition of GTP-CH-I by BH4. This inhibition is reversed by phenylalanine. The decameric GTP-CH-I forms a complex with two pentameric GFRP in the presence of phenylalanine or a combination of GTP and BH4, respectively.
Changed!
TIGR folE 168aa 1e-73
in ref transcript
GTP cyclohydrolase I (EC 3.5.4.16) catalyzes the biosynthesis of formic acid and dihydroneopterin triphosphate from GTP. This reaction is the first step in the biosynthesis of tetrahydrofolate in prokaryotes, of tetrahydrobiopterin in vertebrates, and of pteridine-containing pigments in insects.
Changed!
PRK folE 168aa 4e-81
in ref transcript
GTP cyclohydrolase I; Provisional.
Changed!
cd GTP_cyclohydro1 127aa 9e-62
in modified transcript
Changed!
TIGR folE 127aa 2e-53
in modified transcript
Changed!
PRK folE 127aa 3e-59
in modified transcript
Changed!
PRK PRK01297 54aa 0.010
in modified transcript
ATP-dependent RNA helicase RhlB; Provisional.
GEM
refseq_GEM.F2 refseq_GEM.R2 110 225
NCBIGene 36.3 2669
Alternative 5-prime, size difference: 115
Exclusion in 5'UTR
Reference transcript: NM_005261
cd RGK 221aa 1e-99
in ref transcript
RGK subfamily. The RGK (Rem, Rem2, Rad, Gem/Kir) subfamily of Ras GTPases are expressed in a tissue-specific manner and are dynamically regulated by transcriptional and posttranscriptional mechanisms in response to environmental cues. RGK proteins bind to the beta subunit of L-type calcium channels, causing functional down-regulation of these voltage-dependent calcium channels, and either termination of calcium-dependent secretion or modulation of electrical conduction and contractile function. Inhibition of L-type calcium channels by Rem2 may provide a mechanism for modulating calcium-triggered exocytosis in hormone-secreting cells, and has been proposed to influence the secretion of insulin in pancreatic beta cells. RGK proteins also interact with and inhibit the Rho/Rho kinase pathway to modulate remodeling of the cytoskeleton. Two characteristics of RGK proteins cited in the literature are N-terminal and C-terminal extensions beyond the GTPase domain typical of Ras superfamily members. The N-terminal extension is not conserved among family members; the C-terminal extension is reported to be conserved among the family and lack the CaaX prenylation motif typical of membrane-associated Ras proteins. However, a putative CaaX motif has been identified in the alignment of the C-terminal residues of this CD.
smart RAS 165aa 2e-34
in ref transcript
Ras subfamily of RAS small GTPases. Similar in fold and function to the bacterial EF-Tu GTPase. p21Ras couples receptor Tyr kinases and G protein receptors to protein kinase cascades.
COG COG1100 191aa 1e-13
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
GEMIN7
refseq_GEMIN7.F2 refseq_GEMIN7.R2 272 395
NCBIGene 36.3 79760
Single exon skipping, size difference: 123
Exclusion in 5'UTR
Reference transcript: NM_024707
GFM2
refseq_GFM2.F1 refseq_GFM2.R1 168 309
NCBIGene 36.3 84340
Single exon skipping, size difference: 141
Exclusion in the protein (no frameshift)
Reference transcript: NM_032380
cd EF-G 281aa 1e-125
in ref transcript
Elongation factor G (EF-G) subfamily. Translocation is mediated by EF-G (also called translocase). The structure of EF-G closely resembles that of the complex between EF-Tu and tRNA. This is an example of molecular mimicry; a protein domain evolved so that it mimics the shape of a tRNA molecule. EF-G in the GTP form binds to the ribosome, primarily through the interaction of its EF-Tu-like domain with the 50S subunit. The binding of EF-G to the ribosome in this manner stimulates the GTPase activity of EF-G. On GTP hydrolysis, EF-G undergoes a conformational change that forces its arm deeper into the A site on the 30S subunit. To accommodate this domain, the peptidyl-tRNA in the A site moves to the P site, carrying the mRNA and the deacylated tRNA with it. The ribosome may be prepared for these rearrangements by the initial binding of EF-G as well. The dissociation of EF-G leaves the ribosome ready to accept the next aminoacyl-tRNA into the A site. This group contains both eukaryotic and bacterial members.
cd mtEFG2_like_IV 121aa 1e-44
in ref transcript
mtEF-G2 domain IV. This subfamily is a part the of mitochondrial transcriptional elongation factor, mtEF-G2. Mitochondrial translation is crucial for maintaining mitochondrial function and mutations in this system lead to a breakdown in the respiratory chain-oxidative phosphorylation system and to impaired maintenance of mitochondrial DNA. In complex with GTP, EF-G promotes the translocation step of translation. During translocation the peptidyl-tRNA is moved from the A site to the P site of the small subunit of ribosome and the mRNA is shifted one codon relative to the ribosome.
Changed!
cd mtEFG2_II_like 82aa 3e-35
in ref transcript
mtEFG2_C: C-terminus of mitochondrial Elongation factor G2 (mtEFG2)-like proteins found in eukaryotes. Eukaryotic cells harbor 2 protein synthesis systems: one localized in the cytoplasm, the other in the mitochondria. Most factors regulating mitochondrial protein synthesis are encoded by nuclear genes, translated in the cytoplasm, and then transported to the mitochondria. The eukaryotic system of elongation factor (EF) components is more complex than that in prokaryotes, with both cytoplasmic and mitochondrial elongation factors and multiple isoforms being expressed in certain species. Eukaryotic EF-2 operates in the cytosolic protein synthesis machinery of eukaryotes, EF-Gs in protein synthesis in bacteria. Eukaryotic mtEFG1 proteins show significant homology to bacterial EF-Gs. No clear phenotype has been found for mutants in the yeast homologue of mtEFG2, MEF2. There are two forms of mtEFG present in mammals (designated mtEFG1s and mtEFG2s) mtEFG1s are not present in this group.
cd EFG_mtEFG_C 77aa 1e-21
in ref transcript
EFG_mtEFG_C: domains similar to the C-terminal domain of the bacterial translational elongation factor (EF) EF-G. Included in this group is the C-terminus of mitochondrial Elongation factor G1 (mtEFG1) and G2 (mtEFG2) proteins. Eukaryotic cells harbor 2 protein synthesis systems: one localized in the cytoplasm, the other in the mitochondria. Most factors regulating mitochondrial protein synthesis are encoded by nuclear genes, translated in the cytoplasm, and then transported to the mitochondria. The eukaryotic system of elongation factor (EF) components is more complex than that in prokaryotes, with both cytoplasmic and mitochondrial elongation factors and multiple isoforms being expressed in certain species. During the process of peptide synthesis and tRNA site changes, the ribosome is moved along the mRNA a distance equal to one codon with the addition of each amino acid. In bacteria this translocation step is catalyzed by EF-G_GTP, which is hydrolyzed to provide the required energy. Thus, this action releases the uncharged tRNA from the P site and transfers the newly formed peptidyl-tRNA from the A site to the P site. Eukaryotic mtEFG1 proteins show significant homology to bacterial EF-Gs. Mutants in yeast mtEFG1 have impaired mitochondrial protein synthesis, respiratory defects and a tendency to lose mitochondrial DNA. No clear phenotype has been found for mutants in the yeast homologue of mtEFG2, MEF2.
Changed!
TIGR EF-G 695aa 1e-157
in ref transcript
After peptide bond formation, this elongation factor of bacteria and organelles catalyzes the translocation of the tRNA-mRNA complex, with its attached nascent polypeptide chain, from the A-site to the P-site of the ribosome. Every completed bacterial genome has at least one copy, but some species have additional EF-G-like proteins. The closest homolog to canonical (e.g. E. coli) EF-G in the spirochetes clusters as if it is derived from mitochondrial forms, while a more distant second copy is also present. Synechocystis PCC6803 has a few proteins more closely related to EF-G than to any other characterized protein. Two of these resemble E. coli EF-G more closely than does the best match from the spirochetes; it may be that both function as authentic EF-G.
Changed!
PRK PRK12739 712aa 0.0
in ref transcript
elongation factor G; Reviewed.
Changed!
cd mtEFG2_II_like 41aa 1e-13
in modified transcript
Changed!
TIGR EF-G 289aa 4e-89
in modified transcript
Changed!
TIGR EF-G 353aa 6e-55
in modified transcript
Changed!
COG FusA 663aa 1e-132
in modified transcript
Translation elongation factors (GTPases) [Translation, ribosomal structure and biogenesis].
GFRA4
refseq_GFRA4.F2 refseq_GFRA4.R2 288 378
NCBIGene 36.3 64096
Intron retention, size difference: 79
Exclusion in the protein causing a frameshift
Reference transcript: NM_145762
Changed!
pfam GDNF 80aa 0.001
in ref transcript
GDNF/GAS1 domain. This cysteine rich domain is found in multiple copies in GNDF and GAS1 proteins. GDNF and neurturin (NTN) receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuronal survival by signaling through similar multicomponent receptors that consist of a common receptor tyrosine kinase and a member of a GPI-linked family of receptors that determines ligand specificity.
Changed!
pfam GDNF 60aa 7e-04
in modified transcript
GGA1
refseq_GGA1.F1 refseq_GGA1.R1 100 361
NCBIGene 36.3 26088
Multiple exon skipping, size difference: 261
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_013365
cd VHS_GGA 139aa 3e-64
in ref transcript
VHS domain family, GGA subfamily; GGA (Golgi-localized, Gamma-ear-containing, Arf-binding) comprise a subfamily of ubiquitously expressed, monomeric, motif-binding cargo/clathrin adaptor proteins. The VHS domain has a superhelical structure similar to the structure of the ARM (Armadillo) repeats and is present at the N-termini of proteins. GGA proteins have a multidomain structure consisting of an N-terminal VHS domain linked by a short proline-rich linker to a GAT (GGA and TOM) domain, which is followed by a long flexible linker to the C-terminal appendage, GAE (gamma-adaptin ear) domain. The VHS domain of GGA proteins binds to the acidic-cluster dileucine (DxxLL) motif found on the cytoplasmic tails of cargo proteins trafficked between the trans-Golgi network and the endosomal system.
pfam VHS 135aa 1e-38
in ref transcript
VHS domain. Domain present in VPS-27, Hrs and STAM.
Changed!
pfam GAT 97aa 6e-27
in ref transcript
GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilises membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins.
pfam Alpha_adaptinC2 124aa 7e-24
in ref transcript
Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found in alpha, beta and gamma adaptins.
Changed!
pfam GAT 73aa 2e-16
in modified transcript
GGA3
refseq_GGA3.F1 refseq_GGA3.R1 127 226
NCBIGene 36.3 23163
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_138619
Changed!
cd VHS_GGA 139aa 9e-69
in ref transcript
VHS domain family, GGA subfamily; GGA (Golgi-localized, Gamma-ear-containing, Arf-binding) comprise a subfamily of ubiquitously expressed, monomeric, motif-binding cargo/clathrin adaptor proteins. The VHS domain has a superhelical structure similar to the structure of the ARM (Armadillo) repeats and is present at the N-termini of proteins. GGA proteins have a multidomain structure consisting of an N-terminal VHS domain linked by a short proline-rich linker to a GAT (GGA and TOM) domain, which is followed by a long flexible linker to the C-terminal appendage, GAE (gamma-adaptin ear) domain. The VHS domain of GGA proteins binds to the acidic-cluster dileucine (DxxLL) motif found on the cytoplasmic tails of cargo proteins trafficked between the trans-Golgi network and the endosomal system.
Changed!
pfam VHS 135aa 3e-48
in ref transcript
VHS domain. Domain present in VPS-27, Hrs and STAM.
pfam GAT 95aa 4e-30
in ref transcript
GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilises membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins.
pfam Alpha_adaptinC2 124aa 3e-26
in ref transcript
Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found in alpha, beta and gamma adaptins.
Changed!
cd VHS_GGA 106aa 4e-43
in modified transcript
Changed!
smart VHS 132aa 5e-29
in modified transcript
Domain present in VPS-27, Hrs and STAM. Unpublished observations. Domain of unknown function.
GGPS1
refseq_GGPS1.F1 refseq_GGPS1.R1 235 328
NCBIGene 36.3 9453
Single exon skipping, size difference: 93
Exclusion of the protein initiation site
Reference transcript: NM_004837
Changed!
cd Trans_IPPS_HT 212aa 4e-52
in ref transcript
Trans-Isoprenyl Diphosphate Synthases (Trans_IPPS), head-to-tail (HT) (1'-4) condensation reactions. This CD includes all-trans (E)-isoprenyl diphosphate synthases which synthesis various chain length (C10, C15, C20, C25, C30, C35, C40, C45, and C50) linear isoprenyl diphosphates from precursors, isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). They catalyze the successive 1'-4 condensation of the 5-carbon IPP to allylic substrates geranyl-, farnesyl-, or geranylgeranyl-diphosphate. Isoprenoid chain elongation reactions proceed via electrophilic alkylations in which a new carbon-carbon single bond is generated through interaction between a highly reactive electron-deficient allylic carbocation and an electron-rich carbon-carbon double bond. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions (DDXX(XX)D) located on opposite walls. These residues mediate binding of prenyl phosphates via bridging Mg2+ ions, inducing proposed conformational changes that close the active site to solvent, protecting and stabilizing reactive carbocation intermediates. Farnesyl diphosphate synthases produce the precursors of steroids, cholesterol, sesquiterpenes, farnsylated proteins, heme, and vitamin K12; and geranylgeranyl diphosphate and longer chain synthases produce the precursors of carotenoids, retinoids, diterpenes, geranylgeranylated chlorophylls, ubiquinone, and archaeal ether linked lipids. Isoprenyl diphosphate synthases are widely distributed among archaea, bacteria, and eukareya.
Changed!
pfam polyprenyl_synt 251aa 4e-43
in ref transcript
Changed!
cd Trans_IPPS_HT 167aa 2e-43
in modified transcript
Changed!
pfam polyprenyl_synt 208aa 5e-36
in modified transcript
Changed!
COG IspA 233aa 8e-34
in modified transcript
GGT1
refseq_GGT1.F1 refseq_GGT1.R1 364 472
NCBIGene 36.3 2678
Alternative 3-prime, size difference: 108
Inclusion in 5'UTR
Reference transcript: NM_001032364
pfam G_glu_transpept 508aa 1e-174
in ref transcript
Gamma-glutamyltranspeptidase.
COG Ggt 543aa 1e-124
in ref transcript
Gamma-glutamyltransferase [Amino acid transport and metabolism].
GH2
refseq_GH2.F1 refseq_GH2.R1 207 252
NCBIGene 36.3 2689
Alternative 3-prime, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_022557
Changed!
pfam Hormone_1 131aa 3e-26
in ref transcript
Somatotropin hormone family.
Changed!
pfam Hormone_1 116aa 8e-23
in modified transcript
GIMAP6
refseq_GIMAP6.F1 refseq_GIMAP6.R1 100 324
NCBIGene 36.3 474344
Alternative 3-prime, size difference: 224
Exclusion in the protein causing a frameshift
Reference transcript: NM_024711
Changed!
cd AIG1 196aa 1e-78
in ref transcript
AIG1 (avrRpt2-induced gene 1). This represents Arabidoposis protein AIG1 that appears to be involved in plant resistance to bacteria. The Arabidopsis disease resistance gene RPS2 is involved in recognition of bacterial pathogens carrying the avirulence gene avrRpt2. AIG1 exhibits RPS2- and avrRpt1-dependent induction early after infection with Pseudomonas syringae carrying avrRpt2. This subfamily also includes IAN-4 protein, which has GTP-binding activity and shares sequence homology with a novel family of putative GTP-binding proteins: the immuno-associated nucleotide (IAN) family. The evolutionary conservation of the IAN family provides a unique example of a plant pathogen response gene conserved in animals. The IAN/IMAP subfamily has been proposed to regulate apoptosis in vertebrates and angiosperm plants, particularly in relation to cancer, diabetes, and infections. The human IAN genes were renamed GIMAP (GTPase of the immunity associated proteins).
Changed!
pfam AIG1 209aa 4e-73
in ref transcript
AIG1 family. Arabidopsis protein AIG1 appears to be involved in plant resistance to bacteria.
Changed!
PRK rbgA 87aa 3e-04
in ref transcript
ribosomal biogenesis GTPase; Reviewed.
GIPC1
refseq_GIPC1.F1 refseq_GIPC1.R1 100 418
NCBIGene 36.3 10755
Single exon skipping, size difference: 318
Exclusion of the protein initiation site
Reference transcript: NM_202468
cd PDZ_signaling 79aa 8e-08
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart PDZ 82aa 5e-09
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
GIT2
refseq_GIT2.F1 refseq_GIT2.R1 249 339
NCBIGene 36.3 9815
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_057169
cd ANK 112aa 1e-11
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
smart ArfGap 119aa 4e-39
in ref transcript
Putative GTP-ase activating proteins for the small GTPase, ARF. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
pfam GIT_SHD 31aa 1e-07
in ref transcript
Spa2 homology domain (SHD) of GIT. GIT proteins are signaling integrators with GTPase-activating function which may be involved in the organisation of the cytoskeletal matrix assembled at active zones (CAZ). The function of the CAZ might be to define sites of neurotransmitter release. Mutations in the Spa2 homology domain (SHD) domain of GIT1 described here interfere with the association of GIT1 with Piccolo, beta-PIX, and focal adhesion kinase.
pfam GIT_SHD 22aa 7e-04
in ref transcript
COG COG5347 115aa 2e-15
in ref transcript
GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
Exon skipping and alternative 3-prime or 5-prime, size difference: 384
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_057169
cd ANK 112aa 1e-11
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
smart ArfGap 119aa 4e-39
in ref transcript
Putative GTP-ase activating proteins for the small GTPase, ARF. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
pfam GIT_SHD 31aa 1e-07
in ref transcript
Spa2 homology domain (SHD) of GIT. GIT proteins are signaling integrators with GTPase-activating function which may be involved in the organisation of the cytoskeletal matrix assembled at active zones (CAZ). The function of the CAZ might be to define sites of neurotransmitter release. Mutations in the Spa2 homology domain (SHD) domain of GIT1 described here interfere with the association of GIT1 with Piccolo, beta-PIX, and focal adhesion kinase.
pfam GIT_SHD 22aa 7e-04
in ref transcript
COG COG5347 115aa 2e-15
in ref transcript
GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
smart ArfGap 119aa 4e-39
in ref transcript
Putative GTP-ase activating proteins for the small GTPase, ARF. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
pfam GIT_SHD 31aa 1e-07
in ref transcript
Spa2 homology domain (SHD) of GIT. GIT proteins are signaling integrators with GTPase-activating function which may be involved in the organisation of the cytoskeletal matrix assembled at active zones (CAZ). The function of the CAZ might be to define sites of neurotransmitter release. Mutations in the Spa2 homology domain (SHD) domain of GIT1 described here interfere with the association of GIT1 with Piccolo, beta-PIX, and focal adhesion kinase.
pfam GIT_SHD 22aa 7e-04
in ref transcript
COG COG5347 115aa 2e-15
in ref transcript
GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
Changed!
pfam GIY-YIG 77aa 3e-10
in ref transcript
GIY-YIG catalytic domain. This domain called GIY-YIG is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE; it is also found in putative endonucleases encoded by group I introns of fungi and phage. The structure of I-TevI a GIY-YIG endonuclease, reveals a novel alpha/beta-fold with a central three-stranded antiparallel beta-sheet flanked by three helices. The most conserved and putative catalytic residues are located on a shallow, concave surface and include a metal coordination site.
Changed!
PRK PRK00329 69aa 1e-05
in ref transcript
GIY-YIG nuclease superfamily protein; Validated.
Changed!
pfam GIY-YIG 62aa 2e-07
in modified transcript
Changed!
PRK PRK00329 67aa 3e-05
in modified transcript
GK
refseq_GK.F1 refseq_GK.R1 102 120
NCBIGene 36.3 2710
Single exon skipping, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_203391
Changed!
TIGR glycerol_kin 509aa 0.0
in ref transcript
This model describes glycerol kinase, a member of the FGGY family of carbohydrate kinases.
Changed!
PRK glpK 508aa 0.0
in ref transcript
glycerol kinase; Provisional.
Changed!
TIGR glycerol_kin 503aa 0.0
in modified transcript
Changed!
PRK glpK 502aa 0.0
in modified transcript
GLT8D1
refseq_GLT8D1.F1 refseq_GLT8D1.R1 176 405
NCBIGene 36.3 55830
Single exon skipping, size difference: 229
Exclusion in 5'UTR
Reference transcript: NM_001010983
cd GT8_like_1 286aa 2e-89
in ref transcript
GT8_like_1 represents a subfamily of GT8 with unknown function. A subfamily of glycosyltransferase family 8 with unknown function: Glycosyltransferase family 8 comprises enzymes with a number of known activities; lipopolysaccharide galactosyltransferase lipopolysaccharide glucosyltransferase 1, glycogenin glucosyltransferase and inositol 1-alpha-galactosyltransferase. It is classified as a retaining glycosyltransferase, based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed.
pfam Glyco_transf_8 274aa 2e-45
in ref transcript
Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyltransferase, lipopolysaccharide glucosyltransferase 1, and glycogenin glucosyltransferase.
Glutamine synthetase [Amino acid transport and metabolism].
GMEB1
refseq_GMEB1.F1 refseq_GMEB1.R1 141 171
NCBIGene 36.3 10691
Alternative 3-prime, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_006582
pfam SAND 81aa 2e-30
in ref transcript
SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation.
GMPPA
refseq_GMPPA.F1 refseq_GMPPA.R1 106 417
NCBIGene 36.3 29926
Alternative 5-prime and 3-prime, size difference: 311
Inclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_205847
cd M1P_guanylylT_A_like_N 258aa 1e-136
in ref transcript
N-terminal domain of M1P_guanylyl_A_ like proteins are likely to be a isoform of GDP-mannose pyrophosphorylase. N-terminal domain of the M1P-guanylyltransferase A-isoform like proteins: The proteins of this family are likely to be a isoform of GDP-mannose pyrophosphorylase. Their sequences are highly conserved with mannose-1-phosphate guanyltransferase, but generally about 40-60 bases longer. GDP-mannose pyrophosphorylase (GTP: alpha-d-mannose-1-phosphate guanyltransferase) catalyzes the formation of GDP-d-mannose from GTP and alpha-d-mannose-1-Phosphate. It contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain. GDP-d-mannose is the activated form of mannose for formation of cell wall lipoarabinomannan and various mannose-containing glycolipids and polysaccharides. The function of GDP-mannose pyrophosphorylase is essential for cell wall integrity, morphogenesis and viability. Repression of GDP-mannose pyrophosphorylase in yeast leads to phenotypes including cell lysis, defective cell wall, and failure of polarized growth and cell separation.
cd LbH_M1P_guanylylT_C 67aa 1e-19
in ref transcript
Mannose-1-phosphate guanylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Mannose-1-phosphate guanylyltransferase is also known as GDP-mannose pyrophosphorylase. It catalyzes the synthesis of GDP-mannose from GTP and mannose-1-phosphate, and is involved in the maintenance of cell wall integrity and glycosylation. Similar to ADP-glucose pyrophosphorylase, it contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain, presumably with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity.
pfam NTP_transferase 189aa 2e-25
in ref transcript
Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars.
TIGR lipid_A_lpxA 53aa 3e-07
in ref transcript
This model describes LpxA, an enzyme for the biosynthesis of lipid A, a component oflipopolysaccharide (LPS) in the outer membrane outer leaflet of most Gram-negative bacteria. Some differences are found between lipid A of different species, but this protein represents the first step (from UDP-N-acetyl-D-glucosamine) and appears to be conserved in function. Proteins from this family contain many copies of the bacterial transferase hexapeptide repeat (pfam00132).
IMPDH: The catalytic domain of the inosine monophosphate dehydrogenase. IMPDH catalyzes the NAD-dependent oxidation of inosine 5'-monophosphate (IMP) to xanthosine 5' monophosphate (XMP). It is a rate-limiting step in the de novo synthesis of the guanine nucleotides. There is often a CBS domain inserted in the middle of this domain, which is proposed to play a regulatory role. IMPDH is a key enzyme in the regulation of cell proliferation and differentiation. It has been identified as an attractive target for developing chemotherapeutic agents.
TIGR GMP_reduct_1 341aa 1e-179
in ref transcript
A deep split separates two families of GMP reductase. This family includes both eukaryotic and some proteobacterial sequences, while the other family contains other bacterial sequences.
Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_000516
Changed!
cd G-alpha 350aa 1e-138
in ref transcript
G protein alpha subunit. The alpha subunit of G proteins contains the guanine nucleotide binding site. The heterotrimeric GNP-binding proteins are signal transducers that communicate signals from many hormones, neurotransmitters, chemokines, and autocrine and paracrine factors. Extracellular signals are received by receptors, which activate the G proteins, which in turn route the signals to several distinct intracellular signaling pathways. The alpha subunit of G proteins is a weak GTPase. In the resting state, heterotrimeric G proteins are associated at the cytosolic face of the plasma membrane and the alpha subunit binds to GDP. Upon activation by a receptor GDP is replaced with GTP, and the G-alpha/GTP complex dissociates from the beta and gamma subunits. This results in activation of downstream signaling pathways, such as cAMP synthesis by adenylyl cyclase, which is terminated when GTP is hydrolized and the heterotrimers reconstitute.
Changed!
pfam G-alpha 372aa 1e-143
in ref transcript
G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase.
PTZ PTZ00133 80aa 1e-04
in ref transcript
ADP-ribosylation factor; Provisional.
Changed!
cd G-alpha 336aa 1e-139
in modified transcript
Changed!
pfam G-alpha 358aa 1e-143
in modified transcript
GNLY
refseq_GNLY.F1 refseq_GNLY.R1 112 356
NCBIGene 36.3 10578
Single exon skipping, size difference: 244
Inclusion in the protein causing a new stop codon
Reference transcript: NM_006433
Changed!
smart SapB 71aa 4e-08
in ref transcript
Saposin (B) Domains. Present in multiple copies in prosaposin and in pulmonary surfactant-associated protein B. In plant aspartic proteinases, a saposin domain is circularly permuted. This causes the prediction algorithm to predict two such domains, where only one is truly present.
GNRHR
refseq_GNRHR.F1 refseq_GNRHR.R1 229 357
NCBIGene 36.3 2798
Alternative 3-prime, size difference: 128
Exclusion in the protein causing a frameshift
Reference transcript: NM_000406
Changed!
pfam 7tm_1 218aa 2e-22
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
Changed!
pfam 7tm_1 70aa 3e-09
in modified transcript
GOPC
refseq_GOPC.F1 refseq_GOPC.R1 136 160
NCBIGene 36.3 57120
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_020399
cd PDZ_signaling 83aa 2e-17
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart PDZ 84aa 8e-16
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
COG COG3975 207aa 0.001
in ref transcript
Predicted protease with the C-terminal PDZ domain [General function prediction only].
Changed!
PRK PRK09039 178aa 0.006
in ref transcript
hypothetical protein; Validated.
Changed!
PRK PRK09039 170aa 2e-04
in modified transcript
GPATCH4
refseq_GPATC4.F2 refseq_GPATC4.R2 118 192
NCBIGene 36.3 54865
Alternative 3-prime, size difference: 74
Inclusion in the protein causing a new stop codon
Reference transcript: NM_015590
Changed!
pfam G-patch 44aa 7e-06
in ref transcript
G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines.
GPHN
refseq_GPHN.F2 refseq_GPHN.R2 296 395
NCBIGene 36.3 10243
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_020806
cd MoeA 407aa 1e-118
in ref transcript
MoeA family. Members of this family are involved in biosynthesis of the molybdenum cofactor (MoCF), an essential cofactor of a diverse group of redox enzymes. MoCF biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea and eukaryotes. MoCF contains a tricyclic pyranopterin, termed molybdopterin (MPT). MoeA, together with MoaB, is responsible for the metal incorporation into MPT, the third step in MoCF biosynthesis. The plant homolog Cnx1 is a MoeA-MogA fusion protein. The mammalian homolog gephyrin is a MogA-MoeA fusion protein, that plays a critical role in postsynaptic anchoring of inhibitory glycine receptors and major GABAa receptor subtypes.
cd MogA_MoaB 155aa 5e-51
in ref transcript
MogA_MoaB family. Members of this family are involved in biosynthesis of the molybdenum cofactor (MoCF) an essential cofactor of a diverse group of redox enzymes. MoCF biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea, and eukaryotes. MoCF contains a tricyclic pyranopterin, termed molybdopterin (MPT). MogA, together with MoeA, is responsible for the metal incorporation into MPT, the third step in MoCF biosynthesis. The plant homolog Cnx1 is a MoeA-MogA fusion protein. The mammalian homolog gephyrin is a MogA-MoeA fusion protein, that plays a critical role in postsynaptic anchoring of inhibitory glycine receptors and major GABAa receptor subtypes. In contrast, MoaB shows high similarity to MogA, but little is known about its physiological role. All well studied members of this family form highly stable trimers.
pfam MoeA_N 167aa 2e-51
in ref transcript
MoeA N-terminal region (domain I and II). This family contains two structural domains. One of these contains the conserved DGXA motif. This region is found in proteins involved in biosynthesis of molybdopterin cofactor however the exact molecular function of this region is uncertain.
TIGR molyb_syn 146aa 1e-33
in ref transcript
The Drosophila protein cinnamon, the Arabidopsis protein cnx1, and rat protein gephyrin each have one domain like MoeA and one like MoaB and Mog. These domains are, however, distantly related to each other, as captured by this HMM. Gephyrin is unusual in that it seems to be a tubulin-binding neuroprotein involved in the clustering of both blycine receptors and GABA receptors, rather than a protein of molybdenum cofactor biosynthesis.
TIGR molyb_syn 143aa 1e-30
in ref transcript
pfam MoeA_C 77aa 1e-13
in ref transcript
MoeA C-terminal region (domain IV). This domain is found in proteins involved in biosynthesis of molybdopterin cofactor however the exact molecular function of this domain is uncertain. The structure of this domain is known and forms an incomplete beta barrel.
bifunctional molybdenum cofactor biosynthesis protein C/molybdopterin-binding protein; Provisional.
GPM6A
refseq_GPM6A.F2 refseq_GPM6A.R2 333 392
NCBIGene 36.3 2823
Single exon skipping, size difference: 59
Exclusion of the protein initiation site
Reference transcript: NM_005277
Changed!
pfam Myelin_PLP 242aa 3e-89
in ref transcript
Myelin proteolipid protein (PLP or lipophilin).
Changed!
pfam Myelin_PLP 242aa 2e-87
in modified transcript
GPM6B
refseq_GPM6B.F1 refseq_GPM6B.R1 220 340
NCBIGene 36.3 2824
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_001001995
Changed!
pfam Myelin_PLP 245aa 1e-127
in ref transcript
Myelin proteolipid protein (PLP or lipophilin).
Changed!
pfam Myelin_PLP 245aa 1e-123
in modified transcript
GPNMB
refseq_GPNMB.F1 refseq_GPNMB.R1 181 217
NCBIGene 36.3 10457
Alternative 3-prime, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_001005340
cd PKD 64aa 2e-05
in ref transcript
polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
smart PKD 58aa 6e-06
in ref transcript
Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
TIGR PCC 114aa 0.004
in ref transcript
Note: this model is restricted to the amino half because a full-length model is incompatible with the HMM software package.
GPR108
refseq_GPR108.F1 refseq_GPR108.R1 257 399
NCBIGene 36.2 56927
Alternative 3-prime, size difference: 142
Inclusion in the protein causing a new stop codon
Reference transcript: XM_933929
pfam Lung_7-TM_R 289aa 5e-85
in ref transcript
Lung seven transmembrane receptor. This family represents a conserved region with eukaryotic lung seven transmembrane receptors and related proteins.
GPR126
refseq_GPR126.F2 refseq_GPR126.R2 148 232
NCBIGene 36.3 57211
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_198569
cd CUB 104aa 8e-25
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd PTX 185aa 1e-07
in ref transcript
Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
pfam 7tm_2 259aa 9e-32
in ref transcript
7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
smart CUB 97aa 9e-26
in ref transcript
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
pfam GPS 49aa 3e-10
in ref transcript
Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
smart PTX 185aa 3e-08
in ref transcript
Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
GPR126
refseq_GPR126.F3 refseq_GPR126.R3 169 215
NCBIGene 36.3 57211
Single exon skipping, size difference: 46
Inclusion in the protein causing a frameshift
Reference transcript: NM_198569
cd CUB 104aa 8e-25
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd PTX 185aa 1e-07
in ref transcript
Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
pfam 7tm_2 259aa 9e-32
in ref transcript
7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
smart CUB 97aa 9e-26
in ref transcript
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
pfam GPS 49aa 3e-10
in ref transcript
Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
smart PTX 185aa 3e-08
in ref transcript
Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.
GPR155
refseq_GPR155.F2 refseq_GPR155.R2 223 372
NCBIGene 36.3 151556
Single exon skipping, size difference: 149
Exclusion in 5'UTR
Reference transcript: NM_001033045
cd DEP_GPR155 83aa 4e-41
in ref transcript
DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in GPR155-like proteins. GRP155-like proteins, also known as PGR22, contain an N-terminal permease domain, a central transmembrane region and a C-terminal DEP domain. They are orphan receptors of the class B G protein-coupled receptors. Their function is unknown.
TIGR 2a69 329aa 2e-17
in ref transcript
smart DEP 68aa 6e-10
in ref transcript
Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signalling proteins that contain PH, rasGEF, rhoGEF, rhoGAP, RGS, PDZ domains. DEP domain in Drosophila dishevelled is essential to rescue planar polarity defects and induce JNK signalling (Cell 94, 109-118).
COG COG0679 329aa 4e-19
in ref transcript
Predicted permeases [General function prediction only].
GPER
refseq_GPR30.F2 refseq_GPR30.R2 276 400
NCBIGene 36.3 2852
Single exon skipping, size difference: 124
Exclusion in 5'UTR
Reference transcript: NM_001039966
pfam 7tm_1 243aa 1e-34
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
GPR56
refseq_GPR56.F1 refseq_GPR56.R1 275 397
NCBIGene 36.3 9289
Single exon skipping, size difference: 122
Exclusion in 5'UTR
Reference transcript: NM_201524
pfam 7tm_2 250aa 3e-25
in ref transcript
7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
pfam GPS 47aa 3e-09
in ref transcript
Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
GPR56
refseq_GPR56.F3 refseq_GPR56.R3 100 118
NCBIGene 36.3 9289
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_005682
Changed!
pfam 7tm_2 256aa 1e-24
in ref transcript
7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
pfam GPS 47aa 3e-09
in ref transcript
Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
Changed!
pfam 7tm_2 250aa 3e-25
in modified transcript
GPRASP2
refseq_GPRASP2.F1 refseq_GPRASP2.R1 128 216
NCBIGene 36.3 114928
Single exon skipping, size difference: 88
Exclusion in 5'UTR
Reference transcript: NM_001004051
pfam DUF634 245aa 1e-53
in ref transcript
Protein of unknown function (DUF634). Mammalian protein of unknown function.
PRK PRK12323 129aa 0.008
in ref transcript
DNA polymerase III subunits gamma and tau; Provisional.
GPX4
refseq_GPX4.F2 refseq_GPX4.R2 102 124
NCBIGene 36.3 2879
Alternative 3-prime, size difference: 22
Inclusion in 3'UTR
Reference transcript: NM_001039848
cd GSH_Peroxidase 33aa 2e-08
in ref transcript
Glutathione (GSH) peroxidase family; tetrameric selenoenzymes that catalyze the reduction of a variety of hydroperoxides including lipid peroxidases, using GSH as a specific electron donor substrate. GSH peroxidase contains one selenocysteine residue per subunit, which is involved in catalysis. Different isoenzymes are known in mammals,which are involved in protection against reactive oxygen species, redox regulation of many metabolic processes, peroxinitrite scavenging, and modulation of inflammatory processes.
pfam GSHPx 32aa 1e-10
in ref transcript
Glutathione peroxidase.
COG BtuE 33aa 2e-05
in ref transcript
Glutathione peroxidase [Posttranslational modification, protein turnover, chaperones].
GPX5
refseq_GPX5.F2 refseq_GPX5.R2 128 246
NCBIGene 36.3 2880
Single exon skipping, size difference: 118
Exclusion in the protein causing a frameshift
Reference transcript: NM_001509
Changed!
cd GSH_Peroxidase 173aa 5e-45
in ref transcript
Glutathione (GSH) peroxidase family; tetrameric selenoenzymes that catalyze the reduction of a variety of hydroperoxides including lipid peroxidases, using GSH as a specific electron donor substrate. GSH peroxidase contains one selenocysteine residue per subunit, which is involved in catalysis. Different isoenzymes are known in mammals,which are involved in protection against reactive oxygen species, redox regulation of many metabolic processes, peroxinitrite scavenging, and modulation of inflammatory processes.
Changed!
pfam GSHPx 114aa 4e-42
in ref transcript
Glutathione peroxidase.
Changed!
COG BtuE 179aa 2e-33
in ref transcript
Glutathione peroxidase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd GSH_Peroxidase 52aa 9e-09
in modified transcript
Changed!
pfam GSHPx 51aa 4e-11
in modified transcript
Changed!
COG BtuE 52aa 2e-07
in modified transcript
GRB10
refseq_GRB10.F1 refseq_GRB10.R1 142 280
NCBIGene 36.3 2887
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_005311
cd GRB7_RA 83aa 2e-32
in ref transcript
Grb7_RA The RA (RAS-associated like) domain of Grb7. Grb7 is an adaptor molecule that mediates signal transduction from multiple cell surface receptors to various downstream signaling pathways. Grb7 and its related family members Grb10 and Grb14 share a conserved domain architecture that includes an amino-terminal proline-rich region, a central segment termed the GM region (for Grb and Mig) which includes the RA, PIR, and PH domains, and a carboxyl-terminal SH2 domain. Grb7/10/14 family proteins are phosphorylated on serine/threonine as well as tyrosine residues and are mainly localized to the cytoplasm.
Changed!
cd PH_Apbb1ip 113aa 8e-20
in ref transcript
Apbb1ip (Amyloid beta (A4) Precursor protein-Binding, family B, member 1 Interacting Protein) pleckstrin homology (PH) domain. Apbb1ip consists of a Ras-associated domain and a PH domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
cd SH2 97aa 9e-18
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
smart SH2 90aa 2e-21
in ref transcript
Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
pfam BPS 48aa 5e-19
in ref transcript
BPS (Between PH and SH2). The BPS (Between PH and SH2) domain, comprised of 2 beta strands and a C-terminal helix, is an approximately 45 residue region found in the adaptor proteins Grb7/10/14 that mediates inhibition of the tyrosine kinase domain of the insulin receptor by binding of the N-terminal portion of the BPS domain to the substrate peptide groove of the kinase, acting as a pseudosubstrate inhibitor.
smart RA 84aa 1e-13
in ref transcript
Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.).
Changed!
smart PH 109aa 6e-09
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
Changed!
cd PH_Apbb1ip 74aa 8e-08
in modified transcript
GRB2
refseq_GRB2.F2 refseq_GRB2.R2 240 363
NCBIGene 36.3 2885
Single exon skipping, size difference: 123
Exclusion in the protein (no frameshift)
Reference transcript: NM_002086
Changed!
cd SH2 91aa 2e-21
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
cd SH3 53aa 1e-13
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd SH3 52aa 3e-11
in ref transcript
Changed!
pfam SH2 76aa 9e-27
in ref transcript
SH2 domain.
smart SH3 57aa 4e-16
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
pfam SH3_1 56aa 1e-13
in ref transcript
SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
Changed!
cd SH2 49aa 6e-04
in modified transcript
Changed!
pfam SH2 35aa 1e-05
in modified transcript
GRHL1
refseq_GRHL1.F1 refseq_GRHL1.R1 211 288
NCBIGene 36.3 29841
Single exon skipping, size difference: 77
Exclusion in the protein causing a frameshift
Reference transcript: NM_014552
Changed!
pfam CP2 229aa 1e-108
in ref transcript
CP2 transcription factor. This family represents a conserved region in the CP2 transcription factor family.
GRHL3
refseq_GRHL3.F1 refseq_GRHL3.R1 107 294
NCBIGene 36.3 57822
Single exon skipping, size difference: 187
Exclusion in the protein causing a frameshift
Reference transcript: NM_021180
Changed!
pfam CP2 227aa 2e-96
in ref transcript
CP2 transcription factor. This family represents a conserved region in the CP2 transcription factor family.
GRIK1
refseq_GRIK1.F1 refseq_GRIK1.R1 140 185
NCBIGene 36.3 2897
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_000830
Changed!
cd PBP1_iGluR_Kainate_GluR5_7 396aa 0.0
in ref transcript
N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the GluR5-7 subunits of Kainate receptor. While this N-terminal domain belongs to the periplasmic-binding fold type I superfamily, the glutamate-binding domain of the iGluR is structurally homologous to the periplasmic-binding fold type II. The LIVBP-like domain of iGluRs is thought to play a role in the initial assembly of iGluR subunits, but it is not well understood how this domain is arranged and functions in intact iGluR. There are five types of kainate receptors, GluR5, GluR6, GluR7, KA1, and KA2, which are structurally similar to AMPA and NMDA subunits of ionotropic glutamate receptors. KA1 and KA2 subunits can only form functional receptors with one of the GluR5-7 subunits. Moreover, GluR5-7 can also form functional homomeric receptor channels activated by kainate and glutamate when expressed in heterologous systems. Kainate receptors are involved in excitatory neurotransmission by activating postsynaptic receptors and in inhibitory neurotransmission by modulating release of the inhibitory neurotransmitter GABA through a presynaptic mechanism. Kainate receptors are closely related to AMAP receptors. In contrast of AMPA receptors, kainate receptors play only a minor role in signaling at synapses and their function is not well defined.
cd PBPb 117aa 1e-10
in ref transcript
Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD.
pfam Lig_chan 282aa 1e-105
in ref transcript
Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors.
pfam ANF_receptor 341aa 8e-51
in ref transcript
Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
pfam Lig_chan-Glu_bd 65aa 9e-27
in ref transcript
Ligated ion channel L-glutamate- and glycine-binding site. This region, sometimes called the S1 domain, is the luminal domain just upstream of the first, M1, transmembrane region of transmembrane ion-channel proteins, and it binds L-glutamate and glycine. It is found in association with Lig_chan, pfam00060.
COG HisJ 91aa 2e-05
in ref transcript
ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [Amino acid transport and metabolism / Signal transduction mechanisms].
Changed!
COG LivK 300aa 3e-04
in ref transcript
ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
Changed!
cd PBP1_iGluR_Kainate_GluR5_7 381aa 0.0
in modified transcript
Changed!
COG LivK 307aa 1e-04
in modified transcript
GRIK2
refseq_GRIK2.F1 refseq_GRIK2.R1 315 402
NCBIGene 36.3 2898
Single exon skipping, size difference: 87
Inclusion in the protein causing a new stop codon
Reference transcript: NM_021956
cd PBP1_iGluR_Kainate_GluR5_7 381aa 0.0
in ref transcript
N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the GluR5-7 subunits of Kainate receptor. While this N-terminal domain belongs to the periplasmic-binding fold type I superfamily, the glutamate-binding domain of the iGluR is structurally homologous to the periplasmic-binding fold type II. The LIVBP-like domain of iGluRs is thought to play a role in the initial assembly of iGluR subunits, but it is not well understood how this domain is arranged and functions in intact iGluR. There are five types of kainate receptors, GluR5, GluR6, GluR7, KA1, and KA2, which are structurally similar to AMPA and NMDA subunits of ionotropic glutamate receptors. KA1 and KA2 subunits can only form functional receptors with one of the GluR5-7 subunits. Moreover, GluR5-7 can also form functional homomeric receptor channels activated by kainate and glutamate when expressed in heterologous systems. Kainate receptors are involved in excitatory neurotransmission by activating postsynaptic receptors and in inhibitory neurotransmission by modulating release of the inhibitory neurotransmitter GABA through a presynaptic mechanism. Kainate receptors are closely related to AMAP receptors. In contrast of AMPA receptors, kainate receptors play only a minor role in signaling at synapses and their function is not well defined.
cd PBPb 118aa 8e-11
in ref transcript
Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD.
pfam Lig_chan 282aa 1e-106
in ref transcript
Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors.
pfam ANF_receptor 344aa 2e-55
in ref transcript
Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
pfam Lig_chan-Glu_bd 66aa 1e-26
in ref transcript
Ligated ion channel L-glutamate- and glycine-binding site. This region, sometimes called the S1 domain, is the luminal domain just upstream of the first, M1, transmembrane region of transmembrane ion-channel proteins, and it binds L-glutamate and glycine. It is found in association with Lig_chan, pfam00060.
COG LivK 235aa 9e-08
in ref transcript
ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
COG HisJ 121aa 4e-05
in ref transcript
ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [Amino acid transport and metabolism / Signal transduction mechanisms].
GRIN1
refseq_GRIN1.F2 refseq_GRIN1.R2 190 301
NCBIGene 36.3 2902
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_007327
cd PBP1_iGluR_NMDA_NR1 376aa 0.0
in ref transcript
N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the NR1, an essential channel-forming subunit of the NMDA receptor. The ionotropic N-methyl-d-asparate (NMDA) subtype of glutamate receptor serves critical functions in neuronal development, functioning, and degeneration in the mammalian central nervous system. The functional NMDA receptor is a heterotetramer ccomposed of two NR1 and two NR2 (A, B, C, and D) or of NR3 (A and B) subunits. The receptor controls a cation channel that is highly permeable to monovalent ions and calcium and exhibits voltage-dependent inhibition by magnesium. Dual agonists, glutamate and glycine, are required for efficient activation of the NMDA receptor. When co-expressed with NR1, the NR3 subunits form receptors that are activated by glycine alone and therefore can be classified as excitatory glycine receptors. NR1/NR3 receptors are calcium-impermeable and unaffected by ligands acting at the NR2 glutamate-binding site.
cd PBPb 96aa 2e-09
in ref transcript
Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD.
cd PBPb 111aa 6e-04
in ref transcript
pfam Lig_chan 276aa 7e-79
in ref transcript
Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors.
pfam ANF_receptor 316aa 5e-28
in ref transcript
Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
pfam Lig_chan-Glu_bd 58aa 6e-16
in ref transcript
Ligated ion channel L-glutamate- and glycine-binding site. This region, sometimes called the S1 domain, is the luminal domain just upstream of the first, M1, transmembrane region of transmembrane ion-channel proteins, and it binds L-glutamate and glycine. It is found in association with Lig_chan, pfam00060.
pfam CaM_bdg_C0 29aa 9e-08
in ref transcript
Calmodulin-binding domain C0 of NMDA receptor NR1 subunit. This is a very short highly conserved domain that is C-terminal to the cytosolic transmembrane region IV of the NMDA-receptor 1. It has been shown to bind Calmodulin-Calcium with high affinity. The ionotropic N-methyl-D-aspartate receptor (NMDAR) is a major source of calcium flux into neurons in the brain and plays a critical role in learning, memory, neural development, and synaptic plasticity. Calmodulin (CaM) regulates NMDARs by binding tightly to the C0 and C1 regions of their NR1 subunit. The conserved tryptophan is considered to be the anchor residue.
COG LivK 320aa 1e-10
in ref transcript
ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
COG HisJ 89aa 1e-05
in ref transcript
ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [Amino acid transport and metabolism / Signal transduction mechanisms].
COG HisJ 82aa 0.002
in ref transcript
GRINA
refseq_GRINA.F2 refseq_GRINA.R2 164 276
NCBIGene 36.3 2907
Alternative 5-prime, size difference: 112
Exclusion in 5'UTR
Reference transcript: NM_000837
cd BI-1-like 162aa 6e-40
in ref transcript
BAX inhibitor (BI)-1 like protein family. Mammalian members of this family of small transmembrane proteins have been shown to have an antiapoptotic effect either by stimulating the antiapoptotic function of Bcl-2, a well characterized oncogene, or inhibiting the proapoptotic effect of Bax, another member of the Bcl-2 family. Their broad tissue distribution and high degree of conservation suggests an important regulatory role. In plants, BI-1 like proteins play a role in pathogen resistance. A prokaryotic member, E.coli YccA, has been shown to interact with ATP-dependent protease FtsH, which degrades abnormal membrane proteins as part of a quality control mechanism to keep the integrity of biological membranes.
pfam UPF0005 167aa 8e-28
in ref transcript
Uncharacterised protein family UPF0005. The Pfam entry finds members not in the Prosite definition.
COG COG0670 114aa 2e-10
in ref transcript
Integral membrane protein, interacts with FtsH [General function prediction only].
GRINL1A
refseq_GRINL1A.F1 refseq_GRINL1A.R1 198 384
NCBIGene 36.3 81488
Alternative 5-prime, size difference: 186
Inclusion in the protein causing a new stop codon
Reference transcript: NM_015532
GRIP1
refseq_GRIP1.F1 refseq_GRIP1.R1 180 225
NCBIGene 36.2 23426
Single exon skipping, size difference: 45
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: XM_001133924
cd PDZ_signaling 84aa 2e-14
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 82aa 1e-13
in ref transcript
cd PDZ_signaling 85aa 8e-12
in ref transcript
cd PDZ_signaling 81aa 2e-11
in ref transcript
cd PDZ_signaling 71aa 1e-10
in ref transcript
cd PDZ_signaling 84aa 3e-10
in ref transcript
cd PDZ_signaling 81aa 2e-09
in ref transcript
smart PDZ 89aa 2e-16
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
Exon skipping and alternative 3-prime or 5-prime, size difference: 153
Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
Reference transcript: XM_001133924
cd PDZ_signaling 84aa 2e-14
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 82aa 1e-13
in ref transcript
cd PDZ_signaling 85aa 8e-12
in ref transcript
cd PDZ_signaling 81aa 2e-11
in ref transcript
cd PDZ_signaling 71aa 1e-10
in ref transcript
cd PDZ_signaling 84aa 3e-10
in ref transcript
cd PDZ_signaling 81aa 2e-09
in ref transcript
smart PDZ 89aa 2e-16
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
Exon skipping and alternative 3-prime or 5-prime, size difference: 159
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_020137
GRIPAP1
refseq_GRIPAP1.F3 refseq_GRIPAP1.R3 102 180
NCBIGene 36.3 56850
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_020137
GRK4
refseq_GRK4.F1 refseq_GRK4.R1 234 372
NCBIGene 36.3 2868
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_182982
cd STKc_GRK4 285aa 1e-144
in ref transcript
STKc_GRK4: Serine/Threonine Kinases (STKs), G protein-coupled Receptor Kinase (GRK) subfamily, GRK4 isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The GRK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. GRKs phosphorylate and regulate G protein-coupled receptors (GPCRs), the largest superfamily of cell surface receptors which regulate some part of nearly all physiological functions. Phosphorylated GPCRs bind to arrestins, which prevents further G protein signaling despite the presence of activating ligand. There are seven types of GRKs, named GRK1 to GRK7. GRK4 has a limited tissue distribution. It is mainly found in the testis, but is also present in the cerebellum and kidney. It is expressed as multiple splice variants with different domain architectures. It is post-translationally palmitoylated and localized in the membrane. GRK4 polymorphisms are associated with hypertension and salt sensitivity, as they cause hyperphosphorylation, desensitization, and internalization of the dopamine 1 (D1) receptor while increasing the expression of the angiotensin II type 1 receptor. GRK4 plays a crucial role in the D1 receptor regulation of sodium excretion and blood pressure.
smart S_TKc 253aa 6e-60
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
smart RGS 120aa 2e-20
in ref transcript
Regulator of G protein signalling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
PTZ PTZ00263 273aa 4e-40
in ref transcript
protein kinase A catalytic subunit; Provisional.
GRK4
refseq_GRK4.F3 refseq_GRK4.R3 276 372
NCBIGene 36.3 2868
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_182982
cd STKc_GRK4 285aa 1e-144
in ref transcript
STKc_GRK4: Serine/Threonine Kinases (STKs), G protein-coupled Receptor Kinase (GRK) subfamily, GRK4 isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The GRK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. GRKs phosphorylate and regulate G protein-coupled receptors (GPCRs), the largest superfamily of cell surface receptors which regulate some part of nearly all physiological functions. Phosphorylated GPCRs bind to arrestins, which prevents further G protein signaling despite the presence of activating ligand. There are seven types of GRKs, named GRK1 to GRK7. GRK4 has a limited tissue distribution. It is mainly found in the testis, but is also present in the cerebellum and kidney. It is expressed as multiple splice variants with different domain architectures. It is post-translationally palmitoylated and localized in the membrane. GRK4 polymorphisms are associated with hypertension and salt sensitivity, as they cause hyperphosphorylation, desensitization, and internalization of the dopamine 1 (D1) receptor while increasing the expression of the angiotensin II type 1 receptor. GRK4 plays a crucial role in the D1 receptor regulation of sodium excretion and blood pressure.
smart S_TKc 253aa 6e-60
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
smart RGS 120aa 2e-20
in ref transcript
Regulator of G protein signalling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
PTZ PTZ00263 273aa 4e-40
in ref transcript
protein kinase A catalytic subunit; Provisional.
GRM7
refseq_GRM7.F2 refseq_GRM7.R2 256 348
NCBIGene 36.3 2917
Single exon skipping, size difference: 92
Exclusion of the stop codon
Reference transcript: NM_181874
cd PBP1_mGluR_groupIII 465aa 0.0
in ref transcript
Ligand-binding domain of the group III metabotropic glutamate receptor, a family which contains mGlu4R, mGluR6R, mGluR7, and mGluR8; all of which inhibit adenylyl cyclase. The metabotropic glutamate receptor is a member of the family C of G-protein-coupled receptors that transduce extracellular signals into G-protein activation and ultimately into intracellular responses. The mGluRs are classified into three groups which comprise eight subtypes.
pfam 7tm_3 262aa 3e-96
in ref transcript
7 transmembrane sweet-taste receptor of 3 GCPR. This is a domain of seven transmembrane regions that forms the C-terminus of some subclass 3 G-coupled-protein receptors. It is often associated with a downstream cysteine-rich linker domain, NCD3G pfam07562, which is the human sweet-taste receptor, and the N-terminal domain, ANF_receptor pfam01094. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness.
pfam ANF_receptor 408aa 2e-67
in ref transcript
Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
pfam NCD3G 52aa 1e-17
in ref transcript
Nine Cysteines Domain of family 3 GPCR. This conserved sequence contains several highly-conserved Cys residues that are predicted to form disulphide bridges. It is predicted to lie outside the cell membrane, tethered to the pfam00003 in several receptor proteins.
COG LivK 310aa 9e-12
in ref transcript
ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
GRP
refseq_GRP.F1 refseq_GRP.R1 99 120
NCBIGene 36.3 2922
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_002091
GSG1
refseq_GSG1.F2 refseq_GSG1.R2 184 308
NCBIGene 36.3 83445
Alternative 5-prime, size difference: 124
Inclusion in the protein causing a frameshift
Reference transcript: NM_153823
Changed!
pfam GSG-1 117aa 6e-47
in ref transcript
GSG1-like protein. This family contains sequences bearing similarity to a region of GSG1, a protein specifically expressed in testicular germ cells. It is possible that overexpression of the human homolog may be involved in tumourigenesis of human testicular germ cell tumours. The region in question has four highly-conserved cysteine residues.
Changed!
pfam GSG-1 118aa 3e-46
in modified transcript
GSTZ1
refseq_GSTZ1.F2 refseq_GSTZ1.R2 148 274
NCBIGene 36.3 2954
Single exon skipping, size difference: 126
Exclusion in the protein (no frameshift)
Reference transcript: NM_145870
Changed!
cd GST_C_Zeta 115aa 2e-42
in ref transcript
GST_C family, Class Zeta subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Class Zeta GSTs, also known as maleylacetoacetate (MAA) isomerases, catalyze the isomerization of MAA to fumarylacetoacetate, the penultimate step in tyrosine/phenylalanine catabolism, using GSH as a cofactor. They show little GSH-conjugating activity towards traditional GST substrates, but display modest GSH peroxidase activity. They are also implicated in the detoxification of the carcinogen dichloroacetic acid by catalyzing its dechlorination to glyoxylic acid.
Changed!
cd GST_N_Zeta 75aa 1e-31
in ref transcript
GST_N family, Class Zeta subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Class Zeta GSTs, also known as maleylacetoacetate (MAA) isomerases, catalyze the isomerization of MAA to fumarylacetoacetate, the penultimate step in tyrosine/phenylalanine catabolism, using GSH as a cofactor. They show little GSH-conjugating activity towards traditional GST substrates but display modest GSH peroxidase activity. They are also implicated in the detoxification of the carcinogen dichloroacetic acid by catalyzing its dechlorination to glyoxylic acid.
Changed!
TIGR maiA 205aa 4e-96
in ref transcript
Maleylacetoacetate isomerase is an enzyme of tyrosine and phenylalanine catabolism. It requires glutathione and belongs by homology to the zeta family of glutathione S-transferases. The enzyme (EC 5.2.1.2) is described as active also on maleylpyruvate, and the example from a Ralstonia sp. catabolic plasmid is described as a maleylpyruvate isomerase involved in gentisate catabolism.
Changed!
COG Gst 195aa 8e-32
in ref transcript
Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd GST_C_Zeta 93aa 2e-33
in modified transcript
Changed!
cd GST_N_Zeta 67aa 5e-27
in modified transcript
Changed!
TIGR maiA 163aa 2e-69
in modified transcript
Changed!
COG Gst 153aa 2e-17
in modified transcript
GTDC1
refseq_GTDC1.F1 refseq_GTDC1.R1 152 407
NCBIGene 36.3 79712
Multiple exon skipping, size difference: 255
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001006636
Changed!
cd GT1_cap1E_like 90aa 1e-04
in ref transcript
This family is most closely related to the GT1 family of glycosyltransferases. cap1E in Streptococcus pneumoniae is required for the synthesis of type 1 capsular polysaccharides.
Changed!
pfam Glycos_transf_1 110aa 0.001
in ref transcript
Glycosyl transferases group 1. Mutations in this domain of subunit A of phosphatidylinositol N-acetylglucosaminyltransferase lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharides. Members of this family transfer UDP, ADP, GDP or CMP linked sugars. The eukaryotic glycogen synthases may be distant members of this family.
GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain.
pfam GTF2I 73aa 1e-30
in ref transcript
pfam GTF2I 73aa 4e-30
in ref transcript
pfam GTF2I 73aa 1e-29
in ref transcript
pfam GTF2I 73aa 3e-26
in ref transcript
pfam GTF2I 73aa 3e-26
in ref transcript
GTF2IRD1
refseq_GTF2IRD1.F2 refseq_GTF2IRD1.R2 138 183
NCBIGene 36.3 9569
Alternative 3-prime, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_016328
pfam GTF2I 73aa 2e-29
in ref transcript
GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain.
pfam GTF2I 73aa 3e-29
in ref transcript
pfam GTF2I 73aa 3e-29
in ref transcript
pfam GTF2I 73aa 5e-28
in ref transcript
pfam GTF2I 73aa 2e-25
in ref transcript
GTF3C2
refseq_GTF3C2.F2 refseq_GTF3C2.R2 195 271
NCBIGene 36.3 2976
Single exon skipping, size difference: 76
Exclusion in 5'UTR
Reference transcript: NM_001521
cd WD40 90aa 1e-05
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
smart WD40 38aa 0.002
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
GTPBP8
refseq_GTPBP8.F1 refseq_GTPBP8.R1 117 248
NCBIGene 36.2 29083
Single exon skipping, size difference: 131
Exclusion in the protein causing a frameshift
Reference transcript: NM_014170
Changed!
cd YihA_EngB 169aa 1e-50
in ref transcript
The YihA (EngB) subfamily. This subfamily of GTPases is typified by the E. coli YihA, an essential protein involved in cell division control. YihA and its orthologs are small proteins that typically contain less than 200 amino acid residues and consists of the GTPase domain only (some of the eukaryotic homologs contain an N-terminal extension of about 120 residues that might be involved in organellar targeting). Homologs of yihA are found in most Gram-positive and Gram-negative pathogenic bacteria, with the exception of Mycobacterium tuberculosis. The broad-spectrum nature of YihA and its essentiality for cell viability in bacteria make it an attractive antibacterial target.
Changed!
TIGR GTPase_YsxC 176aa 2e-47
in ref transcript
Members of this protein family are a GTPase associated with ribosome biogenesis, typified by YsxC from Bacillus subutilis. The family is widely but not universally distributed among bacteria. Members commonly are called EngB based on homology to EngA, one of several other GTPases of ribosome biogenesis. Cutoffs as set find essentially all bacterial members, but also identify large numbers of eukaryotic (probably organellar) sequences. This protein is found in about 80 percent of bacterial genomes.
Changed!
PRK engB 188aa 6e-50
in ref transcript
GTPase EngB; Reviewed.
Changed!
cd YihA_EngB 34aa 1e-09
in modified transcript
Changed!
TIGR GTPase_YsxC 51aa 7e-13
in modified transcript
Changed!
PRK engB 52aa 6e-14
in modified transcript
GTPBP8
refseq_GTPBP8.F3 refseq_GTPBP8.R3 129 228
NCBIGene 36.3 29083
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_014170
Changed!
cd YihA_EngB 169aa 1e-50
in ref transcript
The YihA (EngB) subfamily. This subfamily of GTPases is typified by the E. coli YihA, an essential protein involved in cell division control. YihA and its orthologs are small proteins that typically contain less than 200 amino acid residues and consists of the GTPase domain only (some of the eukaryotic homologs contain an N-terminal extension of about 120 residues that might be involved in organellar targeting). Homologs of yihA are found in most Gram-positive and Gram-negative pathogenic bacteria, with the exception of Mycobacterium tuberculosis. The broad-spectrum nature of YihA and its essentiality for cell viability in bacteria make it an attractive antibacterial target.
Changed!
TIGR GTPase_YsxC 176aa 2e-47
in ref transcript
Members of this protein family are a GTPase associated with ribosome biogenesis, typified by YsxC from Bacillus subutilis. The family is widely but not universally distributed among bacteria. Members commonly are called EngB based on homology to EngA, one of several other GTPases of ribosome biogenesis. Cutoffs as set find essentially all bacterial members, but also identify large numbers of eukaryotic (probably organellar) sequences. This protein is found in about 80 percent of bacterial genomes.
Changed!
PRK engB 188aa 6e-50
in ref transcript
GTPase EngB; Reviewed.
Changed!
cd YihA_EngB 135aa 4e-35
in modified transcript
Changed!
TIGR GTPase_YsxC 125aa 2e-28
in modified transcript
Changed!
PRK engB 155aa 1e-30
in modified transcript
GUSBL2
refseq_GUSBL2.F1 refseq_GUSBL2.R1 163 316
NCBIGene 36.2 375513
Single exon skipping, size difference: 153
Exclusion in the protein (no frameshift)
Reference transcript: NM_206910
GYPC
refseq_GYPC.F1 refseq_GYPC.R1 125 182
NCBIGene 36.3 2995
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_002101
smart 4.1m 19aa 6e-04
in ref transcript
putative band 4.1 homologues' binding motif.
Gcom1
refseq_Gcom1.F1 refseq_Gcom1.R1 113 206
NCBIGene 36.2 145781
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_001018090
Changed!
TIGR SMC_prok_B 250aa 2e-08
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
COG Smc 266aa 4e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
TIGR SMC_prok_B 235aa 2e-05
in modified transcript
Changed!
COG Smc 206aa 4e-04
in modified transcript
Gcom1
refseq_Gcom1.F3 refseq_Gcom1.R3 138 222
NCBIGene 36.3 145781
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_001018090
Changed!
TIGR SMC_prok_B 250aa 2e-08
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
COG Smc 266aa 4e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
TIGR SMC_prok_B 294aa 2e-09
in modified transcript
Changed!
COG Smc 285aa 3e-06
in modified transcript
H2AFV
refseq_H2AFV.F1 refseq_H2AFV.R1 226 304
NCBIGene 36.3 94239
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_012412
Changed!
cd H2A 105aa 1e-37
in ref transcript
Histone 2A; H2A is a subunit of the nucleosome. The nucleosome is an octamer containing two H2A, H2B, H3, and H4 subunits. The H2A subunit performs essential roles in maintaining structural integrity of the nucleosome, chromatin condensation, and binding of specific chromatin-associated proteins.
Changed!
smart H2A 106aa 2e-43
in ref transcript
Histone 2A.
Changed!
PTZ PTZ00017 110aa 1e-46
in ref transcript
histone H2A; Provisional.
Changed!
cd H2A 95aa 1e-33
in modified transcript
Changed!
smart H2A 97aa 9e-39
in modified transcript
Changed!
PTZ PTZ00017 101aa 1e-42
in modified transcript
H2AFV
refseq_H2AFV.F3 refseq_H2AFV.R3 121 251
NCBIGene 36.3 94239
Single exon skipping, size difference: 130
Exclusion in the protein causing a frameshift
Reference transcript: NM_012412
Changed!
cd H2A 105aa 1e-37
in ref transcript
Histone 2A; H2A is a subunit of the nucleosome. The nucleosome is an octamer containing two H2A, H2B, H3, and H4 subunits. The H2A subunit performs essential roles in maintaining structural integrity of the nucleosome, chromatin condensation, and binding of specific chromatin-associated proteins.
Changed!
smart H2A 106aa 2e-43
in ref transcript
Histone 2A.
Changed!
PTZ PTZ00017 110aa 1e-46
in ref transcript
histone H2A; Provisional.
Changed!
cd H2A 49aa 8e-14
in modified transcript
Changed!
smart H2A 48aa 1e-17
in modified transcript
Changed!
PTZ PTZ00017 48aa 8e-17
in modified transcript
H2AFV
refseq_H2AFV.F5 refseq_H2AFV.R5 134 248
NCBIGene 36.3 94239
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_012412
Changed!
cd H2A 105aa 1e-37
in ref transcript
Histone 2A; H2A is a subunit of the nucleosome. The nucleosome is an octamer containing two H2A, H2B, H3, and H4 subunits. The H2A subunit performs essential roles in maintaining structural integrity of the nucleosome, chromatin condensation, and binding of specific chromatin-associated proteins.
Changed!
smart H2A 106aa 2e-43
in ref transcript
Histone 2A.
Changed!
PTZ PTZ00017 110aa 1e-46
in ref transcript
histone H2A; Provisional.
Changed!
cd H2A 58aa 5e-18
in modified transcript
Changed!
smart H2A 60aa 2e-19
in modified transcript
Changed!
PTZ PTZ00017 64aa 2e-24
in modified transcript
HSD17B10
refseq_HADH2.F1 refseq_HADH2.R1 135 162
NCBIGene 36.3 3028
Alternative 5-prime, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_004493
Changed!
TIGR 3oxo_ACP_reduc 210aa 2e-32
in ref transcript
This model represents 3-oxoacyl-[ACP] reductase, also called 3-ketoacyl-acyl carrier protein reductase, an enzyme of fatty acid biosynthesis.
Changed!
TIGR 3oxo_ACP_reduc 201aa 2e-26
in modified transcript
Changed!
PRK fabG 245aa 4e-37
in modified transcript
HAGH
refseq_HAGH.F2 refseq_HAGH.R2 292 388
NCBIGene 36.3 3029
Single exon skipping, size difference: 96
Inclusion in the protein causing a new stop codon
Reference transcript: NM_005326
Changed!
TIGR GSH_gloB 250aa 5e-70
in ref transcript
Members of this protein family are hydroxyacylglutathione hydrolase, a detoxification enzyme known as glyoxalase II. It follows lactoylglutathione lyase, or glyoxalase I, and acts to remove the toxic metabolite methylglyoxal and related compounds. This protein belongs to the broader metallo-beta-lactamase family (pfam00753).
Changed!
PRK PRK10241 255aa 9e-33
in ref transcript
hydroxyacylglutathione hydrolase; Provisional.
HAO2
refseq_HAO2.F1 refseq_HAO2.R1 162 382
NCBIGene 36.3 51179
Single exon skipping, size difference: 220
Exclusion of the protein initiation site
Reference transcript: NM_001005783
cd alpha_hydroxyacid_oxid_FMN 336aa 1e-120
in ref transcript
Family of homologous FMN-dependent alpha-hydroxyacid oxidizing enzymes. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). In green plants, glycolate oxidase is one of the key enzymes in photorespiration where it oxidizes glycolate to glyoxylate. LMO catalyzes the oxidation of L-lactate to acetate and carbon dioxide. MDH oxidizes (S)-mandelate to phenylglyoxalate. It is an enzyme in the mandelate pathway that occurs in several strains of Pseudomonas which converts (R)-mandelate to benzoate.
pfam FMN_dh 336aa 1e-129
in ref transcript
FMN-dependent dehydrogenase.
COG LldD 344aa 1e-81
in ref transcript
L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases [Energy production and conversion].
HAP1
refseq_HAP1.F1 refseq_HAP1.R1 187 343
NCBIGene 36.3 9001
Single exon skipping, size difference: 156
Exclusion in the protein (no frameshift)
Reference transcript: NM_003949
Changed!
pfam HAP1_N 354aa 1e-71
in ref transcript
HAP1 N-terminal conserved region. This family represents an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntington's disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34.
Changed!
COG Smc 148aa 4e-07
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
pfam HAP1_N 320aa 2e-65
in modified transcript
Changed!
COG Smc 133aa 4e-05
in modified transcript
HAT1
refseq_HAT1.F2 refseq_HAT1.R2 315 391
NCBIGene 36.3 8520
Single exon skipping, size difference: 76
Exclusion in the protein causing a frameshift
Reference transcript: NM_003642
Changed!
pfam Hat1_N 161aa 4e-50
in ref transcript
Histone acetyl transferase HAT1 N-terminus. This domain is the N-terminal half of the structure of histone acetyl transferase HAT1. It is often found in association with the C-terminal part of the GNAT Acetyltransf_1 (pfam00583) domain. It seems to be motifs C and D of the structure. Histone acetyltransferases (HATs) catalyse the transfer of an acetyl group from acetyl-CoA to the lysine E-amino groups on the N-terminal tails of histones. HATs are involved in transcription since histones tend to be hyper-acetylated in actively transcribed regions of chromatin, whereas in transcriptionally silent regions histones are hypo-acetylated.
HAX1
refseq_HAX1.F1 refseq_HAX1.R1 232 376
NCBIGene 36.3 10456
Alternative 3-prime, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_006118
HCFC1R1
refseq_HCFC1R1.F1 refseq_HCFC1R1.R1 139 196
NCBIGene 36.3 54985
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_017885
HDAC7A
refseq_HDAC7A.F2 refseq_HDAC7A.R2 117 237
NCBIGene 36.3 51564
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098415
pfam Hist_deacetyl 330aa 3e-82
in ref transcript
Histone deacetylase domain. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related to other proteins.
COG AcuC 309aa 3e-57
in ref transcript
Deacetylases, including yeast histone deacetylase and acetoin utilization protein [Chromatin structure and dynamics / Secondary metabolites biosynthesis, transport, and catabolism].
N6AMT1
refseq_HEMK2.F1 refseq_HEMK2.R1 121 205
NCBIGene 36.3 29104
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_013240
Changed!
cd AdoMet_MTases 126aa 3e-04
in ref transcript
S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
Changed!
TIGR hemK_rel_arch 189aa 1e-23
in ref transcript
The gene hemK from E. coli was found to contribute to heme biosynthesis and originally suggested to be protoporphyrinogen oxidase. Functional analysis of the nearest homolog in Saccharomyces cerevisiae, YNL063w, finds it is not protoporphyrinogen oxidase and sequence analysis suggests that HemK homologs have S-adenosyl-methionine-dependent methyltransferase activity. Homologs are found, usually in a single copy, in nearly all completed genomes, but varying somewhat in apparent domain architecture. This model represents an archaeal and eukaryotic protein family that lacks an N-terminal domain found in HemK and its eubacterial homologs. It is found in a single copy in the first six completed archaeal and eukaryotic genomes.
Changed!
COG HemK 171aa 1e-23
in ref transcript
Methylase of polypeptide chain release factors [Translation, ribosomal structure and biogenesis].
Changed!
TIGR hemK_rel_arch 161aa 5e-15
in modified transcript
Changed!
COG HemK 143aa 4e-14
in modified transcript
HERC4
refseq_HERC4.F2 refseq_HERC4.R2 182 342
NCBIGene 36.2 26091
Single exon skipping, size difference: 160
Exclusion in the protein causing a frameshift
Reference transcript: NM_022079
Changed!
cd HECTc 347aa 1e-108
in ref transcript
HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
Changed!
pfam HECT 298aa 5e-93
in ref transcript
HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
Changed!
pfam RCC1 50aa 2e-07
in ref transcript
Regulator of chromosome condensation (RCC1).
Changed!
pfam RCC1 48aa 1e-05
in ref transcript
Changed!
pfam RCC1 48aa 0.004
in ref transcript
Changed!
pfam RCC1 50aa 0.006
in ref transcript
Changed!
COG HUL4 398aa 6e-78
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
COG ATS1 251aa 3e-35
in ref transcript
Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
Changed!
COG ATS1 69aa 1e-09
in modified transcript
HERC4
refseq_HERC4.F3 refseq_HERC4.R3 123 147
NCBIGene 36.3 26091
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_022079
cd HECTc 347aa 1e-108
in ref transcript
HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
pfam HECT 298aa 5e-93
in ref transcript
HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
pfam RCC1 50aa 2e-07
in ref transcript
Regulator of chromosome condensation (RCC1).
pfam RCC1 48aa 1e-05
in ref transcript
pfam RCC1 48aa 0.004
in ref transcript
pfam RCC1 50aa 0.006
in ref transcript
COG HUL4 398aa 6e-78
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
COG ATS1 251aa 3e-35
in ref transcript
Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
HERC4
refseq_HERC4.F4 refseq_HERC4.R4 204 296
NCBIGene 36.2 26091
Single exon skipping, size difference: 92
Exclusion in the protein causing a frameshift
Reference transcript: NM_022079
Changed!
cd HECTc 347aa 1e-108
in ref transcript
HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
Changed!
pfam HECT 298aa 5e-93
in ref transcript
HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
pfam RCC1 50aa 2e-07
in ref transcript
Regulator of chromosome condensation (RCC1).
pfam RCC1 48aa 1e-05
in ref transcript
Changed!
pfam RCC1 48aa 0.004
in ref transcript
Changed!
pfam RCC1 50aa 0.006
in ref transcript
Changed!
COG HUL4 398aa 6e-78
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
COG ATS1 251aa 3e-35
in ref transcript
Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
Changed!
COG ATS1 204aa 2e-30
in modified transcript
HERC6
refseq_HERC6.F1 refseq_HERC6.R1 215 297
NCBIGene 36.2 55008
Single exon skipping, size difference: 82
Inclusion in the protein causing a frameshift
Reference transcript: NM_017912
Changed!
cd HECTc 343aa 1e-101
in ref transcript
HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
Changed!
pfam HECT 288aa 2e-65
in ref transcript
HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
pfam RCC1 51aa 4e-05
in ref transcript
Regulator of chromosome condensation (RCC1).
pfam RCC1 48aa 6e-04
in ref transcript
Changed!
COG HUL4 608aa 1e-58
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
COG ATS1 241aa 1e-23
in ref transcript
Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
HERC6
refseq_HERC6.F2 refseq_HERC6.R1 282 390
NCBIGene 36.2 55008
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_017912
cd HECTc 343aa 1e-101
in ref transcript
HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
pfam HECT 288aa 2e-65
in ref transcript
HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
pfam RCC1 51aa 4e-05
in ref transcript
Regulator of chromosome condensation (RCC1).
pfam RCC1 48aa 6e-04
in ref transcript
Changed!
COG HUL4 608aa 1e-58
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
COG ATS1 241aa 1e-23
in ref transcript
Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
Changed!
COG HUL4 572aa 1e-62
in modified transcript
HERC6
refseq_HERC6.F4 refseq_HERC6.R3 209 364
NCBIGene 36.2 55008
Single exon skipping, size difference: 155
Exclusion in the protein causing a frameshift
Reference transcript: NM_017912
Changed!
cd HECTc 343aa 1e-101
in ref transcript
HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
Changed!
pfam HECT 288aa 2e-65
in ref transcript
HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
pfam RCC1 51aa 4e-05
in ref transcript
Regulator of chromosome condensation (RCC1).
pfam RCC1 48aa 6e-04
in ref transcript
Changed!
COG HUL4 608aa 1e-58
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
COG ATS1 241aa 1e-23
in ref transcript
Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
HERC6
refseq_HERC6.F5 refseq_HERC6.R4 224 346
NCBIGene 36.2 55008
Single exon skipping, size difference: 122
Exclusion in the protein causing a frameshift
Reference transcript: NM_017912
Changed!
cd HECTc 343aa 1e-101
in ref transcript
HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
Changed!
pfam HECT 288aa 2e-65
in ref transcript
HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
pfam RCC1 51aa 4e-05
in ref transcript
Regulator of chromosome condensation (RCC1).
pfam RCC1 48aa 6e-04
in ref transcript
Changed!
COG HUL4 608aa 1e-58
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
COG ATS1 241aa 1e-23
in ref transcript
Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
HFE
refseq_HFE.F1 refseq_HFE.R1 284 353
NCBIGene 36.3 3077
Alternative 3-prime, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_000410
cd IGc 76aa 2e-12
in ref transcript
Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
pfam MHC_I 176aa 3e-31
in ref transcript
Class I Histocompatibility antigen, domains alpha 1 and 2.
pfam C1-set 72aa 2e-15
in ref transcript
Immunoglobulin C1-set domain.
Changed!
pfam MHC_I 157aa 6e-27
in modified transcript
HFE
refseq_HFE.F3 refseq_HFE.R3 202 244
NCBIGene 36.3 3077
Alternative 3-prime, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_000410
Changed!
cd IGc 76aa 2e-12
in ref transcript
Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam MHC_I 176aa 3e-31
in ref transcript
Class I Histocompatibility antigen, domains alpha 1 and 2.
Changed!
pfam C1-set 72aa 2e-15
in ref transcript
Immunoglobulin C1-set domain.
Changed!
cd IGc 92aa 3e-13
in modified transcript
Changed!
pfam C1-set 73aa 6e-16
in modified transcript
HFE
refseq_HFE.F5 refseq_HFE.R3 130 406
NCBIGene 36.3 3077
Single exon skipping, size difference: 276
Exclusion in the protein (no frameshift)
Reference transcript: NM_000410
cd IGc 76aa 2e-12
in ref transcript
Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
pfam MHC_I 176aa 3e-31
in ref transcript
Class I Histocompatibility antigen, domains alpha 1 and 2.
pfam C1-set 72aa 2e-15
in ref transcript
Immunoglobulin C1-set domain.
Changed!
pfam MHC_I 87aa 4e-08
in modified transcript
HFE2
refseq_HFE2.F2 refseq_HFE2.R2 103 289
NCBIGene 36.3 148738
Single exon skipping, size difference: 186
Exclusion of the protein initiation site
Reference transcript: NM_213653
pfam RGM_C 197aa 4e-81
in ref transcript
Repulsive guidance molecule (RGM) C-terminus. This family consists of several mammalian and one bird sequence from Gallus gallus (Chicken). This family represents the C-terminal region of several sequences but in others it represents the full protein. All of the mammalian proteins are hypothetical and have no known function but the member from chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the stratum opticum.
Changed!
pfam RGM_N 195aa 3e-57
in ref transcript
Repulsive guidance molecule (RGM) N-terminus. This family consists of the N-terminal region of several mammalian and one bird sequence from Gallus gallus (Chicken). All of the mammalian proteins are hypothetical and have no known function but the member from chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the stratum opticum.
Changed!
pfam RGM_N 114aa 7e-40
in modified transcript
HGD
refseq_HGD.F2 refseq_HGD.R2 143 366
NCBIGene 36.2 3081
Alternative 5-prime and 3-prime, size difference: 223
Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: XM_001125869
Changed!
TIGR hmgA 432aa 0.0
in ref transcript
Missing in human disease alkaptonuria.
Changed!
PRK PRK05341 439aa 0.0
in ref transcript
homogentisate 1,2-dioxygenase; Provisional.
Changed!
TIGR hmgA 301aa 1e-170
in modified transcript
Changed!
PRK PRK05341 303aa 1e-132
in modified transcript
HHLA3
refseq_HHLA3.F2 refseq_HHLA3.R2 262 314
NCBIGene 36.3 11147
Single exon skipping, size difference: 52
Inclusion in the protein causing a frameshift
Reference transcript: NM_001031693
HIBCH
refseq_HIBCH.F1 refseq_HIBCH.R1 191 225
NCBIGene 36.3 26275
Single exon skipping, size difference: 34
Exclusion in the protein causing a frameshift
Reference transcript: NM_014362
cd crotonase-like 185aa 2e-39
in ref transcript
Crotonase/Enoyl-Coenzyme A (CoA) hydratase superfamily. This superfamily contains a diverse set of enzymes including enoyl-CoA hydratase, napthoate synthase, methylmalonyl-CoA decarboxylase, 3-hydoxybutyryl-CoA dehydratase, and dienoyl-CoA isomerase. Many of these play important roles in fatty acid metabolism. In addition to a conserved structural core and the formation of trimers (or dimers of trimers), a common feature in this superfamily is the stabilization of an enolate anion intermediate derived from an acyl-CoA substrate. This is accomplished by two conserved backbone NH groups in active sites that form an oxyanion hole.
pfam ECH 170aa 4e-15
in ref transcript
Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase.
Changed!
PRK PRK05617 355aa 1e-116
in ref transcript
enoyl-CoA hydratase; Provisional.
Changed!
PRK PRK05617 306aa 9e-93
in modified transcript
NFU1
refseq_HIRIP5.F2 refseq_HIRIP5.R2 307 443
NCBIGene 36.3 27247
Single exon skipping, size difference: 136
Exclusion in the protein causing a frameshift
Reference transcript: NM_015700
Changed!
pfam Nfu_N 88aa 3e-38
in ref transcript
Scaffold protein Nfu/NifU N terminal. This domain is found at the N terminus of NifU and NifU related proteins, and in the human Nfu protein. Both of these proteins are thought to be involved in the the assembly of iron-sulphur clusters.
Changed!
pfam NifU 69aa 7e-20
in ref transcript
NifU-like domain. This is an alignment of the carboxy-terminal domain. This is the only common region between the NifU protein from nitrogen-fixing bacteria and rhodobacterial species. The biochemical function of NifU is unknown.
Changed!
COG COG0694 85aa 4e-22
in ref transcript
Thioredoxin-like proteins and domains [Posttranslational modification, protein turnover, chaperones].
NFU1
refseq_HIRIP5.F3 refseq_HIRIP5.R3 184 274
NCBIGene 36.3 27247
Alternative 5-prime, size difference: 90
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001002755
Changed!
pfam Nfu_N 88aa 2e-38
in ref transcript
Scaffold protein Nfu/NifU N terminal. This domain is found at the N terminus of NifU and NifU related proteins, and in the human Nfu protein. Both of these proteins are thought to be involved in the the assembly of iron-sulphur clusters.
Changed!
pfam NifU 69aa 7e-20
in ref transcript
NifU-like domain. This is an alignment of the carboxy-terminal domain. This is the only common region between the NifU protein from nitrogen-fixing bacteria and rhodobacterial species. The biochemical function of NifU is unknown.
Changed!
COG COG0694 85aa 4e-22
in ref transcript
Thioredoxin-like proteins and domains [Posttranslational modification, protein turnover, chaperones].
HISPPD2A
refseq_HISPPD2A.F1 refseq_HISPPD2A.R1 98 500
NCBIGene 36.3 9677
Exon skipping and alternative 3-prime or 5-prime, size difference: 402
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_014659
cd HP_HAP_like 80aa 4e-16
in ref transcript
Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
cd HP_HAP_like 139aa 8e-09
in ref transcript
pfam Acid_phosphat_A 407aa 5e-37
in ref transcript
Histidine acid phosphatase.
pfam Acid_phosphat_A 47aa 3e-07
in ref transcript
HISPPD2A
refseq_HISPPD2A.F4 refseq_HISPPD2A.R4 113 392
NCBIGene 36.3 9677
Exon skipping and alternative 3-prime or 5-prime, size difference: 279
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_014659
cd HP_HAP_like 80aa 4e-16
in ref transcript
Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
Changed!
cd HP_HAP_like 139aa 8e-09
in ref transcript
Changed!
pfam Acid_phosphat_A 407aa 5e-37
in ref transcript
Histidine acid phosphatase.
pfam Acid_phosphat_A 47aa 3e-07
in ref transcript
Changed!
cd HP_HAP_like 108aa 8e-05
in modified transcript
Changed!
pfam Acid_phosphat_A 372aa 2e-32
in modified transcript
HISPPD2A
refseq_HISPPD2A.F5 refseq_HISPPD2A.R5 130 193
NCBIGene 36.3 9677
Single exon skipping, size difference: 63
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_014659
cd HP_HAP_like 80aa 4e-16
in ref transcript
Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
cd HP_HAP_like 139aa 8e-09
in ref transcript
pfam Acid_phosphat_A 407aa 5e-37
in ref transcript
Histidine acid phosphatase.
pfam Acid_phosphat_A 47aa 3e-07
in ref transcript
HISPPD2A
refseq_HISPPD2A.F7 refseq_HISPPD2A.R7 111 135
NCBIGene 36.3 9677
Alternative 5-prime, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_014659
cd HP_HAP_like 80aa 4e-16
in ref transcript
Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
Changed!
cd HP_HAP_like 139aa 8e-09
in ref transcript
Changed!
pfam Acid_phosphat_A 407aa 5e-37
in ref transcript
Histidine acid phosphatase.
pfam Acid_phosphat_A 47aa 3e-07
in ref transcript
Changed!
cd HP_HAP_like 102aa 5e-07
in modified transcript
Changed!
pfam Acid_phosphat_A 399aa 6e-38
in modified transcript
HK1
refseq_HK1.F2 refseq_HK1.R2 102 156
NCBIGene 36.3 3098
Alternative 5-prime, size difference: 54
Exclusion in 5'UTR
Reference transcript: NM_033498
pfam Hexokinase_2 237aa 1e-113
in ref transcript
Hexokinase. Hexokinase (EC:2.7.1.1) contains two structurally similar domains represented by this family and pfam00349. Some members of the family have two copies of each of these domains.
pfam Hexokinase_2 238aa 1e-108
in ref transcript
pfam Hexokinase_1 206aa 6e-94
in ref transcript
Hexokinase. Hexokinase (EC:2.7.1.1) contains two structurally similar domains represented by this family and pfam03727. Some members of the family have two copies of each of these domains.
pfam Hexokinase_1 201aa 8e-90
in ref transcript
COG COG5026 443aa 5e-84
in ref transcript
Hexokinase [Carbohydrate transport and metabolism].
COG COG5026 442aa 1e-74
in ref transcript
HK1
refseq_HK1.F3 refseq_HK1.R3 113 206
NCBIGene 36.3 3098
Single exon skipping, size difference: 93
Inclusion in the protein causing a new stop codon
Reference transcript: NM_033497
Changed!
pfam Hexokinase_2 237aa 1e-113
in ref transcript
Hexokinase. Hexokinase (EC:2.7.1.1) contains two structurally similar domains represented by this family and pfam00349. Some members of the family have two copies of each of these domains.
Changed!
pfam Hexokinase_2 238aa 1e-108
in ref transcript
Changed!
pfam Hexokinase_1 206aa 6e-94
in ref transcript
Hexokinase. Hexokinase (EC:2.7.1.1) contains two structurally similar domains represented by this family and pfam03727. Some members of the family have two copies of each of these domains.
Changed!
pfam Hexokinase_1 201aa 8e-90
in ref transcript
Changed!
COG COG5026 443aa 5e-84
in ref transcript
Hexokinase [Carbohydrate transport and metabolism].
Changed!
COG COG5026 442aa 1e-74
in ref transcript
HM13
refseq_HM13.F2 refseq_HM13.R2 162 218
NCBIGene 36.3 81502
Alternative 5-prime, size difference: 56
Inclusion in the protein causing a new stop codon
Reference transcript: NM_178581
pfam Peptidase_A22B 286aa 1e-89
in ref transcript
Signal peptide peptidase. The members of this family are membrane proteins. In some proteins this region is found associated with pfam02225. This family corresponds with Merops subfamily A22B, the type example of which is signal peptide peptidase. There is a sequence-similarity relationship with pfam01080.
COG COG3389 176aa 3e-05
in ref transcript
Uncharacterized protein conserved in archaea [Function unknown].
HMG2L1
refseq_HMG2L1.F1 refseq_HMG2L1.R1 248 363
NCBIGene 36.3 10042
Single exon skipping, size difference: 115
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001003681
Changed!
cd HMG-box 60aa 5e-11
in ref transcript
High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Changed!
pfam HMG_box 61aa 5e-11
in ref transcript
HMG (high mobility group) box.
Changed!
COG NHP6B 72aa 9e-04
in ref transcript
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
HMG2L1
refseq_HMG2L1.F3 refseq_HMG2L1.R3 100 199
NCBIGene 36.3 10042
Single exon skipping, size difference: 99
Exclusion of the protein initiation site
Reference transcript: NM_001003681
cd HMG-box 60aa 5e-11
in ref transcript
High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Changed!
pfam HMG_box 61aa 5e-11
in ref transcript
HMG (high mobility group) box.
COG NHP6B 72aa 9e-04
in ref transcript
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
HMGN3
refseq_HMGN3.F1 refseq_HMGN3.R1 111 152
NCBIGene 36.3 9324
Alternative 5-prime, size difference: 41
Exclusion in the protein causing a frameshift
Reference transcript: NM_004242
HN1
refseq_HN1.F2 refseq_HN1.R2 139 234
NCBIGene 36.3 51155
Single exon skipping, size difference: 95
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001002032
HNRNPA2B1
refseq_HNRPA2B1.F2 refseq_HNRPA2B1.R2 180 216
NCBIGene 36.3 3181
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_031243
Changed!
cd RRM 73aa 3e-13
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 74aa 8e-11
in ref transcript
Changed!
TIGR SF-CC1 140aa 9e-15
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
Changed!
COG COG0724 155aa 3e-10
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
cd RRM 74aa 3e-13
in modified transcript
Changed!
TIGR SF-CC1 150aa 2e-15
in modified transcript
Changed!
COG COG0724 164aa 1e-09
in modified transcript
HNRPAB
refseq_HNRPAB.F2 refseq_HNRPAB.R2 129 270
NCBIGene 36.3 3182
Single exon skipping, size difference: 141
Exclusion in the protein (no frameshift)
Reference transcript: NM_031266
cd RRM 70aa 6e-14
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 48aa 4e-10
in ref transcript
TIGR SF-CC1 137aa 2e-18
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
pfam CBFNT 71aa 6e-06
in ref transcript
CBFNT (NUC161) domain. This N terminal domain is found in proteins of CARG-binding factor A-like proteins.
COG COG0724 142aa 6e-08
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
HNRPD
refseq_HNRPD.F2 refseq_HNRPD.R2 160 217
NCBIGene 36.3 3184
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_031370
cd RRM 79aa 2e-14
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 74aa 9e-14
in ref transcript
Changed!
TIGR PABP-1234 143aa 2e-19
in ref transcript
There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
Changed!
COG COG0724 179aa 2e-07
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
TIGR PABP-1234 155aa 2e-20
in modified transcript
Changed!
COG COG0724 162aa 5e-07
in modified transcript
HNRPD
refseq_HNRPD.F3 refseq_HNRPD.R3 189 336
NCBIGene 36.3 3184
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_031370
cd RRM 79aa 2e-14
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 74aa 9e-14
in ref transcript
TIGR PABP-1234 143aa 2e-19
in ref transcript
There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
COG COG0724 179aa 2e-07
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
HNRPDL
refseq_HNRPDL.F1 refseq_HNRPDL.R1 133 238
NCBIGene 36.3 9987
Single exon skipping, size difference: 105
Exclusion in 3'UTR
Reference transcript: NR_003249
cd RRM 74aa 1e-14
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 70aa 1e-14
in ref transcript
TIGR SF-CC1 149aa 1e-18
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
COG COG0724 170aa 3e-08
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
HNRPH3
refseq_HNRPH3.F1 refseq_HNRPH3.R1 120 165
NCBIGene 36.3 3189
Alternative 5-prime, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_012207
cd RRM 74aa 5e-08
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 71aa 1e-07
in ref transcript
smart RRM_2 69aa 4e-08
in ref transcript
RNA recognition motif.
smart RRM_2 72aa 5e-08
in ref transcript
HNRPK
refseq_HNRPK.F1 refseq_HNRPK.R1 283 343
NCBIGene 36.3 3190
Alternative 3-prime, size difference: 60
Exclusion of the stop codon
Reference transcript: NM_002140
cd PCBP_like_KH 62aa 1e-11
in ref transcript
K homology RNA-binding domain, PCBP_like. Members of this group possess KH domains in a tandem arrangement. Most members, similar to the poly(C) binding proteins (PCBPs) and Nova, containing three KH domains, with the first and second domains, which are represented here, in tandem arrangement, followed by a large spacer region, with the third domain near the C-terminal end of the protein. The poly(C) binding proteins (PCBPs) can be divided into two groups, hnRNPs K/J and the alphaCPs, which share a triple KH domain configuration and poly(C) binding specificity. They play roles in mRNA stabilization, translational activation, and translational silencing. Nova-1 and Nova-2 are nuclear RNA-binding proteins that regulate splicing. This group also contains plant proteins that seem to have two tandem repeat arrrangements, like Hen4, a protein that plays a role in AGAMOUS (AG) pre-mRNA processing and important step in plant development. In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
cd PCBP_like_KH 65aa 1e-07
in ref transcript
pfam ROKNT 43aa 3e-12
in ref transcript
ROKNT (NUC014) domain. This presumed domain is found at the N-terminus of RNP K-like proteins that also contains KH domains pfam00013.
pfam KH_1 61aa 7e-10
in ref transcript
KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
TIGR PNK-3'Pase 111aa 0.003
in ref transcript
Note that the EC number for the kinase function is: 2.7.1.78.
COG COG4639 151aa 0.005
in ref transcript
Predicted kinase [General function prediction only].
HNRPUL1
refseq_HNRPUL1.F2 refseq_HNRPUL1.R2 231 387
NCBIGene 36.2 11100
Alternative 3-prime, size difference: 156
Exclusion in the protein (no frameshift)
Reference transcript: NM_007040
pfam SPRY 133aa 2e-25
in ref transcript
SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
TIGR selen_PSTK 146aa 0.005
in ref transcript
Members of this protein are L-seryl-tRNA(Sec) kinase. This enzyme is part of a two-step pathway in Eukaryota and Archaea for performing selenocysteine biosynthesis by changing serine misacylated on selenocysteine-tRNA to selenocysteine. This enzyme performs the first step, phosphorylation of the OH group of the serine side chain. This family represents archaeal proteins with this activity.
COG COG4639 131aa 0.009
in ref transcript
Predicted kinase [General function prediction only].
HOMER2
refseq_HOMER2.F1 refseq_HOMER2.R1 131 164
NCBIGene 36.3 9455
Alternative 3-prime, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_199330
cd Homer 111aa 6e-54
in ref transcript
Homer type EVH1 domain. Homer is a synaptic scaffolding protein, involved in neuronal signaling. It contains an EVH1 domain, which binds to both neurotransmitter receptors, such as the metabotropic glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
pfam WH1 104aa 6e-32
in ref transcript
WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGluR5 bind a protein called homer, which is a WH1 domain homologue. A subset of WH1 domains has been termed a "EVH1" domain and appear to bind a polyproline motif.
TIGR SMC_prok_A 190aa 8e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Changed!
COG Smc 188aa 0.008
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
COG SbcC 237aa 0.001
in modified transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
HOMER2
refseq_HOMER2.F4 refseq_HOMER2.R4 281 446
NCBIGene 36.3 9455
Alternative 5-prime and 3-prime, size difference: 165
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_199330
cd Homer 111aa 6e-54
in ref transcript
Homer type EVH1 domain. Homer is a synaptic scaffolding protein, involved in neuronal signaling. It contains an EVH1 domain, which binds to both neurotransmitter receptors, such as the metabotropic glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
pfam WH1 104aa 6e-32
in ref transcript
WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGluR5 bind a protein called homer, which is a WH1 domain homologue. A subset of WH1 domains has been termed a "EVH1" domain and appear to bind a polyproline motif.
Changed!
TIGR SMC_prok_A 190aa 8e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Changed!
COG Smc 188aa 0.008
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
HOPX
refseq_HOP.F1 refseq_HOP.R1 237 362
NCBIGene 36.3 84525
Single exon skipping, size difference: 125
Exclusion of the protein initiation site
Reference transcript: NM_032495
cd homeodomain 53aa 1e-06
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
pfam Homeobox 52aa 4e-05
in ref transcript
Homeobox domain.
HPCAL1
refseq_HPCAL1.F1 refseq_HPCAL1.R1 178 351
NCBIGene 36.3 3241
Single exon skipping, size difference: 173
Exclusion in 5'UTR
Reference transcript: NM_134421
cd EFh 62aa 3e-11
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 75aa 2e-10
in ref transcript
cd EFh 61aa 0.008
in ref transcript
smart EFh 29aa 8e-04
in ref transcript
EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
COG FRQ1 167aa 8e-27
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
HPS1
refseq_HPS1.F1 refseq_HPS1.R1 106 149
NCBIGene 36.2 3257
Alternative 5-prime, size difference: 43
Inclusion in the protein causing a frameshift
Reference transcript: NM_000195
HPS1
refseq_HPS1.F2 refseq_HPS1.R2 149 292
NCBIGene 36.2 3257
Single exon skipping, size difference: 143
Exclusion in the protein causing a frameshift
Reference transcript: NM_000195
HPS1
refseq_HPS1.F4 refseq_HPS1.R4 140 239
NCBIGene 36.2 3257
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_000195
HPS4
refseq_HPS4.F2 refseq_HPS4.R2 144 198
NCBIGene 36.2 89781
Single exon skipping, size difference: 54
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_022081
HPS5
refseq_HPS5.F2 refseq_HPS5.R2 222 379
NCBIGene 36.3 11234
Single exon skipping, size difference: 157
Exclusion of the protein initiation site
Reference transcript: NM_181507
HR
refseq_HR.F1 refseq_HR.R1 243 408
NCBIGene 36.3 55806
Single exon skipping, size difference: 165
Exclusion in the protein (no frameshift)
Reference transcript: NM_005144
Changed!
pfam JmjC 41aa 4e-06
in ref transcript
JmjC domain. The JmjC domain belongs to the Cupin superfamily. JmjC-domain proteins may be protein hydroxylases that catalyse a novel histone modification.
HSD11B1L
refseq_HSD11B1L.F1 refseq_HSD11B1L.R1 189 252
NCBIGene 36.3 374875
Exon skipping and alternative 3-prime or 5-prime, size difference: 63
Inclusion in the protein (no stop codon or frameshift), Exclusion of the protein initiation site
Reference transcript: NM_198533
Changed!
pfam adh_short 165aa 4e-13
in ref transcript
short chain dehydrogenase. This family contains a wide variety of dehydrogenases.
Changed!
PRK PRK06181 234aa 7e-28
in ref transcript
short chain dehydrogenase; Provisional.
Changed!
pfam adh_short 109aa 9e-04
in modified transcript
Changed!
PRK PRK06181 177aa 6e-14
in modified transcript
HSD11B1L
refseq_HSD11B1L.F3 refseq_HSD11B1L.R3 134 246
NCBIGene 36.3 374875
Single exon skipping, size difference: 112
Exclusion in the protein causing a frameshift
Reference transcript: NM_198533
Changed!
pfam adh_short 165aa 4e-13
in ref transcript
short chain dehydrogenase. This family contains a wide variety of dehydrogenases.
Changed!
PRK PRK06181 234aa 7e-28
in ref transcript
short chain dehydrogenase; Provisional.
Changed!
PRK PRK05866 44aa 5e-07
in modified transcript
short chain dehydrogenase; Provisional.
HSD11B1L
refseq_HSD11B1L.F6 refseq_HSD11B1L.R6 223 386
NCBIGene 36.3 374875
Alternative 3-prime, size difference: 163
Inclusion in the protein causing a frameshift
Reference transcript: NM_198533
pfam adh_short 165aa 4e-13
in ref transcript
short chain dehydrogenase. This family contains a wide variety of dehydrogenases.
Changed!
PRK PRK06181 234aa 7e-28
in ref transcript
short chain dehydrogenase; Provisional.
Changed!
PRK PRK06181 216aa 9e-26
in modified transcript
HTATIP
refseq_HTATIP.F1 refseq_HTATIP.R1 105 261
NCBIGene 36.3 10524
Single exon skipping, size difference: 156
Exclusion in the protein (no frameshift)
Reference transcript: NM_182710
cd CHROMO 53aa 0.003
in ref transcript
Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
pfam MOZ_SAS 193aa 1e-95
in ref transcript
MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases.
smart CHROMO 52aa 2e-04
in ref transcript
Chromatin organization modifier domain.
Changed!
COG SAS2 275aa 6e-96
in ref transcript
Histone acetyltransferase (MYST family) [Chromatin structure and dynamics].
Changed!
COG SAS2 72aa 2e-07
in ref transcript
Changed!
COG SAS2 443aa 1e-102
in modified transcript
HTR4
refseq_HTR4.F1 refseq_HTR4.R1 235 355
NCBIGene 36.3 3360
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040173
pfam 7tm_1 271aa 5e-51
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
HTR4
refseq_HTR4.F3 refseq_HTR4.R3 223 265
NCBIGene 36.3 3360
Single exon skipping, size difference: 42
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001040173
Changed!
pfam 7tm_1 271aa 5e-51
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
Changed!
pfam 7tm_1 285aa 3e-50
in modified transcript
HTR4
refseq_HTR4.F5 refseq_HTR4.R6 247 323
NCBIGene 36.3 3360
Single exon skipping, size difference: 76
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001040169
pfam 7tm_1 271aa 6e-50
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
HTRA2
refseq_HTRA2.F1 refseq_HTRA2.R1 209 305
NCBIGene 36.3 27429
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_013247
Changed!
cd PDZ_serine_protease 93aa 1e-12
in ref transcript
PDZ domain of tryspin-like serine proteases, such as DegP/HtrA, which are oligomeric proteins involved in heat-shock response, chaperone function, and apoptosis. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, though binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins.
Changed!
TIGR degP_htrA_DO 304aa 6e-63
in ref transcript
This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
Changed!
COG DegQ 312aa 4e-46
in ref transcript
Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [Posttranslational modification, protein turnover, chaperones].
Changed!
cd PDZ_metalloprotease 51aa 1e-05
in modified transcript
PDZ domain of bacterial and plant zinc metalloprotases, presumably membrane-associated or integral membrane proteases, which may be involved in signalling and regulatory mechanisms. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins.
Changed!
TIGR degP_htrA_DO 272aa 5e-53
in modified transcript
Changed!
COG DegQ 280aa 2e-40
in modified transcript
HTRA2
refseq_HTRA2.F3 refseq_HTRA2.R3 110 305
NCBIGene 36.3 27429
Single exon skipping, size difference: 195
Exclusion in the protein (no frameshift)
Reference transcript: NM_013247
cd PDZ_serine_protease 93aa 1e-12
in ref transcript
PDZ domain of tryspin-like serine proteases, such as DegP/HtrA, which are oligomeric proteins involved in heat-shock response, chaperone function, and apoptosis. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, though binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins.
Changed!
TIGR degP_htrA_DO 304aa 6e-63
in ref transcript
This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
Changed!
COG DegQ 312aa 4e-46
in ref transcript
Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [Posttranslational modification, protein turnover, chaperones].
Changed!
TIGR degP_htrA_DO 239aa 5e-30
in modified transcript
Changed!
COG DegQ 157aa 9e-18
in modified transcript
Changed!
PRK PRK10898 81aa 1e-04
in modified transcript
serine endoprotease; Provisional.
HYAL1
refseq_HYAL1.F1 refseq_HYAL1.R1 101 191
NCBIGene 36.3 3373
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_153281
Changed!
pfam Glyco_hydro_56 335aa 1e-146
in ref transcript
Hyaluronidase.
Changed!
pfam Glyco_hydro_56 305aa 1e-127
in modified transcript
ICAM4
refseq_ICAM4.F2 refseq_ICAM4.R2 237 314
NCBIGene 36.3 3386
Alternative 3-prime, size difference: 77
Inclusion in the protein causing a frameshift
Reference transcript: NM_001039132
Changed!
pfam ICAM_N 87aa 2e-11
in ref transcript
Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signalling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, during the viral attachment process. This family is a family that is part of the Ig superfamily and is therefore related to the family ig (pfam00047).
Changed!
pfam ICAM_N 89aa 3e-12
in modified transcript
ICK
refseq_ICK.F2 refseq_ICK.R2 239 350
NCBIGene 36.3 22858
Single exon skipping, size difference: 111
Exclusion in 5'UTR
Reference transcript: NM_016513
cd S_TKc 282aa 3e-74
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 281aa 1e-79
in ref transcript
Protein kinase domain.
PTZ PTZ00024 275aa 2e-43
in ref transcript
cyclin-dependent protein kinase; Provisional.
IDH3B
refseq_IDH3B.F2 refseq_IDH3B.R2 101 416
NCBIGene 36.3 3420
Alternative 3-prime, size difference: 315
Exclusion of the stop codon
Reference transcript: NM_006899
Changed!
TIGR mito_nad_idh 334aa 1e-133
in ref transcript
The NADP-dependent IDH of Thermus aquaticus thermophilus strain HB8 resembles these NAD-dependent IDH, except for the residues involved in cofactor specificity, much more closely than it resembles other prokaryotic NADP-dependent IDH, including that of Thermus aquaticus strain YT1.
Changed!
COG LeuB 336aa 1e-81
in ref transcript
Isocitrate/isopropylmalate dehydrogenase [Amino acid transport and metabolism].
Changed!
TIGR mito_nad_idh 334aa 1e-133
in modified transcript
Changed!
COG LeuB 334aa 1e-82
in modified transcript
IFIT1
refseq_IFIT1.F2 refseq_IFIT1.R2 201 310
NCBIGene 36.2 3434
Single exon skipping, size difference: 109
Inclusion in the protein causing a frameshift
Reference transcript: NM_001001887
Changed!
cd TPR 85aa 8e-05
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
Changed!
cd TPR 91aa 0.001
in ref transcript
Changed!
TIGR PEP_TPR_lipo 412aa 1e-08
in ref transcript
This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Changed!
COG PilF 144aa 0.002
in ref transcript
Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion].
IFNAR2
refseq_IFNAR2.F1 refseq_IFNAR2.R1 186 232
NCBIGene 36.3 3455
Alternative 3-prime, size difference: 46
Inclusion in 5'UTR
Reference transcript: NM_207585
pfam Interfer-bind 101aa 1e-48
in ref transcript
Interferon-alpha/beta receptor, fibronectin type III. Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha.
IFT122
refseq_IFT122.F2 refseq_IFT122.R2 218 395
NCBIGene 36.3 55764
Single exon skipping, size difference: 177
Exclusion in the protein (no frameshift)
Reference transcript: NM_052985
Changed!
cd WD40 230aa 2e-14
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Changed!
cd WD40 259aa 6e-08
in ref transcript
smart WD40 39aa 2e-06
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
TIGR PEP_TPR_lipo 209aa 0.002
in ref transcript
This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Changed!
COG COG2319 229aa 6e-08
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
COG COG2319 269aa 3e-04
in ref transcript
Changed!
cd WD40 293aa 3e-16
in modified transcript
Changed!
COG COG2319 329aa 4e-13
in modified transcript
IFT122
refseq_IFT122.F3 refseq_IFT122.R3 119 272
NCBIGene 36.3 55764
Single exon skipping, size difference: 153
Exclusion in the protein (no frameshift)
Reference transcript: NM_052985
Changed!
cd WD40 230aa 2e-14
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Changed!
cd WD40 259aa 6e-08
in ref transcript
smart WD40 39aa 2e-06
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
TIGR PEP_TPR_lipo 209aa 0.002
in ref transcript
This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Changed!
COG COG2319 229aa 6e-08
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
COG COG2319 269aa 3e-04
in ref transcript
Changed!
cd WD40 176aa 1e-16
in modified transcript
Changed!
cd WD40 302aa 9e-14
in modified transcript
Changed!
COG COG2319 345aa 1e-11
in modified transcript
IFT88
refseq_IFT88.F1 refseq_IFT88.R1 115 344
NCBIGene 36.3 8100
Multiple exon skipping, size difference: 229
Exclusion in 5'UTR, Exclusion of the protein initiation site
Reference transcript: NM_175605
cd TPR 99aa 6e-12
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
cd TPR 100aa 9e-08
in ref transcript
cd TPR 95aa 4e-07
in ref transcript
cd TPR 98aa 8e-06
in ref transcript
cd TPR 93aa 3e-05
in ref transcript
TIGR PEP_TPR_lipo 270aa 1e-14
in ref transcript
This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
TIGR PEP_TPR_lipo 135aa 0.005
in ref transcript
Changed!
COG NrfG 259aa 3e-05
in ref transcript
FOG: TPR repeat [General function prediction only].
Changed!
COG PilF 213aa 4e-05
in modified transcript
Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion].
Changed!
COG COG4235 125aa 8e-04
in modified transcript
Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones].
IGF2BP2
refseq_IGF2BP2.F1 refseq_IGF2BP2.R1 224 353
NCBIGene 36.3 10644
Single exon skipping, size difference: 129
Exclusion in the protein (no frameshift)
Reference transcript: NM_006548
cd PCBP_like_KH 64aa 1e-09
in ref transcript
K homology RNA-binding domain, PCBP_like. Members of this group possess KH domains in a tandem arrangement. Most members, similar to the poly(C) binding proteins (PCBPs) and Nova, containing three KH domains, with the first and second domains, which are represented here, in tandem arrangement, followed by a large spacer region, with the third domain near the C-terminal end of the protein. The poly(C) binding proteins (PCBPs) can be divided into two groups, hnRNPs K/J and the alphaCPs, which share a triple KH domain configuration and poly(C) binding specificity. They play roles in mRNA stabilization, translational activation, and translational silencing. Nova-1 and Nova-2 are nuclear RNA-binding proteins that regulate splicing. This group also contains plant proteins that seem to have two tandem repeat arrrangements, like Hen4, a protein that plays a role in AGAMOUS (AG) pre-mRNA processing and important step in plant development. In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
cd RRM 70aa 8e-09
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd KH-I 64aa 2e-08
in ref transcript
K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
cd KH-I 66aa 3e-08
in ref transcript
cd RRM 72aa 6e-07
in ref transcript
cd KH-I 63aa 5e-06
in ref transcript
smart KH 74aa 2e-08
in ref transcript
K homology RNA-binding domain.
smart KH 68aa 3e-08
in ref transcript
pfam KH_1 64aa 1e-07
in ref transcript
KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
pfam KH_1 63aa 4e-07
in ref transcript
smart RRM 66aa 5e-07
in ref transcript
RNA recognition motif.
TIGR PABP-1234 107aa 2e-06
in ref transcript
There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
TIGR arCOG04150 132aa 2e-04
in ref transcript
This family of proteins is universal among the 41 archaeal genomes analyzed in and is not observed outside of the archaea. The proteins contain a single KH domain (pfam00013) which is likely to confer the ability to bind RNA.
PRK PRK13763 140aa 8e-05
in ref transcript
putative RNA-processing protein; Provisional.
COG COG0724 76aa 1e-04
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
PRK PRK12705 123aa 9e-04
in ref transcript
hypothetical protein; Provisional.
Changed!
cd PCBP_like_KH 56aa 8e-06
in modified transcript
IGLL1
refseq_IGLL1.F2 refseq_IGLL1.R2 225 341
NCBIGene 36.3 3543
Single exon skipping, size difference: 116
Exclusion in the protein causing a frameshift
Reference transcript: NM_020070
Changed!
cd IGc 95aa 5e-22
in ref transcript
Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
smart IGc1 75aa 2e-21
in ref transcript
Immunoglobulin C-Type.
IGSF3
refseq_IGSF3.F2 refseq_IGSF3.R2 334 394
NCBIGene 36.3 3321
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_001542
cd IGv 94aa 8e-04
in ref transcript
Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGv 86aa 8e-04
in ref transcript
pfam V-set 105aa 8e-08
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
pfam V-set 102aa 4e-05
in ref transcript
pfam V-set 102aa 8e-05
in ref transcript
pfam V-set 111aa 8e-05
in ref transcript
IHPK2
refseq_IHPK2.F1 refseq_IHPK2.R1 268 399
NCBIGene 36.2 51447
Single exon skipping, size difference: 131
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001005910
IKIP
refseq_IKIP.F1 refseq_IKIP.R1 253 371
NCBIGene 36.3 121457
Single exon skipping, size difference: 118
Exclusion in the protein causing a frameshift
Reference transcript: NM_201612
Changed!
TIGR SMC_prok_B 198aa 3e-06
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
COG Smc 267aa 2e-04
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
IL17RC
refseq_IL17RC.F2 refseq_IL17RC.R2 138 183
NCBIGene 36.3 84818
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_153461
pfam SEFIR 145aa 1e-33
in ref transcript
SEFIR domain. This family comprises IL17 receptors (IL17Rs) and SEF proteins. The latter are feedback inhibitors of FGF signalling and are also thought to be receptors. Due to its similarity to the TIR domain (pfam01582), the SEFIR region is thought to be involved in homotypic interactions with other SEFIR/TIR-domain-containing proteins. Thus, SEFs and IL17Rs may be involved in TOLL/IL1R-like signalling pathways.
IL17RC
refseq_IL17RC.F3 refseq_IL17RC.R3 170 383
NCBIGene 36.3 84818
Alternative 5-prime, size difference: 213
Exclusion in the protein (no frameshift)
Reference transcript: NM_153461
pfam SEFIR 145aa 1e-33
in ref transcript
SEFIR domain. This family comprises IL17 receptors (IL17Rs) and SEF proteins. The latter are feedback inhibitors of FGF signalling and are also thought to be receptors. Due to its similarity to the TIR domain (pfam01582), the SEFIR region is thought to be involved in homotypic interactions with other SEFIR/TIR-domain-containing proteins. Thus, SEFs and IL17Rs may be involved in TOLL/IL1R-like signalling pathways.
IL17RE
refseq_IL17RE.F2 refseq_IL17RE.R2 211 284
NCBIGene 36.2 132014
Alternative 3-prime, size difference: 73
Inclusion in the protein causing a frameshift
Reference transcript: NM_144640
IL18BP
refseq_IL18BP.F1 refseq_IL18BP.R1 144 256
NCBIGene 36.3 10068
Single exon skipping, size difference: 112
Exclusion in 5'UTR
Reference transcript: NM_001039659
pfam Pox_vIL-18BP 100aa 2e-04
in ref transcript
Orthopoxvirus interleukin 18 binding protein. Interleukin-18 (IL-18) is a proinflammatory cytokine that plays a key role in the activation of natural killer and T helper 1 cell responses principally by inducing interferon-gamma (IFN-gamma). Several poxvirus genes encode proteins with sequence similarity to IL-18BPs. It has been shown that vaccinia, ectromelia and cowpox viruses secrete from infected cells a soluble IL-18BP (vIL-18BP) that may modulate the host antiviral response. The expression of vIL-18BPs by distinct poxvirus genera that cause local or general viral dissemination, or persistent or acute infections in the host, emphasises the importance of IL-18 in response to viral infections.
IL1F7
refseq_IL1F7.F1 refseq_IL1F7.R1 116 236
NCBIGene 36.3 27178
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_014439
Changed!
cd IL1 146aa 3e-11
in ref transcript
Interleukin-1 homologes; Cytokines with various biological functions. Interleukin 1 alpha and beta are also known as hematopoietin and catabolin. This family also contains interleukin-1 receptor antagonists (inhibitors).
Changed!
smart IL1 146aa 8e-12
in ref transcript
Interleukin-1 homologues. Cytokines with various biological functions. Interluekin 1 alpha and beta are also known as hematopoietin and catabolin.
Changed!
cd IL1 97aa 3e-11
in modified transcript
Changed!
smart IL1 97aa 4e-11
in modified transcript
IL1F7
refseq_IL1F7.F3 refseq_IL1F7.R3 195 258
NCBIGene 36.3 27178
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_014439
cd IL1 146aa 3e-11
in ref transcript
Interleukin-1 homologes; Cytokines with various biological functions. Interleukin 1 alpha and beta are also known as hematopoietin and catabolin. This family also contains interleukin-1 receptor antagonists (inhibitors).
smart IL1 146aa 8e-12
in ref transcript
Interleukin-1 homologues. Cytokines with various biological functions. Interluekin 1 alpha and beta are also known as hematopoietin and catabolin.
IL22RA2
refseq_IL22RA2.F1 refseq_IL22RA2.R1 154 324
NCBIGene 36.3 116379
Single exon skipping, size difference: 170
Exclusion in the protein causing a frameshift
Reference transcript: NM_052962
Changed!
pfam IL10Ra-bind 236aa 6e-15
in ref transcript
Interleukin-10 receptor alpha chain. IL10Ra-bind is the cell-membrane bound receptor for interleukin-10, a pleiotropic cytokine produced by subsets of activated T cells, B cells, and macrophages, with a broad spectrum of immunoregulation and anti-inflammatory effects. The activity of IL-10 is mediated through its binding with IL10Ra-bind.
Changed!
pfam IL10Ra-bind 128aa 4e-12
in modified transcript
IL22RA2
refseq_IL22RA2.F2 refseq_IL22RA2.R2 256 352
NCBIGene 36.3 116379
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_052962
Changed!
pfam IL10Ra-bind 236aa 6e-15
in ref transcript
Interleukin-10 receptor alpha chain. IL10Ra-bind is the cell-membrane bound receptor for interleukin-10, a pleiotropic cytokine produced by subsets of activated T cells, B cells, and macrophages, with a broad spectrum of immunoregulation and anti-inflammatory effects. The activity of IL-10 is mediated through its binding with IL10Ra-bind.
Changed!
pfam IL10Ra-bind 204aa 8e-20
in modified transcript
IL24
refseq_IL24.F1 refseq_IL24.R1 109 451
NCBIGene 36.3 11009
Multiple exon skipping, size difference: 342
Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
Reference transcript: NM_006850
IL28RA
refseq_IL28RA.F1 refseq_IL28RA.R1 255 386
NCBIGene 36.3 163702
Single exon skipping, size difference: 131
Exclusion in the protein causing a frameshift
Reference transcript: NM_170743
IL28RA
refseq_IL28RA.F4 refseq_IL28RA.R3 138 225
NCBIGene 36.3 163702
Alternative 3-prime, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_170743
IL32
refseq_IL32.F1 refseq_IL32.R1 110 137
NCBIGene 36.3 9235
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_004221
IL32
refseq_IL32.F3 refseq_IL32.R3 100 160
NCBIGene 36.3 9235
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_004221
IL4
refseq_IL4.F2 refseq_IL4.R2 232 280
NCBIGene 36.3 3565
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_000589
Changed!
pfam IL4 148aa 4e-53
in ref transcript
Interleukin 4.
Changed!
pfam IL4 132aa 2e-43
in modified transcript
IL5RA
refseq_IL5RA.F2 refseq_IL5RA.R2 140 209
NCBIGene 36.3 3568
Single exon skipping, size difference: 69
Exclusion in 5'UTR
Reference transcript: NM_000564
pfam Haemat_rec_S_F2 30aa 3e-08
in ref transcript
Haematopoeitin receptor binding domain short. This domain is a common signature for a number of receptors for lymphokines, haematopoietic growth factors and growth hormone-related molecules.
IL6R
refseq_IL6R.F2 refseq_IL6R.R2 277 371
NCBIGene 36.3 3570
Single exon skipping, size difference: 94
Exclusion in the protein causing a frameshift
Reference transcript: NM_000565
cd FN3 93aa 3e-05
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
pfam IL6Ra-bind 102aa 1e-45
in ref transcript
Interleukin-6 receptor alpha chain, binding. Members of this family adopt a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, in a Greek-key topology. They are required for binding to the cytokine Interleukin-6.
smart IGc2 59aa 4e-10
in ref transcript
Immunoglobulin C-2 Type.
smart FN3 86aa 1e-05
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
IL6ST
refseq_IL6ST.F1 refseq_IL6ST.R1 313 396
NCBIGene 36.3 3572
Single exon skipping, size difference: 83
Exclusion in the protein causing a frameshift
Reference transcript: NM_002184
cd FN3 100aa 8e-12
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Changed!
cd FN3 92aa 8e-06
in ref transcript
pfam Lep_receptor_Ig 91aa 1e-30
in ref transcript
Ig-like C2-type domain. This domain is a ligand-binding immunoglobulin-like domain. The two cysteine residues form a disulphide bridge.
Changed!
smart FN3 81aa 3e-09
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
pfam fn3 90aa 1e-08
in ref transcript
Fibronectin type III domain.
IMPDH1
refseq_IMPDH1.F1 refseq_IMPDH1.R1 159 267
NCBIGene 36.3 3614
Multiple exon skipping, size difference: 108
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_000883
cd IMPDH 242aa 1e-95
in ref transcript
IMPDH: The catalytic domain of the inosine monophosphate dehydrogenase. IMPDH catalyzes the NAD-dependent oxidation of inosine 5'-monophosphate (IMP) to xanthosine 5' monophosphate (XMP). It is a rate-limiting step in the de novo synthesis of the guanine nucleotides. There is often a CBS domain inserted in the middle of this domain, which is proposed to play a regulatory role. IMPDH is a key enzyme in the regulation of cell proliferation and differentiation. It has been identified as an attractive target for developing chemotherapeutic agents.
cd CBS_pair_IMPDH_2 116aa 4e-45
in ref transcript
This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentosa.
cd IMPDH 81aa 3e-30
in ref transcript
pfam IMPDH 477aa 0.0
in ref transcript
IMP dehydrogenase / GMP reductase domain. This family is involved in biosynthesis of guanosine nucleotide. Members of this family contain a TIM barrel structure. In the inosine monophosphate dehydrogenases 2 CBS domains pfam00571 are inserted in the TIM barrel. This family is a member of the common phosphate binding site TIM barrel family.
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_198336
Changed!
pfam INSIG 184aa 3e-78
in ref transcript
Insulin-induced protein (INSIG). This family contains a number of eukaryotic Insulin-induced proteins (INSIG-1 and INSIG-2) approximately 200 residues long. INSIG-1 and INSIG-2 are found in the endoplasmic reticulum and bind the sterol-sensing domain of SREBP cleavage-activating protein (SCAP), preventing it from escorting SREBPs to the Golgi. Their combined action permits feedback regulation of cholesterol synthesis over a wide range of sterol concentrations.
Changed!
pfam INSIG 56aa 5e-14
in modified transcript
INSIG1
refseq_INSIG1.F4 refseq_INSIG1.R4 202 273
NCBIGene 36.3 3638
Single exon skipping, size difference: 71
Exclusion in the protein causing a frameshift
Reference transcript: NM_198336
pfam INSIG 184aa 3e-78
in ref transcript
Insulin-induced protein (INSIG). This family contains a number of eukaryotic Insulin-induced proteins (INSIG-1 and INSIG-2) approximately 200 residues long. INSIG-1 and INSIG-2 are found in the endoplasmic reticulum and bind the sterol-sensing domain of SREBP cleavage-activating protein (SCAP), preventing it from escorting SREBPs to the Golgi. Their combined action permits feedback regulation of cholesterol synthesis over a wide range of sterol concentrations.
IQCH
refseq_IQCH.F1 refseq_IQCH.R1 177 239
NCBIGene 36.3 64799
Single exon skipping, size difference: 62
Exclusion in the protein causing a frameshift
Reference transcript: NM_001031715
IQCH
refseq_IQCH.F3 refseq_IQCH.R3 223 329
NCBIGene 36.3 64799
Single exon skipping, size difference: 106
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001031715
IQCH
refseq_IQCH.F5 refseq_IQCH.R5 104 438
NCBIGene 36.3 64799
Multiple exon skipping, size difference: 334
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001031715
IQWD1
refseq_IQWD1.F2 refseq_IQWD1.R2 281 341
NCBIGene 36.3 55827
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_018442
cd WD40 244aa 4e-19
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 78aa 7e-05
in ref transcript
smart WD40 36aa 0.002
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
COG COG2319 274aa 4e-09
in ref transcript
FOG: WD40 repeat [General function prediction only].
IRF7
refseq_IRF7.F1 refseq_IRF7.R1 114 201
NCBIGene 36.3 3665
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_004031
cd IRF 114aa 2e-25
in ref transcript
Interferon Regulatory Factor (IRF); also known as tryptophan pentad repeat. The family of IRF transcription factors is important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. The IRF family is characterized by a unique 'tryptophan cluster' DNA-binding region. Viral IRFs bind to cellular IRFs; block type I and II interferons and host IRF-mediated transcriptional activation.
pfam IRF-3 181aa 5e-50
in ref transcript
Interferon-regulatory factor 3. This is the interferon-regulatory factor 3 chain of the hetero-dimeric structure which also contains the shorter chain CREB-binding protein. These two subunits make up the DRAF1 (double-stranded RNA-activated factor 1). Viral dsRNA produced during viral transcription or replication leads to the activation of DRAF1. The DNA-binding specificity of DRAF1 correlates with transcriptional induction of ISG (interferon-alpha,beta-stimulated gene). IRF-3 preexists in the cytoplasm of uninfected cells and translocates to the nucleus following viral infection. Translocation of IRF-3 is accompanied by an increase in serine and threonine phosphorylation, and association with the CREB coactivator occurs only after infection.
smart IRF 114aa 2e-29
in ref transcript
interferon regulatory factor. interferon regulatory factor, also known as trytophan pentad repeat.
ITGA3
refseq_ITGA3.F1 refseq_ITGA3.R1 174 316
NCBIGene 36.3 3675
Single exon skipping, size difference: 142
Inclusion in the protein causing a new stop codon
Reference transcript: NM_005501
pfam Integrin_alpha2 455aa 7e-78
in ref transcript
Integrin alpha. This domain is found in integrin alpha and integrin alpha precursors to the C terminus of a number of pfam01839 repeats and to the N-terminus of the pfam00357 cytoplasmic region.
smart Int_alpha 53aa 4e-11
in ref transcript
Integrin alpha (beta-propellor repeats). Integrins are cell adhesion molecules that mediate cell-extracellular matrix and cell-cell interactions. They contain both alpha and beta subunits. Alpha integrins are proposed to contain a domain containing a 7-fold repeat that adopts a beta-propellor fold. Some of these domains contain an inserted von Willebrand factor type-A domain. Some repeats contain putative calcium-binding sites. The 7-fold repeat domain is homologous to a similar domain in phosphatidylinositol-glycan-specific phospholipase D.
smart Int_alpha 49aa 1e-08
in ref transcript
smart Int_alpha 61aa 1e-05
in ref transcript
ITGB1BP1
refseq_ITGB1BP1.F2 refseq_ITGB1BP1.R2 251 401
NCBIGene 36.3 9270
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_004763
Changed!
cd PTB 130aa 0.003
in ref transcript
Phosphotyrosine-binding (PTB) domain; PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. The PTB domain of SHC binds to a NPXpY sequence. More recent studies have found that some types of PTB domains such as the neuronal protein X11 and in the cell-fate determinant protein Numb can bind to peptides which are not tyrosine phosphorylated; whereas, other PTB domains can bind motifs lacking tyrosine residues altogether.
Changed!
pfam ICAP-1_inte_bdg 171aa 2e-81
in ref transcript
Beta-1 integrin binding protein. ICAP-1 is a serine/threonine-rich protein that binds to the cytoplasmic domains of beta-1 integrins in a highly specific manner, binding to a NPXY sequence motif on the beta-1 integrin. The cytoplasmic domains of integrins are essential for cell adhesion, and the fact that phosphorylation of ICAP-1 by interaction with the cell-matrix implies an important role of ICAP-1 during integrin-dependent cell adhesion. Overexpression of ICAP-1 strongly reduces the integrin-mediated cell spreading on extracellular matrix and inhibits both Cdc42 and Rac1. In addition, ICAP-1 induces release of Cdc42 from cellular membranes and prevents the dissociation of GDP from this GTPase. An additional function of ICAP-1 is to promote differentiation of osteoprogenitors by supporting their condensation through modulating the integrin high affinity state,.
Changed!
pfam ICAP-1_inte_bdg 121aa 1e-51
in modified transcript
ITGB1BP3
refseq_ITGB1BP3.F1 refseq_ITGB1BP3.R1 222 354
NCBIGene 36.2 27231
Alternative 5-prime and 3-prime, size difference: 132
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_170678
Changed!
cd NRK1 161aa 9e-69
in ref transcript
Nicotinamide riboside kinase (NRK) is an enzyme involved in the metabolism of nicotinamide adenine dinucleotide (NAD+). This enzyme catalyzes the phosphorylation of nicotinamide riboside (NR) to form nicotinamide mononucleotide (NMN). It defines the NR salvage pathway of NAD+ biosynthesis in addition to the pathways through nicotinic acid mononucleotide (NaMN). This enzyme can also phosphorylate the anticancer drug tiazofurin, which is an analog of nicotinamide riboside.
Changed!
COG Udk 153aa 3e-15
in ref transcript
Uridine kinase [Nucleotide transport and metabolism].
Changed!
cd NRK1 177aa 7e-48
in modified transcript
Changed!
TIGR udk 45aa 0.009
in modified transcript
Model contains a number of longer eukaryotic proteins and starts bringing in phosphoribulokinase hits at scores of 160 and below.
Changed!
COG Udk 141aa 6e-10
in modified transcript
ITGB4
refseq_ITGB4.F1 refseq_ITGB4.R1 203 362
NCBIGene 36.3 3691
Single exon skipping, size difference: 159
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_000213
cd FN3 91aa 2e-13
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 94aa 1e-11
in ref transcript
cd FN3 94aa 6e-10
in ref transcript
cd FN3 88aa 6e-09
in ref transcript
pfam Integrin_beta 417aa 1e-180
in ref transcript
Integrin, beta chain. Integrins have been found in animals and their homologues have also been found in cyanobacteria, probably due to horizontal gene transfer. The sequences repeats have been trimmed due to an overlap with EGF.
pfam Integrin_B_tail 86aa 7e-21
in ref transcript
Integrin beta tail domain. This is the beta tail domain of the Integrin protein. Integrins are receptors which are involved in cell-cell and cell-extracellular matrix interactions.
pfam fn3 84aa 5e-17
in ref transcript
Fibronectin type III domain.
pfam fn3 87aa 8e-16
in ref transcript
smart Calx_beta 64aa 2e-15
in ref transcript
Domains in Na-Ca exchangers and integrin-beta4. Domain in Na-Ca exchangers and integrin subunit beta4 (and some cyanobacterial proteins).
pfam fn3 82aa 5e-12
in ref transcript
pfam fn3 90aa 1e-10
in ref transcript
ITGB4
refseq_ITGB4.F3 refseq_ITGB4.R3 131 341
NCBIGene 36.3 3691
Single exon skipping, size difference: 210
Exclusion in the protein (no frameshift)
Reference transcript: NM_000213
cd FN3 91aa 2e-13
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 94aa 1e-11
in ref transcript
cd FN3 94aa 6e-10
in ref transcript
cd FN3 88aa 6e-09
in ref transcript
pfam Integrin_beta 417aa 1e-180
in ref transcript
Integrin, beta chain. Integrins have been found in animals and their homologues have also been found in cyanobacteria, probably due to horizontal gene transfer. The sequences repeats have been trimmed due to an overlap with EGF.
pfam Integrin_B_tail 86aa 7e-21
in ref transcript
Integrin beta tail domain. This is the beta tail domain of the Integrin protein. Integrins are receptors which are involved in cell-cell and cell-extracellular matrix interactions.
pfam fn3 84aa 5e-17
in ref transcript
Fibronectin type III domain.
pfam fn3 87aa 8e-16
in ref transcript
smart Calx_beta 64aa 2e-15
in ref transcript
Domains in Na-Ca exchangers and integrin-beta4. Domain in Na-Ca exchangers and integrin subunit beta4 (and some cyanobacterial proteins).
pfam fn3 82aa 5e-12
in ref transcript
pfam fn3 90aa 1e-10
in ref transcript
EIF6
refseq_ITGB4BP.F1 refseq_ITGB4BP.R1 215 391
NCBIGene 36.3 3692
Single exon skipping, size difference: 176
Exclusion in the protein causing a frameshift
Reference transcript: NM_002212
Changed!
cd IF6 222aa 1e-90
in ref transcript
Ribosome anti-association factor IF6 binds the large ribosomal subunit and prevents the two subunits from associating during translation initiation. IF6 comprises a family of translation factors that includes both eukaryotic (eIF6) and archeal (aIF6) members. All members of this family have a conserved pentameric fold referred to as a beta/alpha propeller. The eukaryotic IF6 members have a moderately conserved C-terminal extension which is not required for ribosomal binding, and may have an alternative function.
Changed!
smart eIF6 201aa 9e-90
in ref transcript
translation initiation factor 6.
Changed!
PTZ PTZ00136 244aa 1e-118
in ref transcript
Changed!
smart eIF6 65aa 2e-22
in modified transcript
Changed!
PTZ PTZ00136 68aa 6e-27
in modified transcript
ITM2C
refseq_ITM2C.F1 refseq_ITM2C.R1 248 359
NCBIGene 36.3 81618
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_030926
Changed!
pfam BRICHOS 93aa 6e-28
in ref transcript
BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer. Its exact function is unknown; roles that have been proposed for it include (a) in targeting of the protein to the secretory pathway, (b) intramolecular chaperone-like function, and (c) assisting the specialised intracellular protease processing system.
Changed!
pfam BRICHOS 56aa 5e-08
in modified transcript
ITM2C
refseq_ITM2C.F3 refseq_ITM2C.R3 149 290
NCBIGene 36.3 81618
Single exon skipping, size difference: 141
Exclusion in the protein (no frameshift)
Reference transcript: NM_030926
pfam BRICHOS 93aa 6e-28
in ref transcript
BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer. Its exact function is unknown; roles that have been proposed for it include (a) in targeting of the protein to the secretory pathway, (b) intramolecular chaperone-like function, and (c) assisting the specialised intracellular protease processing system.
ITPA
refseq_ITPA.F1 refseq_ITPA.R1 146 197
NCBIGene 36.3 3704
Alternative 5-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_033453
Changed!
cd HAM1 179aa 1e-56
in ref transcript
NTPase/HAM1. This family consists of the HAM1 protein and pyrophosphate-releasing xanthosine/ inosine triphosphatase. HAM1 protects the cell against mutagenesis by the base analog 6-N-hydroxylaminopurine (HAP) in E. Coli and S. cerevisiae. A Ham1-related protein from Methanococcus jannaschii is a novel NTPase that has been shown to hydrolyze nonstandard nucleotides such as XTP to XMP and ITP to IMP, but not the standard nucleotides, in the presence of Mg or Mn ions. The enzyme exists as a homodimer. The HAM1 protein may be acting as an NTPase by hydrolyzing the HAP triphosphate.
Changed!
pfam Ham1p_like 179aa 2e-39
in ref transcript
Ham1 family. This family consists of the HAM1 protein and hypothetical archaeal bacterial and Caenorhabditis elegans proteins. HAM1 controls 6-N-hydroxylaminopurine (HAP) sensitivity and mutagenesis in Saccharomyces cerevisiae HAM1. The HAM1 protein protects the cell from HAP, either on the level of deoxynucleoside triphosphate or the DNA level by a yet unidentified set of reactions.
Changed!
COG COG0127 187aa 1e-42
in ref transcript
Xanthosine triphosphate pyrophosphatase [Nucleotide transport and metabolism].
Changed!
cd HAM1 166aa 2e-50
in modified transcript
Changed!
pfam Ham1p_like 165aa 8e-35
in modified transcript
Changed!
COG COG0127 170aa 3e-37
in modified transcript
IVNS1ABP
refseq_IVNS1ABP.F2 refseq_IVNS1ABP.R2 110 251
NCBIGene 36.2 10625
Single exon skipping, size difference: 141
Inclusion in the protein causing a new stop codon
Reference transcript: NM_006469
pfam BTB 105aa 2e-22
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
Changed!
smart BACK 87aa 2e-13
in ref transcript
BTB And C-terminal Kelch. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues.
Changed!
smart Kelch 46aa 3e-11
in ref transcript
Kelch domain.
Changed!
smart Kelch 47aa 5e-11
in ref transcript
Changed!
pfam Kelch_1 41aa 4e-10
in ref transcript
Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that the Drosophila ring canal kelch protein is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Changed!
smart Kelch 47aa 1e-09
in ref transcript
Changed!
pfam Kelch_1 47aa 8e-08
in ref transcript
Changed!
smart Kelch 48aa 2e-07
in ref transcript
Changed!
TIGR muta_rot_YjhT 179aa 9e-06
in ref transcript
Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Changed!
COG COG3055 81aa 6e-05
in ref transcript
Uncharacterized protein conserved in bacteria [Function unknown].
Changed!
COG COG3055 171aa 0.001
in ref transcript
Changed!
smart BACK 87aa 1e-11
in modified transcript
JAG2
refseq_JAG2.F2 refseq_JAG2.R2 100 214
NCBIGene 36.3 3714
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_002226
cd EGF_CA 39aa 5e-04
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
cd EGF_CA 37aa 0.002
in ref transcript
cd EGF_CA 37aa 0.003
in ref transcript
Changed!
cd EGF_CA 36aa 0.005
in ref transcript
pfam MNNL 81aa 2e-22
in ref transcript
N terminus of Notch ligand. This entry represents a region of conserved sequence at the N terminus of several Notch ligand proteins.
pfam DSL 63aa 4e-21
in ref transcript
Delta serrate ligand.
smart VWC_out 72aa 1e-12
in ref transcript
von Willebrand factor (vWF) type C domain.
smart EGF_CA 39aa 4e-04
in ref transcript
Calcium-binding EGF-like domain.
smart EGF_CA 37aa 6e-04
in ref transcript
Changed!
smart EGF_CA 36aa 0.002
in ref transcript
smart EGF_CA 37aa 0.004
in ref transcript
Changed!
cd EGF_CA 36aa 0.002
in modified transcript
Changed!
smart EGF_CA 36aa 6e-04
in modified transcript
KLHDC9
refseq_KARCA1.F1 refseq_KARCA1.R1 100 119
NCBIGene 36.3 126823
Alternative 3-prime, size difference: 19
Inclusion in the protein causing a frameshift
Reference transcript: NM_152366
CCBL2
refseq_KAT3.F2 refseq_KAT3.R2 164 264
NCBIGene 36.3 56267
Single exon skipping, size difference: 100
Exclusion of the protein initiation site
Reference transcript: NM_001008661
cd AAT_like 383aa 6e-78
in ref transcript
Aspartate aminotransferase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). Pyridoxal phosphate combines with an alpha-amino acid to form a compound called a Schiff base or aldimine intermediate, which depending on the reaction, is the substrate in four kinds of reactions (1) transamination (movement of amino groups), (2) racemization (redistribution of enantiomers), (3) decarboxylation (removing COOH groups), and (4) various side-chain reactions depending on the enzyme involved. Pyridoxal phosphate (PLP) dependent enzymes were previously classified into alpha, beta and gamma classes, based on the chemical characteristics (carbon atom involved) of the reaction they catalyzed. The availability of several structures allowed a comprehensive analysis of the evolutionary classification of PLP dependent enzymes, and it was found that the functional classification did not always agree with the evolutionary history of these enzymes. The major groups in this CD corresponds to Aspartate aminotransferase a, b and c, Tyrosine, Alanine, Aromatic-amino-acid, Glutamine phenylpyruvate, 1-Aminocyclopropane-1-carboxylate synthase, Histidinol-phosphate, gene products of malY and cobC, Valine-pyruvate aminotransferase and Rhizopine catabolism regulatory protein.
pfam Aminotran_1_2 368aa 3e-44
in ref transcript
Aminotransferase class I and II.
PRK PRK08912 391aa 2e-97
in ref transcript
hypothetical protein; Provisional.
KBTBD3
refseq_KBTBD3.F2 refseq_KBTBD3.R2 125 326
NCBIGene 36.3 143879
Single exon skipping, size difference: 201
Exclusion in 5'UTR
Reference transcript: NM_198439
pfam BACK 102aa 1e-25
in ref transcript
BTB And C-terminal Kelch. This domain is found associated with pfam00651 and pfam01344. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues. This family appears to be closely related to the BTB domain (Finn RD, personal observation).
pfam BTB 108aa 3e-17
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
pfam Kelch_1 50aa 3e-04
in ref transcript
Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that the Drosophila ring canal kelch protein is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
pfam Kelch_1 44aa 0.002
in ref transcript
KBTBD4
refseq_KBTBD4.F1 refseq_KBTBD4.R1 274 383
NCBIGene 36.3 55709
Alternative 5-prime and 3-prime, size difference: 109
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_016506
pfam BTB 104aa 3e-22
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
smart BACK 83aa 6e-12
in ref transcript
BTB And C-terminal Kelch. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues.
KCNAB2
refseq_KCNAB2.F2 refseq_KCNAB2.R2 100 142
NCBIGene 36.3 8514
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_003636
Changed!
cd Aldo_ket_red 316aa 8e-76
in ref transcript
Aldo-keto reductases (AKRs) are a superfamily of soluble NAD(P)(H) oxidoreductases whose chief purpose is to reduce aldehydes and ketones to primary and secondary alcohols. AKRs are present in all phyla and are of importance to both health and industrial applications. Members have very distinct functions and include the prokaryotic 2,5-diketo-D-gluconic acid reductases and beta-keto ester reductases, the eukaryotic aldose reductases, aldehyde reductases, hydroxysteroid dehydrogenases, steroid 5beta-reductases, potassium channel beta-subunits and aflatoxin aldehyde reductases, among others.
Changed!
TIGR Kv_beta 317aa 1e-179
in ref transcript
Plant beta subunits and their closely related bacterial homologs (in Deinococcus radiudurans, Xylella fastidiosa, etc.) appear more closely related to each other than to animal forms. However, the bacterial species lack convincing counterparts the Kv alpha subunit and the Kv beta homolog may serve as an enzyme. Cutoffs are set for this model such that yeast and plant forms and bacterial close homologs score between trusted and noise cutoffs.
Changed!
COG Tas 321aa 2e-67
in ref transcript
Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) [Energy production and conversion].
Changed!
cd Aldo_ket_red 316aa 1e-75
in modified transcript
Changed!
TIGR Kv_beta 317aa 1e-179
in modified transcript
Changed!
COG Tas 323aa 8e-68
in modified transcript
KCNC2
refseq_KCNC2.F2 refseq_KCNC2.R2 125 290
NCBIGene 36.3 3747
Single exon skipping, size difference: 165
Exclusion in the protein (no frameshift)
Reference transcript: NM_139136
pfam K_tetra 51aa 6e-19
in ref transcript
K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerisation domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly related to the BTB/POZ domain pfam00651.
pfam Ion_trans 189aa 2e-16
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
smart BTB 143aa 5e-07
in ref transcript
Broad-Complex, Tramtrack and Bric a brac. Domain in Broad-Complex, Tramtrack and Bric a brac. Also known as POZ (poxvirus and zinc finger) domain. Known to be a protein-protein interaction motif found at the N-termini of several C2H2-type transcription factors as well as Shaw-type potassium channels. Known structure reveals a tightly intertwined dimer formed via interactions between N-terminal strand and helix structures. However in a subset of BTB/POZ domains, these two secondary structures appear to be missing. Be aware SMART predicts BTB/POZ domains without the beta1- and alpha1-secondary structures.
PRK PRK10537 25aa 0.001
in ref transcript
voltage-gated potassium channel; Provisional.
KCND3
refseq_KCND3.F1 refseq_KCND3.R1 204 261
NCBIGene 36.3 3752
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_004980
pfam K_tetra 90aa 5e-33
in ref transcript
K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerisation domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly related to the BTB/POZ domain pfam00651.
pfam Ion_trans 162aa 6e-27
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
PRK PRK10537 87aa 6e-09
in ref transcript
voltage-gated potassium channel; Provisional.
KCNG3
refseq_KCNG3.F1 refseq_KCNG3.R1 134 167
NCBIGene 36.3 170850
Alternative 5-prime, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_133329
pfam Ion_trans 186aa 2e-25
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
pfam K_tetra 101aa 4e-19
in ref transcript
K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerisation domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly related to the BTB/POZ domain pfam00651.
PRK PRK10537 51aa 5e-06
in ref transcript
voltage-gated potassium channel; Provisional.
KCNH1
refseq_KCNH1.F1 refseq_KCNH1.R1 290 371
NCBIGene 36.3 3756
Alternative 5-prime, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_172362
cd CAP_ED 110aa 5e-17
in ref transcript
effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
cd PAS 95aa 8e-07
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
smart cNMP 115aa 1e-15
in ref transcript
Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
Changed!
pfam Ion_trans_2 60aa 3e-09
in ref transcript
Ion channel. This family includes the two membrane helix type ion channels found in bacteria.
Changed!
pfam Ion_trans 120aa 4e-07
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
smart PAC 42aa 9e-06
in ref transcript
Motif C-terminal to PAS motifs (likely to contribute to PAS structural domain). PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold.
TIGR sensory_box 123aa 2e-05
in ref transcript
The PAS domain was previously described. This sensory box, or S-box domain occupies the central portion of the PAS domain but is more widely distributed. It is often tandemly repeated. Known prosthetic groups bound in the S-box domain include heme in the oxygen sensor FixL, FAD in the redox potential sensor NifL, and a 4-hydroxycinnamyl chromophore in photoactive yellow protein. Proteins containing the domain often contain other regulatory domains such as response regulator or sensor histidine kinase domains. Other S-box proteins include phytochromes and the aryl hydrocarbon receptor nuclear translocator.
PRK PRK13558 100aa 7e-13
in ref transcript
bacterio-opsin activator; Provisional.
COG Crp 121aa 5e-09
in ref transcript
cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
PRK PRK10537 49aa 0.001
in ref transcript
voltage-gated potassium channel; Provisional.
Changed!
pfam Ion_trans 185aa 4e-10
in modified transcript
KCNH5
refseq_KCNH5.F1 refseq_KCNH5.R1 205 402
NCBIGene 36.3 27133
Single exon skipping, size difference: 197
Exclusion in the protein causing a frameshift
Reference transcript: NM_139318
Changed!
cd CAP_ED 110aa 2e-16
in ref transcript
effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
cd PAS 95aa 9e-06
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
Changed!
smart cNMP 100aa 4e-13
in ref transcript
Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
Changed!
pfam Ion_trans_2 75aa 7e-10
in ref transcript
Ion channel. This family includes the two membrane helix type ion channels found in bacteria.
pfam Ion_trans 184aa 7e-07
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
pfam PAS 92aa 1e-05
in ref transcript
PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
PRK PRK13558 100aa 4e-12
in ref transcript
bacterio-opsin activator; Provisional.
Changed!
COG Crp 215aa 2e-08
in ref transcript
cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
PRK PRK10537 52aa 1e-04
in ref transcript
voltage-gated potassium channel; Provisional.
Changed!
cd CAP_ED 52aa 2e-06
in modified transcript
Changed!
pfam Ion_trans_2 77aa 3e-08
in modified transcript
Changed!
smart cNMP 57aa 3e-07
in modified transcript
KCNH6
refseq_KCNH6.F1 refseq_KCNH6.R1 189 297
NCBIGene 36.3 81033
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_030779
cd CAP_ED 111aa 4e-17
in ref transcript
effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
cd PAS 92aa 2e-06
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
smart cNMP 100aa 2e-16
in ref transcript
Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
pfam Ion_trans 177aa 2e-10
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
pfam PAS 92aa 5e-07
in ref transcript
PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
PRK PRK13559 95aa 5e-14
in ref transcript
hypothetical protein; Provisional.
COG Crp 121aa 2e-09
in ref transcript
cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
PRK PRK10537 73aa 1e-04
in ref transcript
voltage-gated potassium channel; Provisional.
KCNH7
refseq_KCNH7.F2 refseq_KCNH7.R2 119 140
NCBIGene 36.3 90134
Single exon skipping, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_033272
cd CAP_ED 111aa 1e-17
in ref transcript
effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
cd PAS 90aa 3e-07
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
smart cNMP 115aa 3e-18
in ref transcript
Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
pfam Ion_trans 177aa 2e-11
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
pfam PAS 89aa 7e-05
in ref transcript
PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
PRK PRK13557 80aa 8e-11
in ref transcript
histidine kinase; Provisional.
COG Crp 144aa 1e-09
in ref transcript
cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
PRK PRK10537 80aa 8e-05
in ref transcript
voltage-gated potassium channel; Provisional.
KCNIP1
refseq_KCNIP1.F2 refseq_KCNIP1.R2 172 205
NCBIGene 36.3 30820
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_001034837
cd EFh 72aa 3e-08
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 63aa 6e-06
in ref transcript
smart EFh 29aa 0.002
in ref transcript
EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
COG FRQ1 167aa 5e-19
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
KCNIP2
refseq_KCNIP2.F1 refseq_KCNIP2.R1 121 229
NCBIGene 36.3 30819
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_014591
Changed!
cd EFh 63aa 6e-08
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Changed!
cd EFh 72aa 1e-07
in ref transcript
Changed!
smart EH 87aa 7e-04
in ref transcript
Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Changed!
COG FRQ1 167aa 3e-18
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
Changed!
cd EFh 72aa 1e-06
in modified transcript
Changed!
COG FRQ1 104aa 2e-08
in modified transcript
KCNJ1
refseq_KCNJ1.F1 refseq_KCNJ1.R1 229 399
NCBIGene 36.3 3758
Single exon skipping, size difference: 170
Inclusion in the protein causing a new stop codon
Reference transcript: NM_153765
Changed!
pfam IRK 338aa 1e-153
in ref transcript
Inward rectifier potassium channel.
KCNJ1
refseq_KCNJ1.F2 refseq_KCNJ1.R2 258 391
NCBIGene 36.3 3758
Single exon skipping, size difference: 133
Exclusion in 5'UTR
Reference transcript: NM_153767
pfam IRK 338aa 1e-153
in ref transcript
Inward rectifier potassium channel.
KCNQ1
refseq_KCNQ1.F1 refseq_KCNQ1.R1 122 214
NCBIGene 36.2 3784
Single exon skipping, size difference: 92
Inclusion in the protein causing a new stop codon
Reference transcript: NM_000218
Changed!
pfam KCNQ_channel 163aa 5e-62
in ref transcript
KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels.
Changed!
pfam Ion_trans 190aa 4e-18
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
KCNQ2
refseq_KCNQ2.F1 refseq_KCNQ2.R1 153 183
NCBIGene 36.3 3785
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_172107
pfam KCNQ_channel 201aa 2e-70
in ref transcript
KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels.
pfam Ion_trans 185aa 8e-25
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
PRK PRK10537 40aa 0.004
in ref transcript
voltage-gated potassium channel; Provisional.
KCNQ4
refseq_KCNQ4.F1 refseq_KCNQ4.R1 105 267
NCBIGene 36.3 9132
Single exon skipping, size difference: 162
Exclusion in the protein (no frameshift)
Reference transcript: NM_004700
pfam KCNQ_channel 197aa 5e-72
in ref transcript
KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels.
pfam Ion_trans 185aa 1e-18
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
PRK PRK10537 69aa 0.002
in ref transcript
voltage-gated potassium channel; Provisional.
KCNRG
refseq_KCNRG.F2 refseq_KCNRG.R2 267 365
NCBIGene 36.3 283518
Single exon skipping, size difference: 98
Inclusion in the protein causing a frameshift
Reference transcript: NM_173605
pfam K_tetra 89aa 4e-12
in ref transcript
K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerisation domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly related to the BTB/POZ domain pfam00651.
KIAA0101
refseq_KIAA0101.F1 refseq_KIAA0101.R1 129 292
NCBIGene 36.3 9768
Single exon skipping, size difference: 163
Exclusion in the protein causing a frameshift
Reference transcript: NM_014736
KIAA0841
refseq_KIAA0841.F2 refseq_KIAA0841.R2 137 198
NCBIGene 36.2 23354
Alternative 3-prime, size difference: 61
Exclusion of the protein initiation site
Reference transcript: XM_932194
KIAA0859
refseq_KIAA0859.F2 refseq_KIAA0859.R2 101 426
NCBIGene 36.3 51603
Alternative 5-prime, size difference: 325
Exclusion of the protein initiation site
Reference transcript: NM_015935
Changed!
cd AdoMet_MTases 109aa 8e-08
in ref transcript
S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
Changed!
pfam Methyltransf_11 106aa 6e-09
in ref transcript
Methyltransferase domain. Members of this family are SAM dependent methyltransferases.
pfam Spermine_synth 94aa 3e-06
in ref transcript
Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase.
COG SpeE 169aa 1e-11
in ref transcript
Spermidine synthase [Amino acid transport and metabolism].
Changed!
PRK PRK08317 221aa 5e-09
in ref transcript
hypothetical protein; Provisional.
Changed!
PRK PRK08317 64aa 7e-04
in modified transcript
KIAA1462
refseq_KIAA1462.F2 refseq_KIAA1462.R2 147 187
NCBIGene 36.2 57608
Single exon skipping, size difference: 40
Inclusion in the protein causing a new stop codon
Reference transcript: XM_929312
FNIP1
refseq_KIAA1961.F1 refseq_KIAA1961.R1 149 233
NCBIGene 36.3 96459
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_133372
KIF23
refseq_KIF23.F1 refseq_KIF23.R1 104 416
NCBIGene 36.3 9493
Single exon skipping, size difference: 312
Exclusion in the protein (no frameshift)
Reference transcript: NM_138555
cd KISc_KIF23_like 219aa 1e-102
in ref transcript
Kinesin motor domain, KIF23-like subgroup. Members of this group may play a role in mitosis. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In most kinesins, the motor domain is found at the N-terminus (N-type). N-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. Kinesin motor domains hydrolyze ATP at a rate of about 80 per second, and move along the microtubule at a speed of about 6400 Angstroms per second. To achieve that, kinesin head groups work in pairs. Upon replacing ADP with ATP, a kinesin motor domain increases its affinity for microtubule binding and locks in place. Also, the neck linker binds to the motor domain, which repositions the other head domain through the coiled-coil domain close to a second tubulin dimer, about 80 Angstroms along the microtubule. Meanwhile, ATP hydrolysis takes place, and when the second head domain binds to the microtubule, the first domain again replaces ADP with ATP, triggering a conformational change that pulls the first domain forward.
cd KISc_KIF23_like 123aa 9e-50
in ref transcript
pfam Kinesin 228aa 1e-74
in ref transcript
Kinesin motor domain.
smart KISc 126aa 3e-30
in ref transcript
Kinesin motor, catalytic domain. ATPase. Microtubule-dependent molecular motors that play important roles in intracellular transport of organelles and in cell division.
pfam SMC_N 168aa 4e-04
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
COG KIP1 236aa 2e-36
in ref transcript
Kinesin-like protein [Cytoskeleton].
COG KIP1 75aa 1e-11
in ref transcript
COG Smc 184aa 2e-04
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
KIF25
refseq_KIF25.F1 refseq_KIF25.R1 145 301
NCBIGene 36.3 3834
Single exon skipping, size difference: 156
Exclusion in the protein (no frameshift)
Reference transcript: NM_030615
Changed!
cd KISc_C_terminal 336aa 5e-63
in ref transcript
Kinesin motor domain, KIFC2/KIFC3/ncd-like carboxy-terminal kinesins. Ncd is a spindle motor protein necessary for chromosome segregation in meiosis. KIFC2/KIFC3-like kinesins have been implicated in motility of the Golgi apparatus as well as dentritic and axonal transport in neurons. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In this subgroup the motor domain is found at the C-terminus (C-type). C-type kinesins are (-) end-directed motors, i.e. they transport cargo towards the (-) end of the microtubule. Kinesin motor domains hydrolyze ATP at a rate of about 80 per second, and move along the microtubule at a speed of about 6400 Angstroms per second. To achieve that, kinesin head groups work in pairs. Upon replacing ADP with ATP, a kinesin motor domain increases its affinity for microtubule binding and locks in place. Also, the neck linker binds to the motor domain, which repositions the other head domain through the coiled-coil domain close to a second tubulin dimer, about 80 Angstroms along the microtubule. Meanwhile, ATP hydrolysis takes place, and when the second head domain binds to the microtubule, the first domain again replaces ADP with ATP, triggering a conformational change that pulls the first domain forward.
Changed!
pfam Kinesin 333aa 1e-65
in ref transcript
Kinesin motor domain.
Changed!
COG KIP1 333aa 6e-38
in ref transcript
Kinesin-like protein [Cytoskeleton].
Changed!
cd KISc_C_terminal 284aa 7e-43
in modified transcript
Changed!
smart KISc 287aa 5e-46
in modified transcript
Kinesin motor, catalytic domain. ATPase. Microtubule-dependent molecular motors that play important roles in intracellular transport of organelles and in cell division.
Changed!
COG KIP1 281aa 9e-28
in modified transcript
KIF9
refseq_KIF9.F1 refseq_KIF9.R1 201 396
NCBIGene 36.3 64147
Single exon skipping, size difference: 195
Exclusion in the protein (no frameshift)
Reference transcript: NM_182902
cd KISc_KIF9_like 333aa 1e-144
in ref transcript
Kinesin motor domain, KIF9-like subgroup; might play a role in cell shape remodeling. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In most kinesins, the motor domain is found at the N-terminus (N-type). N-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. Kinesin motor domains hydrolyze ATP at a rate of about 80 per second, and move along the microtubule at a speed of about 6400 Angstroms per second. To achieve that, kinesin head groups work in pairs. Upon replacing ADP with ATP, a kinesin motor domain increases its affinity for microtubule binding and locks in place. Also, the neck linker binds to the motor domain, which repositions the other head domain through the coiled-coil domain close to a second tubulin dimer, about 80 Angstroms along the microtubule. Meanwhile, ATP hydrolysis takes place, and when the second head domain binds to the microtubule, the first domain again replaces ADP with ATP, triggering a conformational change that pulls the first domain forward.
pfam Kinesin 329aa 1e-107
in ref transcript
Kinesin motor domain.
COG KIP1 300aa 2e-54
in ref transcript
Kinesin-like protein [Cytoskeleton].
KIRREL2
refseq_KIRREL2.F1 refseq_KIRREL2.R1 202 352
NCBIGene 36.3 84063
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_199180
cd IGcam 82aa 8e-10
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 79aa 2e-05
in ref transcript
Changed!
cd IGcam 95aa 0.001
in ref transcript
cd IGcam 73aa 0.002
in ref transcript
pfam I-set 80aa 9e-12
in ref transcript
Immunoglobulin I-set domain.
pfam C2-set_2 90aa 3e-07
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
Changed!
pfam I-set 83aa 7e-07
in ref transcript
pfam I-set 74aa 3e-04
in ref transcript
Changed!
smart IG_like 44aa 0.005
in modified transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
KL
refseq_KL.F2 refseq_KL.R2 209 255
NCBIGene 36.2 9365
Alternative 5-prime and 3-prime, size difference: 46
Inclusion in the protein causing a new stop codon, Exclusion in the protein causing a frameshift
Reference transcript: NM_004795
pfam Glyco_hydro_1 449aa 1e-115
in ref transcript
Glycosyl hydrolase family 1.
Changed!
pfam Glyco_hydro_1 372aa 6e-78
in ref transcript
COG BglB 446aa 4e-79
in ref transcript
Beta-glucosidase/6-phospho-beta-glucosidase/beta- galactosidase [Carbohydrate transport and metabolism].
Changed!
COG BglB 371aa 5e-49
in ref transcript
KLHL5
refseq_KLHL5.F1 refseq_KLHL5.R1 144 327
NCBIGene 36.3 51088
Single exon skipping, size difference: 183
Exclusion in the protein (no frameshift)
Reference transcript: NM_015990
pfam BACK 100aa 4e-35
in ref transcript
BTB And C-terminal Kelch. This domain is found associated with pfam00651 and pfam01344. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues. This family appears to be closely related to the BTB domain (Finn RD, personal observation).
Changed!
pfam BTB 108aa 2e-26
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
pfam Kelch_1 46aa 1e-12
in ref transcript
Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that the Drosophila ring canal kelch protein is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
smart Kelch 47aa 3e-12
in ref transcript
Kelch domain.
pfam Kelch_1 44aa 2e-10
in ref transcript
smart Kelch 45aa 9e-10
in ref transcript
smart Kelch 46aa 1e-08
in ref transcript
pfam Kelch_1 52aa 2e-07
in ref transcript
TIGR mutarot_permut 200aa 6e-06
in ref transcript
Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
TIGR muta_rot_YjhT 119aa 5e-05
in ref transcript
Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Changed!
pfam BTB 74aa 8e-13
in modified transcript
KLHL7
refseq_KLHL7.F1 refseq_KLHL7.R1 240 371
NCBIGene 36.3 55975
Single exon skipping, size difference: 131
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001031710
Changed!
pfam BACK 103aa 2e-32
in ref transcript
BTB And C-terminal Kelch. This domain is found associated with pfam00651 and pfam01344. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues. This family appears to be closely related to the BTB domain (Finn RD, personal observation).
Changed!
pfam BTB 107aa 5e-29
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
Changed!
pfam Kelch_1 46aa 6e-09
in ref transcript
Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that the Drosophila ring canal kelch protein is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
Changed!
pfam Kelch_1 48aa 7e-08
in ref transcript
Changed!
smart Kelch 48aa 2e-05
in ref transcript
Kelch domain.
Changed!
TIGR muta_rot_YjhT 65aa 0.002
in ref transcript
Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
KLK12
refseq_KLK12.F1 refseq_KLK12.R1 131 391
NCBIGene 36.3 43849
Single exon skipping, size difference: 260
Exclusion in the protein causing a frameshift
Reference transcript: NM_019598
Changed!
cd Tryp_SPc 215aa 8e-50
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
Changed!
smart Tryp_SPc 216aa 9e-56
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Changed!
COG COG5640 236aa 2e-15
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
Changed!
cd Tryp_SPc 50aa 9e-11
in modified transcript
Changed!
smart Tryp_SPc 43aa 2e-12
in modified transcript
Changed!
COG COG5640 63aa 9e-06
in modified transcript
KLK3
refseq_KLK3.F2 refseq_KLK3.R2 247 376
NCBIGene 36.3 354
Alternative 3-prime, size difference: 129
Exclusion in the protein (no frameshift)
Reference transcript: NM_001648
Changed!
cd Tryp_SPc 232aa 3e-64
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
Changed!
smart Tryp_SPc 230aa 5e-71
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Changed!
COG COG5640 250aa 1e-21
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
Changed!
cd Tryp_SPc 189aa 1e-52
in modified transcript
Changed!
smart Tryp_SPc 187aa 2e-60
in modified transcript
Changed!
COG COG5640 207aa 8e-19
in modified transcript
KLRC1
refseq_KLRC1.F1 refseq_KLRC1.R1 215 269
NCBIGene 36.3 3821
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_213658
cd CLECT_NK_receptors_like 113aa 4e-26
in ref transcript
CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
smart CLECT 112aa 4e-17
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
KLRD1
refseq_KLRD1.F1 refseq_KLRD1.R1 135 228
NCBIGene 36.3 3824
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_002262
cd CLECT_NK_receptors_like 116aa 3e-26
in ref transcript
CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
smart CLECT 115aa 9e-21
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
KREMEN1
refseq_KREMEN1.F2 refseq_KREMEN1.R2 186 237
NCBIGene 36.3 83999
Alternative 5-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039570
cd KR 84aa 9e-18
in ref transcript
Kringle domain; Kringle domains are believed to play a role in binding mediators, such as peptides, other proteins, membranes, or phospholipids. They are autonomous structural domains, found in a varying number of copies, in blood clotting and fibrinolytic proteins, some serine proteases and plasma proteins. Plasminogen-like kringles possess affinity for free lysine and lysine-containing peptides.
cd CUB 107aa 1e-15
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
pfam WSC 81aa 3e-19
in ref transcript
WSC domain. This domain may be involved in carbohydrate binding.
smart KR 84aa 3e-18
in ref transcript
Kringle domain. Named after a Danish pastry. Found in several serine proteases and in ROR-like receptors. Can occur in up to 38 copies (in apolipoprotein(a)). Plasminogen-like kringles possess affinity for free lysine and lysine- containing peptides.
pfam CUB 105aa 1e-14
in ref transcript
CUB domain.
KREMEN2
refseq_KREMEN2.F1 refseq_KREMEN2.R1 266 345
NCBIGene 36.3 79412
Single exon skipping, size difference: 79
Exclusion in the protein causing a frameshift
Reference transcript: NM_172229
cd CUB 106aa 2e-15
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd KR 86aa 2e-15
in ref transcript
Kringle domain; Kringle domains are believed to play a role in binding mediators, such as peptides, other proteins, membranes, or phospholipids. They are autonomous structural domains, found in a varying number of copies, in blood clotting and fibrinolytic proteins, some serine proteases and plasma proteins. Plasminogen-like kringles possess affinity for free lysine and lysine-containing peptides.
pfam WSC 82aa 7e-17
in ref transcript
WSC domain. This domain may be involved in carbohydrate binding.
smart KR 86aa 3e-16
in ref transcript
Kringle domain. Named after a Danish pastry. Found in several serine proteases and in ROR-like receptors. Can occur in up to 38 copies (in apolipoprotein(a)). Plasminogen-like kringles possess affinity for free lysine and lysine- containing peptides.
pfam CUB 105aa 9e-14
in ref transcript
CUB domain.
KRIT1
refseq_KRIT1.F2 refseq_KRIT1.R2 153 301
NCBIGene 36.3 889
Single exon skipping, size difference: 148
Exclusion in 5'UTR
Reference transcript: NM_194456
cd ANK 121aa 1e-14
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
smart B41 220aa 1e-32
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
COG Arp 127aa 3e-06
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
KRIT1
refseq_KRIT1.F4 refseq_KRIT1.R4 231 306
NCBIGene 36.3 889
Alternative 5-prime and 3-prime, size difference: 75
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_194456
cd ANK 121aa 1e-14
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
smart B41 220aa 1e-32
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
COG Arp 127aa 3e-06
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
KRIT1
refseq_KRIT1.F5 refseq_KRIT1.R5 103 466
NCBIGene 36.3 889
Alternative 5-prime, size difference: 363
Exclusion in 5'UTR
Reference transcript: NM_004912
cd ANK 121aa 1e-14
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
smart B41 220aa 1e-32
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
COG Arp 127aa 3e-06
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
KRIT1
refseq_KRIT1.F8 refseq_KRIT1.R8 135 279
NCBIGene 36.3 889
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_194456
Changed!
cd ANK 121aa 1e-14
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
smart B41 220aa 1e-32
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
Changed!
COG Arp 127aa 3e-06
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
cd ANK 85aa 1e-06
in modified transcript
Changed!
PTZ PTZ00322 74aa 2e-04
in modified transcript
Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
smart MBT 93aa 2e-36
in ref transcript
smart MBT 98aa 1e-35
in ref transcript
pfam zf-C2HC 29aa 4e-10
in ref transcript
Zinc finger, C2HC type. This is a DNA binding zinc finger domain.
L3MBTL2
refseq_L3MBTL2.F2 refseq_L3MBTL2.R2 176 281
NCBIGene 36.2 83746
Single exon skipping, size difference: 105
Inclusion in the protein causing a new stop codon
Reference transcript: NM_031488
smart MBT 94aa 6e-38
in ref transcript
Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
smart MBT 97aa 3e-28
in ref transcript
smart MBT 101aa 6e-19
in ref transcript
smart MBT 93aa 2e-17
in ref transcript
L3MBTL3
refseq_L3MBTL3.F2 refseq_L3MBTL3.R2 204 279
NCBIGene 36.3 84456
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_032438
cd SAM 64aa 4e-08
in ref transcript
Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
smart MBT 97aa 1e-33
in ref transcript
Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
smart MBT 93aa 2e-33
in ref transcript
smart MBT 95aa 7e-30
in ref transcript
pfam SAM_1 63aa 5e-11
in ref transcript
SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.
LAIR1
refseq_LAIR1.F1 refseq_LAIR1.R1 265 316
NCBIGene 36.3 3903
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_002287
LAIR2
refseq_LAIR2.F2 refseq_LAIR2.R2 280 331
NCBIGene 36.3 3904
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_002288
LARGE
refseq_LARGE.F1 refseq_LARGE.R1 174 237
NCBIGene 36.3 9215
Single exon skipping, size difference: 63
Exclusion in 5'UTR
Reference transcript: NM_004737
cd GT8_LARGE_C 280aa 1e-165
in ref transcript
LARGE catalytic domain has closest homology to GT8 glycosyltransferase involved in lipooligosaccharide synthesis. The catalytic domain of LARGE is a putative glycosyltransferase. Mutations of LARGE in mouse and human cause dystroglycanopathies, a disease associated with hypoglycosylation of the membrane protein alpha-dystroglycan (alpha-DG) and consequent loss of extracellular ligand binding. LARGE needs to both physically interact with alpha-dystroglycan and function as a glycosyltransferase in order to stimulate alpha-dystroglycan hyperglycosylation. LARGE localizes to the Golgi apparatus and contains three conserved DxD motifs. While two of the motifs are indispensible for glycosylation function, one is important for localization of th eenzyme. LARGE was originally named because it covers approximately large trunck of genomic DNA, more than 600bp long. The predicted protein structure contains an N-terminal cytoplasmic domain, a transmembrane region, a coiled-coil motif, and two putative catalytic domains. This catalytic domain has closest homology to GT8 glycosyltransferase involved in lipooligosaccharide synthesis.
pfam Glyco_transf_8 245aa 3e-17
in ref transcript
Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyltransferase, lipopolysaccharide glucosyltransferase 1, and glycogenin glucosyltransferase.
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart LA 79aa 5e-29
in ref transcript
Domain in the RNA-binding Lupus La protein; unknown function.
LASS2
refseq_LASS2.F2 refseq_LASS2.R2 224 264
NCBIGene 36.3 29956
Alternative 3-prime, size difference: 40
Inclusion in 5'UTR
Reference transcript: NM_022075
cd homeodomain 58aa 0.001
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
smart TLC 202aa 1e-42
in ref transcript
TRAM, LAG1 and CLN8 homology domains. Protein domain with at least 5 transmembrane alpha-helices. Lag1p and Lac1p are essential for acyl-CoA-dependent ceramide synthesis, TRAM is a subunit of the translocon and the CLN8 gene is mutated in Northern epilepsy syndrome. The family may possess multiple functions such as lipid trafficking, metabolism, or sensing. Trh homologues possess additional homeobox domains.
COG LAG1 220aa 3e-22
in ref transcript
Protein transporter of the TRAM (translocating chain-associating membrane) superfamily, longevity assurance factor [Intracellular trafficking and secretion].
LAT2
refseq_LAT2.F1 refseq_LAT2.R1 117 187
NCBIGene 36.3 7462
Alternative 5-prime, size difference: 70
Exclusion in 5'UTR
Reference transcript: NM_032464
LCMT1
refseq_LCMT1.F1 refseq_LCMT1.R1 116 281
NCBIGene 36.3 51451
Multiple exon skipping, size difference: 165
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_016309
Changed!
pfam LCM 299aa 1e-127
in ref transcript
Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.-. This family may need divides a the full alignment contains a significantly shorter mouse sequence.
Changed!
COG COG3315 195aa 6e-14
in ref transcript
O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism].
Changed!
pfam LCM 244aa 1e-101
in modified transcript
Changed!
COG COG3315 140aa 7e-09
in modified transcript
LDHC
refseq_LDHC.F1 refseq_LDHC.R1 121 175
NCBIGene 36.3 3948
Alternative 5-prime, size difference: 54
Exclusion in 5'UTR
Reference transcript: NM_002301
cd LDH_1 312aa 1e-145
in ref transcript
A subgroup of L-lactate dehydrogenases. L-lactate dehydrogenases (LDH) are tetrameric enzymes catalyzing the last step of glycolysis in which pyruvate is converted to L-lactate. This subgroup is composed of eukaryotic LDHs. Vertebrate LDHs are non-allosteric. This is in contrast to some bacterial LDHs that are activated by an allosteric effector such as fructose-1,6-bisphosphate. LDHs are part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of protein families including the NAD(P)-binding domains of alcohol dehydrogenases, tyrosine-dependent oxidoreductases, glyceraldehyde-3-phosphate dehydrogenases, formate/glycerate dehydrogenases, siroheme synthases, 6-phosphogluconate dehydrogenases, aminoacid dehydrogenases, repressor rex, and NAD-binding potassium channel domains, among others.
TIGR L-LDH-NAD 301aa 1e-110
in ref transcript
This model represents the NAD-dependent L-lactate dehydrogenases from bacteria and eukaryotes. This enzyme function as as the final step in anaerobic glycolysis. Although lactate dehydrogenases have in some cases been mistaken for malate dehydrogenases due to the similarity of these two substrates and the apparent ease with which evolution can toggle these activities, critical residues have been identified which can discriminate between the two activities. At the time of the creation of this model no hits above the trusted cutoff contained critical residues typical of malate dehydrogenases.
PRK ldh 309aa 8e-87
in ref transcript
L-lactate dehydrogenase; Reviewed.
LDHD
refseq_LDHD.F1 refseq_LDHD.R1 225 294
NCBIGene 36.3 197257
Alternative 3-prime, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_153486
Changed!
TIGR glcD 437aa 2e-84
in ref transcript
This protein, the glycolate oxidase GlcD subunit, is similar in sequence to that of several D-lactate dehydrogenases, including that of E. coli. The glycolate oxidase has been found to have some D-lactate dehydrogenase activity.
Changed!
COG GlcD 474aa 2e-80
in ref transcript
FAD/FMN-containing dehydrogenases [Energy production and conversion].
Changed!
TIGR glcD 414aa 3e-86
in modified transcript
Changed!
COG GlcD 451aa 5e-83
in modified transcript
LETMD1
refseq_LETMD1.F1 refseq_LETMD1.R1 104 220
NCBIGene 36.2 25875
Single exon skipping, size difference: 116
Exclusion in the protein causing a frameshift
Reference transcript: NM_015416
Changed!
pfam LETM1 255aa 4e-32
in ref transcript
LETM1-like protein. Members of this family are inner mitochondrial membrane proteins which play a role in potassium and hydrogen ion exchange. Deletion of LETM1 is thought to be involved in the development of Wolf-Hirschhorn syndrome in humans.
LETMD1
refseq_LETMD1.F4 refseq_LETMD1.R4 166 273
NCBIGene 36.3 25875
Alternative 5-prime, size difference: 107
Exclusion in the protein causing a frameshift
Reference transcript: NM_015416
Changed!
pfam LETM1 255aa 4e-32
in ref transcript
LETM1-like protein. Members of this family are inner mitochondrial membrane proteins which play a role in potassium and hydrogen ion exchange. Deletion of LETM1 is thought to be involved in the development of Wolf-Hirschhorn syndrome in humans.
LGALS14
refseq_LGALS14.F1 refseq_LGALS14.R1 102 407
NCBIGene 36.3 56891
Single exon skipping, size difference: 305
Exclusion of the protein initiation site
Reference transcript: NM_203471
Changed!
cd GLECT 130aa 4e-28
in ref transcript
Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.
Changed!
pfam Gal-bind_lectin 129aa 8e-31
in ref transcript
Galactoside-binding lectin. This family contains galactoside binding lectins. The family also includes enzymes such as human eosinophil lysophospholipase (EC:3.1.1.5).
Changed!
cd GLECT 131aa 1e-28
in modified transcript
Changed!
pfam Gal-bind_lectin 131aa 3e-31
in modified transcript
LGALS8
refseq_LGALS8.F2 refseq_LGALS8.R2 206 332
NCBIGene 36.3 3964
Single exon skipping, size difference: 126
Exclusion in the protein (no frameshift)
Reference transcript: NM_201545
cd GLECT 132aa 5e-34
in ref transcript
Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.
cd GLECT 129aa 2e-32
in ref transcript
pfam Gal-bind_lectin 132aa 7e-39
in ref transcript
Galactoside-binding lectin. This family contains galactoside binding lectins. The family also includes enzymes such as human eosinophil lysophospholipase (EC:3.1.1.5).
pfam Gal-bind_lectin 129aa 4e-37
in ref transcript
LGMN
refseq_LGMN.F2 refseq_LGMN.R2 171 289
NCBIGene 36.3 5641
Single exon skipping, size difference: 118
Exclusion in 5'UTR
Reference transcript: NM_001008530
pfam Peptidase_C13 257aa 1e-125
in ref transcript
Peptidase C13 family. Members of this family are asparaginyl peptidases. The blood fluke parasite Schistosoma mansoni has at least five Clan CA cysteine peptidases in its digestive tract including cathepsins B (2 isoforms), C, F and L. All have been recombinantly expressed as active enzymes, albeit in various stages of activation. In addition, a Clan CD peptidase, termed asparaginyl endopeptidase or 'legumain' has been identified. This has formerly been characterised as a 'haemoglobinase', but this term is probably incorrect. Two cDNAs have been described for Schistosoma mansoni legumain; one encodes an active enzyme whereas the active site cysteine residue encoded by the second cDNA is substituted by an asparagine residue. Both forms have been recombinantly expressed.
COG GPI8 194aa 2e-26
in ref transcript
Glycosylphosphatidylinositol transamidase (GPIT), subunit GPI8 [Posttranslational modification, protein turnover, chaperones].
LGR6
refseq_LGR6.F1 refseq_LGR6.R1 100 388
NCBIGene 36.3 59352
Multiple exon skipping, size difference: 288
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001017403
Changed!
cd LRR_RI 262aa 6e-08
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Changed!
cd LRR_RI 143aa 1e-06
in ref transcript
Changed!
COG COG4886 215aa 1e-17
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
Changed!
COG COG4886 253aa 2e-14
in ref transcript
Changed!
cd LRR_RI 226aa 9e-09
in modified transcript
Changed!
COG COG4886 241aa 9e-16
in modified transcript
LHX6
refseq_LHX6.F2 refseq_LHX6.R2 287 391
NCBIGene 36.3 26468
Single exon skipping, size difference: 104
Exclusion in the protein causing a frameshift
Reference transcript: NM_014368
cd homeodomain 58aa 6e-14
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
pfam Homeobox 57aa 1e-17
in ref transcript
Homeobox domain.
pfam LIM 58aa 1e-09
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
Changed!
cd Radical_SAM 197aa 7e-07
in ref transcript
Radical SAM superfamily. Enzymes of this family generate radicals by combining a 4Fe-4S cluster and S-adenosylmethionine (SAM) in close proximity. They are characterized by a conserved CxxxCxxC motif, which coordinates the conserved iron-sulfur cluster. Mechanistically, they share the transfer of a single electron from the iron-sulfur cluster to SAM, which leads to its reductive cleavage to methionine and a 5'-deoxyadenosyl radical, which, in turn, abstracts a hydrogen from the appropriately positioned carbon atom. Depending on the enzyme, SAM is consumed during this process or it is restored and reused. Radical SAM enzymes catalyze steps in metabolism, DNA repair, the biosynthesis of vitamins and coenzymes, and the biosynthesis of many antibiotics. Examples are biotin synthase (BioB), lipoyl synthase (LipA), pyruvate formate-lyase (PFL), coproporphyrinogen oxidase (HemN), lysine 2,3-aminomutase (LAM), anaerobic ribonucleotide reductase (ARR), and MoaA, an enzyme of the biosynthesis of molybdopterin.
Changed!
TIGR lipA 302aa 1e-104
in ref transcript
The family shows strong sequence conservation.
Changed!
PRK PRK05481 293aa 1e-126
in ref transcript
lipoyl synthase; Provisional.
Changed!
cd Radical_SAM 177aa 6e-07
in modified transcript
Changed!
TIGR lipA 255aa 7e-86
in modified transcript
Changed!
PRK PRK05481 248aa 1e-105
in modified transcript
LILRA5
refseq_LILRA5.F2 refseq_LILRA5.R2 167 203
NCBIGene 36.3 353514
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_021250
LLGL2
refseq_LLGL2.F1 refseq_LLGL2.R1 101 144
NCBIGene 36.3 3993
Single exon skipping, size difference: 43
Exclusion in the protein causing a frameshift
Reference transcript: NM_001031803
cd WD40 233aa 6e-05
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
pfam LLGL 105aa 6e-39
in ref transcript
LLGL2. This domain is found in lethal giant larvae homolog 2 (LLGL2) proteins and syntaxin-binding proteins like tomosyn. It has been identified in eukaryotes and tends to be found together with WD repeats (pfam00400).
COG COG2319 227aa 3e-05
in ref transcript
FOG: WD40 repeat [General function prediction only].
LMNA
refseq_LMNA.F1 refseq_LMNA.R1 218 308
NCBIGene 36.3 4000
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_170707
pfam Filament 349aa 3e-52
in ref transcript
Intermediate filament protein.
Changed!
pfam IF_tail 111aa 1e-47
in ref transcript
Intermediate filament tail domain. The Prosite motif does not correspond to this Pfam entry.
COG Smc 290aa 4e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
pfam IF_tail 102aa 3e-44
in modified transcript
LMX1A
refseq_LMX1A.F1 refseq_LMX1A.R1 109 242
NCBIGene 36.3 4009
Single exon skipping, size difference: 133
Inclusion in the protein causing a new stop codon
Reference transcript: NM_177398
cd homeodomain 57aa 4e-13
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
pfam Homeobox 57aa 6e-17
in ref transcript
Homeobox domain.
pfam LIM 56aa 1e-12
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 73aa 2e-13
in ref transcript
smart RRM 70aa 6e-16
in ref transcript
RNA recognition motif.
TIGR SF-CC1 142aa 5e-15
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
COG COG0724 171aa 1e-10
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
LOC150223
refseq_LOC150223.F1 refseq_LOC150223.R1 100 159
NCBIGene 36.2 150223
Alternative 5-prime, size difference: 59
Inclusion in the protein causing a frameshift
Reference transcript: NM_001017964
Changed!
pfam YdjC 283aa 8e-60
in ref transcript
YdjC-like protein. Family of YdjC-like proteins. This region is possibly involved in the the cleavage of cellobiose-phosphate.
Changed!
PRK PRK02134 294aa 3e-34
in ref transcript
hypothetical protein; Provisional.
Changed!
pfam YdjC 105aa 6e-27
in modified transcript
Changed!
PRK PRK02134 104aa 8e-15
in modified transcript
LOC150383
refseq_LOC150383.F1 refseq_LOC150383.R1 140 263
NCBIGene 36.2 150383
Single exon skipping, size difference: 123
Exclusion in 5'UTR
Reference transcript: NM_001008917
LOC219854
refseq_LOC219854.F2 refseq_LOC219854.R2 103 162
NCBIGene 36.2 219854
Single exon skipping, size difference: 59
Exclusion in 5'UTR
Reference transcript: XM_933911
LOC219854
refseq_LOC219854.F4 refseq_LOC219854.R4 114 162
NCBIGene 36.2 219854
Alternative 5-prime, size difference: 48
Exclusion in 5'UTR
Reference transcript: XM_933911
LOC219854
refseq_LOC219854.F6 refseq_LOC219854.R6 214 292
NCBIGene 36.2 219854
Single exon skipping, size difference: 78
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: XM_933894
LOC255411
refseq_LOC255411.F2 refseq_LOC255411.R2 218 303
NCBIGene 36.3 255411
Single exon skipping, size difference: 85
Inclusion in the protein causing a new stop codon
Reference transcript: XM_170708
C19orf63
refseq_LOC284361.F1 refseq_LOC284361.R1 241 328
NCBIGene 36.3 284361
Single exon skipping, size difference: 87
Inclusion in the protein causing a new stop codon
Reference transcript: NM_206538
LOC339287
refseq_LOC339287.F1 refseq_LOC339287.R1 109 157
NCBIGene 36.2 339287
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: XM_932097
C16orf84
refseq_LOC348180.F2 refseq_LOC348180.R2 224 283
NCBIGene 36.3 348180
Single exon skipping, size difference: 59
Exclusion in the protein causing a frameshift
Reference transcript: NM_001012759
pfam DUF2392 101aa 5e-37
in ref transcript
Protein of unknown function (DUF2392). This is a family of proteins conserved from plants to humans. The function is not known. It carries a characteristic GRG sequence motif.
COG MesJ 153aa 0.001
in ref transcript
Predicted ATPase of the PP-loop superfamily implicated in cell cycle control [Cell division and chromosome partitioning].
LOC348801
refseq_LOC348801.F1 refseq_LOC348801.R1 220 259
NCBIGene 36.2 348801
Alternative 3-prime, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: XM_931908
LOC352909
refseq_LOC352909.F1 refseq_LOC352909.R1 112 188
NCBIGene 36.2 352909
Alternative 5-prime, size difference: 76
Inclusion in the protein causing a new stop codon
Reference transcript: NM_178837
LOC374569
refseq_LOC374569.F1 refseq_LOC374569.R1 101 119
NCBIGene 36.2 374569
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: XM_370777
cd Asparaginase 340aa 8e-93
in ref transcript
Asparaginase (amidohydrolase): Asparaginases are tetrameric enzymes that catalyze the hydrolysis of asparagine to aspartic acid and ammonia. In bacteria, there are two classes of amidohydrolases, one highly specific for asparagine and localised to the periplasm, and a second (asparaginase- glutaminase) present in the cytosol that hydrolyzises both asparagine and glutamine with similar specificities.
cd ANK 124aa 2e-17
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
smart Asparaginase 339aa 1e-82
in ref transcript
Asparaginase, which is found in various plant, animal and bacterial cells, catalyses the deamination of asparagine to yield aspartic acid and an ammonium ion, resulting in a depletion of free circulatory asparagine in plasma PUBMED:3026924. The enzyme is effective in the treatment of human malignant lymphomas, which have a diminished capacity to produce asparagine synthetase: in order to survive, such cells absorb asparagine from blood plasma PUBMED:2407723, PUBMED:3379033 - if Asn levels have been depleted by injection of asparaginase, the lymphoma cells die.
pfam Ank 32aa 3e-06
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
TIGR trp 76aa 1e-04
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
FOG: Ankyrin repeat [General function prediction only].
NHLRC3
refseq_LOC387921.F1 refseq_LOC387921.R1 129 330
NCBIGene 36.3 387921
Single exon skipping, size difference: 201
Exclusion in the protein (no frameshift)
Reference transcript: NM_001012754
pfam NHL 28aa 1e-05
in ref transcript
NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in the peptidyl-glycine alpha-amidating monooxygenase (PAM), proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localised to the repeats. The human tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is me diated by the NHL repeats.
Changed!
COG COG3391 243aa 1e-05
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
COG COG3391 153aa 9e-07
in modified transcript
LOC388276
refseq_LOC388276.F1 refseq_LOC388276.R1 106 234
NCBIGene 36.2 388276
Multiple exon skipping, size difference: 128
Inclusion in the protein causing a new stop codon, Inclusion in the protein causing a frameshift
Reference transcript: XM_931511
LOC389073
refseq_LOC389073.F1 refseq_LOC389073.R1 119 238
NCBIGene 36.2 389073
Single exon skipping, size difference: 119
Exclusion in the protein causing a frameshift
Reference transcript: XM_371592
LOC441155
refseq_LOC441155.F1 refseq_LOC441155.R1 108 390
NCBIGene 36.2 441155
Alternative 3-prime, size difference: 282
Inclusion in 5'UTR
Reference transcript: XM_930997
LOC51149
refseq_LOC51149.F1 refseq_LOC51149.R1 155 235
NCBIGene 36.2 51149
Single exon skipping, size difference: 80
Inclusion in the protein causing a frameshift
Reference transcript: NM_016175
LOC643446
refseq_LOC643446.F1 refseq_LOC643446.R1 149 237
NCBIGene 36.3 643446
Alternative 5-prime, size difference: 88
Exclusion in 5'UTR
Reference transcript: XM_001130637
cd RRM 73aa 3e-15
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM 70aa 8e-14
in ref transcript
RNA recognition motif.
COG COG0724 76aa 3e-05
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
LOC643446
refseq_LOC643446.F3 refseq_LOC643446.R3 149 337
NCBIGene 36.3 643446
Alternative 3-prime, size difference: 188
Exclusion of the protein initiation site
Reference transcript: XM_001130681
cd RRM 73aa 3e-15
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM 70aa 8e-14
in ref transcript
RNA recognition motif.
COG COG0724 76aa 3e-05
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
LOC643847
refseq_LOC643847.F1 refseq_LOC643847.R1 101 131
NCBIGene 36.2 643847
Alternative 3-prime, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: XM_001129508
cd pepsin_A 323aa 1e-173
in ref transcript
Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which have at least six residues in length with hydrophobic residues in both the P1 and P1' positions. The active site is located at the groove formed by the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors in the active site. Specificity is determined by nearest-neighbor hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. This family of aspartate proteases is classified by MEROPS as the peptidase family A1 (pepsin A, clan AA).
pfam Asp 315aa 1e-129
in ref transcript
Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.
Changed!
pfam A1_Propeptide 29aa 7e-09
in ref transcript
A1 Propeptide. Most eukaryotic endopeptidases (Merops Family A1) are synthesised with signal and propeptides. The animal pepsin-like endopeptidase propeptides form a distinct family of propeptides, which contain a conserved motif approximately 30 residues long. In pepsinogen A, the first 11 residues of the mature pepsin sequence are displaced by residues of the propeptide. The propeptide contains two helices that block the active site cleft, in particular the conserved Asp11 residue, in pepsin, hydrogen bonds to a conserved Arg residues in the propeptide. This hydrogen bond stabilises the propeptide conformation and is probably responsible for triggering the conversion of pepsinogen to pepsin under acidic conditions.
Changed!
PTZ PTZ00165 367aa 2e-63
in ref transcript
aspartyl protease; Provisional.
Changed!
PTZ PTZ00165 327aa 2e-63
in modified transcript
LOC645687
refseq_LOC645687.F1 refseq_LOC645687.R1 150 356
NCBIGene 36.2 645687
Alternative 3-prime, size difference: 206
Exclusion of the stop codon
Reference transcript: XM_001133543
LOC653423
refseq_LOC653423.F1 refseq_LOC653423.R1 319 397
NCBIGene 36.2 653423
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: XM_933397
pfam Sperm_Ag_HE2 70aa 2e-28
in ref transcript
Sperm antigen HE2. This family consists of several variants of the human and chimpanzee sperm antigen proteins (HE2 and EP2 respectively). The EP2 gene codes for a family of androgen-dependent, epididymis-specific secretory proteins.The EP2 gene uses alternative promoters and differential splicing to produce a family of variant messages. The translated putative protein variants differ significantly from each other. Some of these putative proteins have similarity to beta-defensins, a family of antimicrobial peptides.
LOC653423
refseq_LOC653423.F2 refseq_LOC653423.R2 148 224
NCBIGene 36.2 653423
Single exon skipping, size difference: 76
Inclusion in the protein causing a frameshift
Reference transcript: XM_929074
Changed!
pfam Sperm_Ag_HE2 70aa 2e-30
in ref transcript
Sperm antigen HE2. This family consists of several variants of the human and chimpanzee sperm antigen proteins (HE2 and EP2 respectively). The EP2 gene codes for a family of androgen-dependent, epididymis-specific secretory proteins.The EP2 gene uses alternative promoters and differential splicing to produce a family of variant messages. The translated putative protein variants differ significantly from each other. Some of these putative proteins have similarity to beta-defensins, a family of antimicrobial peptides.
Changed!
pfam Defensin_beta 36aa 3e-05
in ref transcript
Beta defensin. The beta defensins are antimicrobial peptides implicated in the resistance of epithelial surfaces to microbial colonisation.
Changed!
pfam Sperm_Ag_HE2 74aa 8e-30
in modified transcript
LOC653423
refseq_LOC653423.F4 refseq_LOC653423.R4 263 369
NCBIGene 36.2 653423
Multiple exon skipping, size difference: 106
Inclusion in the protein causing a frameshift, Inclusion in the protein causing a new stop codon
Reference transcript: XM_933397
pfam Sperm_Ag_HE2 70aa 2e-28
in ref transcript
Sperm antigen HE2. This family consists of several variants of the human and chimpanzee sperm antigen proteins (HE2 and EP2 respectively). The EP2 gene codes for a family of androgen-dependent, epididymis-specific secretory proteins.The EP2 gene uses alternative promoters and differential splicing to produce a family of variant messages. The translated putative protein variants differ significantly from each other. Some of these putative proteins have similarity to beta-defensins, a family of antimicrobial peptides.
LOC653489
refseq_LOC653489.F1 refseq_LOC653489.R1 102 120
NCBIGene 36.2 653489
Alternative 5-prime and 3-prime, size difference: 18
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: XM_934115
cd RanBD 121aa 2e-35
in ref transcript
Ran-binding domain; This domain of approximately 150 residues shares structural similarity to the PH domain, but lacks detectable sequence similarity. Ran is a Ras-like nuclear small GTPase, which regulates receptor-mediated transport between the nucleus and the cytoplasm. RanGTP hydrolysis is stimulated by RanGAP together with the Ran-binding domain containing acessory proteins RanBP1 and RanBP2. These accessory proteins stabilize the active GTP-bound form of Ran . The Ran-binding domain is found in multiple copies in Nuclear pore complex proteins.
cd RanBD 117aa 1e-34
in ref transcript
cd TPR 97aa 1e-05
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
pfam Ran_BP1 122aa 1e-50
in ref transcript
RanBP1 domain.
pfam Ran_BP1 120aa 3e-50
in ref transcript
pfam GRIP 45aa 3e-09
in ref transcript
GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue. At least some of these domains have been shown to bind to GTPase Arl1, see structures.
TIGR type_IV_pilW 152aa 0.008
in ref transcript
Members of this family are designated PilF in ref (PMID:8973346) and PilW in ref (PMID:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
COG YRB1 160aa 2e-31
in ref transcript
Ran GTPase-activating protein (Ran-binding protein) [Intracellular trafficking and secretion].
COG YRB1 160aa 2e-23
in ref transcript
LOC653689
refseq_LOC653689.F1 refseq_LOC653689.R1 124 166
NCBIGene 36.2 653689
Alternative 5-prime, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: XM_928923
Changed!
cd GST_C_Theta 126aa 4e-39
in ref transcript
GST_C family, Class Theta subfamily; composed of eukaryotic class Theta GSTs and bacterial dichloromethane (DCM) dehalogenase. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Mammalian class Theta GSTs show poor GSH conjugating activity towards the standard substrates, CDNB and ethacrynic acid, differentiating them from other mammalian GSTs. GSTT1-1 shows similar cataytic activity as bacterial DCM dehalogenase, catalyzing the GSH-dependent hydrolytic dehalogenation of dihalomethanes. This is an essential process in methylotrophic bacteria to enable them to use chloromethane and DCM as sole carbon and energy sources. The presence of polymorphisms in human GSTT1-1 and its relationship to the onset of diseases including cancer is subject of many studies. Human GSTT2-2 exhibits a highly specific sulfatase activity, catalyzing the cleavage of sulfate ions from aralkyl sufate esters, but not from the aryl or alkyl sulfate esters.
cd GST_N_Theta 76aa 3e-27
in ref transcript
GST_N family, Class Theta subfamily; composed of eukaryotic class Theta GSTs and bacterial dichloromethane (DCM) dehalogenase. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Mammalian class Theta GSTs show poor GSH conjugating activity towards the standard substrates, CDNB and ethacrynic acid, differentiating them from other mammalian GSTs. GSTT1-1 shows similar cataytic activity as bacterial DCM dehalogenase, catalyzing the GSH-dependent hydrolytic dehalogenation of dihalomethanes. This is an essential process in methylotrophic bacteria to enable them to use chloromethane and DCM as sole carbon and energy sources. The presence of polymorphisms in human GSTT1-1 and its relationship to the onset of diseases including cancer is subject of many studies. Human GSTT2-2 exhibits a highly specific sulfatase activity, catalyzing the cleavage of sulfate ions from aralkyl sufate esters, but not from aryl or alkyl sulfate esters.
Changed!
TIGR maiA 143aa 2e-11
in ref transcript
Maleylacetoacetate isomerase is an enzyme of tyrosine and phenylalanine catabolism. It requires glutathione and belongs by homology to the zeta family of glutathione S-transferases. The enzyme (EC 5.2.1.2) is described as active also on maleylpyruvate, and the example from a Ralstonia sp. catabolic plasmid is described as a maleylpyruvate isomerase involved in gentisate catabolism.
Changed!
COG Gst 208aa 3e-19
in ref transcript
Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd GST_C_Theta 112aa 4e-30
in modified transcript
Changed!
TIGR maiA 134aa 7e-10
in modified transcript
Changed!
COG Gst 194aa 2e-16
in modified transcript
LOC653820
refseq_LOC653820.F1 refseq_LOC653820.R1 278 398
NCBIGene 36.2 653820
Alternative 5-prime, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: XM_930579
LOC727761
refseq_LOC727761.F2 refseq_LOC727761.R2 139 211
NCBIGene 36.3 727761
Alternative 3-prime, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: XM_001126211
Changed!
cd TMPK 185aa 3e-23
in ref transcript
Thymidine monophosphate kinase (TMPK), also known as thymidylate kinase, catalyzes the phosphorylation of thymidine monophosphate (TMP) to thymidine diphosphate (TDP) utilizing ATP as its preferred phophoryl donor. TMPK represents the rate-limiting step in either de novo or salvage biosynthesis of thymidine triphosphate (TTP).
Changed!
pfam Thymidylate_kin 181aa 3e-54
in ref transcript
Thymidylate kinase.
Changed!
COG Tmk 188aa 2e-32
in ref transcript
Thymidylate kinase [Nucleotide transport and metabolism].
Changed!
cd TMPK 161aa 4e-17
in modified transcript
Changed!
pfam Thymidylate_kin 157aa 7e-41
in modified transcript
Changed!
COG Tmk 164aa 2e-23
in modified transcript
LOC727761
refseq_LOC727761.F4 refseq_LOC727761.R3 98 227
NCBIGene 36.3 727761
Exon skipping and alternative 3-prime or 5-prime, size difference: 129
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: XM_001126211
Changed!
cd TMPK 185aa 3e-23
in ref transcript
Thymidine monophosphate kinase (TMPK), also known as thymidylate kinase, catalyzes the phosphorylation of thymidine monophosphate (TMP) to thymidine diphosphate (TDP) utilizing ATP as its preferred phophoryl donor. TMPK represents the rate-limiting step in either de novo or salvage biosynthesis of thymidine triphosphate (TTP).
Changed!
pfam Thymidylate_kin 181aa 3e-54
in ref transcript
Thymidylate kinase.
Changed!
COG Tmk 188aa 2e-32
in ref transcript
Thymidylate kinase [Nucleotide transport and metabolism].
Changed!
cd TMPK 142aa 2e-11
in modified transcript
Changed!
pfam Thymidylate_kin 138aa 2e-31
in modified transcript
Changed!
COG Tmk 145aa 4e-16
in modified transcript
LOC727778
refseq_LOC727778.F2 refseq_LOC727778.R2 273 334
NCBIGene 36.2 727778
Alternative 5-prime, size difference: 61
Exclusion in the protein causing a frameshift
Reference transcript: XM_001125727
Changed!
pfam DUF1242 36aa 3e-05
in ref transcript
Protein of unknown function (DUF1242). This family consists of a number of eukaryotic proteins of around 72 residues in length. The function of this family is unknown.
LOC728005
refseq_LOC728005.F1 refseq_LOC728005.R1 150 315
NCBIGene 36.2 728005
Multiple exon skipping, size difference: 165
Inclusion in 5'UTR, Inclusion in 5'UTR
Reference transcript: XM_001127140
cd PH_centaurin 61aa 2e-11
in ref transcript
Centaurin Pleckstrin homology (PH) domain. Centaurin beta and gamma consist of a PH domain, an ArfGAP domain and three ankyrin repeats. Centaurain gamma also has an N-terminal Ras homology domain. Centaurin alpha has a different domain architecture and its PH domain is in a different subfamily. Centaurin can bind to phosphatidlyinositol (3,4,5)P3. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
cd ANK 90aa 3e-11
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd PH_centaurin 32aa 2e-04
in ref transcript
pfam ArfGap 117aa 6e-34
in ref transcript
Putative GTPase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
smart PH 64aa 8e-08
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
pfam PH 160aa 1e-06
in ref transcript
PH domain. PH stands for pleckstrin homology.
pfam Ank 32aa 8e-06
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
COG COG5347 117aa 4e-23
in ref transcript
GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
Exon skipping and alternative 3-prime or 5-prime, size difference: 30
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: XM_001128423
LOC728239
refseq_LOC728239.F2 refseq_LOC728239.R2 215 263
NCBIGene 36.2 728239
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: XM_001127104
Changed!
pfam MAGE 186aa 1e-71
in ref transcript
MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
PRK PRK07003 168aa 0.002
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
Changed!
pfam MAGE 170aa 8e-75
in modified transcript
LOC728461
refseq_LOC728461.F2 refseq_LOC728461.R2 198 249
NCBIGene 36.2 728461
Alternative 5-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: XM_001126956
LOC728635
refseq_LOC728635.F2 refseq_LOC728635.R2 239 350
NCBIGene 36.2 728635
Multiple exon skipping, size difference: 111
Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: XM_001131318
Changed!
TIGR 3oxo_ACP_reduc 242aa 1e-45
in ref transcript
This model represents 3-oxoacyl-[ACP] reductase, also called 3-ketoacyl-acyl carrier protein reductase, an enzyme of fatty acid biosynthesis.
Changed!
TIGR 3oxo_ACP_reduc 205aa 3e-37
in modified transcript
Changed!
PRK fabG 210aa 2e-44
in modified transcript
LOC728742
refseq_LOC728742.F1 refseq_LOC728742.R1 148 246
NCBIGene 36.2 728742
Single exon skipping, size difference: 98
Inclusion in the protein causing a new stop codon
Reference transcript: XM_001128742
Changed!
cd CobW_like 165aa 7e-44
in ref transcript
The function of this protein family is unkown. The amino acid sequence of YjiA protein in E. coli contains several conserved motifs that characterizes it as a P-loop GTPase. YijA gene is among the genes significantly induced in response to DNA-damage caused by mitomycin. YijA gene is a homologue of the CobW gene which encodes the cobalamin synthesis protein/P47K.
Changed!
TIGR CobW 328aa 9e-39
in ref transcript
A broader CobW family is delineated by two PFAM models which identify the N- and C-terminal domains (pfam02492 and pfam07683).
Changed!
COG COG0523 335aa 8e-67
in ref transcript
Putative GTPases (G3E family) [General function prediction only].
LOC728835
refseq_LOC728835.F1 refseq_LOC728835.R1 165 206
NCBIGene 36.2 728835
Alternative 3-prime, size difference: 41
Inclusion in the protein causing a new stop codon
Reference transcript: XM_001133183
cd Chemokine_CC 31aa 6e-06
in ref transcript
Chemokine_CC: 1 of 4 subgroup designations based on the arrangement of the two N-terminal cysteine residues; includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity; some members (e.g. 2HCC) contain an additional disulfide bond which is thought to compensate for the highly conserved Trp missing in these; chemotatic for monocytes, macrophages, eosinophils, basophils, and T cells, but not neutrophils; exist as monomers and dimers, but are believed to be functional as monomers; found only in vertebrates and a few viruses; a subgroup of CC, identified by an N-terminal DCCL motif (Exodus-1, Exodus-2, and Exodus-3), has been shown to inhibit specific types of human cancer cell growth in a mouse model. See CDs: Chemokine (cd00169) for the general alignment of chemokines, or Chemokine_CXC (cd00273), Chemokine_C (cd00271), and Chemokine_CX3C (cd00274) for the additional chemokine subgroups, and Chemokine_CC_DCCL for the DCCL subgroup of this CD.
Changed!
pfam IL8 41aa 2e-08
in ref transcript
Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds.
Changed!
pfam IL8 40aa 1e-07
in modified transcript
LOC728860
refseq_LOC728860.F1 refseq_LOC728860.R1 100 128
NCBIGene 36.2 728860
Alternative 3-prime, size difference: 28
Inclusion in the protein causing a frameshift
Reference transcript: XM_001133267
Changed!
cd ARM 113aa 8e-22
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Changed!
cd ARM 121aa 7e-21
in ref transcript
Changed!
cd ARM 126aa 1e-18
in ref transcript
Changed!
pfam IBB 95aa 8e-29
in ref transcript
Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex; required for classical NLS import of proteins into the nucleus, through the nuclear pore complex and across the nuclear envelope. Also in the alignment is the NLS of importin alpha which overlaps with the IBB domain.
Changed!
pfam Arm 41aa 4e-07
in ref transcript
Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Changed!
pfam Arm 40aa 5e-07
in ref transcript
Changed!
smart ARM 41aa 2e-05
in ref transcript
Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
Changed!
pfam Arm 39aa 2e-04
in ref transcript
Changed!
pfam Arm 39aa 4e-04
in ref transcript
Changed!
pfam Arm 41aa 9e-04
in ref transcript
Changed!
smart ARM 40aa 0.001
in ref transcript
Changed!
COG SRP1 519aa 1e-137
in ref transcript
Karyopherin (importin) alpha [Intracellular trafficking and secretion].
LOC728860
refseq_LOC728860.F3 refseq_LOC728860.R3 157 187
NCBIGene 36.2 728860
Alternative 3-prime, size difference: 30
Inclusion in 5'UTR
Reference transcript: XM_001133267
cd ARM 113aa 8e-22
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
cd ARM 121aa 7e-21
in ref transcript
cd ARM 126aa 1e-18
in ref transcript
pfam IBB 95aa 8e-29
in ref transcript
Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex; required for classical NLS import of proteins into the nucleus, through the nuclear pore complex and across the nuclear envelope. Also in the alignment is the NLS of importin alpha which overlaps with the IBB domain.
pfam Arm 41aa 4e-07
in ref transcript
Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
pfam Arm 40aa 5e-07
in ref transcript
smart ARM 41aa 2e-05
in ref transcript
Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
pfam Arm 39aa 2e-04
in ref transcript
pfam Arm 39aa 4e-04
in ref transcript
pfam Arm 41aa 9e-04
in ref transcript
smart ARM 40aa 0.001
in ref transcript
COG SRP1 519aa 1e-137
in ref transcript
Karyopherin (importin) alpha [Intracellular trafficking and secretion].
ANAPC11
refseq_LOC728919.F1 refseq_ANAPC11.R1 94 102
NCBIGene 36.3 51529
Alternative 5-prime, size difference: 8
Exclusion in 5'UTR
Reference transcript: NM_001002248
LOC728919
refseq_LOC728919.F1 refseq_LOC728919.R1 101 123
NCBIGene 36.2 728919
Alternative 5-prime, size difference: 22
Exclusion in 5'UTR
Reference transcript: XM_001133292
COG APC11 88aa 2e-19
in ref transcript
Component of SCF ubiquitin ligase and anaphase-promoting complex [Posttranslational modification, protein turnover, chaperones / Cell division and chromosome partitioning].
LOC729059
refseq_LOC729059.F2 refseq_LOC729059.R2 301 400
NCBIGene 36.2 729059
Alternative 3-prime, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: XM_001133127
LOC729264
refseq_LOC729264.F1 refseq_LOC729264.R1 156 280
NCBIGene 36.3 729264
Alternative 5-prime, size difference: 124
Exclusion in the protein causing a frameshift
Reference transcript: XM_001133674
LOC729355
refseq_LOC729264.F1 refseq_LOC729355.R1 154 278
NCBIGene 36.2 729355
Alternative 5-prime, size difference: 124
Exclusion in the protein causing a frameshift
Reference transcript: XM_001133681
LOC730258
refseq_LOC730258.F1 refseq_LOC730258.R1 209 327
NCBIGene 36.2 730258
Single exon skipping, size difference: 118
Exclusion in 5'UTR
Reference transcript: XM_001134401
LONRF3
refseq_LONRF3.F2 refseq_LONRF3.R2 257 380
NCBIGene 36.3 79836
Single exon skipping, size difference: 123
Exclusion in the protein (no frameshift)
Reference transcript: NM_001031855
Changed!
cd TPR 96aa 3e-10
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
cd RING 40aa 4e-06
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
pfam LON 172aa 4e-19
in ref transcript
ATP-dependent protease La (LON) domain.
TIGR rad18 86aa 1e-09
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Changed!
TIGR 3a0801s09 115aa 1e-07
in ref transcript
TIGR rad18 40aa 7e-06
in ref transcript
COG COG2802 197aa 1e-15
in ref transcript
Uncharacterized protein, similar to the N-terminal domain of Lon protease [General function prediction only].
Changed!
TIGR 3a0801s09 66aa 0.001
in modified transcript
LRDD
refseq_LRDD.F1 LRDD.u.r.22 121 154
NCBIGene 36.3 55367
Alternative 5-prime, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_145886
pfam Death 79aa 3e-09
in ref transcript
Death domain.
pfam Peptidase_S68 34aa 2e-08
in ref transcript
Peptidase S68. This family of serine peptidases contains PIDD proteins. PIDD forms a complex with RAIDD and procaspase-2 that is known as the 'PIDDosome'. The PIDDosome forms when DNA damage occurs and either activates NF-kappaB, leading to cell survival, or caspase-2, which leads to apoptosis.
smart ZU5 63aa 9e-04
in ref transcript
Domain present in ZO-1 and Unc5-like netrin receptors. Domain of unknown function.
COG COG4886 96aa 3e-07
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
LRDD
refseq_LRDD.F4 refseq_LRDD.R4 260 320
NCBIGene 36.3 55367
Alternative 3-prime, size difference: 60
Inclusion in the protein causing a new stop codon
Reference transcript: NM_145886
Changed!
pfam Death 79aa 3e-09
in ref transcript
Death domain.
Changed!
pfam Peptidase_S68 34aa 2e-08
in ref transcript
Peptidase S68. This family of serine peptidases contains PIDD proteins. PIDD forms a complex with RAIDD and procaspase-2 that is known as the 'PIDDosome'. The PIDDosome forms when DNA damage occurs and either activates NF-kappaB, leading to cell survival, or caspase-2, which leads to apoptosis.
Changed!
smart ZU5 63aa 9e-04
in ref transcript
Domain present in ZO-1 and Unc5-like netrin receptors. Domain of unknown function.
Changed!
COG COG4886 96aa 3e-07
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
LRDD
refseq_LRDD.F6 refseq_LRDD.R6 145 196
NCBIGene 36.3 55367
Alternative 5-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_145886
pfam Death 79aa 3e-09
in ref transcript
Death domain.
pfam Peptidase_S68 34aa 2e-08
in ref transcript
Peptidase S68. This family of serine peptidases contains PIDD proteins. PIDD forms a complex with RAIDD and procaspase-2 that is known as the 'PIDDosome'. The PIDDosome forms when DNA damage occurs and either activates NF-kappaB, leading to cell survival, or caspase-2, which leads to apoptosis.
smart ZU5 63aa 9e-04
in ref transcript
Domain present in ZO-1 and Unc5-like netrin receptors. Domain of unknown function.
COG COG4886 96aa 3e-07
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
LRP8
refseq_LRP8.F2 refseq_LRP8.R2 167 392
NCBIGene 36.3 7804
Single exon skipping, size difference: 225
Exclusion in the protein (no frameshift)
Reference transcript: NM_004631
cd LDLa 32aa 7e-06
in ref transcript
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
cd LDLa 35aa 1e-05
in ref transcript
cd LDLa 36aa 2e-04
in ref transcript
cd LDLa 37aa 3e-04
in ref transcript
cd LDLa 27aa 0.002
in ref transcript
pfam Ldl_recept_b 41aa 2e-11
in ref transcript
Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.
smart LY 43aa 1e-10
in ref transcript
Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
smart LY 43aa 3e-08
in ref transcript
smart LDLa 33aa 9e-07
in ref transcript
Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
pfam Ldl_recept_b 40aa 1e-06
in ref transcript
pfam Ldl_recept_a 37aa 4e-06
in ref transcript
Low-density lipoprotein receptor domain class A.
pfam Ldl_recept_a 38aa 8e-06
in ref transcript
pfam Ldl_recept_a 27aa 3e-05
in ref transcript
smart EGF_CA 31aa 8e-04
in ref transcript
Calcium-binding EGF-like domain.
smart LY 36aa 0.002
in ref transcript
smart LDLa 34aa 0.003
in ref transcript
pfam Ldl_recept_b 44aa 0.005
in ref transcript
Changed!
COG COG3391 189aa 3e-05
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
COG COG3391 205aa 1e-05
in modified transcript
LRP8
refseq_LRP8.F3 refseq_LRP8.R3 100 487
NCBIGene 36.3 7804
Single exon skipping, size difference: 387
Exclusion in the protein (no frameshift)
Reference transcript: NM_004631
cd LDLa 32aa 7e-06
in ref transcript
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
cd LDLa 35aa 1e-05
in ref transcript
Changed!
cd LDLa 36aa 2e-04
in ref transcript
cd LDLa 37aa 3e-04
in ref transcript
Changed!
cd LDLa 27aa 0.002
in ref transcript
pfam Ldl_recept_b 41aa 2e-11
in ref transcript
Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.
smart LY 43aa 1e-10
in ref transcript
Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
smart LY 43aa 3e-08
in ref transcript
smart LDLa 33aa 9e-07
in ref transcript
Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
pfam Ldl_recept_b 40aa 1e-06
in ref transcript
pfam Ldl_recept_a 37aa 4e-06
in ref transcript
Low-density lipoprotein receptor domain class A.
Changed!
pfam Ldl_recept_a 38aa 8e-06
in ref transcript
Changed!
pfam Ldl_recept_a 27aa 3e-05
in ref transcript
smart EGF_CA 31aa 8e-04
in ref transcript
Calcium-binding EGF-like domain.
smart LY 36aa 0.002
in ref transcript
smart LDLa 34aa 0.003
in ref transcript
pfam Ldl_recept_b 44aa 0.005
in ref transcript
COG COG3391 189aa 3e-05
in ref transcript
Uncharacterized conserved protein [Function unknown].
LRP8
refseq_LRP8.F6 refseq_LRP8.R6 136 313
NCBIGene 36.3 7804
Single exon skipping, size difference: 177
Exclusion in the protein (no frameshift)
Reference transcript: NM_004631
cd LDLa 32aa 7e-06
in ref transcript
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
cd LDLa 35aa 1e-05
in ref transcript
cd LDLa 36aa 2e-04
in ref transcript
cd LDLa 37aa 3e-04
in ref transcript
cd LDLa 27aa 0.002
in ref transcript
pfam Ldl_recept_b 41aa 2e-11
in ref transcript
Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.
smart LY 43aa 1e-10
in ref transcript
Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
smart LY 43aa 3e-08
in ref transcript
smart LDLa 33aa 9e-07
in ref transcript
Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
pfam Ldl_recept_b 40aa 1e-06
in ref transcript
pfam Ldl_recept_a 37aa 4e-06
in ref transcript
Low-density lipoprotein receptor domain class A.
pfam Ldl_recept_a 38aa 8e-06
in ref transcript
pfam Ldl_recept_a 27aa 3e-05
in ref transcript
smart EGF_CA 31aa 8e-04
in ref transcript
Calcium-binding EGF-like domain.
smart LY 36aa 0.002
in ref transcript
smart LDLa 34aa 0.003
in ref transcript
pfam Ldl_recept_b 44aa 0.005
in ref transcript
COG COG3391 189aa 3e-05
in ref transcript
Uncharacterized conserved protein [Function unknown].
LRRC17
refseq_LRRC17.F1 refseq_LRRC17.R1 198 251
NCBIGene 36.3 10234
Single exon skipping, size difference: 53
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001031692
Changed!
cd LRR_RI 93aa 0.008
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Changed!
COG COG4886 163aa 1e-05
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
LRRC20
refseq_LRRC20.F1 refseq_LRRC20.R1 127 295
NCBIGene 36.3 55222
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_207119
Changed!
cd LRR_RI 84aa 0.003
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Changed!
COG COG4886 126aa 4e-04
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
LRRC20
refseq_LRRC20.F3 refseq_LRRC20.R3 187 337
NCBIGene 36.3 55222
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_207119
Changed!
cd LRR_RI 84aa 0.003
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Changed!
COG COG4886 126aa 4e-04
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
Changed!
cd LRR_RI 74aa 0.009
in modified transcript
Changed!
COG COG4886 84aa 0.001
in modified transcript
LRRC23
refseq_LRRC23.F1 refseq_LRRC23.R1 100 398
NCBIGene 36.3 10233
Single exon skipping, size difference: 298
Exclusion of the stop codon
Reference transcript: NM_201650
Changed!
COG COG4886 163aa 0.008
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
Changed!
COG COG4886 169aa 3e-05
in modified transcript
LRRFIP2
refseq_LRRFIP2.F1 refseq_LRRFIP2.R1 254 326
NCBIGene 36.3 9209
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_006309
Changed!
pfam DUF2051 354aa 1e-54
in ref transcript
Double stranded RNA binding protein (DUF2051). This is a novel protein identified as interacting with the leucine-rich repeat domain of human flightless-I, FliI protein.
COG SbcC 333aa 3e-07
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
Changed!
pfam DUF2051 330aa 1e-55
in modified transcript
LRRN2
refseq_LRRN5.F1 refseq_LRRN5.R1 270 398
NCBIGene 36.3 10446
Single exon skipping, size difference: 128
Exclusion in 5'UTR
Reference transcript: NM_006338
cd IGcam 92aa 6e-14
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd LRR_RI 174aa 6e-05
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
cd LRR_RI 236aa 1e-04
in ref transcript
pfam I-set 85aa 2e-13
in ref transcript
Immunoglobulin I-set domain.
TIGR PCC 79aa 3e-09
in ref transcript
Note: this model is restricted to the amino half because a full-length model is incompatible with the HMM software package.
COG COG4886 241aa 1e-10
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
LSR
refseq_LSR.F2 refseq_LSR.R2 205 262
NCBIGene 36.3 51599
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_205834
LST1
refseq_LST1.F2 refseq_LST1.R2 180 225
NCBIGene 36.3 7940
Alternative 3-prime, size difference: 45
Exclusion of the protein initiation site
Reference transcript: NM_205839
Changed!
pfam LST1 39aa 3e-08
in modified transcript
LST-1 protein. B144/LST1 is a gene encoded in the human major histocompatibility complex that produces multiple forms of alternatively spliced mRNA and encodes peptides fewer than 100 amino acids in length. B144/LST1 is strongly expressed in dendritic cells. Transfection of B144/LST1 into a variety of cells induces morphologic changes including the production of long, thin filopodia.
LTB
refseq_LTB.F1 refseq_LTB.R1 160 206
NCBIGene 36.3 4050
Single exon skipping, size difference: 46
Exclusion in the protein causing a frameshift
Reference transcript: NM_002341
Changed!
cd TNF 154aa 1e-27
in ref transcript
Tumor Necrosis Factor; TNF superfamily members include the cytokines: TNF (TNF-alpha), LT (lymphotoxin-alpha, TNF-beta), CD40 ligand, Apo2L (TRAIL), Fas ligand, and osteoprotegerin (OPG) ligand. These proteins generally have an intracellular N-terminal domain, a short transmembrane segment, an extracellular stalk, and a globular TNF-like extracellular domain of about 150 residues. They initiate apoptosis by binding to related receptors, some of which have intracellular death domains. They generally form homo- or hetero- trimeric complexes.TNF cytokines bind one elongated receptor molecule along each of three clefts formed by neighboring monomers of the trimer with ligand trimerization a requiste for receptor binding.
Changed!
smart TNF 142aa 3e-29
in ref transcript
Tumour necrosis factor family. Family of cytokines that form homotrimeric or heterotrimeric complexes. TNF mediates mature T-cell receptor-induced apoptosis through the p75 TNF receptor.
LTK
refseq_LTK.F1 refseq_LTK.R1 129 312
NCBIGene 36.3 4058
Single exon skipping, size difference: 183
Exclusion in the protein (no frameshift)
Reference transcript: NM_002344
cd PTKc_ALK_LTK 277aa 1e-160
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinases, Anaplastic Lymphoma Kinase and Leukocyte Tyrosine Kinase. Protein Tyrosine Kinase (PTK) family; Anaplastic lymphoma kinase (ALK) and Leukocyte tyrosine (tyr) kinase (LTK); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyr residues in protein substrates. ALK and LTK are orphan receptor tyr kinases (RTKs) whose ligands are not yet well-defined. RTKs contain an extracellular ligand-binding domain, a transmembrane region, and an intracellular tyr kinase domain. They are usually activated through ligand binding, which causes dimerization and autophosphorylation of the intracellular tyr kinase catalytic domain. ALK appears to play an important role in mammalian neural development as well as visceral muscle differentiation in Drosophila. ALK is aberrantly expressed as fusion proteins, due to chromosomal translocations, in about 60% of anaplastic large cell lymphomas (ALCLs). ALK fusion proteins are also found in rare cases of diffuse large B cell lymphomas (DLBCLs). LTK is mainly expressed in B lymphocytes and neuronal tissues. It is important in cell proliferation and survival. Transgenic mice expressing TLK display retarded growth and high mortality rate. In addition, a polymorphism in mouse and human LTK is implicated in the pathogenesis of systemic lupus erythematosus.
pfam Pkinase_Tyr 268aa 1e-104
in ref transcript
Protein tyrosine kinase.
COG SPS1 269aa 1e-20
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
LY6G5C
refseq_LY6G5C.F2 refseq_LY6G5C.R2 123 217
NCBIGene 36.2 80741
Alternative 5-prime, size difference: 94
Inclusion in the protein causing a frameshift
Reference transcript: NM_025262
LYCAT
refseq_LYCAT.F2 refseq_LYCAT.R2 208 400
NCBIGene 36.3 253558
Exon skipping and alternative 3-prime or 5-prime, size difference: 192
Exclusion in 5'UTR, Exclusion of the protein initiation site
Reference transcript: NM_182551
smart PlsC 122aa 1e-13
in ref transcript
Phosphate acyltransferases. Function in phospholipid biosynthesis and have either glycerolphosphate, 1-acylglycerolphosphate, or 2-acylglycerolphosphoethanolamine acyltransferase activities. Tafazzin, the product of the gene mutated in patients with Barth syndrome, is a member of this family.
Changed!
cd STKc_OSR1_SPAK 311aa 2e-57
in ref transcript
Serine/threonine kinases (STKs), oxidative stress response kinase (OSR1) and Ste20-related proline alanine-rich kinase (SPAK) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The OSR1 and SPAK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. SPAK is also referred to as STK39 or PASK (proline-alanine-rich STE20-related kinase). OSR1 and SPAK regulate the activity of cation-chloride cotransporters through direct interaction and phosphorylation. They are also implicated in cytoskeletal rearrangement, cell differentiation, transformation and proliferation. OSR1 and SPAK contain a conserved C-terminal (CCT) domain, which recognizes a unique motif ([RK]FX[VI]) present in their activating kinases (WNK1/WNK4) and their substrates.
Changed!
smart S_TKc 296aa 2e-41
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
COG SPS1 323aa 4e-19
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
LYST
refseq_LYST.F1 refseq_LYST.R1 125 285
NCBIGene 36.2 1130
Alternative 5-prime, size difference: 160
Exclusion in the protein causing a frameshift
Reference transcript: NM_000081
Changed!
cd Beach 291aa 1e-109
in ref transcript
BEACH (Beige and Chediak-Higashi) domains, implicated in membrane trafficking, are present in a family of proteins conserved throughout eukaryotes. This group contains human lysosomal trafficking regulator (LYST), LPS-responsive and beige-like anchor (LRBA) and neurobeachin. Disruption of LYST leads to Chediak-Higashi syndrome, characterized by severe immunodeficiency, albinism, poor blood coagulation and neurologic problems. Neurobeachin is a candidate gene linked to autism. LBRA seems to be upregulated in several cancer types. It has been shown that the BEACH domain itself is important for the function of these proteins.
Changed!
cd WD40 302aa 2e-22
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Changed!
pfam Beach 291aa 1e-114
in ref transcript
Beige/BEACH domain.
Changed!
COG COG2319 253aa 7e-11
in ref transcript
FOG: WD40 repeat [General function prediction only].
M-RIP
refseq_M-RIP.F1 refseq_M-RIP.R1 114 177
NCBIGene 36.3 23164
Single exon skipping, size difference: 63
Inclusion in the protein causing a new stop codon
Reference transcript: NM_015134
cd PH_outspread 104aa 5e-48
in ref transcript
Outspread Pleckstrin homology (PH) domain. Outspread contains two PH domains and a C-terminal coiled-coil region. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinsases, regulators of G-proteins, endocytotic GTPAses, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes.
cd PH 92aa 1e-09
in ref transcript
Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
smart PH 94aa 7e-13
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
pfam PH 102aa 1e-06
in ref transcript
PH domain. PH stands for pleckstrin homology.
MACF1
refseq_MACF1.F1 refseq_MACF1.R1 102 120
NCBIGene 36.3 23499
Single exon skipping, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_033044
cd SPEC 210aa 1e-20
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
cd SPEC 215aa 3e-19
in ref transcript
cd SPEC 217aa 5e-17
in ref transcript
cd SPEC 218aa 1e-16
in ref transcript
cd SPEC 214aa 1e-15
in ref transcript
cd SPEC 214aa 1e-12
in ref transcript
cd SPEC 213aa 8e-12
in ref transcript
cd SPEC 216aa 1e-11
in ref transcript
cd SPEC 217aa 4e-11
in ref transcript
cd SPEC 229aa 2e-10
in ref transcript
cd SPEC 222aa 6e-10
in ref transcript
cd SPEC 245aa 2e-09
in ref transcript
cd SPEC 218aa 6e-09
in ref transcript
cd SPEC 248aa 7e-08
in ref transcript
cd SPEC 216aa 9e-08
in ref transcript
cd SPEC 213aa 1e-07
in ref transcript
cd SPEC 216aa 1e-06
in ref transcript
cd EFh 63aa 1e-06
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd SPEC 214aa 4e-06
in ref transcript
cd SPEC 238aa 3e-05
in ref transcript
cd SPEC 198aa 1e-04
in ref transcript
cd SPEC 214aa 0.005
in ref transcript
Changed!
smart GAS2 79aa 5e-32
in ref transcript
Growth-Arrest-Specific Protein 2 Domain. GROWTH-ARREST-SPECIFIC PROTEIN 2 Domain.
pfam SMC_N 882aa 4e-13
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
smart SPEC 98aa 8e-11
in ref transcript
Spectrin repeats.
smart SPEC 101aa 2e-10
in ref transcript
smart SPEC 103aa 8e-10
in ref transcript
pfam Plectin 42aa 7e-09
in ref transcript
Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
pfam Plectin 45aa 9e-08
in ref transcript
smart SPEC 102aa 1e-07
in ref transcript
smart SPEC 103aa 2e-07
in ref transcript
pfam Plectin 45aa 2e-07
in ref transcript
smart SPEC 102aa 3e-07
in ref transcript
pfam Plectin 45aa 3e-07
in ref transcript
TIGR SMC_prok_B 802aa 5e-07
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
pfam Plectin 43aa 2e-06
in ref transcript
pfam Plectin 45aa 3e-06
in ref transcript
pfam Plectin 43aa 4e-06
in ref transcript
pfam Plectin 45aa 6e-06
in ref transcript
smart SPEC 101aa 7e-06
in ref transcript
smart SPEC 103aa 8e-06
in ref transcript
TIGR SMC_prok_B 408aa 3e-05
in ref transcript
smart SPEC 102aa 6e-05
in ref transcript
pfam Spectrin 109aa 3e-04
in ref transcript
Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
TIGR SMC_prok_B 308aa 4e-04
in ref transcript
smart SPEC 101aa 0.006
in ref transcript
smart SPEC 111aa 0.006
in ref transcript
smart SPEC 119aa 0.009
in ref transcript
COG Smc 839aa 3e-19
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG Smc 828aa 7e-09
in ref transcript
COG FRQ1 56aa 0.001
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
PRK mukB 199aa 0.002
in ref transcript
cell division protein MukB; Provisional.
Changed!
COG SbcC 571aa 0.010
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
Changed!
smart GAS2 73aa 5e-34
in modified transcript
MAD1L1
refseq_MAD1L1.F2 refseq_MAD1L1.R2 162 202
NCBIGene 36.3 8379
Alternative 3-prime, size difference: 40
Inclusion in 5'UTR
Reference transcript: NM_001013837
pfam MAD 705aa 1e-154
in ref transcript
Mitotic checkpoint protein. This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
COG Smc 213aa 0.006
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
MADCAM1
refseq_MADCAM1.F1 refseq_MADCAM1.R1 115 376
NCBIGene 36.3 8174
Single exon skipping, size difference: 261
Exclusion in the protein (no frameshift)
Reference transcript: NM_130760
Changed!
pfam Adhes-Ig_like 112aa 5e-40
in ref transcript
Adhesion molecule, immunoglobulin-like. Members of this family are found in a set of mucosal cellular adhesion proteins and adopt an immunoglobulin-like beta-sandwich structure, with seven strands arranged in two beta-sheets in a Greek-key topology. They are essential for recruitment of lymphocytes to specific tissues.
Changed!
pfam Adhes-Ig_like 110aa 2e-39
in modified transcript
MADD
refseq_MADD.F10 refseq_MADD.R10 118 178
NCBIGene 36.3 8567
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_003682
pfam DENN 230aa 3e-62
in ref transcript
DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways.
pfam uDENN 91aa 2e-20
in ref transcript
uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain.
pfam dDENN 74aa 2e-11
in ref transcript
dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be similar to members of the Rho/Rac/Cdc42 GEF family.
MADD
refseq_MADD.F2 refseq_MADD.R2 181 310
NCBIGene 36.3 8567
Alternative 5-prime, size difference: 129
Exclusion in the protein (no frameshift)
Reference transcript: NM_003682
pfam DENN 230aa 3e-62
in ref transcript
DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways.
pfam uDENN 91aa 2e-20
in ref transcript
uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain.
pfam dDENN 74aa 2e-11
in ref transcript
dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be similar to members of the Rho/Rac/Cdc42 GEF family.
MADD
refseq_MADD.F3 refseq_MADD.R3 116 186
NCBIGene 36.3 8567
Single exon skipping, size difference: 70
Exclusion in the protein causing a frameshift
Reference transcript: NM_003682
pfam DENN 230aa 3e-62
in ref transcript
DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways.
pfam uDENN 91aa 2e-20
in ref transcript
uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain.
pfam dDENN 74aa 2e-11
in ref transcript
dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be similar to members of the Rho/Rac/Cdc42 GEF family.
MADD
refseq_MADD.F5 refseq_MADD.R5 152 215
NCBIGene 36.3 8567
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_003682
pfam DENN 230aa 3e-62
in ref transcript
DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways.
pfam uDENN 91aa 2e-20
in ref transcript
uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain.
pfam dDENN 74aa 2e-11
in ref transcript
dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be similar to members of the Rho/Rac/Cdc42 GEF family.
MADD
refseq_MADD.F8 refseq_MADD.R8 175 229
NCBIGene 36.3 8567
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_003682
pfam DENN 230aa 3e-62
in ref transcript
DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways.
pfam uDENN 91aa 2e-20
in ref transcript
uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain.
pfam dDENN 74aa 2e-11
in ref transcript
dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be similar to members of the Rho/Rac/Cdc42 GEF family.
Changed!
pfam HPC2 208aa 0.007
in modified transcript
Histone promoter control 2 (HPC2). HPC2 is required for cell-cycle regulation of histone transcription. It regulates transcription of the histone genes during the S-phase of the cell cycle by repressing transcription at other cell cycle stages. HPC2 mutants display synthetic interactions with FACT complex which allows RNA Pol II to elongate through nucleosomes.
C-terminal to LisH motif. Alpha-helical motif of unknown function.
Changed!
smart LisH 34aa 2e-04
in ref transcript
Lissencephaly type-1-like homology motif. Alpha-helical motif present in Lis1, treacle, Nopp140, some katanin p60 subunits, muskelin, tonneau, LEUNIG and numerous WD40 repeat-containing proteins. It is suggested that LisH motifs contribute to the regulation of microtubule dynamics, either by mediating dimerisation, or else by binding cytoplasmic dynein heavy chain or microtubules directly.
Changed!
smart CTLH 51aa 8e-06
in modified transcript
MAG
refseq_MAG.F1 refseq_MAG.R1 187 232
NCBIGene 36.3 4099
Single exon skipping, size difference: 45
Inclusion in the protein causing a new stop codon
Reference transcript: NM_002361
cd IGcam 81aa 2e-07
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 80aa 1e-04
in ref transcript
smart IG_like 78aa 3e-09
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam I-set 83aa 5e-06
in ref transcript
Immunoglobulin I-set domain.
pfam C2-set_2 82aa 3e-05
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
MAGEA10
refseq_MAGEA10.F1 refseq_MAGEA10.R1 293 384
NCBIGene 36.3 4109
Single exon skipping, size difference: 91
Exclusion in 5'UTR
Reference transcript: NM_001011543
pfam MAGE 171aa 6e-79
in ref transcript
MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
MAGEA2
refseq_MAGEA2.F1 refseq_MAGEA2.R1 110 192
NCBIGene 36.3 4101
Single exon skipping, size difference: 82
Exclusion in 5'UTR
Reference transcript: NM_175742
pfam MAGE 171aa 1e-64
in ref transcript
MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
MAGEA2
refseq_MAGEA2.F3 refseq_MAGEA2.R3 148 246
NCBIGene 36.3 4101
Single exon skipping, size difference: 98
Exclusion in 5'UTR
Reference transcript: NM_175743
pfam MAGE 171aa 1e-64
in ref transcript
MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
MAGEB1
refseq_MAGEB1.F1 refseq_MAGEB1.R1 114 197
NCBIGene 36.3 4112
Single exon skipping, size difference: 83
Exclusion in 5'UTR
Reference transcript: NM_002363
pfam MAGE 171aa 4e-78
in ref transcript
MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
MAGED1
refseq_MAGED1.F2 refseq_MAGED1.R2 111 279
NCBIGene 36.3 9500
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_001005333
pfam MAGE 170aa 1e-70
in ref transcript
MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
MAGED4B
refseq_MAGED4.F1 refseq_MAGED4.R1 131 161
NCBIGene 36.3 81557
Alternative 3-prime, size difference: 30
Inclusion in 5'UTR
Reference transcript: NM_177535
pfam MAGE 170aa 8e-75
in ref transcript
MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
PRK PRK07003 168aa 0.002
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
MAGI1
refseq_MAGI1.F1 refseq_MAGI1.R1 194 283
NCBIGene 36.3 9223
Single exon skipping, size difference: 89
Inclusion in the protein causing a frameshift
Reference transcript: NM_001033057
cd PDZ_signaling 95aa 7e-17
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 78aa 8e-16
in ref transcript
cd PDZ_signaling 76aa 3e-13
in ref transcript
cd GMPK 70aa 4e-12
in ref transcript
Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines.
cd PDZ_signaling 68aa 1e-10
in ref transcript
cd PDZ_signaling 80aa 6e-10
in ref transcript
cd WW 30aa 3e-06
in ref transcript
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
cd WW 30aa 0.001
in ref transcript
smart GuKc 183aa 3e-31
in ref transcript
Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
smart PDZ 83aa 1e-18
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart PDZ 99aa 3e-17
in ref transcript
smart PDZ 87aa 3e-14
in ref transcript
smart PDZ 87aa 4e-14
in ref transcript
smart PDZ 84aa 8e-14
in ref transcript
TIGR degP_htrA_DO 235aa 6e-11
in ref transcript
This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
pfam WW 30aa 3e-07
in ref transcript
WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
pfam WW 30aa 1e-05
in ref transcript
COG Gmk 62aa 5e-14
in ref transcript
Guanylate kinase [Nucleotide transport and metabolism].
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001033057
cd PDZ_signaling 95aa 7e-17
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 78aa 8e-16
in ref transcript
cd PDZ_signaling 76aa 3e-13
in ref transcript
cd GMPK 70aa 4e-12
in ref transcript
Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines.
cd PDZ_signaling 68aa 1e-10
in ref transcript
cd PDZ_signaling 80aa 6e-10
in ref transcript
cd WW 30aa 3e-06
in ref transcript
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
cd WW 30aa 0.001
in ref transcript
smart GuKc 183aa 3e-31
in ref transcript
Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
smart PDZ 83aa 1e-18
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart PDZ 99aa 3e-17
in ref transcript
smart PDZ 87aa 3e-14
in ref transcript
smart PDZ 87aa 4e-14
in ref transcript
smart PDZ 84aa 8e-14
in ref transcript
TIGR degP_htrA_DO 235aa 6e-11
in ref transcript
This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
pfam WW 30aa 3e-07
in ref transcript
WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
pfam WW 30aa 1e-05
in ref transcript
COG Gmk 62aa 5e-14
in ref transcript
Guanylate kinase [Nucleotide transport and metabolism].
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd GMPK 101aa 2e-15
in ref transcript
Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines.
cd PDZ_signaling 87aa 3e-15
in ref transcript
cd PDZ_signaling 78aa 4e-11
in ref transcript
cd PDZ_signaling 65aa 2e-10
in ref transcript
cd WW 30aa 3e-06
in ref transcript
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
cd PDZ_signaling 68aa 3e-04
in ref transcript
cd PDZ_signaling 35aa 0.002
in ref transcript
smart GuKc 86aa 2e-20
in ref transcript
Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
smart PDZ 81aa 4e-18
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart PDZ 88aa 9e-15
in ref transcript
smart PDZ 81aa 3e-13
in ref transcript
smart PDZ 84aa 5e-13
in ref transcript
smart PDZ 82aa 1e-07
in ref transcript
pfam WW 30aa 4e-07
in ref transcript
WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd WW 31aa 2e-04
in modified transcript
Changed!
pfam WW 30aa 9e-06
in modified transcript
Changed!
COG HUL4 76aa 2e-04
in modified transcript
MAL
refseq_MAL.F2 refseq_MAL.R2 191 317
NCBIGene 36.3 4118
Single exon skipping, size difference: 126
Exclusion in the protein (no frameshift)
Reference transcript: NM_002371
Changed!
pfam MARVEL 115aa 5e-14
in ref transcript
Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
Changed!
pfam MARVEL 75aa 0.002
in modified transcript
MAL
refseq_MAL.F3 refseq_MAL.R3 111 279
NCBIGene 36.3 4118
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_002371
Changed!
pfam MARVEL 115aa 5e-14
in ref transcript
Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
Changed!
pfam MARVEL 65aa 4e-06
in modified transcript
MALT1
refseq_MALT1.F1 refseq_MALT1.R1 101 134
NCBIGene 36.3 10892
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_006785
cd IGcam 67aa 4e-07
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 65aa 5e-05
in ref transcript
pfam Peptidase_C14 218aa 3e-21
in ref transcript
Caspase domain.
smart IG_like 70aa 1e-07
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IG_like 76aa 2e-06
in ref transcript
COG COG4249 234aa 8e-07
in ref transcript
Uncharacterized protein containing caspase domain [General function prediction only].
MANBAL
refseq_MANBAL.F1 refseq_MANBAL.R1 232 349
NCBIGene 36.3 63905
Single exon skipping, size difference: 117
Exclusion in 5'UTR
Reference transcript: NM_022077
MANEAL
refseq_MANEAL.F1 refseq_MANEAL.R1 156 356
NCBIGene 36.3 149175
Exon skipping and alternative 3-prime or 5-prime, size difference: 200
Inclusion in the protein causing a new stop codon, Exclusion in the protein causing a frameshift
Reference transcript: NM_001031740
MAP2
refseq_MAP2.F1 refseq_MAP2.R1 119 290
NCBIGene 36.3 4133
Single exon skipping, size difference: 171
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_002374
pfam MAP2_projctn 1129aa 0.0
in ref transcript
MAP2/Tau projection domain. This domain is found in the MAP2/Tau family of proteins which includes MAP2, MAP4, Tau, and their homologs. All isoforms contain a conserved C-terminal domain containing tubulin-binding repeats (pfam00418), and a N-terminal projection domain of varying size. This domain has a net negative charge and exerts a long-range repulsive force. This provides a mechanism that can regulate microtubule spacing which might facilitate efficient organelle transport.
pfam Tubulin-binding 31aa 5e-09
in ref transcript
Tau and MAP protein, tubulin-binding repeat. This family includes the vertebrate proteins MAP2, MAP4 and Tau, as well as other animal homologs. MAP4 is present in many tissues but is usually absent from neurons; MAP2 and Tau are mainly neuronal. Members of this family have the ability to bind to and stabilise microtubules. As a result, they are involved in neuronal migration, supporting dendrite elongation, and regulating microtubules during mitotic metaphase. Note that Tau is involved in neurofibrillary tangle formation in Alzheimer's disease and some other dementias. This family features a C-terminal microtubule binding repeat that contains a conserved KXGS motif.
pfam Tubulin-binding 32aa 1e-06
in ref transcript
pfam Tubulin-binding 32aa 6e-06
in ref transcript
MAP2
refseq_MAP2.F3 refseq_MAP2.R3 300 393
NCBIGene 36.3 4133
Single exon skipping, size difference: 93
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_002374
pfam MAP2_projctn 1129aa 0.0
in ref transcript
MAP2/Tau projection domain. This domain is found in the MAP2/Tau family of proteins which includes MAP2, MAP4, Tau, and their homologs. All isoforms contain a conserved C-terminal domain containing tubulin-binding repeats (pfam00418), and a N-terminal projection domain of varying size. This domain has a net negative charge and exerts a long-range repulsive force. This provides a mechanism that can regulate microtubule spacing which might facilitate efficient organelle transport.
pfam Tubulin-binding 31aa 5e-09
in ref transcript
Tau and MAP protein, tubulin-binding repeat. This family includes the vertebrate proteins MAP2, MAP4 and Tau, as well as other animal homologs. MAP4 is present in many tissues but is usually absent from neurons; MAP2 and Tau are mainly neuronal. Members of this family have the ability to bind to and stabilise microtubules. As a result, they are involved in neuronal migration, supporting dendrite elongation, and regulating microtubules during mitotic metaphase. Note that Tau is involved in neurofibrillary tangle formation in Alzheimer's disease and some other dementias. This family features a C-terminal microtubule binding repeat that contains a conserved KXGS motif.
pfam Tubulin-binding 32aa 1e-06
in ref transcript
pfam Tubulin-binding 32aa 6e-06
in ref transcript
Changed!
pfam Tubulin-binding 31aa 6e-09
in modified transcript
MAP3K3
refseq_MAP3K3.F2 refseq_MAP3K3.R2 133 226
NCBIGene 36.3 4215
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_203351
cd STKc_MEKK3 266aa 1e-158
in ref transcript
Serine/threonine kinases (STKs), MAP/ERK kinase kinase 3 (MEKK3) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MEKK3 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. MEKK3 is a mitogen-activated protein kinase (MAPK) kinase kinase (MAPKKK or MKKK or MAP3K), that phosphorylates and activates the MAPK kinase MEK5 (or MKK5), which in turn phosphorylates and activates extracellular signal-regulated kinase 5 (ERK5). The ERK5 cascade plays roles in promoting cell proliferation, differentiation, neuronal survival, and neuroprotection. MEKK3 plays an essential role in embryonic angiogenesis and early heart development. In addition, MEKK3 is involved in interleukin-1 receptor and Toll-like receptor 4 signaling. It is also a specific regulator of the proinflammatory cytokines IL-6 and GM-CSF in some immune cells. MEKK3 also regulates calcineurin, which plays a critical role in T cell activation, apoptosis, skeletal myocyte differentiation, and cardiac hypertrophy.
cd PB1_Mekk2_3 79aa 1e-34
in ref transcript
The PB1 domain is present in the two mitogen-activated protein kinase kinases MEKK2 and MEKK3 which are two members of the signaling kinase cascade involved in angiogenesis and early cardiovascular development. The PB1 domain of MEKK2 (and/or MEKK3) interacts with the PB1 domain of another member of the kinase cascade Map2k5. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi, animals, and plants. The MEKK2 and MEKK3 proteins contain a type II PB1 domain.
smart S_TKc 250aa 3e-79
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
smart PB1 77aa 6e-12
in ref transcript
PB1 domain. Phox and Bem1p domain, present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.
COG SPS1 263aa 4e-30
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
MAP3K4
refseq_MAP3K4.F1 refseq_MAP3K4.R1 139 289
NCBIGene 36.3 4216
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_005922
cd STKc_MEKK4 260aa 1e-134
in ref transcript
Serine/threonine kinases (STKs), MAP/ERK kinase kinase 4 (MEKK4) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MEKK4 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. MEKK4 is a mitogen-activated protein kinase (MAPK) kinase kinase (MAPKKK or MKKK or MAP3K), that phosphorylates and activates MAPK kinases (MAPKKs or MKKs or MAP2Ks), which in turn phosphorylate and activate MAPKs during signaling cascades that are important in mediating cellular responses to extracellular signals. MEKK4 activates the c-Jun N-terminal kinase (JNK) and p38 MAPK signaling pathways by directly activating their respective MAPKKs, MKK4/MKK7 and MKK3/MKK6. JNK and p38 are collectively known as stress-activated MAPKs, as they are activated in response to a variety of environmental stresses and pro-inflammatory cytokines. MEKK4 also plays roles in the re-polarization of the actin cytoskeleton in response to osmotic stress, in the proper closure of the neural tube, in cardiovascular development, and in immune responses.
smart S_TKc 248aa 7e-73
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
COG SPS1 267aa 3e-35
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
MAP3K7
refseq_MAP3K7.F1 refseq_MAP3K7.R1 207 323
NCBIGene 36.3 6885
Single exon skipping, size difference: 116
Exclusion in the protein causing a frameshift
Reference transcript: NM_145331
cd S_TKc 240aa 4e-61
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Mnd1 family. This family of proteins includes MND1 from Saccharomyces cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair.
COG SPS1 318aa 1e-26
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
MAP3K7
refseq_MAP3K7.F3 refseq_MAP3K7.R3 152 233
NCBIGene 36.3 6885
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_145331
cd S_TKc 240aa 4e-61
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Mnd1 family. This family of proteins includes MND1 from Saccharomyces cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair.
COG SPS1 318aa 1e-26
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
MAP4K4
refseq_MAP4K4.F1 refseq_MAP4K4.R1 146 377
NCBIGene 36.3 9448
Single exon skipping, size difference: 231
Exclusion in the protein (no frameshift)
Reference transcript: NM_145686
cd STKc_MAP4K4_6 282aa 1e-149
in ref transcript
Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K4/MAP4K6 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain. MAP4Ks (or MAPKKKKs) are involved in MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K4 is also called Nck Interacting kinase (NIK). It facilitates the activation of the MAPKs, extracellular signal-regulated kinase (ERK) 1, ERK2, and c-Jun N-terminal kinase (JNK), by phosphorylating and activating MEKK1. MAP4K4 plays a role in tumor necrosis factor (TNF) alpha-induced insulin resistance. MAP4K4 silencing in skeletal muscle cells from type II diabetic patients restores insulin-mediated glucose uptake. MAP4K4, through JNK, also plays a broad role in cell motility, which impacts inflammation, homeostasis, as well as the invasion and spread of cancer. MAP4K4 is found to be highly expressed in most tumor cell lines relative to normal tissue. MAP4K6 (also called MINK for Misshapen/NIKs-related kinase) is activated after Ras induction and mediates activation of p38 MAPK. MAP4K6 plays a role in cell cycle arrest, cytoskeleton organization, cell adhesion, and cell motility.
smart CNH 299aa 2e-87
in ref transcript
Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
smart S_TKc 255aa 2e-61
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
COG SPS1 293aa 5e-27
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K4/MAP4K6 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain. MAP4Ks (or MAPKKKKs) are involved in MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K4 is also called Nck Interacting kinase (NIK). It facilitates the activation of the MAPKs, extracellular signal-regulated kinase (ERK) 1, ERK2, and c-Jun N-terminal kinase (JNK), by phosphorylating and activating MEKK1. MAP4K4 plays a role in tumor necrosis factor (TNF) alpha-induced insulin resistance. MAP4K4 silencing in skeletal muscle cells from type II diabetic patients restores insulin-mediated glucose uptake. MAP4K4, through JNK, also plays a broad role in cell motility, which impacts inflammation, homeostasis, as well as the invasion and spread of cancer. MAP4K4 is found to be highly expressed in most tumor cell lines relative to normal tissue. MAP4K6 (also called MINK for Misshapen/NIKs-related kinase) is activated after Ras induction and mediates activation of p38 MAPK. MAP4K6 plays a role in cell cycle arrest, cytoskeleton organization, cell adhesion, and cell motility.
smart CNH 299aa 2e-87
in ref transcript
Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
smart S_TKc 255aa 2e-61
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
COG SPS1 293aa 5e-27
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_145686
cd STKc_MAP4K4_6 282aa 1e-149
in ref transcript
Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K4/MAP4K6 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain. MAP4Ks (or MAPKKKKs) are involved in MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K4 is also called Nck Interacting kinase (NIK). It facilitates the activation of the MAPKs, extracellular signal-regulated kinase (ERK) 1, ERK2, and c-Jun N-terminal kinase (JNK), by phosphorylating and activating MEKK1. MAP4K4 plays a role in tumor necrosis factor (TNF) alpha-induced insulin resistance. MAP4K4 silencing in skeletal muscle cells from type II diabetic patients restores insulin-mediated glucose uptake. MAP4K4, through JNK, also plays a broad role in cell motility, which impacts inflammation, homeostasis, as well as the invasion and spread of cancer. MAP4K4 is found to be highly expressed in most tumor cell lines relative to normal tissue. MAP4K6 (also called MINK for Misshapen/NIKs-related kinase) is activated after Ras induction and mediates activation of p38 MAPK. MAP4K6 plays a role in cell cycle arrest, cytoskeleton organization, cell adhesion, and cell motility.
Changed!
smart CNH 299aa 2e-87
in ref transcript
Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
smart S_TKc 255aa 2e-61
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
COG SPS1 293aa 5e-27
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
smart CNH 307aa 4e-86
in modified transcript
Changed!
COG ROM1 294aa 2e-06
in modified transcript
MAP4K5
refseq_MAP4K5.F1 refseq_MAP4K5.R1 256 304
NCBIGene 36.3 11183
Alternative 5-prime, size difference: 48
Exclusion in 5'UTR
Reference transcript: NM_198794
cd STKc_MAP4K5 267aa 1e-159
in ref transcript
Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 5 (MAPKKKK5 or MAP4K5) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K5 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain, similar to MAP4K4/6. MAP4Ks are involved in some MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K5, also called germinal center kinase-related enzyme (GCKR), has been shown to activate the MAPK c-Jun N-terminal kinase (JNK). MAP4K5 also facilitates Wnt signaling in B cells, and may therefore be implicated in the control of cell fate, proliferation, and polarity.
smart CNH 316aa 2e-81
in ref transcript
Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
smart S_TKc 248aa 3e-66
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
COG SPS1 349aa 3e-33
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
MAPK10
refseq_MAPK10.F1 refseq_MAPK10.R1 138 197
NCBIGene 36.3 5602
Single exon skipping, size difference: 59
Exclusion in the protein causing a frameshift
Reference transcript: NM_138982
Changed!
cd S_TKc 297aa 1e-66
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 284aa 1e-67
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00024 292aa 1e-34
in ref transcript
cyclin-dependent protein kinase; Provisional.
Changed!
cd S_TKc 62aa 1e-10
in modified transcript
Changed!
smart STYKc 61aa 3e-09
in modified transcript
Protein kinase; unclassified specificity. Phosphotransferases. The specificity of this class of kinases can not be predicted. Possible dual-specificity Ser/Thr/Tyr kinase.
Changed!
COG SPS1 59aa 0.003
in modified transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
MAPK7
refseq_MAPK7.F1 refseq_MAPK7.R1 97 500
NCBIGene 36.3 5598
Multiple exon skipping, size difference: 403
Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
Reference transcript: NM_139033
Changed!
cd S_TKc 294aa 2e-74
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 283aa 8e-72
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00024 293aa 9e-45
in ref transcript
cyclin-dependent protein kinase; Provisional.
Changed!
cd S_TKc 205aa 2e-51
in modified transcript
Changed!
smart S_TKc 205aa 2e-53
in modified transcript
Changed!
PTZ PTZ00024 214aa 4e-34
in modified transcript
MAPK7
refseq_MAPK7.F4 refseq_MAPK7.R4 157 245
NCBIGene 36.3 5598
Alternative 5-prime, size difference: 88
Exclusion in 5'UTR
Reference transcript: NM_002749
cd S_TKc 294aa 2e-74
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 283aa 8e-72
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
PTZ PTZ00024 293aa 9e-45
in ref transcript
cyclin-dependent protein kinase; Provisional.
MAPK8IP3
refseq_MAPK8IP3.F1 refseq_MAPK8IP3.R1 102 120
NCBIGene 36.3 23162
Single exon skipping, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_015133
pfam Jnk-SapK_ap_N 158aa 1e-69
in ref transcript
JNK_SAPK-associated protein-1. This is the N-terminal 200 residues of a set of proteins conserved from yeasts to humans. Most of the proteins in this entry have an RhoGEF pfam00621 domain at their C-terminal end.
pfam Trypan_PARP 70aa 2e-04
in ref transcript
Procyclic acidic repetitive protein (PARP). This family consists of several Trypanosoma brucei procyclic acidic repetitive protein (PARP) like sequences. The procyclic acidic repetitive protein (parp) genes of Trypanosoma brucei encode a small family of abundant surface proteins whose expression is restricted to the procyclic form of the parasite. They are found at two unlinked loci, parpA and parpB; transcription of both loci is developmentally regulated.
Changed!
TIGR SMC_prok_B 127aa 3e-04
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG SbcC 155aa 1e-05
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
Changed!
COG Smc 158aa 5e-04
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
TIGR SMC_prok_B 185aa 7e-05
in modified transcript
Changed!
COG Smc 176aa 5e-05
in modified transcript
MAPKAP1
refseq_MAPKAP1.F1 refseq_MAPKAP1.R1 129 270
NCBIGene 36.3 79109
Single exon skipping, size difference: 141
Exclusion in the protein (no frameshift)
Reference transcript: NM_001006617
Changed!
pfam SIN1 464aa 1e-140
in ref transcript
Stress-activated map kinase interacting protein 1 (SIN1). This family consists of several stress-activated map kinase interacting protein 1 (MAPKAP1 OR SIN1) sequences. The fission yeast Sty1/Spc1 mitogen-activated protein (MAP) kinase is a member of the eukaryotic stress-activated MAP kinase (SAPK) family. Sin1 interacts with Sty1/Spc1. Cells lacking Sin1 display many, but not all, of the phenotypes of cells lacking the Sty1/Spc1 MAP kinase including sterility, multiple stress sensitivity and a cell-cycle delay. Sin1 is phosphorylated after stress but this is not Sty1/Spc1-dependent.
Changed!
pfam SIN1 417aa 1e-107
in modified transcript
MAPKAP1
refseq_MAPKAP1.F3 refseq_MAPKAP1.R3 226 334
NCBIGene 36.3 79109
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_001006617
Changed!
pfam SIN1 464aa 1e-140
in ref transcript
Stress-activated map kinase interacting protein 1 (SIN1). This family consists of several stress-activated map kinase interacting protein 1 (MAPKAP1 OR SIN1) sequences. The fission yeast Sty1/Spc1 mitogen-activated protein (MAP) kinase is a member of the eukaryotic stress-activated MAP kinase (SAPK) family. Sin1 interacts with Sty1/Spc1. Cells lacking Sin1 display many, but not all, of the phenotypes of cells lacking the Sty1/Spc1 MAP kinase including sterility, multiple stress sensitivity and a cell-cycle delay. Sin1 is phosphorylated after stress but this is not Sty1/Spc1-dependent.
Changed!
pfam SIN1 428aa 1e-131
in modified transcript
MAPKAP1
refseq_MAPKAP1.F5 refseq_MAPKAP1.R5 135 463
NCBIGene 36.3 79109
Single exon skipping, size difference: 328
Exclusion of the protein initiation site
Reference transcript: NM_001006617
Changed!
pfam SIN1 464aa 1e-140
in ref transcript
Stress-activated map kinase interacting protein 1 (SIN1). This family consists of several stress-activated map kinase interacting protein 1 (MAPKAP1 OR SIN1) sequences. The fission yeast Sty1/Spc1 mitogen-activated protein (MAP) kinase is a member of the eukaryotic stress-activated MAP kinase (SAPK) family. Sin1 interacts with Sty1/Spc1. Cells lacking Sin1 display many, but not all, of the phenotypes of cells lacking the Sty1/Spc1 MAP kinase including sterility, multiple stress sensitivity and a cell-cycle delay. Sin1 is phosphorylated after stress but this is not Sty1/Spc1-dependent.
Changed!
pfam SIN1 289aa 4e-87
in modified transcript
MAPT
refseq_MAPT.F2 refseq_MAPT.R2 128 221
NCBIGene 36.3 4137
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_016835
Changed!
pfam Tubulin-binding 31aa 8e-10
in ref transcript
Tau and MAP protein, tubulin-binding repeat. This family includes the vertebrate proteins MAP2, MAP4 and Tau, as well as other animal homologs. MAP4 is present in many tissues but is usually absent from neurons; MAP2 and Tau are mainly neuronal. Members of this family have the ability to bind to and stabilise microtubules. As a result, they are involved in neuronal migration, supporting dendrite elongation, and regulating microtubules during mitotic metaphase. Note that Tau is involved in neurofibrillary tangle formation in Alzheimer's disease and some other dementias. This family features a C-terminal microtubule binding repeat that contains a conserved KXGS motif.
pfam Tubulin-binding 31aa 2e-08
in ref transcript
pfam Tubulin-binding 32aa 2e-06
in ref transcript
pfam Tubulin-binding 32aa 3e-06
in ref transcript
pfam Hid1 123aa 3e-04
in ref transcript
High-temperature-induced dauer-formation protein. Hid1 (high-temperature-induced dauer-formation protein 1) represents proteins of approximately 800 residues long and is conserved from fungi to humans. It contains up to seven potential transmembrane domains separated by regions of low complexity. Functionally it might be involved in vesicle secretion or be an inter-cellular signalling protein or be a novel insulin receptor.
MAPT
refseq_MAPT.F3 refseq_MAPT.R3 181 379
NCBIGene 36.3 4137
Single exon skipping, size difference: 198
Exclusion in the protein (no frameshift)
Reference transcript: NM_016835
pfam Tubulin-binding 31aa 8e-10
in ref transcript
Tau and MAP protein, tubulin-binding repeat. This family includes the vertebrate proteins MAP2, MAP4 and Tau, as well as other animal homologs. MAP4 is present in many tissues but is usually absent from neurons; MAP2 and Tau are mainly neuronal. Members of this family have the ability to bind to and stabilise microtubules. As a result, they are involved in neuronal migration, supporting dendrite elongation, and regulating microtubules during mitotic metaphase. Note that Tau is involved in neurofibrillary tangle formation in Alzheimer's disease and some other dementias. This family features a C-terminal microtubule binding repeat that contains a conserved KXGS motif.
pfam Tubulin-binding 31aa 2e-08
in ref transcript
pfam Tubulin-binding 32aa 2e-06
in ref transcript
pfam Tubulin-binding 32aa 3e-06
in ref transcript
pfam Hid1 123aa 3e-04
in ref transcript
High-temperature-induced dauer-formation protein. Hid1 (high-temperature-induced dauer-formation protein 1) represents proteins of approximately 800 residues long and is conserved from fungi to humans. It contains up to seven potential transmembrane domains separated by regions of low complexity. Functionally it might be involved in vesicle secretion or be an inter-cellular signalling protein or be a novel insulin receptor.
MAPT
refseq_MAPT.F5 refseq_MAPT.R5 109 283
NCBIGene 36.3 4137
Multiple exon skipping, size difference: 174
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_016835
pfam Tubulin-binding 31aa 8e-10
in ref transcript
Tau and MAP protein, tubulin-binding repeat. This family includes the vertebrate proteins MAP2, MAP4 and Tau, as well as other animal homologs. MAP4 is present in many tissues but is usually absent from neurons; MAP2 and Tau are mainly neuronal. Members of this family have the ability to bind to and stabilise microtubules. As a result, they are involved in neuronal migration, supporting dendrite elongation, and regulating microtubules during mitotic metaphase. Note that Tau is involved in neurofibrillary tangle formation in Alzheimer's disease and some other dementias. This family features a C-terminal microtubule binding repeat that contains a conserved KXGS motif.
pfam Tubulin-binding 31aa 2e-08
in ref transcript
pfam Tubulin-binding 32aa 2e-06
in ref transcript
pfam Tubulin-binding 32aa 3e-06
in ref transcript
Changed!
pfam Hid1 123aa 3e-04
in ref transcript
High-temperature-induced dauer-formation protein. Hid1 (high-temperature-induced dauer-formation protein 1) represents proteins of approximately 800 residues long and is conserved from fungi to humans. It contains up to seven potential transmembrane domains separated by regions of low complexity. Functionally it might be involved in vesicle secretion or be an inter-cellular signalling protein or be a novel insulin receptor.
MARCH2
refseq_MARCH2.F1 refseq_MARCH2.R1 123 394
NCBIGene 36.3 51257
Single exon skipping, size difference: 271
Exclusion in 5'UTR
Reference transcript: NM_016496
smart RINGv 48aa 1e-12
in ref transcript
The RING-variant domain is a C4HC3 zinc-finger like motif found in a number of cellular and viral proteins. Some of these proteins have been shown both in vivo and in vitro to have ubiquitin E3 ligase activity. The RING-variant domain is reminiscent of both the RING and the PHD domains and may represent an evolutionary intermediate. To describe this domain the term PHD/LAP domain has been used in the past. Extended description: The RING-variant (RINGv) domain contains a C4HC3 zinc-finger-like motif similar to the PHD domain, while some of the spacing between the Cys/His residues follow a pattern somewhat closer to that found in the RING domain. The RINGv domain, similar to the RING, PHD and LIM domains, is thought to bind two zinc ions co-ordinated by the highly conserved Cys and His residues. RING variant domain: C-x (2) -C-x(10-45)-C-x (1) -C-x (7) -H-x(2)-C-x(11-25)-C-x(2)-C As opposed to a PHD: C-x(1-2) -C-x (7-13)-C-x(2-4)-C-x(4-5)-H-x(2)-C-x(10-21)-C-x(2)-C Classical RING domain: C-x (2) -C-x (9-39)-C-x(1-3)-H-x(2-3)-C-x(2)-C-x(4-48) -C-x(2)-C.
pfam Vpu 52aa 0.002
in ref transcript
Vpu protein. The Vpu protein contains an N-terminal transmembrane spanning region and a C-terminal cytoplasmic region. The HIV-1 Vpu protein stimulates virus production by enhancing the release of viral particles from infected cells. The VPU protein binds specifically to CD4.
COG SSM4 58aa 3e-12
in ref transcript
Protein involved in mRNA turnover and stability [RNA processing and modification].
MARCH2
refseq_MARCH2.F3 refseq_MARCH2.R3 159 369
NCBIGene 36.3 51257
Single exon skipping, size difference: 210
Exclusion in the protein (no frameshift)
Reference transcript: NM_001005415
smart RINGv 48aa 1e-12
in ref transcript
The RING-variant domain is a C4HC3 zinc-finger like motif found in a number of cellular and viral proteins. Some of these proteins have been shown both in vivo and in vitro to have ubiquitin E3 ligase activity. The RING-variant domain is reminiscent of both the RING and the PHD domains and may represent an evolutionary intermediate. To describe this domain the term PHD/LAP domain has been used in the past. Extended description: The RING-variant (RINGv) domain contains a C4HC3 zinc-finger-like motif similar to the PHD domain, while some of the spacing between the Cys/His residues follow a pattern somewhat closer to that found in the RING domain. The RINGv domain, similar to the RING, PHD and LIM domains, is thought to bind two zinc ions co-ordinated by the highly conserved Cys and His residues. RING variant domain: C-x (2) -C-x(10-45)-C-x (1) -C-x (7) -H-x(2)-C-x(11-25)-C-x(2)-C As opposed to a PHD: C-x(1-2) -C-x (7-13)-C-x(2-4)-C-x(4-5)-H-x(2)-C-x(10-21)-C-x(2)-C Classical RING domain: C-x (2) -C-x (9-39)-C-x(1-3)-H-x(2-3)-C-x(2)-C-x(4-48) -C-x(2)-C.
Changed!
pfam Vpu 52aa 0.002
in ref transcript
Vpu protein. The Vpu protein contains an N-terminal transmembrane spanning region and a C-terminal cytoplasmic region. The HIV-1 Vpu protein stimulates virus production by enhancing the release of viral particles from infected cells. The VPU protein binds specifically to CD4.
COG SSM4 58aa 3e-12
in ref transcript
Protein involved in mRNA turnover and stability [RNA processing and modification].
MARCH8
refseq_MARCH8.F1 refseq_MARCH8.R1 124 217
NCBIGene 36.3 220972
Single exon skipping, size difference: 93
Exclusion in 5'UTR
Reference transcript: NM_001002265
smart RINGv 49aa 1e-15
in ref transcript
The RING-variant domain is a C4HC3 zinc-finger like motif found in a number of cellular and viral proteins. Some of these proteins have been shown both in vivo and in vitro to have ubiquitin E3 ligase activity. The RING-variant domain is reminiscent of both the RING and the PHD domains and may represent an evolutionary intermediate. To describe this domain the term PHD/LAP domain has been used in the past. Extended description: The RING-variant (RINGv) domain contains a C4HC3 zinc-finger-like motif similar to the PHD domain, while some of the spacing between the Cys/His residues follow a pattern somewhat closer to that found in the RING domain. The RINGv domain, similar to the RING, PHD and LIM domains, is thought to bind two zinc ions co-ordinated by the highly conserved Cys and His residues. RING variant domain: C-x (2) -C-x(10-45)-C-x (1) -C-x (7) -H-x(2)-C-x(11-25)-C-x(2)-C As opposed to a PHD: C-x(1-2) -C-x (7-13)-C-x(2-4)-C-x(4-5)-H-x(2)-C-x(10-21)-C-x(2)-C Classical RING domain: C-x (2) -C-x (9-39)-C-x(1-3)-H-x(2-3)-C-x(2)-C-x(4-48) -C-x(2)-C.
COG SSM4 60aa 8e-16
in ref transcript
Protein involved in mRNA turnover and stability [RNA processing and modification].
MARK2
refseq_MARK2.F2 refseq_MARK2.R2 106 133
NCBIGene 36.3 2011
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039469
cd S_TKc 253aa 4e-87
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
cd UBA 38aa 0.001
in ref transcript
Ubiquitin Associated domain. The UBA domain is a commonly occurring sequence motif in some members of the ubiquitination pathway, UV excision repair proteins, and certain protein kinases. Although its specific role is so far unknown, it has been suggested that UBA domains are involved in conferring protein target specificity. The domain, a compact three helix bundle, has a conserved GFP-loop and the proline is thought to be critical for binding. The UBA domain is distinct from the conserved three helical domain seen in the N-terminus of EF-TS and eukaryotic NAC proteins.
smart S_TKc 241aa 8e-89
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam KA1 43aa 7e-14
in ref transcript
Kinase associated domain 1.
smart UBA 37aa 3e-05
in ref transcript
Ubiquitin associated domain. Present in Rad23, SNF1-like kinases. The newly-found UBA in p62 is known to bind ubiquitin.
PTZ PTZ00263 250aa 3e-46
in ref transcript
protein kinase A catalytic subunit; Provisional.
COG RER1 45aa 0.002
in ref transcript
Golgi protein involved in Golgi-to-ER retrieval [Intracellular trafficking and secretion].
MARK2
refseq_MARK2.F4 refseq_MARK2.R4 112 274
NCBIGene 36.3 2011
Single exon skipping, size difference: 162
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039469
cd S_TKc 253aa 4e-87
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
cd UBA 38aa 0.001
in ref transcript
Ubiquitin Associated domain. The UBA domain is a commonly occurring sequence motif in some members of the ubiquitination pathway, UV excision repair proteins, and certain protein kinases. Although its specific role is so far unknown, it has been suggested that UBA domains are involved in conferring protein target specificity. The domain, a compact three helix bundle, has a conserved GFP-loop and the proline is thought to be critical for binding. The UBA domain is distinct from the conserved three helical domain seen in the N-terminus of EF-TS and eukaryotic NAC proteins.
smart S_TKc 241aa 8e-89
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam KA1 43aa 7e-14
in ref transcript
Kinase associated domain 1.
smart UBA 37aa 3e-05
in ref transcript
Ubiquitin associated domain. Present in Rad23, SNF1-like kinases. The newly-found UBA in p62 is known to bind ubiquitin.
PTZ PTZ00263 250aa 3e-46
in ref transcript
protein kinase A catalytic subunit; Provisional.
COG RER1 45aa 0.002
in ref transcript
Golgi protein involved in Golgi-to-ER retrieval [Intracellular trafficking and secretion].
MATN2
refseq_MATN2.F1 refseq_MATN2.R1 139 196
NCBIGene 36.3 4147
Alternative 3-prime, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_002380
cd vWA_Matrilin 223aa 7e-99
in ref transcript
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Changed!
cd vWA_Matrilin 241aa 5e-92
in ref transcript
cd vWA_Matrilin 44aa 5e-09
in ref transcript
cd vWA_Matrilin 44aa 2e-06
in ref transcript
cd vWA_Matrilin 44aa 2e-06
in ref transcript
cd vWA_Matrilin 43aa 5e-06
in ref transcript
cd vWA_Matrilin 52aa 5e-06
in ref transcript
cd vWA_Matrilin 43aa 7e-06
in ref transcript
cd vWA_Matrilin 50aa 5e-04
in ref transcript
pfam VWA 176aa 4e-49
in ref transcript
von Willebrand factor type A domain.
pfam VWA 176aa 2e-48
in ref transcript
pfam Matrilin_ccoil 44aa 1e-11
in ref transcript
Trimeric coiled-coil oligomerisation domain of matrilin. This short domain is a coiled coil structure and has a single cysteine residue at the start which is likely to form a di-sulfide bridge with a corresponding cysteine in an upstream EGF (pfam00008) domain thereby spanning a VWA (pfam00092) domain. All three domains can be associated together as in the cartilage matrix protein matrilin, where this domain is likely to be responsible for oligomerisation.
smart EGF_CA 39aa 0.008
in ref transcript
Calcium-binding EGF-like domain.
Changed!
cd vWA_Matrilin 222aa 6e-89
in modified transcript
MATN4
refseq_MATN4.F2 refseq_MATN4.R2 256 379
NCBIGene 36.3 8785
Single exon skipping, size difference: 123
Exclusion in the protein (no frameshift)
Reference transcript: NM_003833
cd vWA_Matrilin 237aa 3e-88
in ref transcript
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
Changed!
cd vWA_Matrilin 223aa 6e-86
in ref transcript
Changed!
cd vWA_Matrilin 44aa 0.001
in ref transcript
pfam VWA 176aa 2e-47
in ref transcript
von Willebrand factor type A domain.
pfam VWA 172aa 2e-41
in ref transcript
pfam Matrilin_ccoil 44aa 2e-08
in ref transcript
Trimeric coiled-coil oligomerisation domain of matrilin. This short domain is a coiled coil structure and has a single cysteine residue at the start which is likely to form a di-sulfide bridge with a corresponding cysteine in an upstream EGF (pfam00008) domain thereby spanning a VWA (pfam00092) domain. All three domains can be associated together as in the cartilage matrix protein matrilin, where this domain is likely to be responsible for oligomerisation.
Changed!
smart EGF_CA 39aa 5e-04
in ref transcript
Calcium-binding EGF-like domain.
Changed!
cd vWA_Matrilin 223aa 3e-82
in modified transcript
Changed!
smart EGF_CA 40aa 0.006
in modified transcript
MAX
refseq_MAX.F2 refseq_MAX.R2 125 226
NCBIGene 36.3 4149
Single exon skipping, size difference: 101
Inclusion in the protein causing a new stop codon
Reference transcript: NM_002382
cd HLH 57aa 6e-07
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
pfam HLH 52aa 5e-12
in ref transcript
Helix-loop-helix DNA-binding domain.
MAX
refseq_MAX.F3 refseq_MAX.R3 141 168
NCBIGene 36.3 4149
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_002382
Changed!
cd HLH 57aa 6e-07
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
pfam HLH 52aa 5e-12
in ref transcript
Helix-loop-helix DNA-binding domain.
Changed!
cd HLH 59aa 4e-07
in modified transcript
MBD1
refseq_MBD1.F1 refseq_MBD1.R1 123 261
NCBIGene 36.3 4152
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_015846
cd MeCP2_MBD 62aa 1e-15
in ref transcript
MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
smart MBD 74aa 7e-20
in ref transcript
Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain.
pfam zf-CXXC 47aa 1e-11
in ref transcript
CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
pfam zf-CXXC 48aa 8e-08
in ref transcript
pfam zf-CXXC 46aa 1e-06
in ref transcript
MBD1
refseq_MBD1.F4 refseq_MBD1.R4 135 204
NCBIGene 36.3 4152
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_015846
cd MeCP2_MBD 62aa 1e-15
in ref transcript
MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
smart MBD 74aa 7e-20
in ref transcript
Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain.
pfam zf-CXXC 47aa 1e-11
in ref transcript
CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
pfam zf-CXXC 48aa 8e-08
in ref transcript
pfam zf-CXXC 46aa 1e-06
in ref transcript
MBD1
refseq_MBD1.F5 refseq_MBD1.R5 163 235
NCBIGene 36.3 4152
Single exon skipping, size difference: 72
Exclusion of the stop codon
Reference transcript: NM_015846
cd MeCP2_MBD 62aa 1e-15
in ref transcript
MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
smart MBD 74aa 7e-20
in ref transcript
Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain.
pfam zf-CXXC 47aa 1e-11
in ref transcript
CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
pfam zf-CXXC 48aa 8e-08
in ref transcript
pfam zf-CXXC 46aa 1e-06
in ref transcript
MBD1
refseq_MBD1.F7 refseq_MBD1.R7 156 303
NCBIGene 36.3 4152
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_015846
cd MeCP2_MBD 62aa 1e-15
in ref transcript
MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
smart MBD 74aa 7e-20
in ref transcript
Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain.
pfam zf-CXXC 47aa 1e-11
in ref transcript
CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
Changed!
pfam zf-CXXC 48aa 8e-08
in ref transcript
Changed!
pfam zf-CXXC 46aa 1e-06
in ref transcript
Changed!
pfam zf-CXXC 46aa 9e-08
in modified transcript
MBD1
refseq_MBD1.F9 refseq_MBD1.R9 144 312
NCBIGene 36.3 4152
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_015846
cd MeCP2_MBD 62aa 1e-15
in ref transcript
MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
smart MBD 74aa 7e-20
in ref transcript
Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain.
Changed!
pfam zf-CXXC 47aa 1e-11
in ref transcript
CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
pfam zf-CXXC 48aa 8e-08
in ref transcript
pfam zf-CXXC 46aa 1e-06
in ref transcript
MBNL1
refseq_MBNL1.F2 refseq_MBNL1.R2 153 189
NCBIGene 36.3 4154
Single exon skipping, size difference: 36
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_207293
smart ZnF_C3H1 25aa 0.004
in ref transcript
zinc finger.
MBNL1
refseq_MBNL1.F3 refseq_MBNL1.R3 298 352
NCBIGene 36.3 4154
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_207293
smart ZnF_C3H1 25aa 0.004
in ref transcript
zinc finger.
MBNL1
refseq_MBNL1.F5 refseq_MBNL1.R5 108 312
NCBIGene 36.3 4154
Single exon skipping, size difference: 204
Exclusion in the protein (no frameshift)
Reference transcript: NM_207293
Changed!
smart ZnF_C3H1 25aa 0.004
in ref transcript
zinc finger.
Changed!
smart ZnF_C3H1 25aa 0.008
in modified transcript
MBNL1
refseq_MBNL1.F7 refseq_MBNL1.R7 159 254
NCBIGene 36.3 4154
Single exon skipping, size difference: 95
Exclusion in the protein causing a frameshift
Reference transcript: NM_021038
smart ZnF_C3H1 25aa 0.005
in ref transcript
zinc finger.
MBNL1
refseq_MBNL1.F8 refseq_MBNL1.R8 115 179
NCBIGene 36.3 4154
Single exon skipping, size difference: 64
Inclusion in the protein causing a new stop codon
Reference transcript: NM_207293
smart ZnF_C3H1 25aa 0.004
in ref transcript
zinc finger.
MBNL2
refseq_MBNL2.F1 refseq_MBNL2.R1 205 336
NCBIGene 36.3 10150
Multiple exon skipping, size difference: 131
Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_144778
MBNL3
refseq_MBNL3.F2 refseq_MBNL3.R2 156 230
NCBIGene 36.3 55796
Single exon skipping, size difference: 74
Inclusion in the protein causing a new stop codon
Reference transcript: NM_018388
smart ZnF_C3H1 27aa 0.005
in ref transcript
zinc finger.
MBNL3
refseq_MBNL3.F4 refseq_MBNL3.R4 269 374
NCBIGene 36.3 55796
Alternative 3-prime, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_018388
smart ZnF_C3H1 27aa 0.005
in ref transcript
zinc finger.
MBP
refseq_MBP.F2 refseq_MBP.R2 134 212
NCBIGene 36.3 4155
Single exon skipping, size difference: 78
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001025101
Changed!
pfam Myelin_MBP 154aa 1e-59
in ref transcript
Myelin basic protein.
Changed!
pfam Myelin_MBP 180aa 7e-56
in modified transcript
MCHR2
refseq_MCHR2.F1 refseq_MCHR2.R1 219 458
NCBIGene 36.3 84539
Alternative 5-prime, size difference: 239
Exclusion in 5'UTR
Reference transcript: NM_001040179
pfam 7tm_1 250aa 7e-24
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
MCM8
refseq_MCM8.F2 refseq_MCM8.R2 269 323
NCBIGene 36.3 84515
Alternative 5-prime, size difference: 54
Inclusion in 5'UTR
Reference transcript: NM_032485
cd AAA 159aa 1e-04
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
smart MCM 543aa 1e-121
in ref transcript
minichromosome maintenance proteins.
COG MCM2 699aa 1e-120
in ref transcript
Predicted ATPase involved in replication control, Cdc46/Mcm family [DNA replication, recombination, and repair].
MCM8
refseq_MCM8.F3 refseq_MCM8.R3 105 153
NCBIGene 36.3 84515
Alternative 3-prime, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_032485
cd AAA 159aa 1e-04
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
Changed!
smart MCM 543aa 1e-121
in ref transcript
minichromosome maintenance proteins.
Changed!
COG MCM2 699aa 1e-120
in ref transcript
Predicted ATPase involved in replication control, Cdc46/Mcm family [DNA replication, recombination, and repair].
Changed!
smart MCM 527aa 1e-116
in modified transcript
Changed!
COG MCM2 683aa 1e-116
in modified transcript
MECR
refseq_MECR.F1 refseq_MECR.R1 233 360
NCBIGene 36.3 51102
Alternative 3-prime, size difference: 127
Inclusion in the protein causing a new stop codon
Reference transcript: NM_016011
Changed!
cd CBS_like 75aa 2e-05
in ref transcript
CBS_like: This subgroup includes Cystathionine beta-synthase (CBS) and Cysteine synthase. CBS is a unique heme-containing enzyme that catalyzes a pyridoxal 5'-phosphate (PLP)-dependent condensation of serine and homocysteine to give cystathionine. Deficiency of CBS leads to homocystinuria, an inherited disease of sulfur metabolism characterized by increased levels of the toxic metabolite homocysteine. Cysteine synthase on the other hand catalyzes the last step of cysteine biosynthesis. This subgroup also includes an O-Phosphoserine sulfhydrylase found in hyperthermophilic archaea which produces L-cysteine from sulfide and the more thermostable O-phospho-L-serine.
Changed!
smart PKS_ER 285aa 3e-26
in ref transcript
Enoylreductase. Enoylreductase in Polyketide synthases.
Changed!
COG Qor 330aa 6e-50
in ref transcript
NADPH:quinone reductase and related Zn-dependent oxidoreductases [Energy production and conversion / General function prediction only].
MED8
refseq_MED8.F1 refseq_MED8.R1 106 167
NCBIGene 36.2 112950
Alternative 3-prime, size difference: 61
Inclusion in the protein causing a frameshift
Reference transcript: NM_001001651
Changed!
pfam Arc32 178aa 3e-55
in ref transcript
Mediator of RNA polymerase II transcription. Arc32, or Med8, is one of the subunits of the Mediator complex of RNA polymerase II. The region conserved contains two alpha helices putatively necessary for binding to other subunits within the core of the Mediator complex.
MED8
refseq_MED8.F3 refseq_MED8.R3 102 120
NCBIGene 36.2 112950
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_001001651
pfam Arc32 178aa 3e-55
in ref transcript
Mediator of RNA polymerase II transcription. Arc32, or Med8, is one of the subunits of the Mediator complex of RNA polymerase II. The region conserved contains two alpha helices putatively necessary for binding to other subunits within the core of the Mediator complex.
MEIS2
refseq_MEIS2.F2 refseq_MEIS2.R2 101 122
NCBIGene 36.3 4212
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_170675
cd homeodomain 54aa 2e-10
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
smart HOX 54aa 2e-11
in ref transcript
Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.
MEIS2
refseq_MEIS2.F4 refseq_MEIS2.R4 299 395
NCBIGene 36.3 4212
Single exon skipping, size difference: 96
Inclusion in the protein causing a new stop codon
Reference transcript: NM_170675
cd homeodomain 54aa 2e-10
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
smart HOX 54aa 2e-11
in ref transcript
Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.
MEIS2
refseq_MEIS2.F5 refseq_MEIS2.R5 130 153
NCBIGene 36.3 4212
Alternative 3-prime, size difference: 23
Exclusion in the protein causing a frameshift
Reference transcript: NM_170675
Changed!
cd homeodomain 54aa 2e-10
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Changed!
smart HOX 54aa 2e-11
in ref transcript
Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.
MEIS2
refseq_MEIS2.F7 refseq_MEIS2.R7 174 251
NCBIGene 36.2 4212
Single exon skipping, size difference: 77
Exclusion in the protein causing a frameshift
Reference transcript: NM_170675
Changed!
cd homeodomain 54aa 2e-10
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Changed!
smart HOX 54aa 2e-11
in ref transcript
Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.
MEIS3
refseq_MEIS3.F2 refseq_MEIS3.R2 169 307
NCBIGene 36.3 56917
Alternative 3-prime, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_020160
Changed!
cd homeodomain 37aa 2e-08
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Changed!
smart HOX 37aa 9e-09
in ref transcript
Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.
Changed!
cd homeodomain 56aa 1e-09
in modified transcript
Changed!
smart HOX 56aa 8e-11
in modified transcript
MEIS3
refseq_MEIS3.F4 refseq_MEIS3.R4 141 192
NCBIGene 36.3 56917
Alternative 3-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_020160
cd homeodomain 37aa 2e-08
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
smart HOX 37aa 9e-09
in ref transcript
Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.
MEN1
refseq_MEN1.F1 refseq_MEN1.R1 109 355
NCBIGene 36.3 4221
Alternative 5-prime, size difference: 246
Exclusion in 5'UTR
Reference transcript: NM_130802
pfam Menin 615aa 0.0
in ref transcript
Menin. MEN1, the gene responsible for multiple endocrine neoplasia type 1, is a tumour suppressor gene that encodes a protein called Menin which may be an atypical GTPase stimulated by nm23.
MEOX1
refseq_MEOX1.F1 refseq_MEOX1.R1 227 400
NCBIGene 36.3 4222
Single exon skipping, size difference: 173
Exclusion in the protein causing a frameshift
Reference transcript: NM_004527
Changed!
cd homeodomain 59aa 5e-18
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Changed!
pfam Homeobox 57aa 8e-24
in ref transcript
Mitochondrial small ribosomal subunit Rsm22. Rsm22 has been identified as a mitochondrial small ribosomal subunit and is a methyltransferase. In Schizosaccharomyces pombe, Rsm22 is tandemly fused to Cox11 (a factor required for copper insertion into cytochrome oxidase) and the two proteins are proteolytically cleaved after import into the mitochondria.
Changed!
COG COG5459 263aa 1e-26
in ref transcript
Predicted rRNA methylase [Translation, ribosomal structure and biogenesis].
METTL1
refseq_METTL1.F1 refseq_METTL1.R1 163 348
NCBIGene 36.3 4234
Single exon skipping, size difference: 185
Exclusion in the protein causing a frameshift
Reference transcript: NM_005371
Changed!
pfam Methyltransf_4 200aa 2e-62
in ref transcript
Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity.
Changed!
COG COG0220 243aa 3e-39
in ref transcript
Predicted S-adenosylmethionine-dependent methyltransferase [General function prediction only].
Changed!
COG COG0220 78aa 0.002
in modified transcript
MGA
refseq_MGA.F1 refseq_MGA.R1 100 247
NCBIGene 36.2 23269
Alternative 3-prime, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: XM_932720
cd TBOX 191aa 9e-87
in ref transcript
T-box DNA binding domain of the T-box family of transcriptional regulators. The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.
pfam T-box 186aa 2e-80
in ref transcript
T-box. The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box proteins are found in a wide range of animals, but not in other kingdoms such as plants. Family members are all thought to bind to the DNA consensus sequence TCACACCT. they are found exclusively in the nucleus, and perform DNA-binding and transcriptional activation/repression roles. They are generally required for development of the specific tissues they are expressed in, and mutations in T-box genes are implicated in human conditions such as DiGeorge syndrome and X-linked cleft palate, which feature malformations.
C16orf13
refseq_MGC13114.F1 refseq_MGC13114.R1 237 400
NCBIGene 36.3 84326
Single exon skipping, size difference: 163
Exclusion in the protein causing a frameshift
Reference transcript: NM_001040160
Changed!
pfam DUF938 137aa 6e-48
in ref transcript
Protein of unknown function (DUF938). This family consists of several hypothetical proteins from both prokaryotes and eukaryotes. The function of this family is unknown.
Changed!
pfam DUF938 62aa 8e-21
in modified transcript
Changed!
pfam DUF938 40aa 2e-12
in modified transcript
C17orf91
refseq_MGC14376.F1 refseq_MGC14376.R1 267 350
NCBIGene 36.3 84981
Single exon skipping, size difference: 83
Exclusion in 5'UTR
Reference transcript: NM_032895
MGC15875
refseq_MGC15875.F2 refseq_MGC15875.R2 130 218
NCBIGene 36.2 85007
Alternative 3-prime, size difference: 88
Exclusion in the protein causing a frameshift
Reference transcript: NM_153373
Changed!
cd OAT_like 404aa 1e-105
in ref transcript
Acetyl ornithine aminotransferase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to ornithine aminotransferase, acetylornithine aminotransferase, alanine-glyoxylate aminotransferase, dialkylglycine decarboxylase, 4-aminobutyrate aminotransferase, beta-alanine-pyruvate aminotransferase, adenosylmethionine-8-amino-7-oxononanoate aminotransferase, and glutamate-1-semialdehyde 2,1-aminomutase. All the enzymes belonging to this family act on basic amino acids and their derivatives are involved in transamination or decarboxylation.
Changed!
pfam Aminotran_3 340aa 8e-65
in ref transcript
Aminotransferase class-III.
Changed!
PRK PRK06148 429aa 1e-153
in ref transcript
hypothetical protein; Provisional.
Changed!
cd OAT_like 205aa 1e-46
in modified transcript
Changed!
pfam Aminotran_3 201aa 2e-30
in modified transcript
Changed!
PRK PRK06148 227aa 2e-80
in modified transcript
MGC15875
refseq_MGC15875.F3 refseq_MGC15875.R3 143 378
NCBIGene 36.2 85007
Multiple exon skipping, size difference: 235
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_153373
Changed!
cd OAT_like 404aa 1e-105
in ref transcript
Acetyl ornithine aminotransferase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to ornithine aminotransferase, acetylornithine aminotransferase, alanine-glyoxylate aminotransferase, dialkylglycine decarboxylase, 4-aminobutyrate aminotransferase, beta-alanine-pyruvate aminotransferase, adenosylmethionine-8-amino-7-oxononanoate aminotransferase, and glutamate-1-semialdehyde 2,1-aminomutase. All the enzymes belonging to this family act on basic amino acids and their derivatives are involved in transamination or decarboxylation.
Changed!
pfam Aminotran_3 340aa 8e-65
in ref transcript
Aminotransferase class-III.
Changed!
PRK PRK06148 429aa 1e-153
in ref transcript
hypothetical protein; Provisional.
Changed!
cd OAT_like 31aa 5e-06
in modified transcript
Changed!
TIGR argD 28aa 6e-05
in modified transcript
Members of this family may also act on ornithine, like ornithine aminotransferase (EC 2.6.1.13) and on succinyldiaminopimelate, like N-succinyldiaminopmelate-aminotransferase (EC 2.6.1.17, DapC, an enzyme of lysine biosynthesis).
Changed!
PRK PRK06148 55aa 4e-09
in modified transcript
MGC15875
refseq_MGC15875.F5 refseq_MGC15875.R5 106 197
NCBIGene 36.2 85007
Exon skipping and alternative 3-prime or 5-prime, size difference: 91
Inclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_153373
Changed!
cd OAT_like 404aa 1e-105
in ref transcript
Acetyl ornithine aminotransferase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to ornithine aminotransferase, acetylornithine aminotransferase, alanine-glyoxylate aminotransferase, dialkylglycine decarboxylase, 4-aminobutyrate aminotransferase, beta-alanine-pyruvate aminotransferase, adenosylmethionine-8-amino-7-oxononanoate aminotransferase, and glutamate-1-semialdehyde 2,1-aminomutase. All the enzymes belonging to this family act on basic amino acids and their derivatives are involved in transamination or decarboxylation.
Changed!
pfam Aminotran_3 340aa 8e-65
in ref transcript
Aminotransferase class-III.
Changed!
PRK PRK06148 429aa 1e-153
in ref transcript
hypothetical protein; Provisional.
Changed!
cd OAT_like 126aa 7e-35
in modified transcript
Changed!
pfam Aminotran_3 128aa 9e-22
in modified transcript
Changed!
PRK PRK06148 163aa 7e-60
in modified transcript
MGC19604
refseq_MGC19604.F1 refseq_MGC19604.R1 181 218
NCBIGene 36.2 112812
Alternative 3-prime, size difference: 37
Exclusion in the protein causing a frameshift
Reference transcript: NM_001031734
Changed!
cd fer2 90aa 7e-08
in ref transcript
2Fe-2S iron-sulfur cluster binding domain. Iron-sulfur proteins play an important role in electron transfer processes and in various enzymatic reactions. The family includes plant and algal ferredoxins, which act as electron carriers in photosynthesis and ferredoxins, which participate in redox chains (from bacteria to mammals). Fold is ismilar to thioredoxin.
Changed!
TIGR fdx_isc 86aa 2e-16
in ref transcript
This family consists of proteobacterial ferredoxins associated with and essential to the ISC system of 2Fe-2S cluster assembly. This family is closely related to (but excludes) eukaryotic (mitochondrial) adrenodoxins, which are ferredoxins involved in electron transfer to P450 cytochromes.
Changed!
COG Fdx 102aa 2e-15
in ref transcript
Ferredoxin [Energy production and conversion].
Changed!
cd fer2 68aa 4e-05
in modified transcript
Changed!
TIGR fdx_isc 46aa 1e-07
in modified transcript
Changed!
COG Fdx 71aa 7e-07
in modified transcript
PRELID2
refseq_MGC21644.F1 refseq_MGC21644.R1 140 343
NCBIGene 36.3 153768
Single exon skipping, size difference: 203
Inclusion in the protein causing a new stop codon
Reference transcript: NM_182960
Changed!
pfam PRELI 170aa 4e-39
in ref transcript
PRELI-like family. This family includes a conserved region found in the PRELI protein and yeast YLR168C gene MSF1 product. The function of this protein is unknown, though it is thought to be involved in intra-mitochondrial protein sorting. This region is also found in a number of other eukaryotic proteins.
PRELID2
refseq_MGC21644.F2 refseq_MGC21644.R2 100 136
NCBIGene 36.3 153768
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_182960
Changed!
pfam PRELI 170aa 4e-39
in ref transcript
PRELI-like family. This family includes a conserved region found in the PRELI protein and yeast YLR168C gene MSF1 product. The function of this protein is unknown, though it is thought to be involved in intra-mitochondrial protein sorting. This region is also found in a number of other eukaryotic proteins.
Changed!
pfam PRELI 158aa 1e-40
in modified transcript
MGC3207
refseq_MGC3207.F1 refseq_MGC3207.R1 247 388
NCBIGene 36.3 84245
Exon skipping and alternative 3-prime or 5-prime, size difference: 141
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_001031727
Changed!
TIGR salvage_mtnA 343aa 1e-122
in ref transcript
The delineation of this family was based in part on a discussion and neighbor-joining phylogenetic study, by Kyrpides and Woese, of archaeal and other proteins homologous to the alpha, beta, and delta subunits of eukaryotic initiation factor 2B (eIF-2B), a five-subunit molecule that catalyzes GTP recycling for eIF-2. This clade is now recognized to include the methionine salvage pathway enzyme MtnA.
Changed!
PRK mtnA 351aa 2e-84
in ref transcript
methylthioribose-1-phosphate isomerase; Reviewed.
Changed!
TIGR salvage_mtnA 146aa 1e-67
in modified transcript
Changed!
TIGR salvage_mtnA 93aa 5e-27
in modified transcript
Changed!
PRK mtnA 154aa 8e-58
in modified transcript
Changed!
COG COG0182 96aa 9e-13
in modified transcript
Predicted translation initiation factor 2B subunit, eIF-2B alpha/beta/delta family [Translation, ribosomal structure and biogenesis].
FAM133B
refseq_MGC40405.F1 refseq_MGC40405.R1 160 259
NCBIGene 36.3 257415
Single exon skipping, size difference: 99
Inclusion in the protein causing a new stop codon
Reference transcript: NM_152789
MID1
refseq_MID1.F2 refseq_MID1.R2 213 313
NCBIGene 36.2 4281
Alternative 5-prime and 3-prime, size difference: 100
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_000381
cd FN3 100aa 9e-07
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd RING 55aa 2e-04
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
cd BBOX 39aa 4e-04
in ref transcript
B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
Changed!
smart SPRY 119aa 2e-25
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
smart BBC 126aa 1e-13
in ref transcript
B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains.
smart FN3 90aa 4e-08
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
smart BBOX 42aa 3e-06
in ref transcript
B-Box-type zinc finger.
smart RING 50aa 3e-05
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
smart PRY 38aa 0.002
in ref transcript
associated with SPRY domains.
MID2
refseq_MID2.F1 refseq_MID2.R1 205 295
NCBIGene 36.3 11043
Alternative 5-prime, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_012216
cd RING 55aa 2e-04
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
cd BBOX 38aa 5e-04
in ref transcript
B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
Changed!
cd FN3 47aa 0.001
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
smart BBC 126aa 8e-29
in ref transcript
B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains.
smart SPRY 109aa 4e-24
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
pfam zf-B_box 42aa 2e-05
in ref transcript
B-box zinc finger.
smart RING 50aa 2e-05
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
Changed!
smart FN3 120aa 1e-04
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
PRK PRK00409 124aa 1e-04
in ref transcript
recombination and DNA strand exchange inhibitor protein; Reviewed.
Changed!
cd FN3 100aa 1e-06
in modified transcript
Changed!
smart FN3 90aa 4e-08
in modified transcript
MINA
refseq_MINA.F1 refseq_MINA.R1 264 368
NCBIGene 36.2 84864
Alternative 3-prime, size difference: 3
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_032778
Changed!
pfam Cupin_4 312aa 5e-89
in ref transcript
Cupin superfamily protein. This family contains many hypothetical proteins that belong to the cupin superfamily.
COG COG2850 202aa 2e-10
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
pfam Cupin_4 313aa 3e-88
in modified transcript
MINA
refseq_MINA.F1 refseq_MINA.R3 124 225
NCBIGene 36.2 84864
Single exon skipping, size difference: 101
Inclusion in the protein causing a new stop codon
Reference transcript: NM_032778
Changed!
pfam Cupin_4 312aa 5e-89
in ref transcript
Cupin superfamily protein. This family contains many hypothetical proteins that belong to the cupin superfamily.
COG COG2850 202aa 2e-10
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
pfam Cupin_4 227aa 2e-71
in modified transcript
MINK1
refseq_MINK1.F1 refseq_MINK1.R1 131 155
NCBIGene 36.3 50488
Single exon skipping, size difference: 24
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_153827
cd STKc_MAP4K4_6 192aa 1e-114
in ref transcript
Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K4/MAP4K6 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain. MAP4Ks (or MAPKKKKs) are involved in MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K4 is also called Nck Interacting kinase (NIK). It facilitates the activation of the MAPKs, extracellular signal-regulated kinase (ERK) 1, ERK2, and c-Jun N-terminal kinase (JNK), by phosphorylating and activating MEKK1. MAP4K4 plays a role in tumor necrosis factor (TNF) alpha-induced insulin resistance. MAP4K4 silencing in skeletal muscle cells from type II diabetic patients restores insulin-mediated glucose uptake. MAP4K4, through JNK, also plays a broad role in cell motility, which impacts inflammation, homeostasis, as well as the invasion and spread of cancer. MAP4K4 is found to be highly expressed in most tumor cell lines relative to normal tissue. MAP4K6 (also called MINK for Misshapen/NIKs-related kinase) is activated after Ras induction and mediates activation of p38 MAPK. MAP4K6 plays a role in cell cycle arrest, cytoskeleton organization, cell adhesion, and cell motility.
smart CNH 299aa 9e-83
in ref transcript
Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
smart S_TKc 188aa 2e-54
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
COG SPS1 205aa 2e-24
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K4/MAP4K6 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain. MAP4Ks (or MAPKKKKs) are involved in MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K4 is also called Nck Interacting kinase (NIK). It facilitates the activation of the MAPKs, extracellular signal-regulated kinase (ERK) 1, ERK2, and c-Jun N-terminal kinase (JNK), by phosphorylating and activating MEKK1. MAP4K4 plays a role in tumor necrosis factor (TNF) alpha-induced insulin resistance. MAP4K4 silencing in skeletal muscle cells from type II diabetic patients restores insulin-mediated glucose uptake. MAP4K4, through JNK, also plays a broad role in cell motility, which impacts inflammation, homeostasis, as well as the invasion and spread of cancer. MAP4K4 is found to be highly expressed in most tumor cell lines relative to normal tissue. MAP4K6 (also called MINK for Misshapen/NIKs-related kinase) is activated after Ras induction and mediates activation of p38 MAPK. MAP4K6 plays a role in cell cycle arrest, cytoskeleton organization, cell adhesion, and cell motility.
smart CNH 299aa 9e-83
in ref transcript
Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
smart S_TKc 188aa 2e-54
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
COG SPS1 205aa 2e-24
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
PRK PRK07764 140aa 2e-04
in modified transcript
DNA polymerase III subunits gamma and tau; Validated.
MINK1
refseq_MINK1.F5 refseq_MINK1.R5 246 306
NCBIGene 36.3 50488
Alternative 3-prime, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_153827
cd STKc_MAP4K4_6 192aa 1e-114
in ref transcript
Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K4/MAP4K6 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain. MAP4Ks (or MAPKKKKs) are involved in MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K4 is also called Nck Interacting kinase (NIK). It facilitates the activation of the MAPKs, extracellular signal-regulated kinase (ERK) 1, ERK2, and c-Jun N-terminal kinase (JNK), by phosphorylating and activating MEKK1. MAP4K4 plays a role in tumor necrosis factor (TNF) alpha-induced insulin resistance. MAP4K4 silencing in skeletal muscle cells from type II diabetic patients restores insulin-mediated glucose uptake. MAP4K4, through JNK, also plays a broad role in cell motility, which impacts inflammation, homeostasis, as well as the invasion and spread of cancer. MAP4K4 is found to be highly expressed in most tumor cell lines relative to normal tissue. MAP4K6 (also called MINK for Misshapen/NIKs-related kinase) is activated after Ras induction and mediates activation of p38 MAPK. MAP4K6 plays a role in cell cycle arrest, cytoskeleton organization, cell adhesion, and cell motility.
smart CNH 299aa 9e-83
in ref transcript
Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
smart S_TKc 188aa 2e-54
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
COG SPS1 205aa 2e-24
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 316aa 2e-66
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
COG SPS1 323aa 1e-33
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
COG SPS1 336aa 3e-34
in modified transcript
MKNK1
refseq_MKNK1.F4 refseq_MKNK1.R4 132 255
NCBIGene 36.3 8569
Single exon skipping, size difference: 123
Exclusion in the protein (no frameshift)
Reference transcript: NM_003684
Changed!
cd S_TKc 322aa 3e-62
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 316aa 2e-66
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
COG SPS1 323aa 1e-33
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
cd S_TKc 281aa 3e-67
in modified transcript
Changed!
smart S_TKc 275aa 2e-71
in modified transcript
Changed!
COG SPS1 282aa 4e-39
in modified transcript
MLH3
refseq_MLH3.F1 refseq_MLH3.R1 153 225
NCBIGene 36.3 27030
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040108
cd MutL_Trans_MLH3 144aa 4e-53
in ref transcript
MutL_Trans_MLH3: transducer domain, having a ribosomal S5 domain 2-like fold, found in proteins similar to yeast and human MLH3 (MutL homologue 3). MLH3 belongs to the DNA mismatch repair (MutL/MLH1/PMS2) family. This transducer domain is homologous to the second domain of the DNA gyrase B subunit, which is known to be important in nucleotide hydrolysis and the transduction of structural signals from ATP-binding site to the DNA breakage/reunion regions of the enzymes. MLH1 forms heterodimers with MLH3. The MLH1-MLH3 complex plays a role in meiosis. A role for hMLH1-hMLH3 in DNA mismatch repair (MMR) has not been established. It has been suggested that hMLH3 may be a low risk gene for colorectal cancer; however there is little evidence to support it having a role in classical HNPCC.
TIGR mutl 269aa 1e-44
in ref transcript
All proteins in this family for which the functions are known are involved in the process of generalized mismatch repair. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Changed!
smart MutL_C 174aa 4e-21
in ref transcript
MutL C terminal dimerisation domain. MutL and MutS are key components of the DNA repair machinery that corrects replication errors. MutS recognises mispaired or unpaired bases in a DNA duplex and in the presence of ATP, recruits MutL to form a DNA signaling complex for repair. The N terminal region of MutL contains the ATPase domain and the C terminal is involved in dimerisation.
pfam DNA_mis_repair 134aa 1e-05
in ref transcript
DNA mismatch repair protein, C-terminal domain. This family represents the C-terminal domain of the mutL/hexB/PMS1 family. This domain has a ribosomal S5 domain 2-like fold.
COG MutL 437aa 4e-45
in ref transcript
DNA mismatch repair enzyme (predicted ATPase) [DNA replication, recombination, and repair].
Changed!
PRK mutL 221aa 3e-22
in ref transcript
DNA mismatch repair protein; Reviewed.
Changed!
smart MutL_C 150aa 6e-11
in modified transcript
Changed!
COG MutL 152aa 6e-13
in modified transcript
MLL3
refseq_MLL3.F1 refseq_MLL3.R1 229 400
NCBIGene 36.2 58508
Exon skipping and alternative 3-prime or 5-prime, size difference: 171
Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_170606
pfam SET 129aa 5e-28
in ref transcript
SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
smart FYRC 88aa 2e-19
in ref transcript
"FY-rich" domain, C-terminal region. is sometimes closely juxtaposed with the N-terminal region (FYRN), but sometimes is far distant. Unknown function, but occurs frequently in chromatin-associated proteins.
pfam FYRN 55aa 3e-16
in ref transcript
F/Y-rich N-terminus. This region is normally found in the trithorax/ALL1 family proteins. It is similar to SMART:SM00541.
pfam PHD 47aa 9e-09
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
pfam PHD 49aa 5e-08
in ref transcript
pfam PAT1 178aa 7e-08
in ref transcript
Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
pfam PHD 51aa 1e-07
in ref transcript
pfam PHD 48aa 3e-06
in ref transcript
pfam PHD 52aa 1e-05
in ref transcript
smart HMG 52aa 4e-04
in ref transcript
high mobility group.
pfam PHD 52aa 0.001
in ref transcript
pfam ATG13 135aa 0.009
in ref transcript
Autophagy-related protein 13. Members of this family of phosphoproteins are involved in cytoplasm to vacuole transport (Cvt), and more specifically in Cvt vesicle formation. They are probably involved in the switching machinery regulating the conversion between the Cvt pathway and autophagy. Finally, ATG13 is also required for glycogen storage.
COG COG2940 187aa 2e-18
in ref transcript
Proteins containing SET domain [General function prediction only].
COG COG5141 115aa 0.001
in ref transcript
PHD zinc finger-containing protein [General function prediction only].
COG COG5141 137aa 0.003
in ref transcript
Changed!
PRK PRK07003 331aa 0.010
in modified transcript
DNA polymerase III subunits gamma and tau; Validated.
MLL5
refseq_MLL5.F2 refseq_MLL5.R2 249 397
NCBIGene 36.3 55904
Exon skipping and alternative 3-prime or 5-prime, size difference: 148
Exclusion in 5'UTR, Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_182931
pfam SET 128aa 5e-25
in ref transcript
SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
pfam PHD 46aa 2e-10
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
TIGR PABP-1234 119aa 0.003
in ref transcript
There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
pfam PAT1 113aa 0.003
in ref transcript
Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
COG COG2940 298aa 1e-04
in ref transcript
Proteins containing SET domain [General function prediction only].
PRK PRK13729 87aa 0.007
in ref transcript
conjugal transfer pilus assembly protein TraB; Provisional.
MLLT10
refseq_MLLT10.F1 refseq_MLLT10.R1 157 205
NCBIGene 36.3 8028
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_004641
pfam PHD 47aa 1e-06
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
smart PHD 59aa 0.005
in ref transcript
PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the R ING finger and the FY VE finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent B ROMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were R ING fingers misidentified as PHD fingers.
COG COG5141 179aa 1e-37
in ref transcript
PHD zinc finger-containing protein [General function prediction only].
MLLT4
refseq_MLLT4.F1 refseq_MLLT4.R1 158 203
NCBIGene 36.3 4301
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040001
cd AF6_RA_repeat1 112aa 3e-53
in ref transcript
The AF-6 protein (also known as afadin and canoe) is a multidomain cell junction protein that contains two N-terminal Ras-associating (RA) domains in addition to FHA (forkhead-associated), DIL (class V myosin homology region), and PDZ domains and a proline-rich region. AF6 acts downstream of the Egfr (Epidermal Growth Factor-receptor)/Ras signalling pathway and provides a link from Egfr to cytoskeletal elements.
cd AF6_RA_repeat2 100aa 2e-42
in ref transcript
The AF-6 protein (also known as afadin and canoe) is a multidomain cell junction protein that contains two N-terminal Ras-associating (RA) domains in addition to FHA (forkhead-associated), DIL (class V myosin homology region), and PDZ domains and a proline-rich region. AF6 acts downstream of the Egfr (Epidermal Growth Factor-receptor)/Ras signalling pathway and provides a link from Egfr to cytoskeletal elements.
cd PDZ_signaling 84aa 3e-12
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
Changed!
cd FHA 90aa 2e-05
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
pfam DIL 107aa 5e-26
in ref transcript
DIL domain. The DIL domain has no known function.
smart RA 94aa 7e-17
in ref transcript
Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.).
smart PDZ 88aa 4e-14
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart RA 100aa 2e-13
in ref transcript
smart FHA 52aa 4e-08
in ref transcript
Forkhead associated domain. Found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain.
Changed!
cd FHA 101aa 1e-05
in modified transcript
MLX
refseq_MLX.F2 refseq_MLX.R2 115 277
NCBIGene 36.3 6945
Alternative 5-prime, size difference: 162
Exclusion in the protein (no frameshift)
Reference transcript: NM_170607
cd HLH 66aa 2e-10
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
pfam HLH 59aa 1e-12
in ref transcript
Helix-loop-helix DNA-binding domain.
MLX
refseq_MLX.F3 refseq_MLX.R3 125 215
NCBIGene 36.3 6945
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_170607
cd HLH 66aa 2e-10
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
pfam HLH 59aa 1e-12
in ref transcript
Helix-loop-helix DNA-binding domain.
MLXIPL
refseq_MLXIPL.F1 refseq_MLXIPL.R1 147 204
NCBIGene 36.3 51085
Alternative 3-prime, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_032951
Changed!
cd HLH 61aa 3e-09
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
Changed!
pfam HLH 55aa 8e-09
in ref transcript
Helix-loop-helix DNA-binding domain.
MMP19
refseq_MMP19.F1 refseq_MMP19.R1 101 448
NCBIGene 36.2 4327
Multiple exon skipping, size difference: 347
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_002429
Changed!
cd ZnMc_MMP 154aa 4e-53
in ref transcript
Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Changed!
cd HX 186aa 7e-48
in ref transcript
Hemopexin-like repeats.; Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs). This CD contains 4 instances of the repeat.
Changed!
pfam Peptidase_M10 154aa 3e-57
in ref transcript
Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis.
Changed!
smart HX 39aa 2e-07
in ref transcript
Hemopexin-like repeats. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs).
Changed!
smart HX 44aa 2e-06
in ref transcript
Changed!
pfam PG_binding_1 50aa 6e-05
in ref transcript
Putative peptidoglycan binding domain. This domain is composed of three alpha helices. This domain is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. This family is found N-terminal to the catalytic domain of matrixins.
Changed!
smart HX 43aa 4e-04
in ref transcript
Changed!
smart HX 47aa 0.010
in ref transcript
MMP23B
refseq_MMP23B.F1 refseq_MMP23B.R1 112 252
NCBIGene 36.2 8510
Single exon skipping, size difference: 140
Exclusion in the protein causing a frameshift
Reference transcript: NM_006983
Changed!
cd ZnMc_MMP 168aa 2e-48
in ref transcript
Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
Changed!
cd IGcam 61aa 0.004
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
pfam Peptidase_M10 168aa 7e-40
in ref transcript
Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis.
Changed!
smart IGc2 61aa 3e-04
in ref transcript
Immunoglobulin C-2 Type.
Changed!
smart ShKT 35aa 0.004
in ref transcript
ShK toxin domain. ShK toxin domain.
MOG
refseq_MOG.F1 refseq_MOG.R1 120 370
NCBIGene 36.3 4340
Alternative 3-prime, size difference: 250
Inclusion in the protein causing a new stop codon
Reference transcript: NM_002433
smart IGv 81aa 2e-12
in ref transcript
Immunoglobulin V-Type.
MOG
refseq_MOG.F5 refseq_MOG.R5 275 413
NCBIGene 36.2 4340
Single exon skipping, size difference: 138
Inclusion in the protein causing a new stop codon
Reference transcript: NM_002433
smart IGv 81aa 2e-12
in ref transcript
Immunoglobulin V-Type.
MORF4L1
refseq_MORF4L1.F1 refseq_MORF4L1.R1 143 260
NCBIGene 36.3 10933
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_206839
Changed!
pfam MRG 261aa 1e-81
in ref transcript
MRG. This family consists of three different eukaryotic proteins (mortality factor 4 (MORF4/MRG15), male-specific lethal 3(MSL-3) and ESA1-associated factor 3(EAF3)). It is thought that the MRG family is involved in transcriptional regulation via histone acetylation. It contains 2 chromo domains and a leucine zipper motif.
Changed!
cd CHROMO 49aa 0.003
in modified transcript
Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
Changed!
pfam MRG 261aa 1e-81
in modified transcript
MOSPD3
refseq_MOSPD3.F1 refseq_MOSPD3.R1 128 158
NCBIGene 36.3 64598
Alternative 3-prime, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040097
Changed!
pfam Motile_Sperm 64aa 1e-10
in ref transcript
MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins.
Changed!
pfam Motile_Sperm 54aa 9e-08
in modified transcript
MPG
refseq_MPG.F2 refseq_MPG.R2 111 304
NCBIGene 36.3 4350
Single exon skipping, size difference: 193
Exclusion of the protein initiation site
Reference transcript: NM_002434
cd AAG 190aa 3e-57
in ref transcript
Alkyladenine DNA glycosylase (AAG), also known as 3-methyladenine DNA glycosylase, catalyzes the first step in base excision repair (BER) by cleaving damaged DNA bases within double-stranded DNA to produce an abasic site. AAG bends DNA by intercalating between the base pairs, causing the damaged base to flip out of the double helix and into the enzyme active site for cleavage. Although AAG represents one of six DNA glycosylase classes, it lacks the helix-hairpin-helix active site motif associated with the other BER glycosylases and is structurally quite distinct from them.
TIGR 3mg 201aa 1e-77
in ref transcript
This families are based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). All proteins in this family for which the function is known are involved in the base excision repair of alkylation damage to DNA. The exact specificty of the type of alkylation damage repaired by each of these varies somewhat between species. Substrates include 3-methyl adenine, 7-methyl-guanaine, and 3-methyl-guanine.
PRK PRK00802 204aa 4e-53
in ref transcript
3-methyladenine DNA glycosylase; Reviewed.
MPST
refseq_MPST.F1 refseq_MPST.R1 237 413
NCBIGene 36.3 4357
Single exon skipping, size difference: 176
Inclusion in the protein causing a new stop codon
Reference transcript: NM_021126
Changed!
cd TST_Repeat_2 116aa 5e-38
in ref transcript
Thiosulfate sulfurtransferase (TST), C-terminal, catalytic domain. TST contains 2 copies of the Rhodanese Homology Domain; this is the second repeat. Only the second repeat contains the catalytically active Cys residue.
Changed!
cd TST_Repeat_1 130aa 9e-37
in ref transcript
Thiosulfate sulfurtransferase (TST), N-terminal, inactive domain. TST contains 2 copies of the Rhodanese Homology Domain; this is the 1st repeat, which does not contain the catalytically active Cys residue. The role of the 1st repeat is uncertain, but it is believed to be involved in protein interaction.
Changed!
pfam Rhodanese 118aa 3e-19
in ref transcript
Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
Changed!
pfam Rhodanese 123aa 2e-16
in ref transcript
Changed!
COG SseA 284aa 4e-70
in ref transcript
Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism].
MPZL1
refseq_MPZL1.F1 refseq_MPZL1.R1 146 249
NCBIGene 36.3 9019
Single exon skipping, size difference: 103
Exclusion in the protein causing a frameshift
Reference transcript: NM_003953
pfam V-set 114aa 6e-12
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
MRE11A
refseq_MRE11A.F1 refseq_MRE11A.R1 186 293
NCBIGene 36.3 4361
Alternative 5-prime, size difference: 107
Inclusion in 5'UTR
Reference transcript: NM_005591
TIGR mre11 393aa 1e-140
in ref transcript
All proteins in this family for which functions are known are subunits of a nuclease complex made up of multiple proteins including MRE11 and RAD50 homologs. The functions of this nuclease complex include recombinational repair and non-homolgous end joining. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). The proteins in this family are distantly related to proteins in the SbcCD complex of bacteria.
COG SbcD 309aa 2e-20
in ref transcript
DNA repair exonuclease [DNA replication, recombination, and repair].
MRPL10
refseq_MRPL10.F2 refseq_MRPL10.R2 167 265
NCBIGene 36.3 124995
Single exon skipping, size difference: 98
Exclusion of the protein initiation site
Reference transcript: NM_148887
cd Ribosomal_L10 132aa 9e-22
in ref transcript
Ribosomal protein L10 family, L10 subfamily; composed of bacterial 50S ribosomal protein and eukaryotic mitochondrial 39S ribosomal protein, L10. L10 occupies the L7/L12 stalk of the ribosome. The N-terminal domain (NTD) of L10 interacts with L11 protein and forms the base of the L7/L12 stalk, while the extended C-terminal helix binds to two or three dimers of the NTD of L7/L12 (L7 and L12 are identical except for an acetylated N-terminus). The L7/L12 stalk is known to contain the binding site for elongation factors G and Tu (EF-G and EF-Tu, respectively); however, there is disagreement as to whether or not L10 is involved in forming the binding site. The stalk is believed to be associated with GTPase activities in protein synthesis. In a neuroblastoma cell line, L10 has been shown to interact with the SH3 domain of Src and to activate the binding of the Nck1 adaptor protein with skeletal proteins such as the Wiskott-Aldrich Syndrome Protein (WASP) and the WASP-interacting protein (WIP). These bacteria and eukaryotic sequences have no additional C-terminal domain, present in other eukaryotic and archaeal orthologs.
COG RplJ 128aa 3e-07
in ref transcript
Ribosomal protein L10 [Translation, ribosomal structure and biogenesis].
MRPL11
refseq_MRPL11.F1 refseq_MRPL11.R1 148 226
NCBIGene 36.3 65003
Alternative 5-prime, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_016050
Changed!
cd Ribosomal_L11 136aa 1e-36
in ref transcript
Ribosomal protein L11. Ribosomal protein L11, together with proteins L10 and L7/L12, and 23S rRNA, form the L7/L12 stalk on the surface of the large subunit of the ribosome. The homologous eukaryotic cytoplasmic protein is also called 60S ribosomal protein L12, which is distinct from the L12 involved in the formation of the L7/L12 stalk. The C-terminal domain (CTD) of L11 is essential for binding 23S rRNA, while the N-terminal domain (NTD) contains the binding site for the antibiotics thiostrepton and micrococcin. L11 and 23S rRNA form an essential part of the GTPase-associated region (GAR). Based on differences in the relative positions of the L11 NTD and CTD during the translational cycle, L11 is proposed to play a significant role in the binding of initiation factors, elongation factors, and release factors to the ribosome. Several factors, including the class I release factors RF1 and RF2, are known to interact directly with L11. In eukaryotes, L11 has been implicated in regulating the levels of ubiquinated p53 and MDM2 in the MDM2-p53 feedback loop, which is responsible for apoptosis in response to DNA damage. In bacteria, the "stringent response" to harsh conditions allows bacteria to survive, and ribosomes that lack L11 are deficient in stringent factor stimulation.
Changed!
smart RL11 137aa 8e-36
in ref transcript
Ribosomal protein L11/L12.
Changed!
PRK rplK 143aa 1e-27
in ref transcript
50S ribosomal protein L11; Validated.
Changed!
cd Ribosomal_L11 114aa 8e-36
in modified transcript
Changed!
smart RL11 115aa 5e-35
in modified transcript
Changed!
PRK rplK 116aa 1e-26
in modified transcript
MRPL11
refseq_MRPL11.F3 refseq_MRPL11.R3 193 287
NCBIGene 36.3 65003
Single exon skipping, size difference: 94
Inclusion in the protein causing a new stop codon
Reference transcript: NM_016050
cd Ribosomal_L11 136aa 1e-36
in ref transcript
Ribosomal protein L11. Ribosomal protein L11, together with proteins L10 and L7/L12, and 23S rRNA, form the L7/L12 stalk on the surface of the large subunit of the ribosome. The homologous eukaryotic cytoplasmic protein is also called 60S ribosomal protein L12, which is distinct from the L12 involved in the formation of the L7/L12 stalk. The C-terminal domain (CTD) of L11 is essential for binding 23S rRNA, while the N-terminal domain (NTD) contains the binding site for the antibiotics thiostrepton and micrococcin. L11 and 23S rRNA form an essential part of the GTPase-associated region (GAR). Based on differences in the relative positions of the L11 NTD and CTD during the translational cycle, L11 is proposed to play a significant role in the binding of initiation factors, elongation factors, and release factors to the ribosome. Several factors, including the class I release factors RF1 and RF2, are known to interact directly with L11. In eukaryotes, L11 has been implicated in regulating the levels of ubiquinated p53 and MDM2 in the MDM2-p53 feedback loop, which is responsible for apoptosis in response to DNA damage. In bacteria, the "stringent response" to harsh conditions allows bacteria to survive, and ribosomes that lack L11 are deficient in stringent factor stimulation.
smart RL11 137aa 8e-36
in ref transcript
Ribosomal protein L11/L12.
PRK rplK 143aa 1e-27
in ref transcript
50S ribosomal protein L11; Validated.
MRPL22
refseq_MRPL22.F1 refseq_MRPL22.R1 188 306
NCBIGene 36.3 29093
Single exon skipping, size difference: 118
Exclusion in the protein causing a frameshift
Reference transcript: NM_014180
Changed!
cd Ribosomal_L22 106aa 9e-23
in ref transcript
Ribosomal protein L22/L17e. L22 (L17 in eukaryotes) is a core protein of the large ribosomal subunit. It is the only ribosomal protein that interacts with all six domains of 23S rRNA, and is one of the proteins important for directing the proper folding and stabilizing the conformation of 23S rRNA. L22 is the largest protein contributor to the surface of the polypeptide exit channel, the tunnel through which the polypeptide product passes. L22 is also one of six proteins located at the putative translocon binding site on the exterior surface of the ribosome.
Changed!
pfam Ribosomal_L22 96aa 3e-09
in ref transcript
Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes.
Changed!
PRK rplV 103aa 1e-14
in ref transcript
50S ribosomal protein L22; Reviewed.
MRPL24
refseq_MRPL24.F2 refseq_MRPL24.R2 183 224
NCBIGene 36.3 79590
Alternative 5-prime, size difference: 41
Exclusion in 5'UTR
Reference transcript: NM_145729
TIGR rplX_bact 96aa 7e-15
in ref transcript
This model recognizes bacterial and organellar forms of ribosomal protein L24. It excludes eukaryotic and archaeal forms, designated L26 in eukaryotes.
PRK rplX 96aa 2e-13
in ref transcript
50S ribosomal protein L24; Reviewed.
MRPL33
refseq_MRPL33.F2 refseq_MRPL33.R2 192 299
NCBIGene 36.3 9553
Single exon skipping, size difference: 107
Exclusion in the protein causing a frameshift
Reference transcript: NM_004891
Changed!
TIGR rpmG_bact 49aa 0.002
in ref transcript
This model describes bacterial ribosomal protein L33 and its chloroplast and mitochondrial equivalents.
Changed!
PRK rpmG 51aa 3e-05
in ref transcript
50S ribosomal protein L33; Validated.
MRPL43
refseq_MRPL43.F2 refseq_MRPL43.R2 190 317
NCBIGene 36.3 84545
Single exon skipping, size difference: 127
Exclusion in the protein causing a frameshift
Reference transcript: NM_176794
pfam L51_S25_CI-B8 51aa 1e-10
in ref transcript
Mitochondrial ribosomal protein L51 / S25 / CI-B8 domain. The proteins in this family are located in the mitochondrion. The family includes ribosomal protein L51, and S25. This family also includes mitochondrial NADH-ubiquinone oxidoreductase B8 subunit (CI-B8) EC:1.6.5.3. It is not known whether all members of this family form part of the NADH-ubiquinone oxidoreductase and whether they are also all ribosomal proteins.
MRPL47
refseq_MRPL47.F2 refseq_MRPL47.R2 180 326
NCBIGene 36.3 57129
Single exon skipping, size difference: 146
Exclusion in the protein causing a frameshift
Reference transcript: NM_020409
Changed!
pfam MRP-L47 86aa 3e-29
in ref transcript
Mitochondrial 39-S ribosomal protein L47 (MRP-L47). This family represents the N-terminal region (approximately 8 residues) of the eukaryotic mitochondrial 39-S ribosomal protein L47 (MRP-L47). Mitochondrial ribosomal proteins (MRPs) are the counterparts of the cytoplasmic ribosomal proteins, in that they fulfil similar functions in protein biosynthesis. However, they are distinct in number, features and primary structure.
MRPL48
refseq_MRPL48.F2 refseq_MRPL48.R2 343 468
NCBIGene 36.2 51642
Single exon skipping, size difference: 125
Inclusion in the protein causing a new stop codon
Reference transcript: NM_016055
Changed!
pfam Ribosomal_S10 98aa 3e-16
in ref transcript
Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes.
MRPL52
refseq_MRPL52.F1 refseq_MRPL52.R1 101 175
NCBIGene 36.3 122704
Single exon skipping, size difference: 74
Inclusion in the protein causing a new stop codon
Reference transcript: NM_178336
MRPL55
refseq_MRPL55.F1 refseq_MRPL55.R1 168 276
NCBIGene 36.3 128308
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_181462
Changed!
pfam Mitoc_L55 118aa 1e-32
in ref transcript
Mitochondrial ribosomal protein L55. Members of this family are involved in mitochondrial biogenesis and G2/M phase cell cycle progression. They form a component of the mitochondrial ribosome large subunit (39S) which comprises a 16S rRNA and about 50 distinct proteins.
Changed!
pfam Mitoc_L55 119aa 3e-33
in modified transcript
MRPL55
refseq_MRPL55.F3 refseq_MRPL55.R3 145 190
NCBIGene 36.3 128308
Exon skipping and alternative 3-prime or 5-prime, size difference: 45
Inclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_181454
pfam Mitoc_L55 119aa 3e-33
in ref transcript
Mitochondrial ribosomal protein L55. Members of this family are involved in mitochondrial biogenesis and G2/M phase cell cycle progression. They form a component of the mitochondrial ribosome large subunit (39S) which comprises a 16S rRNA and about 50 distinct proteins.
MRRF
refseq_MRRF.F1 refseq_MRRF.R1 109 269
NCBIGene 36.3 92399
Single exon skipping, size difference: 160
Exclusion in the protein causing a frameshift
Reference transcript: NM_138777
Changed!
cd RRF 178aa 3e-27
in ref transcript
Ribosome recycling factor (RRF). Ribosome recycling factor dissociates the posttermination complex, composed of the ribosome, deacylated tRNA, and mRNA, after termination of translation. Thus ribosomes are "recycled" and ready for another round of protein synthesis. RRF is believed to bind the ribosome at the A-site in a manner that mimics tRNA, but the specific mechanisms remain unclear. RRF is essential for bacterial growth. It is not necessary for cell growth in archaea or eukaryotes, but is found in mitochondria or chloroplasts of some eukaryotic species.
Changed!
pfam RRF 164aa 3e-22
in ref transcript
Ribosome recycling factor. The ribosome recycling factor (RRF / ribosome release factor) dissociates the ribosome from the mRNA after termination of translation, and is essential bacterial growth. Thus ribosomes are "recycled" and ready for another round of protein synthesis.
Changed!
PRK frr 183aa 7e-30
in ref transcript
ribosome recycling factor; Reviewed.
Changed!
cd RRF 109aa 2e-14
in modified transcript
Changed!
TIGR frr 103aa 2e-09
in modified transcript
This model finds only eubacterial proteins. Mitochondrial and/or chloroplast forms might be expected but are not currently known. This protein was previously called ribosome releasing factor. By releasing ribosomes from mRNA at the end of protein biosynthesis, it prevents inappropriate translation from 3-prime regions of the mRNA and frees the ribosome for new rounds of translation. EGAD|53116|YHR038W is part of the frr superfamily.
Changed!
PRK frr 114aa 1e-15
in modified transcript
MS4A1
refseq_MS4A1.F2 refseq_MS4A1.R2 137 400
NCBIGene 36.3 931
Exon skipping and alternative 3-prime or 5-prime, size difference: 263
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_152866
pfam CD20 167aa 2e-32
in ref transcript
CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
MS4A3
refseq_MS4A3.F1 refseq_MS4A3.R1 191 329
NCBIGene 36.3 932
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_006138
Changed!
pfam CD20 148aa 7e-24
in ref transcript
CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
Changed!
pfam CD20 106aa 1e-20
in modified transcript
MS4A6A
refseq_MS4A6A.F2 refseq_MS4A6A.R2 141 245
NCBIGene 36.3 64231
Alternative 5-prime, size difference: 104
Exclusion in the protein causing a frameshift
Reference transcript: NM_152852
Changed!
pfam CD20 163aa 3e-27
in ref transcript
CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
Changed!
pfam CD20 101aa 1e-14
in modified transcript
MS4A7
refseq_MS4A7.F1 refseq_MS4A7.R1 101 149
NCBIGene 36.3 58475
Alternative 5-prime, size difference: 48
Exclusion in 5'UTR
Reference transcript: NM_021201
pfam CD20 162aa 6e-33
in ref transcript
CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
MS4A7
refseq_MS4A7.F4 refseq_MS4A7.R4 162 297
NCBIGene 36.3 58475
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_206939
Changed!
pfam CD20 162aa 6e-33
in ref transcript
CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
Changed!
pfam CD20 121aa 1e-21
in modified transcript
MSH5
refseq_MSH5.F1 refseq_MSH5.R1 219 270
NCBIGene 36.3 4439
Alternative 3-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_025259
cd ABC_MSH5_euk 207aa 5e-77
in ref transcript
MutS5 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.
TIGR mutS1 529aa 9e-66
in ref transcript
COG MutS 561aa 1e-84
in ref transcript
Mismatch repair ATPase (MutS family) [DNA replication, recombination, and repair].
MSH5
refseq_MSH5.F4 refseq_MSH5.R4 114 272
NCBIGene 36.3 4439
Alternative 5-prime, size difference: 158
Inclusion in 5'UTR
Reference transcript: NM_025259
cd ABC_MSH5_euk 207aa 5e-77
in ref transcript
MutS5 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.
TIGR mutS1 529aa 9e-66
in ref transcript
COG MutS 561aa 1e-84
in ref transcript
Mismatch repair ATPase (MutS family) [DNA replication, recombination, and repair].
MSL3L1
refseq_MSL3L1.F2 refseq_MSL3L1.R2 139 222
NCBIGene 36.3 10943
Single exon skipping, size difference: 83
Exclusion in the protein causing a frameshift
Reference transcript: NM_078629
Changed!
cd CHROMO 46aa 0.005
in ref transcript
Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
Changed!
pfam MRG 110aa 5e-26
in ref transcript
MRG. This family consists of three different eukaryotic proteins (mortality factor 4 (MORF4/MRG15), male-specific lethal 3(MSL-3) and ESA1-associated factor 3(EAF3)). It is thought that the MRG family is involved in transcriptional regulation via histone acetylation. It contains 2 chromo domains and a leucine zipper motif.
Changed!
pfam MRG 57aa 5e-19
in ref transcript
Changed!
smart CHROMO 43aa 2e-04
in ref transcript
Chromatin organization modifier domain.
MSMB
refseq_MSMB.F1 refseq_MSMB.R1 107 213
NCBIGene 36.3 4477
Single exon skipping, size difference: 106
Exclusion in the protein causing a frameshift
Reference transcript: NM_002443
Changed!
pfam PSP94 114aa 2e-50
in ref transcript
Beta-microseminoprotein (PSP-94). This family consists of the mammalian specific protein beta-microseminoprotein. Prostatic secretory protein of 94 amino acids (PSP94), also called beta-microseminoprotein, is a small, nonglycosylated protein, rich in cysteine residues. It was first isolated as a major protein from human seminal plasma. The exact function of this protein is unknown.
Changed!
pfam PSP94 36aa 2e-13
in modified transcript
MSRB3
refseq_MSRB3.F1 refseq_MSRB3.R1 177 304
NCBIGene 36.3 253827
Single exon skipping, size difference: 127
Inclusion in the protein causing a new stop codon
Reference transcript: NM_198080
Changed!
pfam SelR 123aa 3e-52
in ref transcript
SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four decades and has been extensively characterised with respect to structure and function. However, recent studies revealed that MsrA is only specific for methionine-S-sulfoxides. Because oxidised methionines occur in a mixture of R and S isomers in vivo, it was unclear how stereo-specific MsrA could be responsible for the reduction of all protein methionine sulfoxides. It appears that a second methionine sulfoxide reductase, SelR, evolved that is specific for methionine-R-sulfoxides, the activity that is different but complementary to that of MsrA. Thus, these proteins, working together, could reduce both stereoisomers of methionine sulfoxide. This domain is found both in SelR proteins and fused with the peptide methionine sulfoxide reductase enzymatic domain pfam01625. The domain has two conserved cysteine and histidines. The domain binds both selenium and zinc. The final cysteine is found to be replaced by the rare amino acid selenocysteine in some members of the family. This family has methionine-R-sulfoxide reductase activity.
Changed!
PRK PRK00222 129aa 3e-61
in ref transcript
methionine sulfoxide reductase B; Provisional.
MCAT
refseq_MT.F1 refseq_MT.R1 110 328
NCBIGene 36.3 27349
Single exon skipping, size difference: 218
Exclusion in the protein causing a frameshift
Reference transcript: NM_173467
Changed!
TIGR fabD 295aa 4e-53
in ref transcript
The seed alignment for this family of proteins contains a single member each from a number of bacterial species but also an additional pair of closely related, uncharacterized proteins from B. subtilis, one of which has a long C-terminal extension.
Changed!
TIGR fabD 107aa 6e-21
in modified transcript
Changed!
COG FabD 108aa 5e-19
in modified transcript
MTHFD2
refseq_MTHFD2.F1 refseq_MTHFD2.R1 176 274
NCBIGene 36.3 10797
Single exon skipping, size difference: 98
Inclusion in the protein causing a new stop codon
Reference transcript: NM_006636
Changed!
cd NAD_bind_m-THF_DH_Cyclohyd 180aa 5e-67
in ref transcript
NADP binding domain of methylene-tetrahydrofolate dehydrogenase/cyclohydrolase. NADP binding domain of the Methylene-Tetrahydrofolate Dehydrogenase/cyclohydrolase (m-THF DH/cyclohydrolase) bifunctional enzyme. Tetrahydrofolate is a versatile carrier of activated one-carbon units. The major one-carbon folate donors are N-5 methyltetrahydrofolate, N5,N10-m-THF, and N10-formayltetrahydrofolate. The oxidation of metabolic intermediate m-THF to m-THF requires the enzyme m-THF DH. In addition, most DHs also have an associated cyclohydrolase activity which catalyzes its hydrolysis to N10-formyltetrahydrofolate. m-THF DH is typically found as part of a multifunctional protein in eukaryotes. NADP-dependent m-THF DH in mammals, birds and yeast are components of a trifunctional enzyme with DH, cyclohydrolase, and synthetase activities. Certain eukaryotic cells also contain homodimeric bifunctional DH/cyclodrolase form. In bacteria, monofucntional DH, as well as bifunctional m-THF m-THF DHm-THF DHDH/cyclodrolase are found. In addition, yeast (S. cerevisiae) also express an monofunctional DH. This family contains the bifunctional DH/cyclohydrolase. M-THF DH, like other amino acid DH-like NAD(P)-binding domains, is a member of the Rossmann fold superfamily which includes glutamate, leucine, and phenylalanine DHs, m-THF DH, methylene-tetrahydromethanopterin DH, m-THF DH/cyclohydrolase, Shikimate DH-like proteins, malate oxidoreductases, and glutamyl tRNA reductase. Amino acid DHs catalyze the deamination of amino acids to keto acids with NAD(P)+ as a cofactor. The NAD(P)-binding Rossmann fold superfamily includes a wide variety of protein families including NAD(P)- binding domains of alcohol DHs, tyrosine-dependent oxidoreductases, glyceraldehyde-3-phosphate DH, lactate/malate DHs, formate/glycerate DHs, siroheme synthases, 6-phosphogluconate DH, amino acid DHs, repressor rex, NAD-binding potassium channel domain, CoA-binding, and ornithine cyclodeaminase-like domains.
Changed!
pfam THF_DHG_CYH_C 173aa 1e-64
in ref transcript
IF2/eIF5B (initiation factors 2/ eukaryotic initiation factor 5B) subfamily. IF2/eIF5B contribute to ribosomal subunit joining and function as GTPases that are maximally activated by the presence of both ribosomal subunits. As seen in other GTPases, IF2/IF5B undergoes conformational changes between its GTP- and GDP-bound states. Eukaryotic IF2/eIF5Bs possess three characteristic segments, including a divergent N-terminal region followed by conserved central and C-terminal segments. This core region is conserved among all known eukaryotic and archaeal IF2/eIF5Bs and eubacterial IF2s.
cd IF2_mtIF2_II 95aa 1e-32
in ref transcript
This family represents the domain II of bacterial Initiation Factor 2 (IF2) and its eukaryotic mitochondrial homologue mtIF2. IF2, the largest initiation factor is an essential GTP binding protein. In E. coli three natural forms of IF2 exist in the cell, IF2alpha, IF2beta1, and IF2beta2. Bacterial IF-2 is structurally and functionally related to eukaryotic mitochondrial mtIF-2.
cd mtIF2_IVc 88aa 5e-22
in ref transcript
mtIF2_IVc: this family represents the C2 subdomain of domain IV of mitochondrial translation initiation factor 2 (mtIF2) which adopts a beta-barrel fold displaying a high degree of structural similarity with domain II of the translation elongation factor EF-Tu. The C-terminal part of mtIF2 contains the entire fMet-tRNAfmet binding site of IF-2 and is resistant to proteolysis. This C-terminal portion consists of two domains, IF2 C1 and IF2 C2. IF2 C2 been shown to contain all molecular determinants necessary and sufficient for the recognition and binding of fMet-tRNAfMet. Like IF2 from certain prokaryotes such as Thermus thermophilus, mtIF2lacks domain II which is thought to be involved in binding of E.coli IF-2 to 30S subunits.
cd IF2_eIF5B 162aa 8e-04
in ref transcript
TIGR IF-2 545aa 1e-146
in ref transcript
This model discriminates eubacterial (and mitochondrial) translation initiation factor 2 (IF-2), encoded by the infB gene in bacteria, from similar proteins in the Archaea and Eukaryotes. In the bacteria and in organelles, the initiator tRNA is charged with N-formyl-Met instead of Met. This translation factor acts in delivering the initator tRNA to the ribosome. It is one of a number of GTP-binding translation factors recognized by the pfam HMM GTP_EFTU.
PRK infB 563aa 0.0
in ref transcript
translation initiation factor IF-2; Validated.
MTMR2
refseq_MTMR2.F1 refseq_MTMR2.R1 102 225
NCBIGene 36.3 8898
Single exon skipping, size difference: 123
Inclusion in the protein causing a new stop codon
Reference transcript: NM_016156
Changed!
pfam Myotub-related 118aa 5e-54
in ref transcript
Myotubularin-related. This family represents a region within eukaryotic myotubularin-related proteins that is sometimes found with pfam02893. Myotubularin is a dual-specific lipid phosphatase that dephosphorylates phosphatidylinositol 3-phosphate and phosphatidylinositol (3,5)-bi-phosphate. Mutations in gene encoding myotubularin-related proteins have been associated with disease.
Changed!
pfam GRAM 62aa 5e-09
in ref transcript
GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins.
Changed!
smart PTPc_motif 107aa 6e-06
in ref transcript
Protein tyrosine phosphatase, catalytic domain motif.
MTMR2
refseq_MTMR2.F2 refseq_MTMR2.R2 154 225
NCBIGene 36.3 8898
Single exon skipping, size difference: 71
Inclusion in the protein causing a new stop codon
Reference transcript: NM_016156
Changed!
pfam Myotub-related 118aa 5e-54
in ref transcript
Myotubularin-related. This family represents a region within eukaryotic myotubularin-related proteins that is sometimes found with pfam02893. Myotubularin is a dual-specific lipid phosphatase that dephosphorylates phosphatidylinositol 3-phosphate and phosphatidylinositol (3,5)-bi-phosphate. Mutations in gene encoding myotubularin-related proteins have been associated with disease.
Changed!
pfam GRAM 62aa 5e-09
in ref transcript
GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins.
Changed!
smart PTPc_motif 107aa 6e-06
in ref transcript
Protein tyrosine phosphatase, catalytic domain motif.
MTMR3
refseq_MTMR3.F1 refseq_MTMR3.R1 138 165
NCBIGene 36.3 8897
Single exon skipping, size difference: 27
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_021090
Changed!
cd FYVE 55aa 4e-15
in ref transcript
FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
pfam Myotub-related 163aa 5e-38
in ref transcript
Myotubularin-related. This family represents a region within eukaryotic myotubularin-related proteins that is sometimes found with pfam02893. Myotubularin is a dual-specific lipid phosphatase that dephosphorylates phosphatidylinositol 3-phosphate and phosphatidylinositol (3,5)-bi-phosphate. Mutations in gene encoding myotubularin-related proteins have been associated with disease.
Changed!
smart FYVE 66aa 4e-22
in ref transcript
Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PHD finger and the RIN G finger. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. The FYVE finger functions in the membrane recruitment of cytosolic proteins by binding to phosphatidylinositol 3-phosphate (PI3P), which is prominent on endosomes. The R+HHC+XCG motif is critical for PI3P binding.
smart PTPc_motif 40aa 7e-05
in ref transcript
Protein tyrosine phosphatase, catalytic domain motif.
Changed!
cd FYVE 64aa 8e-13
in modified transcript
Changed!
smart FYVE 75aa 8e-20
in modified transcript
MTMR3
refseq_MTMR3.F3 refseq_MTMR3.R3 185 296
NCBIGene 36.3 8897
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_021090
cd FYVE 55aa 4e-15
in ref transcript
FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
pfam Myotub-related 163aa 5e-38
in ref transcript
Myotubularin-related. This family represents a region within eukaryotic myotubularin-related proteins that is sometimes found with pfam02893. Myotubularin is a dual-specific lipid phosphatase that dephosphorylates phosphatidylinositol 3-phosphate and phosphatidylinositol (3,5)-bi-phosphate. Mutations in gene encoding myotubularin-related proteins have been associated with disease.
smart FYVE 66aa 4e-22
in ref transcript
Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PHD finger and the RIN G finger. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. The FYVE finger functions in the membrane recruitment of cytosolic proteins by binding to phosphatidylinositol 3-phosphate (PI3P), which is prominent on endosomes. The R+HHC+XCG motif is critical for PI3P binding.
smart PTPc_motif 40aa 7e-05
in ref transcript
Protein tyrosine phosphatase, catalytic domain motif.
MTO1
refseq_MTO1.F2 refseq_MTO1.R2 181 256
NCBIGene 36.3 25821
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_133645
Changed!
TIGR gidA 652aa 0.0
in ref transcript
GidA, the longer of two forms of GidA-related proteins, appears to be present in all complete eubacterial genomes so far, as well as Saccharomyces cerevisiae. A subset of these organisms have a closely related protein. GidA is absent in the Archaea. It appears to act with MnmE, in an alpha2/beta2 heterotetramer, in the 5-carboxymethylaminomethyl modification of uridine 34 in certain tRNAs. The shorter, related protein, previously called gid or gidA(S), is now called TrmFO (see model TIGR00137).
Changed!
TIGR gidA 627aa 0.0
in modified transcript
Changed!
PRK PRK05192 632aa 0.0
in modified transcript
MTP18
refseq_MTP18.F1 refseq_MTP18.R1 158 391
NCBIGene 36.3 51537
Single exon skipping, size difference: 233
Exclusion in the protein causing a frameshift
Reference transcript: NM_016498
Changed!
pfam MTP18 130aa 1e-45
in ref transcript
Mitochondrial 18 KDa protein (MTP18). This family of proteins are mitochondrial 18KDa proteins that are often misannotated as carbonic anhydrases. It was shown that knockdown of MTP18 protein results in a cytochrome c release from mitochondria and consequently leads to apoptosis. Overexpression studies suggest that MTP18 is required for mitochondrial fission.
Changed!
pfam MTP18 54aa 1e-16
in modified transcript
MTRR
refseq_MTRR.F2 refseq_MTRR.R2 114 140
NCBIGene 36.3 4552
Alternative 5-prime, size difference: 26
Inclusion in the protein causing a frameshift
Reference transcript: NM_024010
Changed!
cd methionine_synthase_red 421aa 1e-164
in ref transcript
Human methionine synthase reductase (MSR) restores methionine sythase which is responsible for the regeneration of methionine from homocysteine, as well as the coversion of methyltetrahydrofolate to tetrahydrofolate. In MSR, electrons are transferred from NADPH to FAD to FMN to cob(II)alamin. MSR resembles proteins of the cytochrome p450 family including nitric oxide synthase, the alpha subunit of sulfite reductase, but contains an extended hinge region. NADPH cytochrome p450 reductase (CYPOR) serves as an electron donor in several oxygenase systems and is a component of nitric oxide synthases and methionine synthase reductases. CYPOR transfers two electrons from NADPH to the heme of cytochrome p450 via FAD and FMN. CYPORs resemble ferredoxin reductase (FNR) but have a connecting subdomain inserted within the flavin binding region, which helps orient the FMN binding doamin with the FNR module. Ferredoxin-NADP+ (oxido)reductase is an FAD-containing enzyme that catalyzes the reversible electron transfer between NADP(H) and electron carrier proteins such as ferredoxin and flavodoxin. Isoforms of these flavoproteins (i.e. having a non-covalently bound FAD as a prosthetic group) are present in chloroplasts, mitochondria, and bacteria in which they participate in a wide variety of redox metabolic pathways. The C-terminal domain contains most of the NADP(H) binding residues and the N-terminal domain interacts non-covalently with the isoalloxazine rings of the flavin molecule which lies largely in a large gap betweed the two domains. Ferredoxin-NADP+ reductase first accepts one electron from reduced ferredoxin to form a flavin semiquinone intermediate. The enzyme then accepts a second electron to form FADH2 which then transfers two electrons and a proton to NADP+ to form NADPH.
Changed!
TIGR cysJ 414aa 4e-49
in ref transcript
This model describes an NADPH-dependent sulfite reductase flavoprotein subunit. Most members of this family are found in Cys biosynthesis gene clusters. The closest homologs below the trusted cutoff are designated as subunits nitrate reductase.
Changed!
pfam Flavodoxin_1 137aa 1e-29
in ref transcript
Flavodoxin.
Changed!
COG CysJ 432aa 1e-70
in ref transcript
Sulfite reductase, alpha subunit (flavoprotein) [Inorganic ion transport and metabolism].
Changed!
COG CysJ 188aa 1e-19
in ref transcript
MTUS1
refseq_MTUS1.F2 refseq_MTUS1.R2 105 166
NCBIGene 36.2 57509
Single exon skipping, size difference: 61
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001001924
Changed!
TIGR SMC_prok_B 297aa 3e-14
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
COG Smc 320aa 2e-11
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
MTX1
refseq_MTX1.F1 refseq_MTX1.R1 114 207
NCBIGene 36.3 4580
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_002455
Changed!
cd GST_C_Metaxin1_3 137aa 3e-51
in ref transcript
GST_C family, Metaxin subfamily, Metaxin 1-like proteins; composed of metaxins 1 and 3, and similar proteins. Mammalian metaxin (or metaxin 1) is a component of the preprotein import complex of the mitochondrial outer membrane. Metaxin extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. Like the murine gene, the human metaxin gene is located downstream to the glucocerebrosidase (GBA) pseudogene and is convergently transcribed. Inherited deficiency of GBA results in Gaucher disease, which presents many diverse clinical phenotypes. Alterations in the metaxin gene, in addition to GBA mutations, may be associated with Gaucher disease. Genome sequencing shows that a third metaxin gene also exists in zebrafish, Xenopus, chicken, and mammals.
Changed!
cd GST_N_Metaxin1_like 74aa 7e-23
in ref transcript
GST_N family, Metaxin subfamily, Metaxin 1-like proteins; composed of metaxins 1 and 3, and similar proteins including Tom37 from fungi. Mammalian metaxin (or metaxin 1) and the fungal protein Tom37 are components of preprotein import complexes of the mitochondrial outer membrane. Metaxin extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. Like the murine gene, the human metaxin gene is located downstream to the glucocerebrosidase (GBA) pseudogene and is convergently transcribed. Inherited deficiency of GBA results in Gaucher disease, which presents many diverse clinical phenotypes. Alterations in the metaxin gene, in addition to GBA mutations, may be associated with Gaucher disease. Genome sequencing shows that a third metaxin gene also exists in zebrafish, Xenopus, chicken and mammals.
Changed!
pfam Tom37 219aa 4e-58
in ref transcript
Outer mitochondrial membrane transport complex protein. The TOM37 protein is one of the outer membrane proteins that make up the TOM complex for guiding cytosolic mitochondrial beta-barrel proteins from the cytosol across the outer mitochondrial membrane into the intramembrane space. In conjunction with TOM70 it guides peptides without an MTS into TOM40, the protein that forms the passage through the outer membrane. It has homology with Metaxin-1, also part of the outer mitochondrial membrane beta-barrel protein transport complex.
Changed!
cd GST_C_Metaxin1_3 134aa 3e-50
in modified transcript
Changed!
cd GST_N_Metaxin1_like 73aa 1e-22
in modified transcript
Changed!
pfam Tom37 188aa 8e-43
in modified transcript
MTX2
refseq_MTX2.F1 refseq_MTX2.R1 153 357
NCBIGene 36.3 10651
Single exon skipping, size difference: 204
Inclusion in the protein causing a new stop codon
Reference transcript: NM_006554
Changed!
cd GST_C_Metaxin2 126aa 2e-62
in ref transcript
GST_C family, Metaxin subfamily, Metaxin 2; a metaxin 1 binding protein identified through a yeast two-hybrid system using metaxin 1 as the bait. Metaxin 2 shares sequence similarity with metaxin 1 but does not contain a C-terminal mitochondrial outer membrane signal-anchor domain. It associates with mitochondrial membranes through its interaction with metaxin 1, which is a component of the mitochondrial preprotein import complex of the outer membrane. The biological function of metaxin 2 is unknown. It is likely that it also plays a role in protein translocation into the mitochondria. However, this has not been experimentally validated. In a recent proteomics study, it has been shown that metaxin 2 is overexpressed in response to lipopolysaccharide-induced liver injury.
Changed!
cd GST_N_Metaxin2 74aa 8e-32
in ref transcript
GST_N family, Metaxin subfamily, Metaxin 2; a metaxin 1 binding protein identified through a yeast two-hybrid system using metaxin 1 as the bait. Metaxin 2 shares sequence similarity with metaxin 1 but does not contain a C-terminal mitochondrial outer membrane signal-anchor domain. It associates with mitochondrial membranes through its interaction with metaxin 1, which is a component of the mitochondrial preprotein import complex of the outer membrane. The biological function of metaxin 2 is unknown. It is likely that it also plays a role in protein translocation into the mitochondria. However, this has not been experimentally validated. In a recent proteomics study, it has been shown that metaxin 2 is overexpressed in response to lipopolysaccharide-induced liver injury.
Changed!
pfam Tom37 197aa 2e-11
in ref transcript
Outer mitochondrial membrane transport complex protein. The TOM37 protein is one of the outer membrane proteins that make up the TOM complex for guiding cytosolic mitochondrial beta-barrel proteins from the cytosol across the outer mitochondrial membrane into the intramembrane space. In conjunction with TOM70 it guides peptides without an MTS into TOM40, the protein that forms the passage through the outer membrane. It has homology with Metaxin-1, also part of the outer mitochondrial membrane beta-barrel protein transport complex.
Changed!
COG Gst 187aa 0.002
in ref transcript
Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones].
MXI1
refseq_MXI1.F1 refseq_MXI1.R1 144 174
NCBIGene 36.3 4601
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_130439
Changed!
cd HLH 60aa 2e-06
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
Changed!
smart HLH 53aa 1e-07
in ref transcript
helix loop helix domain.
Changed!
cd HLH 47aa 0.003
in modified transcript
Changed!
smart HLH 48aa 2e-05
in modified transcript
MXRA7
refseq_MXRA7.F2 refseq_MXRA7.R2 307 388
NCBIGene 36.3 439921
Single exon skipping, size difference: 81
Exclusion of the stop codon
Reference transcript: NM_001008529
MYADM
refseq_MYADM.F1 refseq_MYADM.R1 107 171
NCBIGene 36.3 91663
Alternative 3-prime, size difference: 64
Inclusion in 5'UTR
Reference transcript: NM_138373
pfam MARVEL 108aa 2e-10
in ref transcript
Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
pfam MARVEL 152aa 2e-07
in ref transcript
MYBPC1
refseq_MYBPC1.F1 refseq_MYBPC1.R1 129 188
NCBIGene 36.3 4604
Single exon skipping, size difference: 59
Exclusion in the protein causing a frameshift
Reference transcript: NM_002465
cd IGcam 89aa 2e-15
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd FN3 84aa 1e-11
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 81aa 1e-10
in ref transcript
cd FN3 85aa 2e-08
in ref transcript
cd IGcam 80aa 4e-07
in ref transcript
cd IGcam 90aa 1e-06
in ref transcript
cd IGcam 70aa 0.001
in ref transcript
cd IGcam 75aa 0.003
in ref transcript
pfam I-set 90aa 1e-17
in ref transcript
Immunoglobulin I-set domain.
smart FN3 83aa 2e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
pfam I-set 78aa 6e-11
in ref transcript
pfam I-set 85aa 4e-10
in ref transcript
pfam I-set 69aa 2e-09
in ref transcript
pfam I-set 84aa 8e-08
in ref transcript
smart FN3 74aa 2e-07
in ref transcript
smart IG_like 97aa 4e-07
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IG_like 82aa 3e-06
in ref transcript
smart FN3 83aa 1e-04
in ref transcript
MYBPC1
refseq_MYBPC1.F4 refseq_MYBPC1.R4 105 180
NCBIGene 36.3 4604
Multiple exon skipping, size difference: 75
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_002465
cd IGcam 89aa 2e-15
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd FN3 84aa 1e-11
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 81aa 1e-10
in ref transcript
cd FN3 85aa 2e-08
in ref transcript
cd IGcam 80aa 4e-07
in ref transcript
cd IGcam 90aa 1e-06
in ref transcript
cd IGcam 70aa 0.001
in ref transcript
cd IGcam 75aa 0.003
in ref transcript
pfam I-set 90aa 1e-17
in ref transcript
Immunoglobulin I-set domain.
smart FN3 83aa 2e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
pfam I-set 78aa 6e-11
in ref transcript
pfam I-set 85aa 4e-10
in ref transcript
pfam I-set 69aa 2e-09
in ref transcript
pfam I-set 84aa 8e-08
in ref transcript
smart FN3 74aa 2e-07
in ref transcript
smart IG_like 97aa 4e-07
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IG_like 82aa 3e-06
in ref transcript
smart FN3 83aa 1e-04
in ref transcript
MYBPC1
refseq_MYBPC1.F5 refseq_MYBPC1.R5 190 244
NCBIGene 36.3 4604
Single exon skipping, size difference: 54
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_002465
cd IGcam 89aa 2e-15
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd FN3 84aa 1e-11
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 81aa 1e-10
in ref transcript
Changed!
cd FN3 85aa 2e-08
in ref transcript
cd IGcam 80aa 4e-07
in ref transcript
cd IGcam 90aa 1e-06
in ref transcript
cd IGcam 70aa 0.001
in ref transcript
cd IGcam 75aa 0.003
in ref transcript
pfam I-set 90aa 1e-17
in ref transcript
Immunoglobulin I-set domain.
smart FN3 83aa 2e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
pfam I-set 78aa 6e-11
in ref transcript
pfam I-set 85aa 4e-10
in ref transcript
pfam I-set 69aa 2e-09
in ref transcript
pfam I-set 84aa 8e-08
in ref transcript
smart FN3 74aa 2e-07
in ref transcript
smart IG_like 97aa 4e-07
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IG_like 82aa 3e-06
in ref transcript
Changed!
smart FN3 83aa 1e-04
in ref transcript
Changed!
cd FN3 103aa 2e-05
in modified transcript
Changed!
smart FN3 101aa 0.001
in modified transcript
MYH11
refseq_MYH11.F1 refseq_MYH11.R1 172 211
NCBIGene 36.3 4629
Single exon skipping, size difference: 39
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001040114
cd MYSc_type_II 709aa 0.0
in ref transcript
Myosin motor domain, type II myosins. Myosin II mediates cortical contraction in cell motility, and is the motor in smooth and skeletal muscle. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
pfam Myosin_head 692aa 0.0
in ref transcript
Myosin head (motor domain).
Changed!
pfam Myosin_tail_1 820aa 1e-126
in ref transcript
Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
pfam SMC_N 233aa 1e-09
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
pfam Myosin_N 43aa 1e-09
in ref transcript
Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown.
COG COG5022 1265aa 0.0
in ref transcript
Myosin heavy chain [Cytoskeleton].
Changed!
COG Smc 392aa 2e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG COG4372 227aa 2e-04
in ref transcript
Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown].
COG Smc 356aa 0.001
in ref transcript
Changed!
pfam Myosin_tail_1 819aa 1e-126
in modified transcript
Changed!
COG Smc 386aa 2e-05
in modified transcript
MYH11
refseq_MYH11.F3 refseq_MYH11.R3 104 125
NCBIGene 36.3 4629
Single exon skipping, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040114
Changed!
cd MYSc_type_II 709aa 0.0
in ref transcript
Myosin motor domain, type II myosins. Myosin II mediates cortical contraction in cell motility, and is the motor in smooth and skeletal muscle. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
Changed!
pfam Myosin_head 692aa 0.0
in ref transcript
Myosin head (motor domain).
pfam Myosin_tail_1 820aa 1e-126
in ref transcript
Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
pfam SMC_N 233aa 1e-09
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
pfam Myosin_N 43aa 1e-09
in ref transcript
Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown.
Changed!
COG COG5022 1265aa 0.0
in ref transcript
Myosin heavy chain [Cytoskeleton].
COG Smc 392aa 2e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG COG4372 227aa 2e-04
in ref transcript
Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown].
COG Smc 356aa 0.001
in ref transcript
Changed!
cd MYSc_type_II 702aa 0.0
in modified transcript
Changed!
pfam Myosin_head 685aa 0.0
in modified transcript
Changed!
TIGR SMC_prok_A 236aa 0.001
in modified transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Changed!
COG COG5022 1258aa 0.0
in modified transcript
MYL6
refseq_MYL6.F2 refseq_MYL6.R2 241 286
NCBIGene 36.3 4637
Single exon skipping, size difference: 45
Exclusion of the stop codon
Reference transcript: NM_021019
Changed!
cd EFh 56aa 4e-05
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
smart EFh 29aa 0.003
in ref transcript
EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
Changed!
PTZ PTZ00184 147aa 4e-30
in ref transcript
calmodulin; Provisional.
Changed!
cd EFh 56aa 1e-05
in modified transcript
Changed!
PTZ PTZ00184 147aa 6e-30
in modified transcript
MYL9
refseq_MYL9.F2 refseq_MYL9.R2 261 423
NCBIGene 36.3 10398
Single exon skipping, size difference: 162
Exclusion in the protein (no frameshift)
Reference transcript: NM_006097
Changed!
cd EFh 62aa 3e-04
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Changed!
cd EFh 58aa 4e-04
in ref transcript
smart EFh 29aa 0.002
in ref transcript
EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
Changed!
COG FRQ1 150aa 1e-26
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
Changed!
cd EFh 78aa 6e-04
in modified transcript
Changed!
PTZ PTZ00184 85aa 2e-08
in modified transcript
calmodulin; Provisional.
MYLK
refseq_MYLK.F2 refseq_MYLK.R2 155 375
NCBIGene 36.2 4638
Exon skipping and alternative 3-prime or 5-prime, size difference: 220
Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_053025
Changed!
cd S_TKc 256aa 7e-74
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
cd IGcam 88aa 1e-16
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 90aa 9e-16
in ref transcript
cd IGcam 90aa 5e-15
in ref transcript
Changed!
cd IGcam 90aa 9e-15
in ref transcript
cd IGcam 90aa 2e-14
in ref transcript
cd IGcam 89aa 3e-13
in ref transcript
cd IGcam 86aa 3e-12
in ref transcript
cd IGcam 88aa 1e-10
in ref transcript
cd FN3 66aa 4e-09
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd IGcam 87aa 8e-05
in ref transcript
Changed!
smart S_TKc 245aa 4e-77
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
pfam I-set 91aa 2e-30
in ref transcript
Immunoglobulin I-set domain.
pfam I-set 90aa 1e-23
in ref transcript
pfam I-set 91aa 1e-21
in ref transcript
pfam I-set 90aa 2e-21
in ref transcript
pfam I-set 91aa 5e-19
in ref transcript
pfam I-set 90aa 3e-18
in ref transcript
pfam I-set 87aa 3e-18
in ref transcript
pfam I-set 89aa 7e-15
in ref transcript
pfam I-set 89aa 2e-13
in ref transcript
pfam fn3 59aa 2e-06
in ref transcript
Fibronectin type III domain.
Changed!
COG SPS1 351aa 1e-38
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
MYLK
refseq_MYLK.F3 refseq_MYLK.R3 208 361
NCBIGene 36.3 4638
Single exon skipping, size difference: 153
Exclusion in the protein (no frameshift)
Reference transcript: NM_053025
Changed!
cd S_TKc 256aa 7e-74
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
cd IGcam 88aa 1e-16
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 90aa 9e-16
in ref transcript
cd IGcam 90aa 5e-15
in ref transcript
cd IGcam 90aa 9e-15
in ref transcript
cd IGcam 90aa 2e-14
in ref transcript
cd IGcam 89aa 3e-13
in ref transcript
cd IGcam 86aa 3e-12
in ref transcript
cd IGcam 88aa 1e-10
in ref transcript
cd FN3 66aa 4e-09
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd IGcam 87aa 8e-05
in ref transcript
Changed!
smart S_TKc 245aa 4e-77
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam I-set 91aa 2e-30
in ref transcript
Immunoglobulin I-set domain.
pfam I-set 90aa 1e-23
in ref transcript
pfam I-set 91aa 1e-21
in ref transcript
pfam I-set 90aa 2e-21
in ref transcript
pfam I-set 91aa 5e-19
in ref transcript
pfam I-set 90aa 3e-18
in ref transcript
pfam I-set 87aa 3e-18
in ref transcript
pfam I-set 89aa 7e-15
in ref transcript
pfam I-set 89aa 2e-13
in ref transcript
pfam fn3 59aa 2e-06
in ref transcript
Fibronectin type III domain.
Changed!
COG SPS1 351aa 1e-38
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
cd S_TKc 220aa 3e-56
in modified transcript
Changed!
smart S_TKc 209aa 3e-59
in modified transcript
Changed!
COG SPS1 342aa 1e-29
in modified transcript
MYLK
refseq_MYLK.F4 refseq_MYLK.R4 140 347
NCBIGene 36.3 4638
Single exon skipping, size difference: 207
Exclusion in the protein (no frameshift)
Reference transcript: NM_053025
cd S_TKc 256aa 7e-74
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
cd IGcam 88aa 1e-16
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
cd IGcam 90aa 9e-16
in ref transcript
cd IGcam 90aa 5e-15
in ref transcript
cd IGcam 90aa 9e-15
in ref transcript
cd IGcam 90aa 2e-14
in ref transcript
cd IGcam 89aa 3e-13
in ref transcript
cd IGcam 86aa 3e-12
in ref transcript
cd IGcam 88aa 1e-10
in ref transcript
cd FN3 66aa 4e-09
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd IGcam 87aa 8e-05
in ref transcript
smart S_TKc 245aa 4e-77
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam I-set 91aa 2e-30
in ref transcript
Immunoglobulin I-set domain.
pfam I-set 90aa 1e-23
in ref transcript
pfam I-set 91aa 1e-21
in ref transcript
pfam I-set 90aa 2e-21
in ref transcript
Changed!
pfam I-set 91aa 5e-19
in ref transcript
pfam I-set 90aa 3e-18
in ref transcript
pfam I-set 87aa 3e-18
in ref transcript
pfam I-set 89aa 7e-15
in ref transcript
pfam I-set 89aa 2e-13
in ref transcript
pfam fn3 59aa 2e-06
in ref transcript
Fibronectin type III domain.
COG SPS1 351aa 1e-38
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
MYO18A
refseq_MYO18A.F2 refseq_MYO18A.R2 255 300
NCBIGene 36.3 399687
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_078471
cd MYSc_type_XVIII 779aa 0.0
in ref transcript
Myosin motor domain, type XVIII myosins. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
cd PDZ_signaling 80aa 6e-10
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart MYSc 786aa 1e-105
in ref transcript
Myosin. Large ATPases. ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.
pfam Myosin_tail_1 527aa 1e-36
in ref transcript
Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
Changed!
TIGR SMC_prok_B 288aa 2e-11
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart PDZ 81aa 5e-08
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
COG COG5022 1200aa 1e-103
in ref transcript
Myosin heavy chain [Cytoskeleton].
Changed!
COG Smc 738aa 5e-21
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
TIGR SMC_prok_B 331aa 2e-08
in modified transcript
Changed!
COG Smc 691aa 1e-18
in modified transcript
MYOHD1
refseq_MYOHD1.F2 refseq_MYOHD1.R2 115 165
NCBIGene 36.2 80179
Single exon skipping, size difference: 50
Inclusion in the protein causing a new stop codon
Reference transcript: NM_025109
cd MYSc 402aa 1e-116
in ref transcript
Myosin motor domain. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
Changed!
cd MYSc_type_V 116aa 8e-08
in ref transcript
Myosin motor domain, type V myosins. Myosins V transport a variety of intracellular cargo processively along actin filaments, such as membraneous organelles and mRNA. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
smart MYSc 404aa 1e-124
in ref transcript
Myosin. Large ATPases. ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.
Changed!
smart MYSc 116aa 4e-07
in ref transcript
COG COG5022 438aa 1e-107
in ref transcript
Myosin heavy chain [Cytoskeleton].
Changed!
COG COG5022 185aa 2e-11
in ref transcript
Changed!
cd MYSc_type_V 34aa 1e-07
in modified transcript
Changed!
smart MYSc 34aa 9e-07
in modified transcript
Changed!
COG COG5022 34aa 5e-07
in modified transcript
MZF1
refseq_MZF1.F1 refseq_MZF1.R1 100 418
NCBIGene 36.3 7593
Alternative 5-prime, size difference: 318
Exclusion in 5'UTR
Reference transcript: NM_198055
smart SCAN 113aa 2e-43
in ref transcript
leucine rich region.
COG COG5048 165aa 6e-08
in ref transcript
FOG: Zn-finger [General function prediction only].
NLRP1
refseq_NALP1.F2 refseq_NALP1.R2 184 316
NCBIGene 36.3 22861
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_033004
cd LRR_RI 179aa 8e-30
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
cd AAA 95aa 0.009
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
pfam NACHT 170aa 5e-45
in ref transcript
NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
pfam CARD 82aa 2e-16
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
pfam PAAD_DAPIN 58aa 2e-08
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
smart LRR_RI 28aa 0.002
in ref transcript
Leucine rich repeat, ribonuclease inhibitor type.
COG RNA1 142aa 0.004
in ref transcript
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].
NLRP1
refseq_NALP1.F3 refseq_NALP1.R3 178 268
NCBIGene 36.3 22861
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_033004
Changed!
cd LRR_RI 179aa 8e-30
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
cd AAA 95aa 0.009
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
pfam NACHT 170aa 5e-45
in ref transcript
NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
pfam CARD 82aa 2e-16
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
pfam PAAD_DAPIN 58aa 2e-08
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
smart LRR_RI 28aa 0.002
in ref transcript
Leucine rich repeat, ribonuclease inhibitor type.
COG RNA1 142aa 0.004
in ref transcript
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].
Changed!
cd LRR_RI 161aa 4e-27
in modified transcript
NLRP12
refseq_NALP12.F1 refseq_NALP12.R1 206 377
NCBIGene 36.3 91662
Single exon skipping, size difference: 171
Exclusion in the protein (no frameshift)
Reference transcript: NM_144687
Changed!
cd LRR_RI 319aa 6e-34
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Changed!
cd LRR_RI 306aa 2e-23
in ref transcript
pfam NACHT 170aa 1e-43
in ref transcript
NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
pfam PAAD_DAPIN 81aa 5e-24
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
Changed!
cd LRR_RI 286aa 3e-31
in modified transcript
NLRP12
refseq_NALP12.F4 refseq_NALP12.R4 168 358
NCBIGene 36.3 91662
Single exon skipping, size difference: 190
Inclusion in the protein causing a new stop codon
Reference transcript: NM_144687
Changed!
cd LRR_RI 319aa 6e-34
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Changed!
cd LRR_RI 306aa 2e-23
in ref transcript
pfam NACHT 170aa 1e-43
in ref transcript
NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
pfam PAAD_DAPIN 81aa 5e-24
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
Changed!
cd LRR_RI 188aa 4e-05
in modified transcript
NLRP7
refseq_NALP7.F2 refseq_NALP7.R2 200 284
NCBIGene 36.3 199713
Alternative 3-prime, size difference: 84
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_139176
cd LRR_RI 266aa 7e-27
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
pfam NACHT 168aa 1e-35
in ref transcript
NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
pfam PAAD_DAPIN 67aa 3e-11
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
COG RNA1 165aa 5e-04
in ref transcript
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].
NLRP7
refseq_NALP7.F3 refseq_NALP7.R3 203 374
NCBIGene 36.3 199713
Single exon skipping, size difference: 171
Exclusion in the protein (no frameshift)
Reference transcript: NM_139176
Changed!
cd LRR_RI 266aa 7e-27
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
pfam NACHT 168aa 1e-35
in ref transcript
NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
pfam PAAD_DAPIN 67aa 3e-11
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
Changed!
COG RNA1 165aa 5e-04
in ref transcript
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].
Changed!
cd LRR_RI 246aa 1e-23
in modified transcript
NASP
refseq_NASP.F1 refseq_NASP.R1 140 213
NCBIGene 36.3 4678
Single exon skipping, size difference: 73
Exclusion in the protein causing a frameshift
Reference transcript: NM_172164
Changed!
cd TPR 97aa 5e-04
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
Changed!
pfam SHNi-TPR 38aa 2e-07
in ref transcript
SHNi-TPR. SHNi-TPR family members contain a reiterated sequence motif that is an interrupted form of TPR repeat.
Changed!
TIGR 2A1904 189aa 0.001
in ref transcript
Changed!
COG MDN1 198aa 0.004
in ref transcript
AAA ATPase containing von Willebrand factor type A (vWA) domain [General function prediction only].
NAT5
refseq_NAT5.F1 refseq_NAT5.R1 178 324
NCBIGene 36.3 51126
Single exon skipping, size difference: 146
Exclusion in the protein causing a frameshift
Reference transcript: NM_016100
Changed!
cd GNAT 86aa 1e-07
in ref transcript
GCN5-related N-acetyltransferases (GNAT) represent a large superfamily of functionally diverse enzymes that catalyze the transfer of an acetyl group from acetyl-Coenzyme A to the primary amine of a wide range of acceptor substrates. Members of this superfamily include aminoglycoside N-acetyltransferases, serotonin N-acetyltransferase, glucosamine-6-phosphate N-acetyltransferase, the histone acetyltransferases, mycothiol synthase, and the Fem family of amino acyl transferases.
Changed!
pfam Acetyltransf_1 79aa 2e-12
in ref transcript
Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions such as Elp3-related proteins.
Changed!
COG RimI 156aa 1e-19
in ref transcript
Acetyltransferases [General function prediction only].
Changed!
pfam Acetyltransf_1 44aa 3e-04
in modified transcript
Changed!
COG RimI 93aa 1e-09
in modified transcript
NAV2
refseq_NAV2.F2 refseq_NAV2.R2 165 264
NCBIGene 36.3 89797
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_182964
cd CH 102aa 4e-12
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
pfam CH 102aa 5e-14
in ref transcript
Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
smart AAA 109aa 0.001
in ref transcript
ATPases associated with a variety of cellular activities. AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment.
COG SAC6 98aa 3e-08
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
PRK PRK06835 104aa 0.006
in ref transcript
DNA replication protein DnaC; Validated.
NAV2
refseq_NAV2.F4 refseq_NAV2.R4 341 410
NCBIGene 36.3 89797
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_182964
cd CH 102aa 4e-12
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
pfam CH 102aa 5e-14
in ref transcript
Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
smart AAA 109aa 0.001
in ref transcript
ATPases associated with a variety of cellular activities. AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment.
COG SAC6 98aa 3e-08
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
PRK PRK06835 104aa 0.006
in ref transcript
DNA replication protein DnaC; Validated.
NBR1
refseq_NBR1.F1 refseq_NBR1.R1 103 435
NCBIGene 36.3 4077
Alternative 5-prime, size difference: 332
Exclusion in 5'UTR
Reference transcript: NM_031858
cd PB1_NBR1 81aa 1e-35
in ref transcript
The PB1 domain is an essential part of NBR1 protein, next to BRCA1, a scaffold protein mediating specific protein-protein interaction with both titin protein kinase and with another scaffold protein p62. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The NBR1 protein contains a type I PB1 domain.
cd ZZ_NBR1_like 45aa 6e-12
in ref transcript
Zinc finger, ZZ type. Zinc finger present in Drosophila ref(2)P, NBR1, Human sequestosome 1 and related proteins. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Drosophila ref(2)P appears to control the multiplication of sigma rhabdovirus. NBR1 (Next to BRCA1 gene 1 protein) interacts with fasciculation and elongation protein zeta-1 (FEZ1) and calcium and integrin binding protein (CIB), and may function in cell signalling pathways. Sequestosome 1 is a phosphotyrosine independent ligand for the Lck SH2 domain and binds noncovalently to ubiquitin via its UBA domain.
smart PB1 80aa 1e-12
in ref transcript
PB1 domain. Phox and Bem1p domain, present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.
smart ZnF_ZZ 45aa 9e-11
in ref transcript
Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
NCAM1
refseq_NCAM1.F1 refseq_NCAM1.R1 154 184
NCBIGene 36.3 4684
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_181351
cd IGcam 91aa 5e-12
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 87aa 8e-10
in ref transcript
Changed!
cd IGcam 104aa 1e-09
in ref transcript
cd IGcam 73aa 2e-09
in ref transcript
cd FN3 98aa 5e-07
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 88aa 2e-04
in ref transcript
pfam I-set 83aa 1e-15
in ref transcript
Immunoglobulin I-set domain.
Changed!
smart IG_like 91aa 2e-11
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IGc2 69aa 8e-10
in ref transcript
Immunoglobulin C-2 Type.
smart IG_like 85aa 2e-08
in ref transcript
pfam fn3 89aa 7e-08
in ref transcript
Fibronectin type III domain.
pfam fn3 83aa 1e-06
in ref transcript
Changed!
cd IGcam 94aa 9e-11
in modified transcript
Changed!
smart IG_like 81aa 2e-12
in modified transcript
NCKAP1
refseq_NCKAP1.F1 refseq_NCKAP1.R1 102 120
NCBIGene 36.3 10787
Single exon skipping, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_205842
Changed!
pfam Nckap1 1123aa 0.0
in ref transcript
Membrane-associated apoptosis protein. Expression of this protein was found to be markedly reduced in patients with Alzheimer's disease. It is involved in the regulation of actin polymerisation in the brain as part of a WAVE2 signalling complex.
Changed!
pfam Nckap1 1117aa 0.0
in modified transcript
NCKIPSD
refseq_NCKIPSD.F1 refseq_NCKIPSD.R1 107 128
NCBIGene 36.3 51517
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_016453
cd SH3 55aa 4e-09
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
pfam DUF2013 117aa 2e-33
in ref transcript
Protein of unknown function (DUF2013). This region is found at the C terminal of a group of cytoskeletal proteins.
smart SH3 56aa 7e-10
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
NCOA1
refseq_NCOA1.F1 refseq_NCOA1.R1 114 174
NCBIGene 36.3 8648
Exon skipping and alternative 3-prime or 5-prime, size difference: 60
Inclusion in the protein causing a new stop codon, Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_147233
cd PAS 94aa 2e-08
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
cd HLH 58aa 4e-05
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
pfam Nuc_rec_co-act 47aa 8e-13
in ref transcript
Nuclear receptor coactivator. This region is found on eukaryotic nuclear receptor coactivators and forms an alpha helical structure.
pfam SRC-1 79aa 7e-10
in ref transcript
Steroid receptor coactivator. This domain is found in steroid/nuclear receptor coactivators and contains two LXXLL motifs that are involved in receptor binding. The family includes SRC-1/NcoA-1, NcoA-2/TIF2, pCIP/ACTR/GRIP-1/AIB1.
smart PAS 58aa 8e-09
in ref transcript
PAS domain. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels ([1]; Ponting & Aravind, in press).
smart HLH 55aa 3e-06
in ref transcript
helix loop helix domain.
pfam DUF1518 38aa 0.001
in ref transcript
Domain of unknown function (DUF1518). This domain, which is usually found tandemly repeated, is found various receptor co-activating proteins.
NDEL1
refseq_NDEL1.F2 refseq_NDEL1.R2 130 165
NCBIGene 36.3 81565
Single exon skipping, size difference: 35
Inclusion in the protein causing a frameshift
Reference transcript: NM_030808
pfam NUDE_C 181aa 5e-32
in ref transcript
NUDE protein, C-terminal conserved region. This family represents the C-terminal conserved region of the NUDE proteins. NUDE proteins are involved in nuclear migration.
NDRG2
refseq_NDRG2.F1 refseq_NDRG2.R1 140 182
NCBIGene 36.3 57447
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_201537
pfam Ndr 279aa 1e-124
in ref transcript
Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (sunflower), known as Sf21. Interestingly, the highest scoring matches in the noise are all alpha/beta hydrolases pfam00561, suggesting that this family may have an enzymatic function (Bateman A pers. obs.).
COG MhpC 259aa 3e-05
in ref transcript
Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only].
NDRG2
refseq_NDRG2.F3 refseq_NDRG2.R3 106 173
NCBIGene 36.3 57447
Single exon skipping, size difference: 67
Exclusion in 5'UTR
Reference transcript: NM_201540
pfam Ndr 279aa 1e-124
in ref transcript
Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (sunflower), known as Sf21. Interestingly, the highest scoring matches in the noise are all alpha/beta hydrolases pfam00561, suggesting that this family may have an enzymatic function (Bateman A pers. obs.).
COG MhpC 259aa 3e-05
in ref transcript
Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only].
NDRG2
refseq_NDRG2.F5 refseq_NDRG2.R3 130 197
NCBIGene 36.3 57447
Single exon skipping, size difference: 67
Exclusion in 5'UTR
Reference transcript: NM_201535
pfam Ndr 279aa 1e-124
in ref transcript
Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (sunflower), known as Sf21. Interestingly, the highest scoring matches in the noise are all alpha/beta hydrolases pfam00561, suggesting that this family may have an enzymatic function (Bateman A pers. obs.).
COG MhpC 259aa 3e-05
in ref transcript
Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only].
NDRG3
refseq_NDRG3.F1 refseq_NDRG3.R1 119 155
NCBIGene 36.3 57446
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_032013
pfam Ndr 286aa 1e-123
in ref transcript
Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (sunflower), known as Sf21. Interestingly, the highest scoring matches in the noise are all alpha/beta hydrolases pfam00561, suggesting that this family may have an enzymatic function (Bateman A pers. obs.).
COG MhpC 275aa 4e-08
in ref transcript
Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only].
NDUFB6
refseq_NDUFB6.F1 refseq_NDUFB6.R1 225 270
NCBIGene 36.3 4712
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_002493
Changed!
pfam NDUF_B6 128aa 2e-45
in ref transcript
NADH:ubiquinone oxidoreductase, NDUFB6/B17 subunit. Members of this family mediate the transfer of electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone, the reaction that occurs being: NADH + ubiquinone = NAD(+) + ubiquinol.
Changed!
pfam NDUF_B6 113aa 2e-36
in modified transcript
NEBL
refseq_NEBL.F1 refseq_NEBL.R1 131 374
NCBIGene 36.3 10529
Multiple exon skipping, size difference: 243
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_006393
cd SH3 54aa 1e-11
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart SH3 57aa 2e-13
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
pfam Nebulin 29aa 4e-04
in ref transcript
Nebulin repeat.
smart NEBU 30aa 7e-04
in ref transcript
The Nebulin repeat is present also in Las1. Tandem arrays of these repeats are known to bind actin.
pfam Nebulin 28aa 0.002
in ref transcript
smart NEBU 26aa 0.003
in ref transcript
pfam Nebulin 29aa 0.005
in ref transcript
NF2
refseq_NF2.F1 refseq_NF2.R1 217 343
NCBIGene 36.3 4771
Single exon skipping, size difference: 126
Exclusion in the protein (no frameshift)
Reference transcript: NM_000268
cd FERM_C 93aa 2e-22
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
Changed!
smart B41 204aa 6e-53
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam ERM 234aa 1e-35
in ref transcript
Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins.
pfam FERM_C 87aa 1e-32
in ref transcript
FERM C-terminal PH-like domain.
Changed!
smart B41 143aa 9e-36
in modified transcript
NF2
refseq_NF2.F2 refseq_NF2.R2 100 223
NCBIGene 36.3 4771
Single exon skipping, size difference: 123
Exclusion in the protein (no frameshift)
Reference transcript: NM_000268
cd FERM_C 93aa 2e-22
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
Changed!
smart B41 204aa 6e-53
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam ERM 234aa 1e-35
in ref transcript
Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins.
pfam FERM_C 87aa 1e-32
in ref transcript
FERM C-terminal PH-like domain.
Changed!
smart B41 163aa 2e-38
in modified transcript
NFAT5
refseq_NFAT5.F1 refseq_NFAT5.R1 262 324
NCBIGene 36.3 10725
Single exon skipping, size difference: 62
Inclusion in the protein causing a new stop codon
Reference transcript: NM_138713
Changed!
cd IPT_NFAT 100aa 5e-36
in ref transcript
IPT domain of the NFAT family of transcription factors. NFAT transcription complexes are a target of calcineurin, a calcium dependent phosphatase, and activate genes mainly involved in cell-cell-interaction.
Changed!
pfam RHD 158aa 9e-20
in ref transcript
Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See pfam01833) that binds to DNA.
Changed!
smart IPT 98aa 4e-08
in ref transcript
ig-like, plexins, transcription factors.
Changed!
pfam PAT1 225aa 2e-04
in ref transcript
Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
NFAT5
refseq_NFAT5.F3 refseq_NFAT5.R3 112 166
NCBIGene 36.3 10725
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_138713
cd IPT_NFAT 100aa 5e-36
in ref transcript
IPT domain of the NFAT family of transcription factors. NFAT transcription complexes are a target of calcineurin, a calcium dependent phosphatase, and activate genes mainly involved in cell-cell-interaction.
pfam RHD 158aa 9e-20
in ref transcript
Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See pfam01833) that binds to DNA.
smart IPT 98aa 4e-08
in ref transcript
ig-like, plexins, transcription factors.
pfam PAT1 225aa 2e-04
in ref transcript
Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
NFATC1
refseq_NFATC1.F1 refseq_NFATC1.R1 181 488
NCBIGene 36.3 4772
Alternative 5-prime, size difference: 307
Exclusion in the protein causing a frameshift
Reference transcript: NM_172387
cd IPT_NFAT 101aa 5e-33
in ref transcript
IPT domain of the NFAT family of transcription factors. NFAT transcription complexes are a target of calcineurin, a calcium dependent phosphatase, and activate genes mainly involved in cell-cell-interaction.
pfam RHD 161aa 6e-30
in ref transcript
Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See pfam01833) that binds to DNA.
pfam TIG 98aa 8e-09
in ref transcript
IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: This family does not currently recognise a significant number of members.
NFATC2
refseq_NFATC2.F2 refseq_NFATC2.R2 230 318
NCBIGene 36.3 4773
Single exon skipping, size difference: 88
Inclusion in the protein causing a new stop codon
Reference transcript: NM_173091
cd IPT_NFAT 101aa 1e-40
in ref transcript
IPT domain of the NFAT family of transcription factors. NFAT transcription complexes are a target of calcineurin, a calcium dependent phosphatase, and activate genes mainly involved in cell-cell-interaction.
pfam RHD 161aa 9e-26
in ref transcript
Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See pfam01833) that binds to DNA.
smart IPT 100aa 3e-10
in ref transcript
ig-like, plexins, transcription factors.
NFIC
refseq_NFIC.F1 refseq_NFIC.R1 107 347
NCBIGene 36.3 4782
Multiple exon skipping, size difference: 240
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_205843
cd MH1 114aa 5e-17
in ref transcript
MH1 is a small DNA binding domain, binding in an unusal way involving a beta hairpin structure binding to the major groove. MH1 is present in Smad proteins, an important family of proteins involved in TGF-beta signalling and frequent targets of tumorigenic mutations. Also known as Domain A in dwarfin family proteins.
Changed!
pfam CTF_NFI 292aa 1e-107
in ref transcript
CTF/NF-I family transcription modulation region.
pfam NfI_DNAbd_pre-N 38aa 5e-19
in ref transcript
Nuclear factor I protein pre-N-terminus. The Nuclear factor I (NFI) family of site-specific DNA-binding proteins (also known as CTF or CAAT box transcription factor) functions both in viral DNA replication and in the regulation of gene expression in higher organisms. The N-terminal 200 residues contains the DNA-binding and dimerisation domain, but also has an 8-47 residue highly conserved region 5' of this, whose function is not known. Deletion of the N-terminal 200 amino acids removes the DNA-binding activity, dimerisation-ability and the stimulation of adenovirus DNA replication.
smart DWA 102aa 9e-15
in ref transcript
Domain A in dwarfin family proteins.
Changed!
pfam CTF_NFI 207aa 2e-85
in modified transcript
NFKBIB
refseq_NFKBIB.F2 refseq_NFKBIB.R2 213 269
NCBIGene 36.3 4793
Alternative 3-prime, size difference: 56
Exclusion in the protein causing a frameshift
Reference transcript: NM_002503
Changed!
cd ANK 97aa 3e-15
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
cd ANK 86aa 7e-12
in ref transcript
Changed!
cd ANK 139aa 2e-07
in ref transcript
Changed!
TIGR trp 68aa 2e-07
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Changed!
COG Arp 85aa 2e-09
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
COG Arp 94aa 7e-09
in ref transcript
Changed!
COG Arp 210aa 6e-06
in ref transcript
NFYA
refseq_NFYA.F1 refseq_NFYA.R1 212 299
NCBIGene 36.3 4800
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_002505
smart CBF 60aa 9e-27
in ref transcript
CCAAT-Binding transcription Factor.
COG HAP2 64aa 1e-10
in ref transcript
CCAAT-binding factor, subunit B [Transcription].
NGFRAP1
refseq_NGFRAP1.F2 refseq_NGFRAP1.R2 241 288
NCBIGene 36.3 27018
Alternative 3-prime, size difference: 47
Exclusion of the protein initiation site
Reference transcript: NM_206915
Changed!
pfam BEX 91aa 2e-20
in ref transcript
Brain expressed X-linked like family.
Changed!
pfam BEX 100aa 3e-22
in modified transcript
ISCU
refseq_NIFUN.F1 refseq_NIFUN.R1 125 221
NCBIGene 36.3 23479
Single exon skipping, size difference: 96
Exclusion of the protein initiation site
Reference transcript: NM_014301
Changed!
TIGR iscU 123aa 1e-60
in ref transcript
This model represents IscU, a homolog of the N-terminal region of NifU, an Fe-S cluster assembly protein found mostly in nitrogen-fixing bacteria. IscU is considered part of the IscSUA-hscAB-fdx system of Fe-S assembly, whereas NifU is found in nitrogenase-containing (nitrogen-fixing) species. A NifU-type protein is also found in Helicobacter and Campylobacter. IscU and NifU are considered scaffold proteins on which Fe-S clusters are assembled before transfer to apoproteins. This model excludes true NifU proteins as in Klebsiella pneumoniae and Anabaena sp. as well as archaeal homologs. It includes largely proteobacterial and eukaryotic forms.
Changed!
PRK PRK11325 123aa 1e-66
in ref transcript
scaffold protein; Provisional.
Changed!
TIGR iscU 86aa 9e-40
in modified transcript
Changed!
PRK PRK11325 86aa 2e-43
in modified transcript
NIPA2
refseq_NIPA2.F1 refseq_NIPA2.R1 125 247
NCBIGene 36.3 81614
Single exon skipping, size difference: 122
Exclusion in 5'UTR
Reference transcript: NM_030922
pfam DUF803 301aa 7e-96
in ref transcript
Protein of unknown function (DUF803). This family consists of several eukaryotic proteins of unknown function.
NIPA2
refseq_NIPA2.F4 refseq_NIPA2.R4 189 246
NCBIGene 36.3 81614
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_001008860
Changed!
pfam DUF803 301aa 7e-96
in ref transcript
Protein of unknown function (DUF803). This family consists of several eukaryotic proteins of unknown function.
Changed!
pfam DUF803 224aa 2e-81
in modified transcript
NKTR
refseq_NKTR.F2 refseq_NKTR.R2 139 167
NCBIGene 36.2 4820
Single exon skipping, size difference: 28
Inclusion in the protein causing a frameshift
Reference transcript: NM_005385
Changed!
cd cyclophilin_ABH_like 168aa 8e-58
in ref transcript
cyclophilin_ABH_like: Cyclophilin A, B and H-like cyclophilin-type peptidylprolyl cis- trans isomerase (PPIase) domain. This family represents the archetypal cystolic cyclophilin similar to human cyclophilins A, B and H. PPIase is an enzyme which accelerates protein folding by catalyzing the cis-trans isomerization of the peptide bonds preceding proline residues. These enzymes have been implicated in protein folding processes which depend on catalytic /chaperone-like activities. As cyclophilins, Human hCyP-A, human cyclophilin-B (hCyP-19), S. cerevisiae Cpr1 and C. elegans Cyp-3, are inhibited by the immunosuppressive drug cyclopsporin A (CsA). CsA binds to the PPIase active site. Cyp-3. S. cerevisiae Cpr1 interacts with the Rpd3 - Sin3 complex and in addition is a component of the Set3 complex. S. cerevisiae Cpr1 has also been shown to have a role in Zpr1p nuclear transport. Human cyclophilin H associates with the [U4/U6.U5] tri-snRNP particles of the splicesome.
Changed!
pfam Pro_isomerase 169aa 8e-72
in ref transcript
Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans isomerases, also known as cyclophilins, share this domain of about 109 amino acids. Cyclophilins have been found in all organisms studied so far and catalyse peptidyl-prolyl isomerisation during which the peptide bond preceding proline (the peptidyl-prolyl bond) is stabilised in the cis conformation. Mammalian cyclophilin A (CypA) is a major cellular target for the immunosuppressive drug cyclosporin A (CsA). Other roles for cyclophilins may include chaperone and cell signalling function.
Changed!
PTZ PTZ00060 171aa 1e-51
in ref transcript
cyclophilin; Provisional.
Changed!
cd cyclophilin_ABH_like 88aa 1e-22
in modified transcript
Changed!
pfam Pro_isomerase 89aa 4e-27
in modified transcript
Changed!
PTZ PTZ00060 91aa 7e-22
in modified transcript
NKTR
refseq_NKTR.F3 refseq_NKTR.R3 148 176
NCBIGene 36.2 4820
Single exon skipping, size difference: 28
Inclusion in the protein causing a frameshift
Reference transcript: NM_005385
Changed!
cd cyclophilin_ABH_like 168aa 8e-58
in ref transcript
cyclophilin_ABH_like: Cyclophilin A, B and H-like cyclophilin-type peptidylprolyl cis- trans isomerase (PPIase) domain. This family represents the archetypal cystolic cyclophilin similar to human cyclophilins A, B and H. PPIase is an enzyme which accelerates protein folding by catalyzing the cis-trans isomerization of the peptide bonds preceding proline residues. These enzymes have been implicated in protein folding processes which depend on catalytic /chaperone-like activities. As cyclophilins, Human hCyP-A, human cyclophilin-B (hCyP-19), S. cerevisiae Cpr1 and C. elegans Cyp-3, are inhibited by the immunosuppressive drug cyclopsporin A (CsA). CsA binds to the PPIase active site. Cyp-3. S. cerevisiae Cpr1 interacts with the Rpd3 - Sin3 complex and in addition is a component of the Set3 complex. S. cerevisiae Cpr1 has also been shown to have a role in Zpr1p nuclear transport. Human cyclophilin H associates with the [U4/U6.U5] tri-snRNP particles of the splicesome.
Changed!
pfam Pro_isomerase 169aa 8e-72
in ref transcript
Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans isomerases, also known as cyclophilins, share this domain of about 109 amino acids. Cyclophilins have been found in all organisms studied so far and catalyse peptidyl-prolyl isomerisation during which the peptide bond preceding proline (the peptidyl-prolyl bond) is stabilised in the cis conformation. Mammalian cyclophilin A (CypA) is a major cellular target for the immunosuppressive drug cyclosporin A (CsA). Other roles for cyclophilins may include chaperone and cell signalling function.
Changed!
PTZ PTZ00060 171aa 1e-51
in ref transcript
cyclophilin; Provisional.
Changed!
cd cyclophilin_ABH_like 126aa 2e-39
in modified transcript
Changed!
pfam Pro_isomerase 121aa 1e-45
in modified transcript
Changed!
PTZ PTZ00060 126aa 2e-38
in modified transcript
NME1
refseq_NME1.F2 refseq_NME1.R2 167 387
NCBIGene 36.3 4830
Single exon skipping, size difference: 220
Exclusion of the protein initiation site
Reference transcript: NM_198175
cd NDPk_I 130aa 1e-65
in ref transcript
Nucleoside diphosphate kinase Group I (NDPk_I)-like: NDP kinase domains are present in a large family of structurally and functionally conserved proteins from bacteria to humans that generally catalyze the transfer of gamma-phosphates of a nucleoside triphosphate (NTP) donor onto a nucleoside diphosphate (NDP) acceptor through a phosphohistidine intermediate. The mammalian nm23/NDP kinase gene family can be divided into two distinct groups. The group I genes encode proteins that generally have highly homologous counterparts in other organisms and possess the classic enzymatic activity of a kinase. This group includes vertebrate NDP kinases A-D (Nm23- H1 to -H4), and its counterparts in bacteria, archea and other eukaryotes. NDP kinases exist in two different quaternary structures; all known eukaryotic enzymes are hexamers, while some bacterial enzymes are tetramers, as in Myxococcus. They possess the NDP kinase active site motif (NXXH[G/A]SD) and the nine residues that are most essential for catalysis.
pfam NDK 135aa 2e-72
in ref transcript
Nucleoside diphosphate kinase.
PTZ PTZ00093 147aa 2e-72
in ref transcript
nucleoside diphosphate kinase; Provisional.
NME7
refseq_NME7.F1 refseq_NME7.R1 141 172
NCBIGene 36.3 29922
Alternative 3-prime, size difference: 31
Exclusion in the protein causing a frameshift
Reference transcript: NM_013330
Changed!
cd NDPk7A 131aa 3e-63
in ref transcript
Nucleoside diphosphate kinase 7 domain A (NDPk7A): The nm23-H7 class of nucleoside diphosphate kinase (NDPk7) consists of an N-terminal DM10 domain and two functional catalytic NDPk modules, NDPk7A and NDPk7B. The function of the DM10 domain, which also occurs in multiple copies in other proteins, is unknown. NDPk7 is predominantly expressed in testes, although appreciable amount are also found in liver, heart, brain, ovary, small intestine and spleen. The nm23-H7 gene is located in or near the hereditary prostrate cancer susceptibility locus. Nm23-H7 may be involved in the development of colon and gastric carcinoma, the latter possibly in a type-specific manner.
Changed!
cd NDPk7B 134aa 3e-62
in ref transcript
Nucleoside diphosphate kinase 7 domain B (NDPk7B): The nm23-H7 class of nucleoside diphosphate kinase (NDPk7) consists of an N-terminal DM10 domain and two functional catalytic NDPk modules, NDPk7A and NDPk7B. The function of the DM10 domain, which also occurs in multiple copies in other proteins, is unknown. NDPk7 is predominantly expressed in testes, although appreciable amount are also found in liver, heart, brain, ovary, small intestine and spleen. The nm23-H7 gene is located in or near the hereditary prostrate cancer susceptibility locus. Nm23-H7 may be involved in the development of colon and gastric carcinoma, the latter possibly in a type-specific manner.
Changed!
smart NDK 134aa 4e-47
in ref transcript
These are enzymes that catalyze nonsubstrate specific conversions of nucleoside diphosphates to nucleoside triphosphates. These enzymes play important roles in bacterial growth, signal transduction and pathogenicity.
Changed!
smart NDK 138aa 1e-44
in ref transcript
Changed!
smart DM10 89aa 6e-25
in ref transcript
Domains in hypothetical proteins in Drosophila, C. elegans and mammals. Occurs singly in some nucleoside diphosphate kinases.
Changed!
COG Ndk 134aa 4e-31
in ref transcript
Nucleoside diphosphate kinase [Nucleotide transport and metabolism].
Changed!
COG Ndk 133aa 3e-22
in ref transcript
NNAT
refseq_NNAT.F1 refseq_NNAT.R1 316 397
NCBIGene 36.3 4826
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_005386
NOL1
refseq_NOL1.F1 refseq_NOL1.R1 116 184
NCBIGene 36.3 4839
Alternative 5-prime, size difference: 68
Exclusion in 5'UTR
Reference transcript: NM_001033714
cd AdoMet_MTases 129aa 1e-05
in ref transcript
S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
pfam Nol1_Nop2_Fmu 286aa 8e-92
in ref transcript
NOL1/NOP2/sun family.
COG Sun 330aa 2e-79
in ref transcript
tRNA and rRNA cytosine-C5-methylases [Translation, ribosomal structure and biogenesis].
NOLA1
refseq_NOLA1.F1 refseq_NOLA1.R1 117 376
NCBIGene 36.3 54433
Alternative 5-prime, size difference: 259
Exclusion in 5'UTR
Reference transcript: NM_018983
pfam Gar1 104aa 3e-36
in ref transcript
Gar1 protein RNA binding region. Gar1 is a small nucleolar RNP that is required for pre-mRNA processing and pseudouridylation. It is co-immunoprecipitated with the H/ACA families of snoRNAs. This family represents the conserved central region of Gar1. This region is necessary and sufficient for normal cell growth, and specifically binds two snoRNAs snR10 and snR30. This region is also necessary for nucleolar targeting, and it is thought that the protein is co-transported to the nucleolus as part of a nucleoprotein complex. In humans, Gar1 is also component of telomerase in vivo.
COG GAR1 78aa 4e-12
in ref transcript
RNA-binding protein involved in rRNA processing [Translation, ribosomal structure and biogenesis].
NOLA2
refseq_NOLA2.F2 refseq_NOLA2.R2 269 375
NCBIGene 36.3 55651
Single exon skipping, size difference: 106
Exclusion in the protein causing a frameshift
Reference transcript: NM_017838
Changed!
pfam Ribosomal_L7Ae 94aa 1e-18
in ref transcript
Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from eukaryotes and archaebacteria. Gadd45 and MyD118.
Changed!
COG RPL8A 114aa 2e-16
in ref transcript
Ribosomal protein HS6-type (S12/L30/L7a) [Translation, ribosomal structure and biogenesis].
Changed!
pfam Ribosomal_L7Ae 32aa 0.003
in modified transcript
NOSTRIN
refseq_NOSTRIN.F1 refseq_NOSTRIN.R1 122 208
NCBIGene 36.3 115677
Single exon skipping, size difference: 86
Exclusion in the protein causing a frameshift
Reference transcript: NM_001039724
Changed!
cd SH3 52aa 1e-14
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Changed!
smart SH3 57aa 9e-17
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
NOX1
refseq_NOX1.F1 refseq_NOX1.R1 165 312
NCBIGene 36.3 27035
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_007052
Changed!
cd NOX_Duox_like_FAD_NADP 268aa 3e-42
in ref transcript
NADPH oxidase (NOX) catalyzes the generation of reactive oxygen species (ROS) such as superoxide and hydrogen peroxide. ROS were originally identified as bactericidal agents in phagocytes, but are now also implicated in cell signaling and metabolism. NOX has a 6-alpha helix heme-binding transmembrane domain fused to a flavoprotein with the nucleotide binding domain located in the cytoplasm. Duox enzymes link a peroxidase domain to the NOX domain via a single transmembrane and EF-hand Ca2+ binding sites. The flavoprotein module has a ferredoxin like FAD/NADPH binding domain. In classical phagocytic NOX2, electron transfer occurs from NADPH to FAD to the heme of cytb to oxygen leading to superoxide formation.
Changed!
pfam NAD_binding_6 150aa 5e-27
in ref transcript
Ferric reductase NAD binding domain.
pfam FAD_binding_8 95aa 6e-21
in ref transcript
FAD-binding domain.
pfam Ferric_reduct 156aa 2e-18
in ref transcript
Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mouse-ear cress. This may be a family of flavocytochromes capable of moving electrons across the plasma membrane. The Frp1 protein from S. pombe is a ferric reductase component and is required for cell surface ferric reductase activity, mutants in frp1 are deficient in ferric iron uptake. Cytochrome B-245 heavy chain is a FAD-dependent dehydrogenase it is also has electron transferase activity which reduces molecular oxygen to superoxide anion, a precursor in the production of microbicidal oxidants. Mutations in the sequence of cytochrome B-245 heavy chain (gp91-phox) lead to the X-linked chronic granulomatous disease. The bacteriocidal ability of phagocytic cells is reduced and is characterised by the absence of a functional plasma membrane associated NADPH oxidase. The chronic granulomatous disease gene codes for the beta chain of cytochrome B-245 and cytochrome B-245 is missing from patients with the disease.
Changed!
COG COG4097 263aa 9e-09
in ref transcript
Predicted ferric reductase [Inorganic ion transport and metabolism].
Changed!
cd NOX_Duox_like_FAD_NADP 219aa 6e-34
in modified transcript
Changed!
pfam NAD_binding_6 101aa 4e-12
in modified transcript
Changed!
COG COG4097 214aa 6e-10
in modified transcript
NPAS3
refseq_NPAS3.F1 refseq_NPAS3.R1 147 204
NCBIGene 36.3 64067
Exon skipping and alternative 3-prime or 5-prime, size difference: 57
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_173159
cd PAS 81aa 1e-09
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
cd PAS 53aa 3e-07
in ref transcript
cd HLH 58aa 3e-06
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
pfam PAS_3 69aa 1e-10
in ref transcript
PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
smart PAS 59aa 6e-09
in ref transcript
PAS domain. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels ([1]; Ponting & Aravind, in press).
smart HLH 46aa 1e-05
in ref transcript
helix loop helix domain.
NPTN
refseq_NPTN.F1 refseq_NPTN.R1 103 451
NCBIGene 36.3 27020
Single exon skipping, size difference: 348
Exclusion in the protein (no frameshift)
Reference transcript: NM_012428
cd IGcam 77aa 6e-07
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
cd IGcam 89aa 1e-05
in ref transcript
pfam I-set 96aa 5e-10
in ref transcript
Immunoglobulin I-set domain.
Changed!
pfam I-set 88aa 1e-07
in ref transcript
NR1I2
refseq_NR1I2.F1 refseq_NR1I2.R1 265 376
NCBIGene 36.3 8856
Alternative 3-prime, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_022002
Changed!
cd NR_LBD_PXR_like 193aa 1e-102
in ref transcript
The ligand binding domain of xenobiotic receptors:pregnane X receptor and constitutive androstane receptor. The ligand binding domain of xenobiotic receptors: This xenobiotic receptor family includes pregnane X receptor (PXR), constitutive androstane receptor (CAR) and other related nuclear receptors. They function as sensors of toxic byproducts of cell metabolism and of exogenous chemicals, to facilitate their elimination. The nuclear receptor pregnane X receptor (PXR) is a ligand-regulated transcription factor that responds to a diverse array of chemically distinct ligands, including many endogenous compounds and clinical drugs. The ligand binding domain of PXR shows remarkable flexibility to accommodate both large and small molecules. PXR functions as a heterodimer with retinoic X receptor-alpha (RXRa) and binds to a variety of response elements in the promoter regions of a diverse set of target genes involved in the metabolism, transport, and elimination of these molecules from the cell. Constitutive androstane receptor (CAR) is a closest mammalian relative of PXR, which has also been proposed to function as a xenosensor. CAR is activated by some of the same ligands as PXR and regulates a subset of common genes. The sequence homology and functional similarity suggests that the CAR gene arose from a duplication of an ancestral PXR gene. Like other nuclear receptors, xenobiotic receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
Changed!
cd NR_LBD_PXR_like 33aa 1e-09
in ref transcript
pfam Hormone_recep 181aa 2e-32
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
pfam zf-C4 74aa 1e-30
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
Changed!
cd NR_LBD_PXR_like 249aa 1e-113
in modified transcript
NR4A1
refseq_NR4A1.F1 refseq_NR4A1.R1 140 290
NCBIGene 36.3 3164
Single exon skipping, size difference: 150
Exclusion in 5'UTR
Reference transcript: NM_002135
cd NR_LBD_NGFI-B 238aa 1e-138
in ref transcript
The ligand binding domain of Nurr1, a member of conserved family of nuclear receptors. The ligand binding domain of Nerve growth factor-induced-B (NGFI-B): NGFI-B is a member of the nuclear#steroid receptor superfamily. NGFI-B is classified as an orphan receptor because no ligand has yet been identified. NGFI-B is an early immediate gene product of the embryo development that is rapidly produced in response to a variety of cellular signals including nerve growth factor. It is involved in T-cell-mediated apoptosis, as well as neuronal differentiation and function. NGFI-B regulates transcription by binding to a specific DNA target upstream of its target genes and regulating the rate of transcriptional initiation. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, NGFI-B has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
pfam Hormone_recep 179aa 4e-36
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
pfam zf-C4 74aa 1e-28
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
NR4A2
refseq_NR4A2.F1 refseq_NR4A2.R1 162 216
NCBIGene 36.3 4929
Alternative 3-prime, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_006186
Changed!
cd NR_LBD_Nurr1 235aa 1e-137
in ref transcript
The ligand binding domain of Nurr1, a member of conserved family of nuclear receptors. The ligand binding domain of nuclear receptor Nurr1: Nurr1 belongs to the conserved family of nuclear receptors. It is a transcription factor that is expressed in the embryonic ventral midbrain and is critical for the development of dopamine (DA) neurons. Structural studies have shown that the ligand binding pocket of Nurr1 is filled by bulky hydrophobic residues, making it unable to bind to ligands. Therefore, it belongs to the class of orphan receptors. However, Nurr1 forms heterodimers with RXR and can promote signaling via its partner, RXR. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, Nurr1 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
pfam zf-C4 75aa 1e-39
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
Changed!
pfam Hormone_recep 175aa 8e-34
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
Changed!
cd NR_LBD_Nurr1 217aa 1e-122
in modified transcript
Changed!
pfam Hormone_recep 157aa 2e-28
in modified transcript
NR4A2
refseq_NR4A2.F2 refseq_NR4A2.R2 115 293
NCBIGene 36.3 4929
Alternative 3-prime, size difference: 178
Exclusion of the protein initiation site
Reference transcript: NM_006186
cd NR_LBD_Nurr1 235aa 1e-137
in ref transcript
The ligand binding domain of Nurr1, a member of conserved family of nuclear receptors. The ligand binding domain of nuclear receptor Nurr1: Nurr1 belongs to the conserved family of nuclear receptors. It is a transcription factor that is expressed in the embryonic ventral midbrain and is critical for the development of dopamine (DA) neurons. Structural studies have shown that the ligand binding pocket of Nurr1 is filled by bulky hydrophobic residues, making it unable to bind to ligands. Therefore, it belongs to the class of orphan receptors. However, Nurr1 forms heterodimers with RXR and can promote signaling via its partner, RXR. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, Nurr1 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
pfam zf-C4 75aa 1e-39
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
pfam Hormone_recep 175aa 8e-34
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
NR5A2
refseq_NR5A2.F1 refseq_NR5A2.R1 124 262
NCBIGene 36.3 2494
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_205860
cd NR_LBD_Lrh-1 241aa 1e-140
in ref transcript
The ligand binding domain of the liver receptor homolog-1, a member of nuclear receptor superfamily,. The ligand binding domain (LBD) of the liver receptor homolog-1 (LRH-1): LRH-1 belongs to nuclear hormone receptor superfamily, and is expressed mainly in the liver, intestine, exocrine pancreas, and ovary. Most nuclear receptors function as homodimer or heterodimers. However, LRH-1 binds DNA as a monomer, and is a regulator of bile-acid homeostasis, steroidogenesis, reverse cholesterol transport and the initial stages of embryonic development. Recently, phospholipids have been identified as potential ligand for LRH-1 and steroidogenic factor-1 (SF-1). Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, LRH-1 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
pfam Hormone_recep 186aa 1e-41
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
pfam zf-C4 76aa 4e-38
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
NRAP
refseq_NRAP.F2 refseq_NRAP.R2 124 229
NCBIGene 36.3 4892
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_198060
pfam LIM 52aa 2e-08
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
smart NEBU 31aa 7e-05
in ref transcript
The Nebulin repeat is present also in Las1. Tandem arrays of these repeats are known to bind actin.
smart NEBU 31aa 7e-04
in ref transcript
smart NEBU 31aa 0.001
in ref transcript
smart NEBU 31aa 0.002
in ref transcript
pfam Nebulin 29aa 0.002
in ref transcript
Nebulin repeat.
smart NEBU 31aa 0.003
in ref transcript
smart NEBU 31aa 0.005
in ref transcript
NRCAM
refseq_NRCAM.F2 refseq_NRCAM.R2 150 180
NCBIGene 36.3 4897
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_001037132
cd IGcam 94aa 7e-14
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 88aa 5e-13
in ref transcript
Changed!
cd FN3 90aa 8e-12
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd IGcam 88aa 4e-10
in ref transcript
cd IGcam 89aa 4e-10
in ref transcript
cd FN3 96aa 6e-10
in ref transcript
cd FN3 100aa 5e-09
in ref transcript
cd IGcam 74aa 4e-07
in ref transcript
cd FN3 88aa 4e-04
in ref transcript
cd FN3 77aa 0.004
in ref transcript
pfam I-set 89aa 2e-12
in ref transcript
Immunoglobulin I-set domain.
pfam I-set 89aa 1e-11
in ref transcript
smart IGc2 63aa 1e-11
in ref transcript
Immunoglobulin C-2 Type.
Changed!
pfam fn3 85aa 5e-11
in ref transcript
Fibronectin type III domain.
pfam fn3 86aa 6e-11
in ref transcript
pfam fn3 93aa 4e-10
in ref transcript
pfam I-set 87aa 1e-09
in ref transcript
pfam I-set 71aa 2e-09
in ref transcript
pfam fn3 83aa 4e-05
in ref transcript
pfam fn3 78aa 2e-04
in ref transcript
Changed!
cd FN3 91aa 1e-12
in modified transcript
Changed!
pfam fn3 87aa 1e-11
in modified transcript
NRCAM
refseq_NRCAM.F4 refseq_NRCAM.R4 102 120
NCBIGene 36.3 4897
Single exon skipping, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_001037132
cd IGcam 94aa 7e-14
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 88aa 5e-13
in ref transcript
cd FN3 90aa 8e-12
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd IGcam 88aa 4e-10
in ref transcript
cd IGcam 89aa 4e-10
in ref transcript
cd FN3 96aa 6e-10
in ref transcript
cd FN3 100aa 5e-09
in ref transcript
cd IGcam 74aa 4e-07
in ref transcript
cd FN3 88aa 4e-04
in ref transcript
cd FN3 77aa 0.004
in ref transcript
pfam I-set 89aa 2e-12
in ref transcript
Immunoglobulin I-set domain.
pfam I-set 89aa 1e-11
in ref transcript
smart IGc2 63aa 1e-11
in ref transcript
Immunoglobulin C-2 Type.
pfam fn3 85aa 5e-11
in ref transcript
Fibronectin type III domain.
pfam fn3 86aa 6e-11
in ref transcript
pfam fn3 93aa 4e-10
in ref transcript
pfam I-set 87aa 1e-09
in ref transcript
pfam I-set 71aa 2e-09
in ref transcript
pfam fn3 83aa 4e-05
in ref transcript
pfam fn3 78aa 2e-04
in ref transcript
NRCAM
refseq_NRCAM.F5 refseq_NRCAM.R5 153 468
NCBIGene 36.3 4897
Multiple exon skipping, size difference: 315
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001037132
cd IGcam 94aa 7e-14
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 88aa 5e-13
in ref transcript
cd FN3 90aa 8e-12
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd IGcam 88aa 4e-10
in ref transcript
cd IGcam 89aa 4e-10
in ref transcript
cd FN3 96aa 6e-10
in ref transcript
cd FN3 100aa 5e-09
in ref transcript
cd IGcam 74aa 4e-07
in ref transcript
cd FN3 88aa 4e-04
in ref transcript
Changed!
cd FN3 77aa 0.004
in ref transcript
pfam I-set 89aa 2e-12
in ref transcript
Immunoglobulin I-set domain.
pfam I-set 89aa 1e-11
in ref transcript
smart IGc2 63aa 1e-11
in ref transcript
Immunoglobulin C-2 Type.
pfam fn3 85aa 5e-11
in ref transcript
Fibronectin type III domain.
pfam fn3 86aa 6e-11
in ref transcript
pfam fn3 93aa 4e-10
in ref transcript
pfam I-set 87aa 1e-09
in ref transcript
pfam I-set 71aa 2e-09
in ref transcript
pfam fn3 83aa 4e-05
in ref transcript
Changed!
pfam fn3 78aa 2e-04
in ref transcript
NRXN2
refseq_NRXN2.F2 refseq_NRXN2.R2 153 180
NCBIGene 36.3 9379
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_015080
cd LamG 154aa 3e-24
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
cd LamG 171aa 3e-20
in ref transcript
cd LamG 168aa 1e-19
in ref transcript
cd LamG 174aa 2e-12
in ref transcript
cd LamG 157aa 4e-11
in ref transcript
Changed!
cd LamG 157aa 6e-11
in ref transcript
cd EGF_CA 34aa 0.001
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
smart LamG 136aa 2e-28
in ref transcript
Laminin G domain.
smart LamG 151aa 2e-24
in ref transcript
smart LamG 147aa 8e-24
in ref transcript
Changed!
pfam Laminin_G_2 128aa 6e-16
in ref transcript
Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
smart LamG 154aa 3e-13
in ref transcript
smart LamG 139aa 2e-11
in ref transcript
pfam FAP 88aa 0.009
in ref transcript
Fibronectin-attachment protein (FAP). This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.
Changed!
cd LamG 148aa 7e-14
in modified transcript
Changed!
smart LamG 128aa 2e-18
in modified transcript
NRXN2
refseq_NRXN2.F3 refseq_NRXN2.R3 229 301
NCBIGene 36.3 9379
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_015080
cd LamG 154aa 3e-24
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
cd LamG 171aa 3e-20
in ref transcript
cd LamG 168aa 1e-19
in ref transcript
cd LamG 174aa 2e-12
in ref transcript
cd LamG 157aa 4e-11
in ref transcript
cd LamG 157aa 6e-11
in ref transcript
cd EGF_CA 34aa 0.001
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
smart LamG 136aa 2e-28
in ref transcript
Laminin G domain.
smart LamG 151aa 2e-24
in ref transcript
smart LamG 147aa 8e-24
in ref transcript
pfam Laminin_G_2 128aa 6e-16
in ref transcript
Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
smart LamG 154aa 3e-13
in ref transcript
smart LamG 139aa 2e-11
in ref transcript
pfam FAP 88aa 0.009
in ref transcript
Fibronectin-attachment protein (FAP). This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.
NRXN2
refseq_NRXN2.F5 refseq_NRXN2.R5 220 310
NCBIGene 36.3 9379
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_015080
cd LamG 154aa 3e-24
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
cd LamG 171aa 3e-20
in ref transcript
cd LamG 168aa 1e-19
in ref transcript
Changed!
cd LamG 174aa 2e-12
in ref transcript
cd LamG 157aa 4e-11
in ref transcript
cd LamG 157aa 6e-11
in ref transcript
cd EGF_CA 34aa 0.001
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
smart LamG 136aa 2e-28
in ref transcript
Laminin G domain.
smart LamG 151aa 2e-24
in ref transcript
smart LamG 147aa 8e-24
in ref transcript
pfam Laminin_G_2 128aa 6e-16
in ref transcript
Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
Changed!
smart LamG 154aa 3e-13
in ref transcript
smart LamG 139aa 2e-11
in ref transcript
Changed!
pfam FAP 88aa 0.009
in ref transcript
Fibronectin-attachment protein (FAP). This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.
Changed!
cd LamG 144aa 2e-14
in modified transcript
Changed!
smart LamG 124aa 8e-17
in modified transcript
NRXN2
refseq_NRXN2.F7 refseq_NRXN2.R7 102 123
NCBIGene 36.3 9379
Alternative 5-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_015080
cd LamG 154aa 3e-24
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
cd LamG 171aa 3e-20
in ref transcript
Changed!
cd LamG 168aa 1e-19
in ref transcript
cd LamG 174aa 2e-12
in ref transcript
cd LamG 157aa 4e-11
in ref transcript
cd LamG 157aa 6e-11
in ref transcript
cd EGF_CA 34aa 0.001
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
smart LamG 136aa 2e-28
in ref transcript
Laminin G domain.
smart LamG 151aa 2e-24
in ref transcript
Changed!
smart LamG 147aa 8e-24
in ref transcript
pfam Laminin_G_2 128aa 6e-16
in ref transcript
Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
smart LamG 154aa 3e-13
in ref transcript
smart LamG 139aa 2e-11
in ref transcript
pfam FAP 88aa 0.009
in ref transcript
Fibronectin-attachment protein (FAP). This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.
Changed!
cd LamG 161aa 1e-20
in modified transcript
Changed!
smart LamG 140aa 1e-24
in modified transcript
NSFL1C
refseq_NSFL1C.F1 refseq_NSFL1C.R1 142 335
NCBIGene 36.3 55968
Alternative 3-prime, size difference: 95
Exclusion in the protein causing a frameshift
Reference transcript: NM_016143
Changed!
cd p47_UBX 79aa 3e-32
in ref transcript
p47_UBX p47 is an adaptor molecule of the cytosolic AAA ATPase p97. The principal role of the p97-p47 complex is to regulate membrane fusion events. Mono-ubiquitin recognition by p47 is crucial for p97-p47-mediated Golgi membrane fusion events. p47 has carboxy-terminal SEP and UBX domains. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Changed!
smart SEP 93aa 2e-38
in ref transcript
Domain present in Saccharomyces cerevisiae Shp1, Drosophila melanogaster eyes closed gene (eyc), and vertebrate p47.
Changed!
pfam UBX 80aa 2e-15
in ref transcript
UBX domain. This domain is present in ubiquitin-regulatory proteins and is a general Cdc48-interacting module.
NSFL1C
refseq_NSFL1C.F4 refseq_NSFL1C.R4 102 195
NCBIGene 36.3 55968
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_016143
cd p47_UBX 79aa 3e-32
in ref transcript
p47_UBX p47 is an adaptor molecule of the cytosolic AAA ATPase p97. The principal role of the p97-p47 complex is to regulate membrane fusion events. Mono-ubiquitin recognition by p47 is crucial for p97-p47-mediated Golgi membrane fusion events. p47 has carboxy-terminal SEP and UBX domains. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
Changed!
smart SEP 93aa 2e-38
in ref transcript
Domain present in Saccharomyces cerevisiae Shp1, Drosophila melanogaster eyes closed gene (eyc), and vertebrate p47.
pfam UBX 80aa 2e-15
in ref transcript
UBX domain. This domain is present in ubiquitin-regulatory proteins and is a general Cdc48-interacting module.
Changed!
smart SEP 92aa 9e-38
in modified transcript
NSUN5B
refseq_NSUN5B.F1 refseq_NSUN5B.R1 139 247
NCBIGene 36.3 155400
Alternative 5-prime, size difference: 108
Exclusion in 5'UTR
Reference transcript: NM_001039575
TIGR nop2p 71aa 2e-04
in ref transcript
COG Sun 101aa 1e-09
in ref transcript
tRNA and rRNA cytosine-C5-methylases [Translation, ribosomal structure and biogenesis].
NT5C1B
refseq_NT5C1B.F1 refseq_NT5C1B.R1 151 331
NCBIGene 36.3 93034
Single exon skipping, size difference: 180
Exclusion in the protein (no frameshift)
Reference transcript: NM_001002006
pfam 5-nucleotidase 261aa 1e-116
in ref transcript
5'-nucleotidase. This family consists of both eukaryotic and prokaryotic 5'-nucleotidase sequences (EC:3.1.3.5).
NT5C3
refseq_NT5C3.F1 refseq_NT5C3.R1 243 298
NCBIGene 36.3 51251
Single exon skipping, size difference: 55
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001002010
Changed!
TIGR HAD-SF-IE 275aa 1e-134
in ref transcript
This group of sequences was found during searches for members of the haloacid dehalogenase (HAD) superfamily. All of the conserved catalytic motifs are found. The placement of the variable domain between motifs 1 and 2 indicates membership in subfamily I of the superfamily, but these sequences are sufficiently different from any of the branches (IA, TIGR01493, TIGR01509, TIGR01549; IB, TIGR01488; IC, TIGR01494; ID, TIGR01658; IF TIGR01545) of that subfamily as to constitute a separate branch to now be called IE. Considering that the closest identifiable hit outside of the noise range is to a phosphoserine phosphatase, this group may be considered to be most closely allied to subfamily IB.
Changed!
COG SerB 122aa 0.009
in ref transcript
Phosphoserine phosphatase [Amino acid transport and metabolism].
NTRK2
refseq_NTRK2.F2 refseq_NTRK2.R2 147 195
NCBIGene 36.3 4915
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_006180
cd PTKc_TrkB 288aa 0.0
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Tropomyosin Related Kinase B. Protein Tyrosine Kinase (PTK) family; Tropomyosin related kinase B (TrkB); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. TrkB is a member of the Trk subfamily of proteins, which are receptor tyr kinases (RTKs) containing an extracellular region with arrays of leucine-rich motifs flanked by two cysteine-rich clusters followed by two immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. Binding of TrkB to its ligands, brain-derived neurotrophic factor (BDNF) or neurotrophin 4 (NT4), results in receptor oligomerization and activation of the catalytic domain. TrkB is broadly expressed in the nervous system and in some non-neural tissues. It plays important roles in cell proliferation, differentiation, and survival. BDNF/Trk signaling plays a key role in regulating activity-dependent synaptic plasticity. TrkB also contributes to protection against gp120-induced neuronal cell death. TrkB overexpression is associated with poor prognosis in neuroblastoma (NB) and other human cancers. It acts as a suppressor of anoikis (detachment-induced apoptosis) and contributes to tumor metastasis.
cd IGcam 79aa 4e-08
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 69aa 3e-04
in ref transcript
pfam Pkinase_Tyr 270aa 1e-115
in ref transcript
Protein tyrosine kinase.
pfam I-set 79aa 3e-12
in ref transcript
Immunoglobulin I-set domain.
pfam I-set 67aa 4e-06
in ref transcript
smart LRRCT 46aa 8e-05
in ref transcript
Leucine rich repeat C-terminal domain.
pfam LRRNT 30aa 0.006
in ref transcript
Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.
COG SPS1 262aa 4e-24
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
NTRK3
refseq_NTRK3.F1 refseq_NTRK3.R1 194 236
NCBIGene 36.3 4916
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_001012338
Changed!
cd PTKc_TrkC 305aa 1e-179
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Tropomyosin Related Kinase C. Protein Tyrosine Kinase (PTK) family; Tropomyosin related kinase C (TrkC); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. TrkC is a member of the Trk subfamily of proteins, which are receptor tyr kinases (RTKs) containing an extracellular region with arrays of leucine-rich motifs flanked by two cysteine-rich clusters followed by two immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. Binding of TrkC to its ligand, neurotrophin 3 (NT3), results in receptor oligomerization and activation of the catalytic domain. TrkC is broadly expressed in the nervous system and in some non-neural tissues including the developing heart. NT3/TrkC signaling plays an important role in the innervation of the cardiac conducting system and the development of smooth muscle cells. Mice deficient with NT3 and TrkC have multiple heart defects. NT3/TrkC signaling is also critical for the development and maintenance of enteric neurons that are important for the control of gut peristalsis.
cd IGcam 89aa 7e-07
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 89aa 2e-05
in ref transcript
Changed!
pfam Pkinase_Tyr 287aa 1e-108
in ref transcript
Protein tyrosine kinase.
smart IG_like 85aa 2e-10
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart LRRCT 46aa 1e-07
in ref transcript
Leucine rich repeat C-terminal domain.
pfam I-set 68aa 5e-07
in ref transcript
Immunoglobulin I-set domain.
Changed!
COG SPS1 283aa 3e-23
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
cd PTKc_TrkC 291aa 0.0
in modified transcript
Changed!
pfam Pkinase_Tyr 273aa 1e-111
in modified transcript
Changed!
COG SPS1 269aa 1e-23
in modified transcript
NUDT1
refseq_NUDT1.F1 refseq_NUDT1.R1 113 186
NCBIGene 36.3 4521
Alternative 5-prime, size difference: 73
Exclusion of the protein initiation site
Reference transcript: NM_198949
cd MTH1 136aa 5e-30
in ref transcript
MutT homolog-1 (MTH1) is a member of the Nudix hydrolase superfamily. MTH1, the mammalian counterpart of MutT, hydrolyzes oxidized purine nucleoside triphosphates, such as 8-oxo-dGTP and 2-hydroxy-ATP, to monophosphates, thereby preventing the incorporation of such oxygen radicals during replication. This is an important step in the repair mechanism in genomic and mitochondrial DNA. Like other members of the Nudix family, it requires a divalent cation, such as Mg2+ or Mn2+, for activity, and contain the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as a metal binding and catalytic site. MTH1 is predominantly localized in the cytoplasm and mitochondria. Structurally, this enzyme adopts a similar fold to MutT despite low sequence similarity outside the conserved nudix motif. The most distinctive structural difference between MutT and MTH1 is the presence of a beta-hairpin, which is absent in MutT. This results in a much deeper and narrower substrate binding pocket. Mechanistically, MTH1 contains dual specificity for nucleotides that contain 2-OH-adenine bases and those that contain 8-oxo-guanine bases.
pfam NUDIX 93aa 1e-14
in ref transcript
NUDIX domain.
COG COG1051 76aa 4e-05
in ref transcript
ADP-ribose pyrophosphatase [Nucleotide transport and metabolism].
NUDT2
refseq_NUDT2.F2 refseq_NUDT2.R2 116 187
NCBIGene 36.3 318
Alternative 3-prime, size difference: 71
Inclusion in 5'UTR
Reference transcript: NM_147172
cd Ap4A_hydrolase_human_like 139aa 4e-43
in ref transcript
Diadenosine tetraphosphate (Ap4A) hydrolase is a member of the Nudix hydrolase superfamily. Ap4A hydrolases are well represented in a variety of prokaryotic and eukaryotic organisms. Phylogenetic analysis reveals two distinct subgroups where plant enzymes fall into one subfamily and fungi/animals/archaea enzymes, represented by this subfamily, fall into another. Bacterial enzymes are found in both subfamilies. Ap4A is a potential by-product of aminoacyl tRNA synthesis, and accumulation of Ap4A has been implicated in a range of biological events, such as DNA replication, cellular differentiation, heat shock, metabolic stress, and apoptosis. Ap4A hydrolase cleaves Ap4A asymmetrically into ATP and AMP. It is important in the invasive properties of bacteria and thus presents a potential target for inhibition of such invasive bacteria. Besides the signature nudix motif (G[X5]E[X7]REUXEEXGU, where U is Ile, Leu, or Val) that functions as a metal binding and catalytic site, and a required divalent cation, Ap4A hydrolase is structurally similar to the other members of the nudix superfamily with some degree of variation. Several regions in the sequences are poorly defined and substrate and metal binding sites are only predicted based on kinetic studies.
pfam NUDIX 112aa 5e-16
in ref transcript
NUDIX domain.
COG MutT 121aa 3e-07
in ref transcript
NTP pyrophosphohydrolases including oxidative damage repair enzymes [DNA replication, recombination, and repair / General function prediction only].
NUDT9
refseq_NUDT9.F1 refseq_NUDT9.R1 109 446
NCBIGene 36.3 53343
Alternative 5-prime, size difference: 337
Exclusion of the protein initiation site
Reference transcript: NM_024047
cd ADPRase_NUDT9 187aa 9e-77
in ref transcript
ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose to AMP and ribose-5-P. Like other members of the Nudix hydrolase superfamily of enzymes, it is thought to require a divalent cation, such as Mg2+, for its activity. It also contains a 23-residue Nudix motif (GX5EX7REUXEEXGU, where U = I, L or V) which functions as a metal binding site/catalytic site. In addition to the Nudix motif, there are additional conserved amino acid residues, distal from the signature sequence, that correlate with substrate specificity. In humans, there are four distinct ADPRase activities, three putative cytosolic (ADPRase-I, -II, and -Mn) and a single mitochondrial enzyme (ADPRase-m). ADPRase-m is also known as NUDT9. It can be distinugished from the cytosolic ADPRase by a N-terminal target sequence unique to mitochondrial ADPRase. NUDT9 functions as a monomer.
pfam NUDIX 38aa 2e-05
in ref transcript
NUDIX domain.
COG COG1051 135aa 7e-06
in ref transcript
ADP-ribose pyrophosphatase [Nucleotide transport and metabolism].
NUMB
refseq_NUMB.F1 refseq_NUMB.R1 127 271
NCBIGene 36.3 8650
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_001005743
cd Numb 149aa 2e-72
in ref transcript
Numb Phosphotyrosine-binding (PTB) domain. Numb is a membrane associated adaptor protein, which is a determinant of asymmetric cell division. Numb has an N-terminal PTB domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
pfam PID 134aa 7e-34
in ref transcript
Phosphotyrosine interaction domain (PTB/PID).
pfam NumbF 82aa 8e-32
in ref transcript
NUMB domain. This presumed domain is found in the Numb family of proteins adjacent to the PTB domain.
NUP35
refseq_NUP35.F1 refseq_NUP35.R1 217 359
NCBIGene 36.2 129401
Single exon skipping, size difference: 142
Exclusion in the protein causing a frameshift
Reference transcript: NM_138285
Changed!
pfam MPPN 84aa 9e-33
in ref transcript
MPPN (rrm-like) domain. The MPPN (Mitotic PhosphoProtein N' end) family is uncharacterised however it probably plays a role in the cell cycle because the family includes mitotic phosphoproteins. This family also includes a suppressor of thermosensitive mutations in the DNA polymerase delta gene (Pol III). The conserved central region appears to be distantly related to the pfam00076 domain, suggesting an RNA binding function for this protein (Bateman A. pers obs).
NUP62
refseq_NUP62.F1 refseq_NUP62.R1 140 173
NCBIGene 36.3 23636
Alternative 3-prime, size difference: 33
Inclusion in 5'UTR
Reference transcript: NM_016553
pfam Nsp1_C 102aa 8e-39
in ref transcript
Nsp1-like C-terminal region. This family probably forms a coiled-coil. This important region of Nsp1 is involved in binding Nup82.
NUPL1
refseq_NUPL1.F2 refseq_NUPL1.R2 156 192
NCBIGene 36.3 9818
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_014089
NVL
refseq_NVL.F1 refseq_NVL.R1 189 263
NCBIGene 36.3 4931
Single exon skipping, size difference: 74
Exclusion in the protein causing a frameshift
Reference transcript: NM_002533
Changed!
cd AAA 141aa 2e-24
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
Changed!
cd AAA 137aa 1e-23
in ref transcript
Changed!
TIGR CDC48 614aa 1e-161
in ref transcript
This subfamily of the AAA family ATPases includes two members each from three archaeal species. It also includes yeast CDC48 (cell division control protein 48) and the human ortholog, transitional endoplasmic reticulum ATPase (valosin-containing protein). These proteins in eukaryotes are involved in the budding and transfer of membrane from the transitional endoplasmic reticulum to the Golgi apparatus.
Changed!
COG SpoVK 557aa 1e-117
in ref transcript
ATPases of the AAA+ class [Posttranslational modification, protein turnover, chaperones].
NXF5
refseq_NXF5.F1 refseq_NXF5.R1 191 380
NCBIGene 36.2 55998
Multiple exon skipping, size difference: 189
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_032946
Changed!
pfam Tap-RNA_bind 88aa 5e-39
in ref transcript
Tap, RNA-binding. Members of this family adopt a structure consisting of an alpha+beta sandwich with an antiparallel beta-sheet, arranged in a 2(beta-alpha-beta) motif. They are mainly found in mRNA export factors, and mediate the sequence nonspecific nuclear export of cellular mRNAs as well as the sequence-specific export of retroviral mRNAs bearing the constitutive transport element.
NXF5
refseq_NXF5.F5 refseq_NXF5.R5 224 288
NCBIGene 36.2 55998
Alternative 3-prime, size difference: 64
Exclusion in the protein causing a frameshift
Reference transcript: NM_032946
pfam Tap-RNA_bind 88aa 5e-39
in ref transcript
Tap, RNA-binding. Members of this family adopt a structure consisting of an alpha+beta sandwich with an antiparallel beta-sheet, arranged in a 2(beta-alpha-beta) motif. They are mainly found in mRNA export factors, and mediate the sequence nonspecific nuclear export of cellular mRNAs as well as the sequence-specific export of retroviral mRNAs bearing the constitutive transport element.
OAS1
refseq_OAS1.F1 refseq_OAS1.R1 201 299
NCBIGene 36.3 4938
Alternative 3-prime, size difference: 98
Inclusion in the protein causing a frameshift
Reference transcript: NM_001032409
Changed!
pfam OAS1_C 182aa 1e-100
in ref transcript
2'-5'-oligoadenylate synthetase 1, domain 2, C-terminus. This is the largely alpha-helical, C-terminal half of 2'-5'-oligoadenylate synthetase 1, being described as domain 2 of the enzyme and homologous to a tandem ubiquitin repeat. It carries the region of enzymic activity between 320 and 344 at the extreme C-terminal end. Oligoadenylate synthetases are antiviral enzymes that counteract vial attack by degrading viral RNA. The enzyme uses ATP in 2'-specific nucleotidyl transfer reactions to synthesise 2'.5'-oligoadenylates, which activate latent ribonuclease, resulting in degradation of viral RNA and inhibition of virus replication. This domain is often associated with NTP_transf_2 pfam01909.
Changed!
pfam OAS1_C 187aa 1e-100
in modified transcript
OASL
refseq_OASL.F1 refseq_OASL.R1 106 348
NCBIGene 36.3 8638
Single exon skipping, size difference: 242
Exclusion in the protein causing a frameshift
Reference transcript: NM_003733
Changed!
cd OASL_repeat1 80aa 5e-34
in ref transcript
OASL_repeat1 (2'-5' oligoadenylate synthetase-like protein) belongs to a family of interferon-induced 2'-5' oligoadenylate synthetases which are important for the antiviral activity of interferons. While each member of this famliy has a conserved N-terminal OAS catalytic domain, only OASL has two tandem ubiquitin-like repeats located at the C-terminus and this CD represents one of those repeats.
Changed!
pfam OAS1_C 187aa 2e-76
in ref transcript
2'-5'-oligoadenylate synthetase 1, domain 2, C-terminus. This is the largely alpha-helical, C-terminal half of 2'-5'-oligoadenylate synthetase 1, being described as domain 2 of the enzyme and homologous to a tandem ubiquitin repeat. It carries the region of enzymic activity between 320 and 344 at the extreme C-terminal end. Oligoadenylate synthetases are antiviral enzymes that counteract vial attack by degrading viral RNA. The enzyme uses ATP in 2'-specific nucleotidyl transfer reactions to synthesise 2'.5'-oligoadenylates, which activate latent ribonuclease, resulting in degradation of viral RNA and inhibition of virus replication. This domain is often associated with NTP_transf_2 pfam01909.
Changed!
pfam ubiquitin 67aa 6e-07
in ref transcript
Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
Changed!
pfam OAS1_C 66aa 8e-22
in modified transcript
OATL1
refseq_OATL1.F1 refseq_OATL1.R1 230 385
NCBIGene 36.2 4943
Single exon skipping, size difference: 155
Exclusion in the protein causing a frameshift
Reference transcript: NM_002536
Changed!
smart TBC 212aa 2e-46
in ref transcript
Domain in Tre-2, BUB2p, and Cdc16p. Probable Rab-GAPs. Widespread domain present in Gyp6 and Gyp7, thereby giving rise to the notion that it performs a GTP-activator activity on Rab-like GTPases.
Changed!
COG COG5210 215aa 1e-26
in ref transcript
GTPase-activating protein [General function prediction only].
OCIAD2
refseq_OCIAD2.F1 refseq_OCIAD2.R1 101 219
NCBIGene 36.3 132299
Single exon skipping, size difference: 118
Exclusion in the protein causing a frameshift
Reference transcript: NM_001014446
Changed!
pfam OCIA 118aa 1e-34
in ref transcript
Ovarian carcinoma immunoreactive antigen (OCIA). This family consists of several ovarian carcinoma immunoreactive antigen (OCIA) and related eukaryotic sequences. The function of this family is unknown.
Changed!
pfam OCIA 86aa 7e-27
in modified transcript
OCRL
refseq_OCRL.F1 refseq_OCRL.R1 123 147
NCBIGene 36.3 4952
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_000276
Changed!
cd RhoGAP_OCRL1 229aa 6e-65
in ref transcript
RhoGAP_OCRL1: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in OCRL1-like proteins. OCRL1 (oculocerebrorenal syndrome of Lowe 1)-like proteins contain two conserved domains: a central inositol polyphosphate 5-phosphatase domain and a C-terminal Rho GAP domain, this GAP domain lacks the catalytic residue and therefore maybe inactive. OCRL-like proteins are type II inositol polyphosphate 5-phosphatases that can hydrolyze lipid PI(4,5)P2 and PI(3,4,5)P3 and soluble Ins(1,4,5)P3 and Ins(1,3,4,5)P4, but their individual specificities vary. The functionality of the RhoGAP domain is still unclear. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
Changed!
cd RhoGAP_OCRL1 221aa 7e-66
in modified transcript
ODF2
refseq_ODF2.F2 refseq_ODF2.R2 115 374
NCBIGene 36.3 4957
Multiple exon skipping, size difference: 259
Exclusion in 5'UTR, Exclusion of the protein initiation site
Reference transcript: NM_002540
TIGR SMC_prok_B 601aa 2e-14
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG Smc 329aa 5e-11
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG Smc 353aa 5e-09
in ref transcript
ODF2L
refseq_ODF2L.F1 refseq_ODF2L.R1 188 321
NCBIGene 36.3 57489
Single exon skipping, size difference: 133
Exclusion in the protein causing a frameshift
Reference transcript: NM_020729
Changed!
pfam SMC_N 255aa 7e-05
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Changed!
TIGR SMC_prok_B 198aa 0.006
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
ODZ3
refseq_ODZ3.F1 refseq_ODZ3.R1 102 123
NCBIGene 36.2 55714
Single exon skipping, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: XM_931600
pfam Ten_N 279aa 4e-63
in ref transcript
Teneurin Intracellular Region. This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
pfam Keratin_B2 153aa 0.005
in ref transcript
Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesised during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.
COG RhsA 630aa 1e-07
in ref transcript
Rhs family protein [Cell envelope biogenesis, outer membrane].
ODZ3
refseq_ODZ3.F3 refseq_ODZ3.R3 101 128
NCBIGene 36.2 55714
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: XM_931600
pfam Ten_N 279aa 4e-63
in ref transcript
Teneurin Intracellular Region. This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
pfam Keratin_B2 153aa 0.005
in ref transcript
Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesised during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.
COG RhsA 630aa 1e-07
in ref transcript
Rhs family protein [Cell envelope biogenesis, outer membrane].
OGT
refseq_OGT.F2 refseq_OGT.R2 106 136
NCBIGene 36.3 8473
Alternative 3-prime, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_181672
cd TPR 100aa 4e-18
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
cd TPR 100aa 4e-18
in ref transcript
cd TPR 100aa 9e-18
in ref transcript
cd TPR 96aa 1e-16
in ref transcript
cd TPR 95aa 1e-14
in ref transcript
cd TPR 98aa 2e-08
in ref transcript
Changed!
TIGR PEP_TPR_lipo 444aa 5e-39
in ref transcript
This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
COG Spy 199aa 8e-46
in ref transcript
Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones].
COG Spy 396aa 1e-38
in ref transcript
COG NrfG 211aa 1e-15
in ref transcript
FOG: TPR repeat [General function prediction only].
COG TadD 199aa 2e-11
in ref transcript
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion].
Changed!
TIGR PEP_TPR_lipo 433aa 5e-39
in modified transcript
OSGIN1
refseq_OKL38.F1 refseq_OKL38.R1 110 270
NCBIGene 36.3 29948
Single exon skipping, size difference: 160
Exclusion in the protein causing a frameshift
Reference transcript: NM_013370
Changed!
pfam Pyr_redox_2 208aa 3e-06
in ref transcript
Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain.
Changed!
PRK PRK12770 80aa 4e-07
in ref transcript
NADPH-dependent glutamate synthase beta chain and related oxidoreductases [Amino acid transport and metabolism / General function prediction only].
Changed!
COG TrkA 202aa 0.003
in ref transcript
Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism].
OLAH
refseq_OLAH.F1 refseq_OLAH.R1 227 386
NCBIGene 36.3 55301
Single exon skipping, size difference: 159
Exclusion in the protein (no frameshift)
Reference transcript: NM_018324
Changed!
pfam Thioesterase 282aa 9e-26
in ref transcript
Thioesterase domain. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid thioesterases of vertebrates. There are also modules within the peptide synthetases that also share this similarity. With respect to antibiotic production, thioesterases are required for the addition of the last amino acid to the peptide antibiotic, thereby forming a cyclic antibiotic. Thioesterases (non-integrated) have molecular masses of 25-29 kDa.
Changed!
COG GrsT 278aa 8e-34
in ref transcript
Predicted thioesterase involved in non-ribosomal peptide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism].
Changed!
pfam Thioesterase 229aa 4e-33
in modified transcript
Changed!
COG GrsT 225aa 4e-42
in modified transcript
OPA1
refseq_OPA1.F1 OPA1.u.r.6 129 183
NCBIGene 36.3 4976
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_130837
pfam Dynamin_N 179aa 4e-38
in ref transcript
Dynamin family.
OPN3
refseq_OPN3.F2 refseq_OPN3.R2 206 366
NCBIGene 36.2 23596
Alternative 5-prime, size difference: 160
Exclusion in the protein causing a frameshift
Reference transcript: NM_014322
Changed!
pfam 7tm_1 243aa 6e-23
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
Changed!
pfam 7tm_1 115aa 2e-08
in modified transcript
OPN4
refseq_OPN4.F1 refseq_OPN4.R1 142 175
NCBIGene 36.3 94233
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_001030015
Changed!
pfam 7tm_1 257aa 6e-41
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
Changed!
pfam 7tm_1 258aa 8e-43
in modified transcript
OPN5
refseq_OPN5.F2 refseq_OPN5.R2 242 322
NCBIGene 36.3 221391
Single exon skipping, size difference: 80
Inclusion in the protein causing a frameshift
Reference transcript: NM_181744
Changed!
pfam 7tm_1 248aa 1e-25
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
Changed!
pfam 7tm_1 79aa 5e-10
in modified transcript
OPRL1
refseq_OPRL1.F1 refseq_OPRL1.R1 134 285
NCBIGene 36.3 4987
Single exon skipping, size difference: 151
Exclusion in 5'UTR
Reference transcript: NM_182647
pfam 7tm_1 246aa 4e-38
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
OPRS1
refseq_OPRS1.F2 refseq_OPRS1.R2 140 248
NCBIGene 36.2 10280
Alternative 5-prime, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_005866
Changed!
pfam ERG2_Sigma1R 189aa 3e-73
in ref transcript
ERG2 and Sigma1 receptor like protein. This family consists of the fungal C-8 sterol isomerase and mammalian sigma1 receptor. C-8 sterol isomerase (delta-8-delta-7 sterol isomerase), catalyses a reaction in ergosterol biosynthesis, which results in unsaturation at C-7 in the B ring of sterols. Sigma 1 receptor is a low molecular mass mammalian protein located in the endoplasmic reticulum, which interacts with endogenous steroid hormones, such as progesterone and testosterone. It also binds the sigma ligands, which are are a set of chemically unrelated drugs including haloperidol, pentazocine, and ditolylguanidine. Sigma1 effectors are not well understood, but sigma1 agonists have been observed to affect NMDA receptor function, the alpha-adrenergic system and opioid analgesia.
Changed!
pfam ERG2_Sigma1R 167aa 7e-67
in modified transcript
OPRS1
refseq_OPRS1.F3 refseq_OPRS1.R3 129 222
NCBIGene 36.3 10280
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_005866
Changed!
pfam ERG2_Sigma1R 189aa 3e-73
in ref transcript
ERG2 and Sigma1 receptor like protein. This family consists of the fungal C-8 sterol isomerase and mammalian sigma1 receptor. C-8 sterol isomerase (delta-8-delta-7 sterol isomerase), catalyses a reaction in ergosterol biosynthesis, which results in unsaturation at C-7 in the B ring of sterols. Sigma 1 receptor is a low molecular mass mammalian protein located in the endoplasmic reticulum, which interacts with endogenous steroid hormones, such as progesterone and testosterone. It also binds the sigma ligands, which are are a set of chemically unrelated drugs including haloperidol, pentazocine, and ditolylguanidine. Sigma1 effectors are not well understood, but sigma1 agonists have been observed to affect NMDA receptor function, the alpha-adrenergic system and opioid analgesia.
Changed!
pfam ERG2_Sigma1R 158aa 3e-54
in modified transcript
ORC4L
refseq_ORC4L.F2 refseq_ORC4L.R2 133 158
NCBIGene 36.3 5000
Alternative 5-prime, size difference: 25
Exclusion in 5'UTR
Reference transcript: NM_002552
cd AAA 168aa 5e-08
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
TIGR TIGR02928 182aa 6e-14
in ref transcript
Members of this protein family are found exclusively in the archaea. This set of DNA binding proteins shows homology to the origin recognition complex subunit 1/cell division control protein 6 family in eukaryotes. Several members may be found in genome and interact with each other.
COG CDC6 176aa 2e-16
in ref transcript
Cdc6-related protein, AAA superfamily ATPase [DNA replication, recombination, and repair / Posttranslational modification, protein turnover, chaperones].
OS9
refseq_OS9.F1 refseq_OS9.R1 300 465
NCBIGene 36.3 10956
Single exon skipping, size difference: 165
Exclusion in the protein (no frameshift)
Reference transcript: NM_006812
pfam PRKCSH 69aa 5e-13
in ref transcript
Glucosidase II beta subunit-like protein. The sequences found in this family are similar to a region found in the beta-subunit of glucosidase II, which is also known as protein kinase C substrate 80K-H (PRKCSH). The enzyme catalyses the sequential removal of two alpha-1,3-linked glucose residues in the second step of N-linked oligosaccharide processing. The beta subunit is required for the solubility and stability of the heterodimeric enzyme, and is involved in retaining the enzyme within the endoplasmic reticulum. Mutations in the gene coding for PRKCSH have been found to be involved in the development of autosomal dominant polycystic liver disease (ADPLD), but the precise role the protein has in the pathogenesis of this disease is unknown. This family also includes an ER sensor for misfolded glycoproteins and is therefore likely to be a generic sugar binding domain.
OS9
refseq_OS9.F3 refseq_OS9.R3 214 259
NCBIGene 36.3 10956
Alternative 5-prime, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_006812
pfam PRKCSH 69aa 5e-13
in ref transcript
Glucosidase II beta subunit-like protein. The sequences found in this family are similar to a region found in the beta-subunit of glucosidase II, which is also known as protein kinase C substrate 80K-H (PRKCSH). The enzyme catalyses the sequential removal of two alpha-1,3-linked glucose residues in the second step of N-linked oligosaccharide processing. The beta subunit is required for the solubility and stability of the heterodimeric enzyme, and is involved in retaining the enzyme within the endoplasmic reticulum. Mutations in the gene coding for PRKCSH have been found to be involved in the development of autosomal dominant polycystic liver disease (ADPLD), but the precise role the protein has in the pathogenesis of this disease is unknown. This family also includes an ER sensor for misfolded glycoproteins and is therefore likely to be a generic sugar binding domain.
OSBPL1A
refseq_OSBPL1A.F2 refseq_OSBPL1A.R2 105 174
NCBIGene 36.2 114876
Alternative 5-prime and 3-prime, size difference: 69
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_080597
cd ANK 154aa 2e-19
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 74aa 3e-07
in ref transcript
cd PH_oxysterol_bp 95aa 3e-06
in ref transcript
Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
Changed!
pfam Oxysterol_BP 402aa 2e-72
in ref transcript
Oxysterol-binding protein.
pfam Ank 32aa 1e-06
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
pfam Ank 32aa 4e-05
in ref transcript
pfam Ank 31aa 3e-04
in ref transcript
smart PH 97aa 0.001
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
FOG: Ankyrin repeat [General function prediction only].
Changed!
pfam Oxysterol_BP 379aa 5e-58
in modified transcript
OSBPL3
refseq_OSBPL3.F2 refseq_OSBPL3.R2 102 195
NCBIGene 36.3 26031
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_015550
cd PH_oxysterol_bp 91aa 2e-18
in ref transcript
Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam Oxysterol_BP 300aa 2e-56
in ref transcript
Oxysterol-binding protein.
smart PH 91aa 3e-09
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
OSBPL5
refseq_OSBPL5.F1 refseq_OSBPL5.R1 112 316
NCBIGene 36.3 114879
Single exon skipping, size difference: 204
Exclusion in the protein (no frameshift)
Reference transcript: NM_020896
Changed!
cd PH_oxysterol_bp 114aa 4e-20
in ref transcript
Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam Oxysterol_BP 332aa 1e-52
in ref transcript
Oxysterol-binding protein.
Changed!
smart PH 117aa 4e-08
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
OSBPL6
refseq_OSBPL6.F2 refseq_OSBPL6.R2 241 316
NCBIGene 36.3 114880
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_145739
cd PH_oxysterol_bp 91aa 7e-15
in ref transcript
Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam Oxysterol_BP 342aa 3e-58
in ref transcript
Oxysterol-binding protein.
smart PH 89aa 8e-08
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
OSBPL8
refseq_OSBPL8.F1 refseq_OSBPL8.R1 188 234
NCBIGene 36.3 114882
Exon skipping and alternative 3-prime or 5-prime, size difference: 46
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_020841
Changed!
cd PH_oxysterol_bp 114aa 2e-19
in ref transcript
Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
Changed!
pfam Oxysterol_BP 333aa 3e-51
in ref transcript
Oxysterol-binding protein.
Changed!
smart PH 117aa 2e-08
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
OSBPL9
refseq_OSBPL9.F1 refseq_OSBPL9.R1 104 143
NCBIGene 36.3 114883
Single exon skipping, size difference: 39
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_148909
cd PH_oxysterol_bp 93aa 1e-28
in ref transcript
Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam Oxysterol_BP 342aa 2e-49
in ref transcript
Oxysterol-binding protein.
smart PH 95aa 6e-13
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
OSBPL9
refseq_OSBPL9.F2 refseq_OSBPL9.R2 303 399
NCBIGene 36.3 114883
Single exon skipping, size difference: 96
Exclusion in 5'UTR
Reference transcript: NM_148904
pfam Oxysterol_BP 342aa 1e-49
in ref transcript
Oxysterol-binding protein.
OSBPL9
refseq_OSBPL9.F4 refseq_OSBPL9.R4 173 252
NCBIGene 36.3 114883
Single exon skipping, size difference: 79
Exclusion in the protein causing a frameshift
Reference transcript: NM_148909
Changed!
cd PH_oxysterol_bp 93aa 1e-28
in ref transcript
Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
Changed!
pfam Oxysterol_BP 342aa 2e-49
in ref transcript
Oxysterol-binding protein.
Changed!
smart PH 95aa 6e-13
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
Changed!
cd PH_oxysterol_bp 49aa 2e-13
in modified transcript
Changed!
smart PH 51aa 0.001
in modified transcript
OSBPL9
refseq_OSBPL9.F6 refseq_OSBPL9.R6 130 253
NCBIGene 36.3 114883
Alternative 5-prime, size difference: 123
Inclusion in the protein causing a new stop codon
Reference transcript: NM_148907
Changed!
pfam Oxysterol_BP 342aa 1e-49
in ref transcript
Oxysterol-binding protein.
OSCAR
refseq_OSCAR.F1 refseq_OSCAR.R1 134 158
NCBIGene 36.2 126014
Alternative 5-prime and 3-prime, size difference: 24
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_206818
OTOF
refseq_OTOF.F1 refseq_OTOF.R1 117 315
NCBIGene 36.3 9381
Single exon skipping, size difference: 198
Exclusion of the stop codon
Reference transcript: NM_194248
cd C2 90aa 5e-14
in ref transcript
Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
cd C2 91aa 5e-09
in ref transcript
cd C2 110aa 7e-09
in ref transcript
cd C2 100aa 4e-05
in ref transcript
cd C2 99aa 0.006
in ref transcript
pfam FerB 78aa 2e-32
in ref transcript
FerB (NUC096) domain. This is central domain B in proteins of the Ferlin family.
pfam FerI 72aa 3e-31
in ref transcript
FerI (NUC094) domain. This domain is present in proteins of the Ferlin family. It is often located between two C2 domains.
pfam C2 84aa 2e-15
in ref transcript
C2 domain.
pfam C2 90aa 6e-11
in ref transcript
smart C2 103aa 5e-10
in ref transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
smart C2 97aa 4e-07
in ref transcript
smart C2 105aa 0.001
in ref transcript
COG COG5038 88aa 9e-11
in ref transcript
Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
COG COG5038 107aa 8e-04
in ref transcript
OTOF
refseq_OTOF.F3 refseq_OTOF.R3 136 196
NCBIGene 36.3 9381
Alternative 3-prime, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_194248
cd C2 90aa 5e-14
in ref transcript
Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
cd C2 91aa 5e-09
in ref transcript
cd C2 110aa 7e-09
in ref transcript
cd C2 100aa 4e-05
in ref transcript
cd C2 99aa 0.006
in ref transcript
pfam FerB 78aa 2e-32
in ref transcript
FerB (NUC096) domain. This is central domain B in proteins of the Ferlin family.
pfam FerI 72aa 3e-31
in ref transcript
FerI (NUC094) domain. This domain is present in proteins of the Ferlin family. It is often located between two C2 domains.
pfam C2 84aa 2e-15
in ref transcript
C2 domain.
pfam C2 90aa 6e-11
in ref transcript
smart C2 103aa 5e-10
in ref transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
smart C2 97aa 4e-07
in ref transcript
smart C2 105aa 0.001
in ref transcript
COG COG5038 88aa 9e-11
in ref transcript
Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
COG COG5038 107aa 8e-04
in ref transcript
OTX2
refseq_OTX2.F1 refseq_OTX2.R1 125 149
NCBIGene 36.3 5015
Alternative 3-prime, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_021728
cd homeodomain 56aa 3e-16
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
ACC channels are probably hetero- or homomultimers and transport small monovalent cations (Me+). Some also transport Ca2+; a few also transport small metabolites.
Changed!
TIGR P2X 32aa 6e-06
in modified transcript
P2RX2
refseq_P2RX2.F2 refseq_P2RX2.R2 136 208
NCBIGene 36.3 22953
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_170683
Changed!
TIGR P2X 392aa 1e-178
in ref transcript
ACC channels are probably hetero- or homomultimers and transport small monovalent cations (Me+). Some also transport Ca2+; a few also transport small metabolites.
Changed!
TIGR P2X 368aa 1e-163
in modified transcript
P2RX5
refseq_P2RX5.F1 refseq_P2RX5.R1 294 369
NCBIGene 36.3 5026
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_002561
Changed!
pfam P2X_receptor 344aa 0.0
in ref transcript
ATP P2X receptor.
Changed!
pfam P2X_receptor 320aa 1e-169
in modified transcript
P2RX5
refseq_P2RX5.F2 refseq_P2RX5.R3 126 198
NCBIGene 36.3 5026
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_002561
Changed!
pfam P2X_receptor 344aa 0.0
in ref transcript
ATP P2X receptor.
Changed!
pfam P2X_receptor 320aa 1e-169
in modified transcript
P2RY10
refseq_P2RY10.F2 refseq_P2RY10.R2 187 379
NCBIGene 36.3 27334
Multiple exon skipping, size difference: 192
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_014499
pfam 7tm_1 230aa 1e-27
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
P2RY2
refseq_P2RY2.F1 refseq_P2RY2.R1 106 240
NCBIGene 36.3 5029
Alternative 3-prime, size difference: 134
Inclusion in 5'UTR
Reference transcript: NM_002564
pfam 7tm_1 251aa 2e-33
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
P2RY6
refseq_P2RY6.F1 refseq_P2RY6.R1 114 274
NCBIGene 36.3 5031
Single exon skipping, size difference: 160
Exclusion in 5'UTR
Reference transcript: NM_176796
pfam 7tm_1 250aa 2e-22
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
P2RY6
refseq_P2RY6.F3 refseq_P2RY6.R3 130 216
NCBIGene 36.3 5031
Single exon skipping, size difference: 86
Exclusion in 5'UTR
Reference transcript: NM_176796
pfam 7tm_1 250aa 2e-22
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
PAEP
refseq_PAEP.F1 refseq_PAEP.R1 146 175
NCBIGene 36.3 5047
Alternative 5-prime, size difference: 29
Exclusion in 3'UTR
Reference transcript: NM_001018049
pfam Lipocalin 142aa 2e-24
in ref transcript
Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. The family also encompasses the enzyme prostaglandin D synthase (EC:5.3.99.2). Alignment subsumes both the lipocalin and fatty acid binding protein signatures from PROSITE. This is supported on structural and functional grounds. The structure is an eight-stranded beta barrel.
PAGE5
refseq_PAGE5.F2 refseq_PAGE5.R2 107 306
NCBIGene 36.3 90737
Alternative 5-prime, size difference: 199
Exclusion of the protein initiation site
Reference transcript: NM_130467
pfam GAGE 103aa 8e-06
in ref transcript
GAGE protein. This family consists of several GAGE and XAGE proteins which are found exclusively in humans. The function of this family is unknown although they have been implicated in human cancers.
PAIP1
refseq_PAIP1.F1 refseq_PAIP1.R1 160 397
NCBIGene 36.3 10605
Alternative 5-prime, size difference: 237
Exclusion in the protein (no frameshift)
Reference transcript: NM_006451
smart MIF4G 217aa 2e-30
in ref transcript
Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press).
PAK4
refseq_PAK4.F1 refseq_PAK4.R1 110 336
NCBIGene 36.3 10298
Exon skipping and alternative 3-prime or 5-prime, size difference: 226
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_001014831
cd STKc_PAK_II 285aa 1e-173
in ref transcript
Serine/threonine kinases (STKs), p21-activated kinase (PAK) subfamily, Group II, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The PAK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. PAKs are Rho family GTPase-regulated kinases that serve as important mediators in the function of Cdc42 (cell division cycle 42) and Rac. PAKs from higher eukaryotes are classified into two groups (I and II), according to their biochemical and structural features. Group II PAKs, also called non-conventional PAKs, include PAK4, PAK5, and PAK6. Group II PAKs contain PBD (p21-binding domain) and catalytic domains, but lack other motifs found in group I PAKs, such as an AID (autoinhibitory domain) and SH3 binding sites. Since group II PAKs do not contain an obvious AID, they may be regulated differently from group I PAKs. While group I PAKs interact with the SH3 containing proteins Nck, Grb2 and PIX, no such binding has been demonstrated for group II PAKs. Some known substrates of group II PAKs are also substrates of group I PAKs such as Raf, BAD, LIMK and GEFH1. Unique group II substrates include MARK/Par-1 and PDZ-RhoGEF. Group II PAKs play important roles in filopodia formation, neuron extension, cytoskeletal organization, and cell survival.
cd CRIB_PAK_like 39aa 4e-11
in ref transcript
PAK (p21 activated kinase) Binding Domain (PBD), binds Cdc42p- and/or Rho-like small GTPases; also known as the Cdc42/Rac interactive binding (CRIB) motif; has been shown to inhibit transcriptional activation and cell transformation mediated by the Ras-Rac pathway. This subgroup of CRIB/PBD-domains is found N-terminal of Serine/Threonine kinase domains in PAK and PAK-like proteins.
smart S_TKc 242aa 5e-76
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam PBD 55aa 1e-11
in ref transcript
P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB).
COG SPS1 248aa 2e-35
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
PAK7
refseq_PAK7.F1 refseq_PAK7.R1 159 308
NCBIGene 36.3 57144
Single exon skipping, size difference: 149
Exclusion in 5'UTR
Reference transcript: NM_020341
cd STKc_PAK_II 285aa 1e-175
in ref transcript
Serine/threonine kinases (STKs), p21-activated kinase (PAK) subfamily, Group II, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The PAK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. PAKs are Rho family GTPase-regulated kinases that serve as important mediators in the function of Cdc42 (cell division cycle 42) and Rac. PAKs from higher eukaryotes are classified into two groups (I and II), according to their biochemical and structural features. Group II PAKs, also called non-conventional PAKs, include PAK4, PAK5, and PAK6. Group II PAKs contain PBD (p21-binding domain) and catalytic domains, but lack other motifs found in group I PAKs, such as an AID (autoinhibitory domain) and SH3 binding sites. Since group II PAKs do not contain an obvious AID, they may be regulated differently from group I PAKs. While group I PAKs interact with the SH3 containing proteins Nck, Grb2 and PIX, no such binding has been demonstrated for group II PAKs. Some known substrates of group II PAKs are also substrates of group I PAKs such as Raf, BAD, LIMK and GEFH1. Unique group II substrates include MARK/Par-1 and PDZ-RhoGEF. Group II PAKs play important roles in filopodia formation, neuron extension, cytoskeletal organization, and cell survival.
cd CRIB_PAK_like 47aa 2e-11
in ref transcript
PAK (p21 activated kinase) Binding Domain (PBD), binds Cdc42p- and/or Rho-like small GTPases; also known as the Cdc42/Rac interactive binding (CRIB) motif; has been shown to inhibit transcriptional activation and cell transformation mediated by the Ras-Rac pathway. This subgroup of CRIB/PBD-domains is found N-terminal of Serine/Threonine kinase domains in PAK and PAK-like proteins.
smart S_TKc 242aa 3e-75
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam PBD 58aa 1e-12
in ref transcript
P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB).
PTZ PTZ00263 245aa 3e-34
in ref transcript
protein kinase A catalytic subunit; Provisional.
PALM
refseq_PALM.F1 refseq_PALM.R1 146 278
NCBIGene 36.3 5064
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_002579
Changed!
pfam Paralemmin 318aa 1e-102
in ref transcript
Paralemmin.
Changed!
pfam Paralemmin 274aa 5e-74
in modified transcript
Changed!
pfam Paralemmin 336aa 3e-34
in ref transcript
Paralemmin.
Changed!
pfam Paralemmin 302aa 6e-33
in modified transcript
PAM
refseq_PAM.F1 refseq_PAM.R1 100 421
NCBIGene 36.3 5066
Single exon skipping, size difference: 321
Exclusion in the protein (no frameshift)
Reference transcript: NM_000919
pfam Cu2_monoox_C 154aa 1e-69
in ref transcript
Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold.
pfam Cu2_monooxygen 138aa 2e-59
in ref transcript
Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold.
pfam NHL 28aa 8e-04
in ref transcript
NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in the peptidyl-glycine alpha-amidating monooxygenase (PAM), proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localised to the repeats. The human tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is me diated by the NHL repeats.
pfam NHL 28aa 0.006
in ref transcript
pfam NHL 30aa 0.007
in ref transcript
COG COG3391 247aa 6e-07
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
pfam NHL 29aa 0.010
in modified transcript
PANK1
refseq_PANK1.F1 refseq_PANK1.R1 152 329
NCBIGene 36.3 53354
Single exon skipping, size difference: 177
Exclusion in the protein (no frameshift)
Reference transcript: NM_148977
Changed!
pfam Fumble 356aa 1e-131
in ref transcript
Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organisation, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isoforms, all of which contain a domain with high similarity to the pantothenate kinases of A. nidulans and mouse. A role of fumble in membrane synthesis has been proposed.
Changed!
pfam Fumble 297aa 8e-94
in modified transcript
Changed!
COG PanK 291aa 5e-22
in modified transcript
PAOX
refseq_PAOX.F2 refseq_PAOX.R2 159 317
NCBIGene 36.3 196743
Single exon skipping, size difference: 158
Exclusion in the protein causing a frameshift
Reference transcript: NM_152911
Changed!
pfam Amino_oxidase 455aa 5e-52
in ref transcript
Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. The family also contains phytoene dehydrogenases and related enzymes. In vertebrates MAO plays an important role regulating the intracellular levels of amines via there oxidation; these include various neurotransmitters, neurotoxins and trace amines. In lower eukaryotes such as aspergillus and in bacteria the main role of amine oxidases is to provide a source of ammonium. PAOs in plants, bacteria and protozoa oxidase spermidine and spermine to an aminobutyral, diaminopropane and hydrogen peroxide and are involved in the catabolism of polyamines. Other members of this family include tryptophan 2-monooxygenase, putrescine oxidase, corticosteroid binding proteins and antibacterial glycoproteins.
Changed!
COG COG1231 232aa 8e-12
in ref transcript
Monoamine oxidase [Amino acid transport and metabolism].
Changed!
pfam Amino_oxidase 391aa 9e-35
in modified transcript
Changed!
COG COG1231 88aa 6e-08
in modified transcript
PAOX
refseq_PAOX.F4 refseq_PAOX.R4 105 358
NCBIGene 36.3 196743
Single exon skipping, size difference: 253
Exclusion in the protein causing a frameshift
Reference transcript: NM_152911
Changed!
pfam Amino_oxidase 455aa 5e-52
in ref transcript
Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. The family also contains phytoene dehydrogenases and related enzymes. In vertebrates MAO plays an important role regulating the intracellular levels of amines via there oxidation; these include various neurotransmitters, neurotoxins and trace amines. In lower eukaryotes such as aspergillus and in bacteria the main role of amine oxidases is to provide a source of ammonium. PAOs in plants, bacteria and protozoa oxidase spermidine and spermine to an aminobutyral, diaminopropane and hydrogen peroxide and are involved in the catabolism of polyamines. Other members of this family include tryptophan 2-monooxygenase, putrescine oxidase, corticosteroid binding proteins and antibacterial glycoproteins.
Changed!
COG COG1231 232aa 8e-12
in ref transcript
Monoamine oxidase [Amino acid transport and metabolism].
Changed!
pfam Amino_oxidase 267aa 5e-18
in modified transcript
PAPD5
refseq_PAPD5.F2 refseq_PAPD5.R2 252 393
NCBIGene 36.3 64282
Alternative 3-prime, size difference: 141
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040284
pfam PAP_assoc 61aa 3e-14
in ref transcript
Poly(A) polymerase. This domain is found in poly(A) polymerases and has been shown to have polynucleotide adenylyltransferase activity. Proteins in this family have been located to both the nucleus and cytoplasm.
pfam NTP_transf_2 77aa 2e-09
in ref transcript
Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to aminoglycosides. KNTase in-activates antibiotics by catalysing the addition of a nucleotidyl group onto the drug.
COG TRF4 262aa 3e-44
in ref transcript
DNA polymerase sigma [DNA replication, recombination, and repair].
PAQR6
refseq_PAQR6.F1 refseq_PAQR6.R1 104 146
NCBIGene 36.3 79957
Alternative 5-prime, size difference: 42
Exclusion in 5'UTR
Reference transcript: NM_024897
pfam HlyIII 149aa 1e-04
in ref transcript
Haemolysin-III related. Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It has been proposed that YOL002c encodes a Saccharomyces cerevisiae protein that plays a key role in metabolic pathways that regulate lipid and phosphate metabolism.
PAQR6
refseq_PAQR6.F3 refseq_PAQR6.R3 103 174
NCBIGene 36.3 79957
Alternative 5-prime, size difference: 71
Exclusion in the protein causing a frameshift
Reference transcript: NM_024897
Changed!
pfam HlyIII 149aa 1e-04
in ref transcript
Haemolysin-III related. Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It has been proposed that YOL002c encodes a Saccharomyces cerevisiae protein that plays a key role in metabolic pathways that regulate lipid and phosphate metabolism.
Changed!
pfam HlyIII 165aa 3e-10
in modified transcript
PARL
refseq_PARL.F2 refseq_PARL.R2 252 402
NCBIGene 36.3 55486
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_018622
Changed!
pfam Rhomboid 142aa 1e-21
in ref transcript
Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth factor Spitz, allowing it to activate the Drosophila EGF receptor. Analysis has shown that Rhomboid-1 is an intramembrane serine protease (EC:3.4.21.105). Parasite-encoded rhomboid enzymes are also important for invasion of host cells by Toxoplasma and the malaria parasite.
Changed!
COG GlpG 198aa 2e-13
in ref transcript
Uncharacterized membrane protein (homolog of Drosophila rhomboid) [General function prediction only].
Changed!
pfam Rhomboid 100aa 4e-09
in modified transcript
Changed!
COG GlpG 119aa 5e-04
in modified transcript
PARP3
refseq_PARP3.F2 refseq_PARP3.R2 263 385
NCBIGene 36.3 10039
Exon skipping and alternative 3-prime or 5-prime, size difference: 122
Inclusion in the protein causing a new stop codon, Inclusion in the protein causing a frameshift
Reference transcript: NM_001003931
Changed!
cd parp_like 351aa 1e-99
in ref transcript
Poly(ADP-ribose) polymerase (parp) catalytic domain catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. Experiments have shown that a carboxyl 40 kDa fragment is still catalytically active. Poly(ADP-ribose)-like polymerases (PARPS 1-3, VPARP, tankyrase) catalyze the addition of up to 100 ADP_ribose units from NAD+. PARPs 1 and 2 are localized in the nucleaus, bind DNA, and are activated by DNA damage. VPARP is part of the vault ribonucleoprotein complex. Tankyrases regulates telomere length through interactions with telomere repeat binding factor 1.
Changed!
pfam PARP 211aa 5e-53
in ref transcript
Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. Experiments have shown that a carboxyl 40 kDa fragment is still catalytically active.
Changed!
pfam WGR 64aa 2e-22
in ref transcript
WGR domain. This domain is found in a variety of polyA polymerases as well as the Escherichia coli molybdate metabolism regulator and other proteins of unknown function. I have called this domain WGR after the most conserved central motif of the domain. The domain is found in isolation in some proteins and is between 70 and 80 residues in length. I propose that this may be a nucleic acid binding domain.
Changed!
pfam PARP_reg 136aa 2e-20
in ref transcript
Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. Experiments have shown that a carboxyl 40 kDa fragment is still catalytically active.
Changed!
COG COG3831 66aa 1e-04
in ref transcript
Uncharacterized conserved protein [Function unknown].
PAX2
refseq_PAX2.F1 refseq_PAX2.R1 149 232
NCBIGene 36.3 5076
Single exon skipping, size difference: 83
Inclusion in the protein causing a frameshift
Reference transcript: NM_003990
cd PAX 127aa 2e-62
in ref transcript
Paired Box domain.
smart PAX 125aa 1e-69
in ref transcript
Paired Box domain.
COG COG3415 92aa 0.003
in ref transcript
Transposase and inactivated derivatives [DNA replication, recombination, and repair].
PAX2
refseq_PAX2.F4 refseq_PAX2.R4 105 124
NCBIGene 36.3 5076
Alternative 3-prime, size difference: 19
Inclusion in the protein causing a frameshift
Reference transcript: NM_003990
cd PAX 127aa 2e-62
in ref transcript
Paired Box domain.
smart PAX 125aa 1e-69
in ref transcript
Paired Box domain.
COG COG3415 92aa 0.003
in ref transcript
Transposase and inactivated derivatives [DNA replication, recombination, and repair].
PAX2
refseq_PAX2.F5 refseq_PAX2.R5 162 231
NCBIGene 36.3 5076
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_003990
cd PAX 127aa 2e-62
in ref transcript
Paired Box domain.
smart PAX 125aa 1e-69
in ref transcript
Paired Box domain.
COG COG3415 92aa 0.003
in ref transcript
Transposase and inactivated derivatives [DNA replication, recombination, and repair].
PAX6
refseq_PAX6.F1 refseq_PAX6.R1 237 335
NCBIGene 36.3 5080
Alternative 5-prime, size difference: 98
Inclusion in 5'UTR
Reference transcript: NM_001604
cd PAX 141aa 2e-58
in ref transcript
Paired Box domain.
cd homeodomain 57aa 1e-15
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Changed!
cd PAX 127aa 4e-61
in modified transcript
Changed!
smart PAX 125aa 7e-66
in modified transcript
PAX8
refseq_PAX8.F2 refseq_PAX8.R2 153 232
NCBIGene 36.3 7849
Alternative 3-prime, size difference: 79
Inclusion in 3'UTR
Reference transcript: NM_013952
cd PAX 127aa 4e-59
in ref transcript
Paired Box domain.
smart PAX 125aa 5e-65
in ref transcript
Paired Box domain.
COG COG3415 92aa 0.004
in ref transcript
Transposase and inactivated derivatives [DNA replication, recombination, and repair].
PBRM1
refseq_PB1.F1 refseq_PB1.R1 220 295
NCBIGene 36.3 55193
Alternative 3-prime, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_018165
cd Bromo_polybromo_I 109aa 3e-53
in ref transcript
Bromodomain, polybromo repeat I. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd Bromo_polybromo_VI 108aa 1e-51
in ref transcript
Bromodomain, polybromo repeat VI. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd Bromo_polybromo_V 105aa 5e-49
in ref transcript
Bromodomain, polybromo repeat V. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd Bromo_polybromo_III 102aa 8e-49
in ref transcript
Bromodomain, polybromo repeat III. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
Changed!
cd BAH_polybromo 118aa 3e-48
in ref transcript
BAH, or Bromo Adjacent Homology domain, as present in polybromo and yeast RSC1/2. The human polybromo protein (BAF180) is a component of the SWI/SNF chromatin-remodeling complex PBAF. It is thought that polybromo participates in transcriptional regulation. Saccharomyces cerevisiae RSC1 and RSC2 are part of the 15-subunit nucleosome remodeling RSC complex. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
cd Bromo_polybromo_II 103aa 3e-46
in ref transcript
Bromodomain, polybromo repeat II. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd BAH_polybromo 119aa 2e-44
in ref transcript
cd Bromo_polybromo_IV 95aa 2e-35
in ref transcript
Bromodomain, polybromo repeat IV. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd HMG-box 44aa 2e-06
in ref transcript
High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Changed!
pfam BAH 119aa 2e-33
in ref transcript
BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.
smart BROMO 113aa 9e-26
in ref transcript
bromo domain.
smart BAH 116aa 6e-25
in ref transcript
Bromo adjacent homology domain.
smart BROMO 107aa 3e-23
in ref transcript
smart BROMO 104aa 5e-22
in ref transcript
smart BROMO 103aa 1e-21
in ref transcript
pfam Bromodomain 87aa 3e-14
in ref transcript
Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
smart BROMO 101aa 2e-13
in ref transcript
smart HMG 44aa 5e-07
in ref transcript
high mobility group.
COG COG5076 281aa 4e-20
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
COG COG5076 205aa 2e-16
in ref transcript
COG COG5076 136aa 4e-16
in ref transcript
Changed!
COG COG5076 392aa 4e-09
in ref transcript
Changed!
cd BAH_polybromo 93aa 3e-29
in modified transcript
Changed!
pfam BAH 94aa 8e-20
in modified transcript
PBRM1
refseq_PB1.F3 refseq_PB1.R3 158 314
NCBIGene 36.3 55193
Single exon skipping, size difference: 156
Exclusion in the protein (no frameshift)
Reference transcript: NM_018165
cd Bromo_polybromo_I 109aa 3e-53
in ref transcript
Bromodomain, polybromo repeat I. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd Bromo_polybromo_VI 108aa 1e-51
in ref transcript
Bromodomain, polybromo repeat VI. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd Bromo_polybromo_V 105aa 5e-49
in ref transcript
Bromodomain, polybromo repeat V. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd Bromo_polybromo_III 102aa 8e-49
in ref transcript
Bromodomain, polybromo repeat III. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd BAH_polybromo 118aa 3e-48
in ref transcript
BAH, or Bromo Adjacent Homology domain, as present in polybromo and yeast RSC1/2. The human polybromo protein (BAF180) is a component of the SWI/SNF chromatin-remodeling complex PBAF. It is thought that polybromo participates in transcriptional regulation. Saccharomyces cerevisiae RSC1 and RSC2 are part of the 15-subunit nucleosome remodeling RSC complex. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
cd Bromo_polybromo_II 103aa 3e-46
in ref transcript
Bromodomain, polybromo repeat II. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd BAH_polybromo 119aa 2e-44
in ref transcript
cd Bromo_polybromo_IV 95aa 2e-35
in ref transcript
Bromodomain, polybromo repeat IV. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
Changed!
cd HMG-box 44aa 2e-06
in ref transcript
High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
pfam BAH 119aa 2e-33
in ref transcript
BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.
smart BROMO 113aa 9e-26
in ref transcript
bromo domain.
smart BAH 116aa 6e-25
in ref transcript
Bromo adjacent homology domain.
smart BROMO 107aa 3e-23
in ref transcript
smart BROMO 104aa 5e-22
in ref transcript
smart BROMO 103aa 1e-21
in ref transcript
pfam Bromodomain 87aa 3e-14
in ref transcript
Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
smart BROMO 101aa 2e-13
in ref transcript
Changed!
smart HMG 44aa 5e-07
in ref transcript
high mobility group.
COG COG5076 281aa 4e-20
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
COG COG5076 205aa 2e-16
in ref transcript
COG COG5076 136aa 4e-16
in ref transcript
COG COG5076 392aa 4e-09
in ref transcript
Changed!
cd HMG-box 47aa 2e-07
in modified transcript
Changed!
smart HMG 47aa 4e-08
in modified transcript
PBRM1
refseq_PB1.F5 refseq_PB1.R5 292 373
NCBIGene 36.3 55193
Alternative 5-prime, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_018165
cd Bromo_polybromo_I 109aa 3e-53
in ref transcript
Bromodomain, polybromo repeat I. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd Bromo_polybromo_VI 108aa 1e-51
in ref transcript
Bromodomain, polybromo repeat VI. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd Bromo_polybromo_V 105aa 5e-49
in ref transcript
Bromodomain, polybromo repeat V. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd Bromo_polybromo_III 102aa 8e-49
in ref transcript
Bromodomain, polybromo repeat III. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd BAH_polybromo 118aa 3e-48
in ref transcript
BAH, or Bromo Adjacent Homology domain, as present in polybromo and yeast RSC1/2. The human polybromo protein (BAF180) is a component of the SWI/SNF chromatin-remodeling complex PBAF. It is thought that polybromo participates in transcriptional regulation. Saccharomyces cerevisiae RSC1 and RSC2 are part of the 15-subunit nucleosome remodeling RSC complex. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
cd Bromo_polybromo_II 103aa 3e-46
in ref transcript
Bromodomain, polybromo repeat II. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd BAH_polybromo 119aa 2e-44
in ref transcript
cd Bromo_polybromo_IV 95aa 2e-35
in ref transcript
Bromodomain, polybromo repeat IV. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd HMG-box 44aa 2e-06
in ref transcript
High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
pfam BAH 119aa 2e-33
in ref transcript
BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.
smart BROMO 113aa 9e-26
in ref transcript
bromo domain.
smart BAH 116aa 6e-25
in ref transcript
Bromo adjacent homology domain.
smart BROMO 107aa 3e-23
in ref transcript
smart BROMO 104aa 5e-22
in ref transcript
smart BROMO 103aa 1e-21
in ref transcript
pfam Bromodomain 87aa 3e-14
in ref transcript
Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
smart BROMO 101aa 2e-13
in ref transcript
smart HMG 44aa 5e-07
in ref transcript
high mobility group.
COG COG5076 281aa 4e-20
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
COG COG5076 205aa 2e-16
in ref transcript
COG COG5076 136aa 4e-16
in ref transcript
COG COG5076 392aa 4e-09
in ref transcript
PBRM1
refseq_PB1.F7 refseq_PB1.R7 150 246
NCBIGene 36.3 55193
Single exon skipping, size difference: 96
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_018165
cd Bromo_polybromo_I 109aa 3e-53
in ref transcript
Bromodomain, polybromo repeat I. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd Bromo_polybromo_VI 108aa 1e-51
in ref transcript
Bromodomain, polybromo repeat VI. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd Bromo_polybromo_V 105aa 5e-49
in ref transcript
Bromodomain, polybromo repeat V. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd Bromo_polybromo_III 102aa 8e-49
in ref transcript
Bromodomain, polybromo repeat III. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd BAH_polybromo 118aa 3e-48
in ref transcript
BAH, or Bromo Adjacent Homology domain, as present in polybromo and yeast RSC1/2. The human polybromo protein (BAF180) is a component of the SWI/SNF chromatin-remodeling complex PBAF. It is thought that polybromo participates in transcriptional regulation. Saccharomyces cerevisiae RSC1 and RSC2 are part of the 15-subunit nucleosome remodeling RSC complex. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
cd Bromo_polybromo_II 103aa 3e-46
in ref transcript
Bromodomain, polybromo repeat II. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd BAH_polybromo 119aa 2e-44
in ref transcript
cd Bromo_polybromo_IV 95aa 2e-35
in ref transcript
Bromodomain, polybromo repeat IV. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
cd HMG-box 44aa 2e-06
in ref transcript
High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
pfam BAH 119aa 2e-33
in ref transcript
BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.
smart BROMO 113aa 9e-26
in ref transcript
bromo domain.
smart BAH 116aa 6e-25
in ref transcript
Bromo adjacent homology domain.
smart BROMO 107aa 3e-23
in ref transcript
smart BROMO 104aa 5e-22
in ref transcript
smart BROMO 103aa 1e-21
in ref transcript
pfam Bromodomain 87aa 3e-14
in ref transcript
Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
smart BROMO 101aa 2e-13
in ref transcript
smart HMG 44aa 5e-07
in ref transcript
high mobility group.
COG COG5076 281aa 4e-20
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
COG COG5076 205aa 2e-16
in ref transcript
COG COG5076 136aa 4e-16
in ref transcript
COG COG5076 392aa 4e-09
in ref transcript
PCDH11X
refseq_PCDH11X.F1 refseq_PCDH11X.R1 105 135
NCBIGene 36.3 27328
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_032968
cd CA 194aa 1e-45
in ref transcript
Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
cd CA 201aa 6e-40
in ref transcript
cd CA 196aa 1e-37
in ref transcript
cd CA 197aa 2e-35
in ref transcript
pfam Protocadherin 199aa 6e-67
in ref transcript
Protocadherin. The structure of protocadherins is similar to that of classic cadherins (pfam00028), but particularly on the cytoplasmic domains they also have some unique features. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated.
smart CA 77aa 5e-19
in ref transcript
Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
smart CA 77aa 7e-19
in ref transcript
pfam Cadherin 80aa 7e-16
in ref transcript
Cadherin domain.
smart CA 84aa 7e-13
in ref transcript
pfam Cadherin 82aa 1e-10
in ref transcript
smart CA 79aa 2e-09
in ref transcript
pfam Cadherin_2 85aa 6e-06
in ref transcript
Cadherin-like. This cadherin domain is usually the most N-terminal copy of the domain.
PCDH8
refseq_PCDH8.F2 refseq_PCDH8.R2 126 417
NCBIGene 36.3 5100
Alternative 5-prime, size difference: 291
Exclusion in the protein (no frameshift)
Reference transcript: NM_002590
cd CA 201aa 4e-43
in ref transcript
Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
cd CA 206aa 2e-33
in ref transcript
cd CA 238aa 2e-28
in ref transcript
cd CA 200aa 5e-20
in ref transcript
smart CA 86aa 6e-19
in ref transcript
Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
smart CA 78aa 5e-17
in ref transcript
smart CA 79aa 1e-13
in ref transcript
smart CA 84aa 5e-13
in ref transcript
pfam Cadherin_2 79aa 6e-09
in ref transcript
Cadherin-like. This cadherin domain is usually the most N-terminal copy of the domain.
pfam Cadherin 81aa 2e-07
in ref transcript
Cadherin domain.
PCDH9
refseq_PCDH9.F1 refseq_PCDH9.R1 293 395
NCBIGene 36.3 5101
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_203487
cd CA 195aa 1e-45
in ref transcript
Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
cd CA 201aa 4e-39
in ref transcript
cd CA 195aa 1e-38
in ref transcript
cd CA 204aa 8e-38
in ref transcript
cd CA 204aa 5e-35
in ref transcript
cd CA 212aa 6e-22
in ref transcript
pfam Protocadherin 225aa 9e-65
in ref transcript
Protocadherin. The structure of protocadherins is similar to that of classic cadherins (pfam00028), but particularly on the cytoplasmic domains they also have some unique features. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated.
smart CA 77aa 2e-20
in ref transcript
Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
smart CA 77aa 2e-19
in ref transcript
pfam Cadherin 92aa 2e-17
in ref transcript
Cadherin domain.
smart CA 84aa 7e-14
in ref transcript
smart CA 79aa 1e-11
in ref transcript
smart CA 79aa 1e-07
in ref transcript
pfam Cadherin_2 90aa 4e-07
in ref transcript
Cadherin-like. This cadherin domain is usually the most N-terminal copy of the domain.
PCGF6
refseq_PCGF6.F2 refseq_PCGF6.R2 129 354
NCBIGene 36.3 84108
Multiple exon skipping, size difference: 225
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_001011663
cd RING 43aa 4e-06
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
pfam zf-C3HC4 39aa 1e-06
in ref transcript
Zinc finger, C3HC4 type (RING finger). The C3HC4 type zinc-finger (RING finger) is a cysteine-rich domain of 40 to 60 residues that coordinates two zinc ions, and has the consensus sequence: C-X2-C-X(9-39)-C-X(1-3)-H-X(2-3)-C-X2-C-X(4-48)-C-X2-C where X is any amino acid. Many proteins containing a RING finger play a key role in the ubiquitination pathway.
Pecanex protein (C-terminus). This family consists of C terminal region of the pecanex protein homologues. The pecanex protein is a maternal-effect neurogenic gene found in Drosophila.
MED15
refseq_PCQAP.F1 refseq_PCQAP.R1 146 266
NCBIGene 36.3 51586
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_001003891
pfam ARC105_Med_act 65aa 7e-29
in ref transcript
ARC105 domain. The approx. 70 residue ARC105 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, ARC105 is a critical transducer of gene activation signals that control early metazoan development.
PCTK3
refseq_PCTK3.F1 refseq_PCTK3.R1 148 238
NCBIGene 36.3 5129
Alternative 5-prime, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_212503
cd S_TKc 282aa 3e-64
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 272aa 1e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
PTZ PTZ00024 284aa 1e-44
in ref transcript
cyclin-dependent protein kinase; Provisional.
PDCD4
refseq_PDCD4.F1 refseq_PDCD4.R1 290 374
NCBIGene 36.3 27250
Single exon skipping, size difference: 84
Inclusion in the protein causing a new stop codon
Reference transcript: NM_014456
Changed!
smart MA3 114aa 1e-29
in ref transcript
Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press.
Changed!
smart MA3 112aa 8e-28
in ref transcript
PDE8A
refseq_PDE8A.F1 refseq_PDE8A.R1 169 367
NCBIGene 36.2 5151
Single exon skipping, size difference: 198
Inclusion in the protein causing a new stop codon
Reference transcript: NM_002605
Changed!
cd PAS 99aa 3e-06
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
Changed!
cd HDc 195aa 4e-05
in ref transcript
Metal dependent phosphohydrolases with conserved 'HD' motif.
Changed!
pfam PDEase_I 255aa 2e-55
in ref transcript
3'5'-cyclic nucleotide phosphodiesterase.
Changed!
pfam PAS 106aa 2e-07
in ref transcript
PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
pfam Response_reg 117aa 5e-07
in ref transcript
Response regulator receiver domain. This domain receives the signal from the sensor partner in bacterial two-component systems. It is usually found N-terminal to a DNA binding effector domain.
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
cd HDc 191aa 0.001
in ref transcript
Metal dependent phosphohydrolases with conserved 'HD' motif.
pfam PDEase_I 251aa 3e-52
in ref transcript
3'5'-cyclic nucleotide phosphodiesterase.
pfam PDE8 52aa 2e-23
in ref transcript
PDE8 phosphodiesterase. This region is found in members of the PDE8 phosphodiesterase family. It is found with pfam00233.
pfam Response_reg 118aa 1e-12
in ref transcript
Response regulator receiver domain. This domain receives the signal from the sensor partner in bacterial two-component systems. It is usually found N-terminal to a DNA binding effector domain.
Changed!
TIGR nifL_nitrog 106aa 5e-08
in ref transcript
NifL is a modulator of the nitrogen fixation positive regulator protein NifA, and is therefore a negative regulator. It binds NifA. NifA and NifL are encoded by adjacent genes.
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
cd HDc 191aa 0.001
in ref transcript
Metal dependent phosphohydrolases with conserved 'HD' motif.
pfam PDEase_I 251aa 3e-52
in ref transcript
3'5'-cyclic nucleotide phosphodiesterase.
pfam PDE8 52aa 2e-23
in ref transcript
PDE8 phosphodiesterase. This region is found in members of the PDE8 phosphodiesterase family. It is found with pfam00233.
pfam Response_reg 118aa 1e-12
in ref transcript
Response regulator receiver domain. This domain receives the signal from the sensor partner in bacterial two-component systems. It is usually found N-terminal to a DNA binding effector domain.
TIGR nifL_nitrog 106aa 5e-08
in ref transcript
NifL is a modulator of the nitrogen fixation positive regulator protein NifA, and is therefore a negative regulator. It binds NifA. NifA and NifL are encoded by adjacent genes.
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
cd HDc 191aa 0.001
in ref transcript
Metal dependent phosphohydrolases with conserved 'HD' motif.
pfam PDEase_I 251aa 3e-52
in ref transcript
3'5'-cyclic nucleotide phosphodiesterase.
pfam PDE8 52aa 2e-23
in ref transcript
PDE8 phosphodiesterase. This region is found in members of the PDE8 phosphodiesterase family. It is found with pfam00233.
pfam Response_reg 118aa 1e-12
in ref transcript
Response regulator receiver domain. This domain receives the signal from the sensor partner in bacterial two-component systems. It is usually found N-terminal to a DNA binding effector domain.
TIGR nifL_nitrog 106aa 5e-08
in ref transcript
NifL is a modulator of the nitrogen fixation positive regulator protein NifA, and is therefore a negative regulator. It binds NifA. NifA and NifL are encoded by adjacent genes.
Metal dependent phosphohydrolases with conserved 'HD' motif.
Changed!
pfam PDEase_I 223aa 8e-56
in ref transcript
3'5'-cyclic nucleotide phosphodiesterase.
PDE9A
refseq_PDE9A.F2 refseq_PDE9A.R2 113 293
NCBIGene 36.3 5152
Single exon skipping, size difference: 180
Exclusion in the protein (no frameshift)
Reference transcript: NM_002606
cd HDc 174aa 6e-10
in ref transcript
Metal dependent phosphohydrolases with conserved 'HD' motif.
pfam PDEase_I 223aa 8e-56
in ref transcript
3'5'-cyclic nucleotide phosphodiesterase.
PDE9A
refseq_PDE9A.F3 refseq_PDE9A.R3 140 211
NCBIGene 36.3 5152
Single exon skipping, size difference: 71
Exclusion in the protein causing a frameshift
Reference transcript: NM_002606
Changed!
cd HDc 174aa 6e-10
in ref transcript
Metal dependent phosphohydrolases with conserved 'HD' motif.
Changed!
pfam PDEase_I 223aa 8e-56
in ref transcript
3'5'-cyclic nucleotide phosphodiesterase.
PDE9A
refseq_PDE9A.F5 refseq_PDE9A.R5 116 201
NCBIGene 36.3 5152
Single exon skipping, size difference: 85
Exclusion in the protein causing a frameshift
Reference transcript: NM_002606
Changed!
cd HDc 174aa 6e-10
in ref transcript
Metal dependent phosphohydrolases with conserved 'HD' motif.
Changed!
pfam PDEase_I 223aa 8e-56
in ref transcript
3'5'-cyclic nucleotide phosphodiesterase.
PDE9A
refseq_PDE9A.F8 refseq_PDE9A.R8 113 191
NCBIGene 36.3 5152
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_002606
cd HDc 174aa 6e-10
in ref transcript
Metal dependent phosphohydrolases with conserved 'HD' motif.
pfam PDEase_I 223aa 8e-56
in ref transcript
3'5'-cyclic nucleotide phosphodiesterase.
PDGFA
refseq_PDGFA.F1 refseq_PDGFA.R1 143 212
NCBIGene 36.3 5154
Single exon skipping, size difference: 69
Exclusion of the stop codon
Reference transcript: NM_002607
cd PDGF 89aa 3e-18
in ref transcript
Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family domain; PDGF is a potent activator for cells of mesenchymal origin; PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer; VEGF is a potent mitogen in embryonic and somatic angiogenesis with a unique specificity for vascular endothelial cells; VEGF forms homodimers and exists in 4 different isoforms; overall, the VEGF monomer resembles that of PDGF, but its N-terminal segment is helical rather than extended; the cysteine knot motif is a common feature of this domain.
pfam PDGF 84aa 3e-33
in ref transcript
Platelet-derived growth factor (PDGF).
pfam PDGF_N 75aa 3e-25
in ref transcript
Platelet-derived growth factor, N terminal region. This family consists of the amino terminal regions of platelet-derived growth factor (PDGF, pfam00341) A and B chains.
PDGFD
refseq_PDGFD.F1 refseq_PDGFD.R1 100 118
NCBIGene 36.3 80310
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_025208
cd CUB 114aa 2e-19
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd PDGF 96aa 2e-17
in ref transcript
Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family domain; PDGF is a potent activator for cells of mesenchymal origin; PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer; VEGF is a potent mitogen in embryonic and somatic angiogenesis with a unique specificity for vascular endothelial cells; VEGF forms homodimers and exists in 4 different isoforms; overall, the VEGF monomer resembles that of PDGF, but its N-terminal segment is helical rather than extended; the cysteine knot motif is a common feature of this domain.
smart CUB 106aa 5e-19
in ref transcript
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
smart PDGF 92aa 0.002
in ref transcript
Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family. Platelet-derived growth factor is a potent activator for cells of mesenchymal origin. PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer. Members of the VEGF family are homologues of PDGF.
PDLIM2
refseq_PDLIM2.F1 refseq_PDLIM2.R1 106 224
NCBIGene 36.3 64236
Single exon skipping, size difference: 118
Exclusion in the protein causing a frameshift
Reference transcript: NM_176871
cd PDZ_signaling 77aa 6e-15
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart PDZ 74aa 1e-14
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
Changed!
pfam LIM 45aa 0.001
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart PDZ 74aa 2e-17
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
pfam LIM 56aa 7e-12
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
Inclusion in the protein causing a frameshift, Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_001011515
cd PDZ_signaling 73aa 2e-15
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart PDZ 74aa 1e-16
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_006457
cd PDZ_signaling 73aa 6e-16
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart PDZ 74aa 2e-17
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
pfam LIM 56aa 7e-12
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_005451
cd PDZ_signaling 75aa 9e-10
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
pfam LIM 56aa 1e-11
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
pfam LIM 52aa 5e-11
in ref transcript
smart PDZ 80aa 3e-10
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
pfam LIM 53aa 4e-09
in ref transcript
Changed!
PRK PRK12323 140aa 0.001
in ref transcript
DNA polymerase III subunits gamma and tau; Provisional.
PDPK1
refseq_PDPK1.F1 refseq_PDPK1.R1 100 481
NCBIGene 36.3 5170
Multiple exon skipping, size difference: 381
Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_002613
Changed!
cd STKc_PDK1 263aa 1e-120
in ref transcript
STKc_PDK1: Serine/Threonine Kinases (STKs), Phosphoinositide-dependent kinase 1 (PDK1) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The PDK1 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase (PI3K). PDK1 carries an N-terminal catalytic domain and a C-terminal pleckstrin homology (PH) domain that binds phosphoinositides. It phosphorylates the activation loop of AGC kinases that are regulated by PI3K such as PKB, SGK, and PKC, among others, and is crucial for their activation. Thus, it contributes in regulating many processes including metabolism, growth, proliferation, and survival. PDK1 also has the ability to autophosphorylate and is constitutively active in mammalian cells. PDK1 is essential for normal embryo development and is important in regulating cell volume.
cd PH_PDK1 88aa 6e-36
in ref transcript
3-Phosphoinositide dependent protein kinase 1 (PDK1) pleckstrin homology (PH) domain. PDK1 contains an N-terminal serine/threonine kinase domain followed by a PH domain. Following binding of the PH domain to PtdIns(3,4,5)P3 and PtdIns(3,4)P2, PDK1 activates kinases such as Akt (PKB). PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
Changed!
smart S_TKc 251aa 1e-78
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00263 276aa 1e-64
in ref transcript
protein kinase A catalytic subunit; Provisional.
Changed!
cd STKc_PDK1 135aa 8e-46
in modified transcript
Changed!
smart S_TKc 118aa 6e-26
in modified transcript
Changed!
smart STYKc 28aa 8e-04
in modified transcript
Protein kinase; unclassified specificity. Phosphotransferases. The specificity of this class of kinases can not be predicted. Possible dual-specificity Ser/Thr/Tyr kinase.
Changed!
PTZ PTZ00263 134aa 5e-20
in modified transcript
PECI
refseq_PECI.F2 refseq_PECI.R2 144 215
NCBIGene 36.3 10455
Single exon skipping, size difference: 71
Inclusion in the protein causing a new stop codon
Reference transcript: NM_006117
Changed!
cd crotonase-like 195aa 5e-47
in ref transcript
Crotonase/Enoyl-Coenzyme A (CoA) hydratase superfamily. This superfamily contains a diverse set of enzymes including enoyl-CoA hydratase, napthoate synthase, methylmalonyl-CoA decarboxylase, 3-hydoxybutyryl-CoA dehydratase, and dienoyl-CoA isomerase. Many of these play important roles in fatty acid metabolism. In addition to a conserved structural core and the formation of trimers (or dimers of trimers), a common feature in this superfamily is the stabilization of an enolate anion intermediate derived from an acyl-CoA substrate. This is accomplished by two conserved backbone NH groups in active sites that form an oxyanion hole.
cd ACBP 76aa 5e-23
in ref transcript
Acyl CoA binding protein (ACBP) binds thiol esters of long fatty acids and coenzyme A in a one-to-one binding mode with high specificity and affinity. Acyl-CoAs are important intermediates in fatty lipid synthesis and fatty acid degradation and play a role in regulation of intermediary metabolism and gene regulation. The suggested role of ACBP is to act as a intracellular acyl-CoA transporter and pool former. ACBPs are present in a large group of eukaryotic species and several tissue-specific isoforms have been detected.
Changed!
TIGR PaaB1 216aa 1e-22
in ref transcript
This family of proteins are found within apparent operons for the degradation of phenylacetic acid. These proteins contain the enoyl-CoA hydratase domain as detected by pfam00378. This activity is consistent with current hypotheses for the degradation pathway which involve the ligation of phenylacetate with coenzyme A (paaF), hydroxylation (paaGHIJK), ring-opening (paaN) and degradation of the resulting fatty acid-like compound to a Krebs cycle intermediate (paaABCDE).
pfam ACBP 76aa 2e-21
in ref transcript
Acyl CoA binding protein.
Changed!
PRK PRK06688 232aa 3e-46
in ref transcript
enoyl-CoA hydratase; Provisional.
COG ACB 75aa 2e-16
in ref transcript
Acyl-CoA-binding protein [Lipid metabolism].
Changed!
cd crotonase-like 26aa 0.002
in modified transcript
Changed!
PRK PRK06688 31aa 0.002
in modified transcript
PER2
refseq_PER2.F1 refseq_PER2.R1 110 217
NCBIGene 36.2 8864
Single exon skipping, size difference: 107
Exclusion in the protein causing a frameshift
Reference transcript: NM_022817
Changed!
cd PAS 102aa 2e-08
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
Changed!
pfam PAS_3 85aa 2e-08
in ref transcript
PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
PEX10
refseq_PEX10.F1 refseq_PEX10.R1 222 282
NCBIGene 36.3 5192
Alternative 3-prime, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_153818
cd RING 42aa 1e-07
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
Changed!
pfam Pex2_Pex12 169aa 1e-16
in ref transcript
Pex2 / Pex12 amino terminal region. This region is found at the N terminal of a number of known and predicted peroxins including Pex2, Pex10 and Pex12. This conserved region is usually associated with a C terminal ring finger (pfam00097) domain.
smart RING 38aa 2e-08
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
COG PEX10 55aa 7e-12
in ref transcript
RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
pfam Pex2_Pex12 175aa 3e-19
in modified transcript
PFDN5
refseq_PFDN5.F1 refseq_PFDN5.R1 142 277
NCBIGene 36.3 5204
Multiple exon skipping, size difference: 135
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_002624
Changed!
cd Prefoldin_alpha 128aa 2e-25
in ref transcript
Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Changed!
TIGR TIGR00293 127aa 2e-31
in ref transcript
This model finds a set of small proteins from the Archaea and from Aquifex aeolicus that may represent two orthologous groups. The proteins are predicted to be mostly coiled coil, and builds of HMMs for the seed alignment with less selective parameters lead to significant hits to large numbers of proteins that contain coiled coil regions. This model is built with a more selective usage of Dirichlet priors.
Changed!
COG GIM5 130aa 3e-16
in ref transcript
Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones].
Changed!
cd Prefoldin_alpha 79aa 5e-13
in modified transcript
Changed!
TIGR TIGR00293 70aa 2e-16
in modified transcript
Changed!
COG GIM5 67aa 2e-06
in modified transcript
PFN2
refseq_PFN2.F2 refseq_PFN2.R2 100 422
NCBIGene 36.3 5217
Alternative 3-prime, size difference: 322
Inclusion in the protein causing a new stop codon
Reference transcript: NM_002628
Changed!
cd PROF 137aa 2e-26
in ref transcript
Profilin binds actin monomers, membrane polyphosphoinositides such as PI(4,5)P2, and poly-L-proline. Profilin can inhibit actin polymerization into F-actin by binding to monomeric actin (G-actin) and terminal F-actin subunits, but - as a regulator of the cytoskeleton - it may also promote actin polymerization. It plays a role in the assembly of branched actin filament networks, by activating WASP via binding to WASP's proline rich domain. Profilin may link the cytoskeleton with major signalling pathways by interacting with components of the phosphatidylinositol cycle and Ras pathway.
Changed!
smart PROF 138aa 2e-33
in ref transcript
Profilin. Binds actin monomers, membrane polyphosphoinositides and poly-L-proline.
Changed!
cd PROF 138aa 2e-25
in modified transcript
Changed!
smart PROF 139aa 2e-32
in modified transcript
CBY1
refseq_PGEA1.F1 refseq_PGEA1.R1 151 291
NCBIGene 36.3 25776
Single exon skipping, size difference: 140
Exclusion of the protein initiation site
Reference transcript: NM_001002880
PHACTR3
refseq_PHACTR3.F2 refseq_PHACTR3.R2 115 325
NCBIGene 36.3 116154
Single exon skipping, size difference: 210
Exclusion in the protein (no frameshift)
Reference transcript: NM_080672
PHF1
refseq_PHF1.F1 refseq_PHF1.R1 299 394
NCBIGene 36.3 5252
Single exon skipping, size difference: 95
Exclusion in the protein causing a frameshift
Reference transcript: NM_024165
smart TUDOR 52aa 1e-06
in ref transcript
Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
pfam PHD 53aa 4e-06
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
Changed!
COG COG5141 84aa 0.009
in ref transcript
PHD zinc finger-containing protein [General function prediction only].
PHF1
refseq_PHF1.F3 refseq_PHF1.R3 285 391
NCBIGene 36.3 5252
Single exon skipping, size difference: 95
Exclusion in the protein causing a frameshift
Reference transcript: NM_024165
smart TUDOR 52aa 1e-06
in ref transcript
Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
pfam PHD 53aa 4e-06
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
Changed!
COG COG5141 84aa 0.009
in ref transcript
PHD zinc finger-containing protein [General function prediction only].
PHF2
refseq_PHF2.F1 refseq_PHF2.R1 100 136
NCBIGene 36.2 5253
Exon skipping and alternative 3-prime or 5-prime, size difference: 36
Inclusion in the protein causing a frameshift, Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_005392
pfam JmjC 101aa 2e-17
in ref transcript
JmjC domain. The JmjC domain belongs to the Cupin superfamily. JmjC-domain proteins may be protein hydroxylases that catalyse a novel histone modification.
Changed!
smart PHD 47aa 1e-08
in ref transcript
PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the R ING finger and the FY VE finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent B ROMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were R ING fingers misidentified as PHD fingers.
smart JmjC 64aa 5e-06
in ref transcript
A domain family that is part of the cupin metalloenzyme superfamily. Probable enzymes, but of unknown functions, that regulate chromatin reorganisation processes (Clissold and Ponting, in press).
Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
smart TUDOR 54aa 3e-06
in ref transcript
Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
smart MBT 66aa 3e-05
in ref transcript
Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
pfam PHD 44aa 1e-04
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
PHF7
refseq_PHF7.F2 refseq_PHF7.R2 276 393
NCBIGene 36.3 51533
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_016483
PHKB
refseq_PHKB.F1 refseq_PHKB.R1 113 225
NCBIGene 36.3 5257
Single exon skipping, size difference: 112
Inclusion in the protein causing a new stop codon
Reference transcript: NM_000293
Changed!
pfam PHK_AB 1040aa 0.0
in ref transcript
Phosphorylase kinase alpha/beta. This family consists of several eukaryotic phosphorylase kinase alpha and beta subunits. Phosphorylase kinase (PHK) is a regulatory enzyme in glycogen metabolism. Mutations in the gene encoding the alpha subunit of PHK (PHKA2) have been shown to be responsible for X-linked liver glycogenosis (XLG). XLG, a frequent type of glycogen storage disease, is characterised by hepatomegaly and growth retardation.
Calcium binding and coiled-coil domain (CALCOCO1) like. Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Changed!
COG COG5411 328aa 5e-40
in ref transcript
Changed!
smart IPPc 305aa 2e-73
in modified transcript
Changed!
COG COG5411 283aa 2e-39
in modified transcript
PICALM
refseq_PICALM.F2 refseq_PICALM.R2 103 127
NCBIGene 36.3 8301
Single exon skipping, size difference: 24
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_007166
cd ANTH_AP180_CALM 117aa 1e-37
in ref transcript
ANTH domain family; composed of adaptor protein 180 (AP180), clathrin assembly lymphoid myeloid leukemia protein (CALM) and similar proteins. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. AP180 and CALM play important roles in clathrin-mediated endocytosis. AP180 is a brain-specific clathrin-binding protein which stimulates clathrin assembly during the recycling of synaptic vesicles. The ANTH domain is structurally similar to the VHS domain and is composed of a superhelix of eight alpha helices. ANTH domains bind both inositol phospholipids and proteins, and contribute to the nucleation and formation of clathrin coats on membranes. ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the trans-Golgi network, which suggests that the ANTH domain is a universal component of the machinery for clathrin-mediated membrane budding.
pfam ANTH 265aa 7e-80
in ref transcript
ANTH domain. AP180 is an endocytotic accessory proteins that has been implicated in the formation of clathrin-coated pits. The domain is involved in phosphatidylinositol 4,5-bisphosphate binding and is a universal adaptor for nucleation of clathrin coats.
PICK1
refseq_PICK1.F1 refseq_PICK1.R1 173 226
NCBIGene 36.3 9463
Alternative 5-prime, size difference: 53
Exclusion in 5'UTR
Reference transcript: NM_001039583
cd Arfaptin 215aa 2e-68
in ref transcript
Arfaptin domain; arfaptin is a ubiquitously expressed protein implicated in mediating cross-talk between Rac, a member of the Rho family, and Arf small GTPases; Arfaptin binds to GTP-bound Arf1 and Arf6, but binds Rac.GTP and Rac.GDP with similar affinities. Structures of Arfaptin with Rac bound to either GDP or the slowly hydrolysable analogue GMPPNP show that the switch regions adopt similar conformations in both complexes. Arf1 and Arf6 are thought to bind to the same surface as Rac.
cd PDZ_signaling 77aa 5e-11
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
pfam Arfaptin 216aa 2e-74
in ref transcript
Arfaptin-like domain. Arfaptin interacts with ARF1, a small GTPase involved in vesicle budding at the Golgi complex and immature secretory granules. The structure of arfaptin shows that upon binding to a small GTPase, arfaptin forms a an elongated, crescent-shaped dimer of three-helix coiled-coils. The N-terminal region of ICA69 is similar to arfaptin.
smart PDZ 81aa 2e-12
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
Phosphatidylinositol N-acetylglucosaminyltransferase. Glycosylphosphatidylinositol (GPI) represents an important anchoring molecule for cell surface proteins. The first step in its synthesis is the transfer of N-acetylglucosamine (GlcNAc) from UDP-N-acetylglucosamine to phosphatidylinositol (PI). This step involves products of three or four genes in both yeast (GPI1, GPI2 and GPI3) and mammals (GPI1, PIG A, PIG H and PIG C), respectively.
PIGF
refseq_PIGF.F1 refseq_PIGF.R1 206 311
NCBIGene 36.3 5281
Single exon skipping, size difference: 105
Inclusion in the protein causing a new stop codon
Reference transcript: NM_002643
Changed!
pfam PIG-F 206aa 2e-59
in ref transcript
GPI biosynthesis protein family Pig-F. PIG-F is involved in glycosylphosphatidylinositol (GPI) anchor biosynthesis.
Changed!
pfam PIG-F 182aa 5e-51
in modified transcript
PIGN
refseq_PIGN.F1 refseq_PIGN.R1 272 349
NCBIGene 36.3 23556
Single exon skipping, size difference: 77
Exclusion in 5'UTR
Reference transcript: NM_176787
pfam PigN 188aa 5e-43
in ref transcript
Phosphatidylinositolglycan class N (PIG-N). Phosphatidylinositolglycan class N (PIG-N) is a mammalian homologue of the yeast protein MCD4P and is expressed in the endoplasmic reticulum. PIG-N is essential for glycosylphosphatidylinositol anchor synthesis. Glycosylphosphatidylinositol (GPI)-anchored proteins are cell surface-localised proteins that serve many important cellular functions.
pfam PigN 202aa 5e-38
in ref transcript
PIGQ
refseq_PIGQ.F1 refseq_PIGQ.R1 298 360
NCBIGene 36.3 9091
Single exon skipping, size difference: 62
Inclusion in the protein causing a frameshift
Reference transcript: NM_148920
pfam Gpi1 172aa 1e-58
in ref transcript
N-acetylglucosaminyl transferase component (Gpi1). Glycosylphosphatidylinositol (GPI) represents an important anchoring molecule for cell surface proteins.The first step in its synthesis is the transfer of N-acetylglucosamine (GlcNAc) from UDP-N-acetylglucosamine to phosphatidylinositol (PI). This chemically simple step is genetically complex because three or four genes are required in both yeast (GPI1, GPI2 and GPI3) and mammals (GPI1, PIG A, PIG H and PIG C), respectively.
PILRA
refseq_PILRA.F1 refseq_PILRA.R1 118 337
NCBIGene 36.3 29992
Single exon skipping, size difference: 219
Exclusion in the protein (no frameshift)
Reference transcript: NM_013439
pfam V-set 115aa 9e-05
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
PILRB
refseq_PILRB.F1 refseq_PILRB.R1 150 218
NCBIGene 36.3 29990
Alternative 3-prime, size difference: 68
Inclusion in 5'UTR
Reference transcript: NM_013440
pfam V-set 92aa 3e-05
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
PILRB
refseq_PILRB.F3 refseq_PILRB.R3 114 213
NCBIGene 36.3 29990
Alternative 3-prime, size difference: 99
Exclusion in 5'UTR
Reference transcript: NM_013440
pfam V-set 92aa 3e-05
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
PIP5K1B
refseq_PIP5K1B.F1 refseq_PIP5K1B.R1 258 379
NCBIGene 36.2 8395
Single exon skipping, size difference: 121
Inclusion in the protein causing a frameshift
Reference transcript: NM_003558
Changed!
cd PIPKc 252aa 4e-81
in ref transcript
Phosphatidylinositol phosphate kinases (PIPK) catalyze the phosphorylation of phosphatidylinositol phosphate on the fourth or fifth hydroxyl of the inositol ring, to form phosphatidylinositol bisphosphate. CD alignment includes type II phosphatidylinositol phosphate kinases (PIPKII-beta), type I andII PIPK (-alpha, -beta, and -gamma) kinases and related yeast Fab1p and Mss4p kinases. Signaling by phosphorylated species of phosphatidylinositol regulates secretion, vesicular trafficking, membrane translocation, cell adhesion, chemotaxis, DNA synthesis, and cell cycling. The catalytic core domains of PIPKs are structurally similar to PI3K, PI4K, and cAMP-dependent protein kinases (PKA), the dimerization region is a unique feature of the PIPKs.
Changed!
cd PIPKc 57aa 5e-15
in ref transcript
Changed!
pfam PIP5K 289aa 1e-119
in ref transcript
Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catalyses the formation of phosphoinositol-4,5-bisphosphate via the phosphorylation of phosphatidylinositol-4-phosphate a precursor in the phosphinositide signaling pathway.
Changed!
cd Fab1_TCP 245aa 1e-104
in ref transcript
TCP-1 like domain of the eukaryotic phosphatidylinositol 3-phosphate (PtdIns3P) 5-kinase Fab1. Fab1p is important for vacuole size regulation, presumably by modulating PtdIns(3,5)P2 effector activity. In the human homolog p235/PIKfyve deletion of this domain leads to loss of catalytic activity. However no exact function this domain has been defined. In general, chaperonins are involved in productive folding of proteins.
Changed!
cd PIPKc 314aa 5e-98
in ref transcript
Phosphatidylinositol phosphate kinases (PIPK) catalyze the phosphorylation of phosphatidylinositol phosphate on the fourth or fifth hydroxyl of the inositol ring, to form phosphatidylinositol bisphosphate. CD alignment includes type II phosphatidylinositol phosphate kinases (PIPKII-beta), type I andII PIPK (-alpha, -beta, and -gamma) kinases and related yeast Fab1p and Mss4p kinases. Signaling by phosphorylated species of phosphatidylinositol regulates secretion, vesicular trafficking, membrane translocation, cell adhesion, chemotaxis, DNA synthesis, and cell cycling. The catalytic core domains of PIPKs are structurally similar to PI3K, PI4K, and cAMP-dependent protein kinases (PKA), the dimerization region is a unique feature of the PIPKs.
Changed!
cd DEP_PIKfyve 82aa 1e-38
in ref transcript
DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in fungal RhoGEF (GDP/GTP exchange factor) PIKfyve-like proteins. PIKfyve contains N-terminal Fyve finger and DEP domains, a central chaperonin-like domain and a C-terminal PIPK (phosphatidylinositol phosphate kinase) domain. PIKfyve-like proteins are important phosphatidylinositol (3)-monophosphate (PtdIns(3)P)-5-kinases, producing PtdIns(3,5)P2, which plays a major role in multivesicular body (MVB) sorting and control of retrograde traffic from the vacuole back to the endosome and/or Golgi. PIKfyve itself has been shown to be play a role in regulating early-endosome-to-trans-Golgi network (TGN) retrograde trafficking.
Changed!
cd FYVE 53aa 2e-15
in ref transcript
FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
Changed!
smart PIPKc 238aa 5e-72
in ref transcript
Phosphatidylinositol phosphate kinases.
Changed!
TIGR chap_CCT_gamma 224aa 8e-32
in ref transcript
Members of this family, all eukaryotic, are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT gamma chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.
Changed!
pfam FYVE 60aa 9e-20
in ref transcript
FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. We have included members which do not conserve these histidine residues but are clearly related.
Changed!
smart DEP 73aa 3e-17
in ref transcript
Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signalling proteins that contain PH, rasGEF, rhoGEF, rhoGAP, RGS, PDZ domains. DEP domain in Drosophila dishevelled is essential to rescue planar polarity defects and induce JNK signalling (Cell 94, 109-118).
Chaperonin GroEL (HSP60 family) [Posttranslational modification, protein turnover, chaperones].
PIR
refseq_PIR.F1 FIGF.R4 192 226
NCBIGene 36.3 8544
Alternative 5-prime, size difference: 34
Exclusion in 5'UTR
Reference transcript: NM_003662
pfam Pirin 98aa 3e-33
in ref transcript
Pirin. This family consists of Pirin proteins from both eukaryotes and prokaryotes. The function of Pirin is unknown but the gene coding for this protein is known to be expressed in all tissues in the human body although it is expressed most strongly in the liver and heart. Pirin is known to be a nuclear protein, exclusively localised within the nucleoplasma and predominantly concentrated within dot-like subnuclear structures. A tomato homologue of human Pirin has been found to be induced during programmed cell death. Human Pirin interacts with Bcl-3 and NFI and hence is probably involved in the regulation of DNA transcription and replication. It appears to be an Fe(II)-containing member of the Cupin superfamily.
pfam Pirin_C 107aa 5e-33
in ref transcript
Pirin C-terminal cupin domain. This region is found the C-terminal half of the Pirin protein.
COG COG1741 275aa 2e-53
in ref transcript
Pirin-related protein [General function prediction only].
PITPNC1
refseq_PITPNC1.F2 refseq_PITPNC1.R2 263 382
NCBIGene 36.3 26207
Single exon skipping, size difference: 119
Inclusion in the protein causing a frameshift
Reference transcript: NM_012417
Changed!
pfam IP_trans 244aa 1e-73
in ref transcript
Phosphatidylinositol transfer protein. Along with the structurally unrelated Sec14p family (found in pfam00650), this family can bind/exchange one molecule of phosphatidylinositol (PI) or phosphatidylcholine (PC) and thus aids their transfer between different membrane compartments. There are three sub-families - all share an N-terminal PITP-like domain, whose sequence is highly conserved. It is described as consisting of three regions. The N-terminal region is thought to bind the lipid and contains two helices and an eight-stranded, mostly antiparallel beta-sheet. An intervening loop region, which is thought to play a role in protein-protein interactions, separates this from the C-terminal region, which exhibits the greatest sequence variation and may be involved in membrane binding. PITP alpha has a 16-fold greater affinity for PI than PC. Together with PITP beta, it is expressed ubiquitously in all tissues.
Changed!
pfam IP_trans 239aa 4e-73
in modified transcript
PITX2
refseq_PITX2.F2 refseq_PITX2.R2 122 260
NCBIGene 36.3 5308
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_153426
cd homeodomain 59aa 9e-15
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
pfam Homeobox 57aa 5e-21
in ref transcript
Homeobox domain.
pfam OAR 21aa 0.001
in ref transcript
OAR domain.
Changed!
COG COG5576 114aa 2e-08
in ref transcript
Changed!
COG COG5576 89aa 6e-08
in modified transcript
PJA1
refseq_PJA1.F2 refseq_PJA1.R2 100 426
NCBIGene 36.3 64219
Alternative 3-prime, size difference: 326
Exclusion of the protein initiation site
Reference transcript: NM_145119
cd RING 43aa 5e-08
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
smart RING 41aa 3e-07
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
COG COG5540 48aa 1e-08
in ref transcript
RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
PKD1L2
refseq_PKD1L2.F1 refseq_PKD1L2.R1 192 397
NCBIGene 36.2 114780
Single exon skipping, size difference: 205
Exclusion in 3'UTR
Reference transcript: NM_052892
cd PLAT_polycystin 120aa 2e-51
in ref transcript
PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.
cd CLECT 116aa 2e-17
in ref transcript
CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.
smart CLECT 126aa 6e-21
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
pfam PLAT 105aa 7e-15
in ref transcript
PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.
pfam GPS 40aa 2e-10
in ref transcript
Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
pfam REJ 203aa 2e-06
in ref transcript
REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges.
PKIA
refseq_PKIA.F1 refseq_PKIA.R1 228 357
NCBIGene 36.3 5569
Single exon skipping, size difference: 129
Exclusion in 5'UTR
Reference transcript: NM_006823
pfam PKI 74aa 9e-19
in ref transcript
cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of its regulatory chains.
PKIB
refseq_PKIB.F2 refseq_PKIB.R2 249 316
NCBIGene 36.3 5570
Single exon skipping, size difference: 67
Exclusion in 5'UTR
Reference transcript: NM_181795
pfam PKI 46aa 2e-08
in ref transcript
cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of its regulatory chains.
PKIG
refseq_PKIG.F1 refseq_PKIG.R1 275 380
NCBIGene 36.3 11142
Single exon skipping, size difference: 105
Exclusion in 5'UTR
Reference transcript: NM_181805
pfam PKI 74aa 1e-19
in ref transcript
cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of its regulatory chains.
PKM2
refseq_PKM2.F1 refseq_PKM2.R1 145 361
NCBIGene 36.3 5315
Alternative 5-prime, size difference: 216
Exclusion in 5'UTR
Reference transcript: NM_182470
cd Pyruvate_Kinase 489aa 0.0
in ref transcript
Pyruvate kinase (PK): Large allosteric enzyme that regulates glycolysis through binding of the substrate, phosphoenolpyruvate, and one or more allosteric effectors. Like other allosteric enzymes, PK has a high substrate affinity R state and a low affinity T state. PK exists as several different isozymes, depending on organism and tissue type. In mammals, there are four PK isozymes: R, found in red blood cells, L, found in liver, M1, found in skeletal muscle, and M2, found in kidney, adipose tissue, and lung. PK forms a homotetramer, with each subunit containing three domains. The T state to R state transition of PK is more complex than in most allosteric enzymes, involving a concerted rotation of all 3 domains of each monomer in the homotetramer.
TIGR pyruv_kin 485aa 0.0
in ref transcript
This enzyme is a homotetramer. Some forms are active only in the presence of fructose-1,6-bisphosphate or similar phosphorylated sugars.
PRK PRK05826 472aa 1e-158
in ref transcript
pyruvate kinase; Provisional.
PKMYT1
refseq_PKMYT1.F1 refseq_PKMYT1.R1 106 144
NCBIGene 36.3 9088
Alternative 3-prime, size difference: 38
Inclusion in the protein causing a frameshift
Reference transcript: NM_004203
cd S_TKc 248aa 4e-50
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 250aa 2e-48
in ref transcript
Protein kinase domain.
COG SPS1 261aa 8e-30
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
PKP1
refseq_PKP1.F1 refseq_PKP1.R1 120 183
NCBIGene 36.3 5317
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_000299
cd ARM 113aa 6e-20
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
cd ARM 129aa 4e-07
in ref transcript
smart ARM 41aa 3e-07
in ref transcript
Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
PKP2
refseq_PKP2.F2 refseq_PKP2.R2 184 316
NCBIGene 36.3 5318
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_004572
Changed!
cd ARM 118aa 3e-11
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
cd ARM 130aa 6e-06
in ref transcript
Changed!
cd ARM 118aa 3e-13
in modified transcript
Changed!
pfam Arm 42aa 0.001
in modified transcript
Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
PKP4
refseq_PKP4.F2 refseq_PKP4.R2 271 400
NCBIGene 36.3 8502
Single exon skipping, size difference: 129
Exclusion in the protein (no frameshift)
Reference transcript: NM_003628
cd ARM 123aa 3e-25
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
cd ARM 129aa 1e-11
in ref transcript
cd ARM 142aa 1e-04
in ref transcript
smart ARM 41aa 4e-07
in ref transcript
Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
smart ARM 36aa 2e-05
in ref transcript
smart ARM 42aa 3e-05
in ref transcript
PLA2G6
refseq_PLA2G6.F1 refseq_PLA2G6.R1 109 271
NCBIGene 36.3 8398
Single exon skipping, size difference: 162
Exclusion in the protein (no frameshift)
Reference transcript: NM_003560
cd Pat_PNPLA9 314aa 1e-164
in ref transcript
Patatin-like phospholipase domain containing protein 9. PNPLA9 is a Ca-independent phospholipase that catalyzes the hydrolysis of glycerophospholipids at the sn-2 position. PNPLA9 is also known as PLA2G6 (phospholipase A2 group VI) or iPLA2beta. PLA2G6 is stimulated by ATP and inhibited by bromoenol lactone (BEL). In humans, PNPLA9 in expressed ubiquitously and is involved in signal transduction, cell proliferation, and apoptotic cell death. Mutations in human PLA2G6 leads to infantile neuroaxonal dystrophy (INAD) and idiopathic neurodegeneration with brain iron accumulation (NBIA). This family includes PLA2G6 from Homo sapiens and Rattus norvegicus.
Changed!
cd ANK 119aa 1e-24
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 126aa 9e-20
in ref transcript
pfam Patatin 185aa 1e-33
in ref transcript
Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of lipid acyl hydrolase, catalysing the cleavage of fatty acids from membrane lipids. Members of this family have been found also in vertebrates.
Changed!
TIGR trp 146aa 1e-04
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
COG COG3621 227aa 6e-19
in ref transcript
Patatin [General function prediction only].
COG Arp 204aa 2e-11
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
PTZ PTZ00322 110aa 3e-06
in ref transcript
Changed!
cd ANK 111aa 4e-24
in modified transcript
Changed!
TIGR trp 101aa 1e-04
in modified transcript
Changed!
pfam Ank 30aa 0.002
in modified transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
Changed!
TIGR trp 234aa 0.008
in modified transcript
PLCB1
refseq_PLCB1.F1 refseq_PLCB1.R1 234 352
NCBIGene 36.3 23236
Single exon skipping, size difference: 118
Inclusion in the protein causing a new stop codon
Reference transcript: NM_015192
cd PLCc 149aa 6e-42
in ref transcript
Phospholipase C, catalytic domain; Phosphoinositide-specific phospholipases C catalyze hydrolysis of phosphatidylinositol-4,5-bisphosphate (PIP2) to D-myo-inositol-1,4,5-trisphosphate (1,4,5-IP3) and sn-1,2-diacylglycerol (DAG). Both products function as second messengers in eukaryotic signal transduction cascades. 1,4,5-IP3 triggers inflow of calcium from intracellular stores; the membrane resident product DAG controls cellular protein phosphorylation states by activating various protein kinase C isozymes. The enzyme comprises 2 regions (X and Y) connected via a linker which may contain inserted domains, X and Y together form a TIM barrel-like structure containing the active site residues.
cd PH_PLC 121aa 2e-25
in ref transcript
Phospholipase C (PLC) pleckstrin homology (PH) domain. There are several isozymes of PLC (beta, gamma, delta, epsilon. zeta). While, PLC beta, gamma and delta all have N-terminal PH domains, lipid binding specificity is not conserved between them. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
cd PLCc 102aa 2e-19
in ref transcript
cd C2_2 118aa 5e-09
in ref transcript
Protein kinase C conserved region 2, subgroup 2; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing phospholipases C and D( PLC-1, PLD) and specific protein kinases C (PKC) subtypes, the C-terminal beta strand occupies the position of what is the N-terminal strand in subgroup 1.
pfam PI-PLC-Y 118aa 1e-59
in ref transcript
Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit.
pfam PI-PLC-X 147aa 2e-55
in ref transcript
Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit.
Changed!
pfam PLC-beta_C 165aa 1e-41
in ref transcript
PLC-beta C terminal. This domain corresponds to the alpha helical C terminal domain of phospholipase C beta.
pfam efhand_like 92aa 9e-22
in ref transcript
Phosphoinositide-specific phospholipase C, efhand-like. Members of this family are predominantly found in phosphoinositide-specific phospholipase C. They adopt a structure consisting of a core of four alpha helices, in an EF like fold, and are required for functioning of the enzyme.
smart C2 100aa 7e-10
in ref transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
Changed!
pfam PLC-beta_C 118aa 7e-31
in modified transcript
PLOD2
refseq_PLOD2.F1 refseq_PLOD2.R1 113 176
NCBIGene 36.3 5352
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_182943
smart P4Hc 173aa 1e-18
in ref transcript
Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.
PLP1
refseq_PLP1.F1 refseq_PLP1.R1 218 323
NCBIGene 36.3 5354
Alternative 5-prime, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_000533
Changed!
pfam Myelin_PLP 276aa 1e-108
in ref transcript
Myelin proteolipid protein (PLP or lipophilin).
Changed!
pfam Myelin_PLP 241aa 1e-109
in modified transcript
PLTP
refseq_PLTP.F1 refseq_PLTP.R1 149 305
NCBIGene 36.3 5360
Single exon skipping, size difference: 156
Exclusion in the protein (no frameshift)
Reference transcript: NM_006227
Changed!
cd BPI1 219aa 2e-56
in ref transcript
BPI/LBP/CETP N-terminal domain; Bactericidal permeability-increasing protein (BPI) / Lipopolysaccharide-binding protein (LBP) / Cholesteryl ester transfer protein (CETP) N-terminal domain; binds to and neutralizes lipopolysaccharides from the outer membrane of Gram-negative bacteria.; Apolar pockets on the concave surface bind a molecule of phosphatidylcholine, primarily by interacting with their acyl chains; this suggests that the pockets may also bind the acyl chains of lipopolysaccharide.
cd BPI2 201aa 1e-47
in ref transcript
BPI/LBP/CETP C-terminal domain; Bactericidal permeability-increasing protein (BPI) / Lipopolysaccharide-binding protein (LBP) / Cholesteryl ester transfer protein (CETP) C-terminal domain; binds to and neutralizes lipopolysaccharides from the outer membrane of Gram-negative bacteria.; Apolar pockets on the concave surface bind a molecule of phosphatidylcholine, primarily by interacting with their acyl chains; this suggests that the pockets may also bind the acyl chains of lipopolysaccharide.
pfam LBP_BPI_CETP_C 237aa 7e-78
in ref transcript
LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar.
Changed!
smart BPI1 219aa 5e-53
in ref transcript
BPI/LBP/CETP N-terminal domain. Bactericidal permeability-increasing protein (BPI) / Lipopolysaccharide-binding protein (LBP) / Cholesteryl ester transfer protein (CETP) N-terminal domain.
Changed!
cd BPI1 167aa 9e-33
in modified transcript
Changed!
smart BPI1 167aa 9e-35
in modified transcript
PLUNC
refseq_PLUNC.F1 refseq_PLUNC.R1 184 239
NCBIGene 36.3 51297
Alternative 3-prime, size difference: 55
Inclusion in 3'UTR
Reference transcript: NM_016583
cd BPI1 138aa 0.001
in ref transcript
BPI/LBP/CETP N-terminal domain; Bactericidal permeability-increasing protein (BPI) / Lipopolysaccharide-binding protein (LBP) / Cholesteryl ester transfer protein (CETP) N-terminal domain; binds to and neutralizes lipopolysaccharides from the outer membrane of Gram-negative bacteria.; Apolar pockets on the concave surface bind a molecule of phosphatidylcholine, primarily by interacting with their acyl chains; this suggests that the pockets may also bind the acyl chains of lipopolysaccharide.
pfam LBP_BPI_CETP 141aa 3e-24
in ref transcript
LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar.
PML
refseq_PML.F2 refseq_PML.R2 139 283
NCBIGene 36.3 5371
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_033238
smart RING 35aa 3e-04
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
pfam zf-B_box 43aa 0.003
in ref transcript
B-box zinc finger.
TIGR rad18 73aa 0.003
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Changed!
pfam zf-B_box 40aa 0.009
in ref transcript
POGZ
refseq_POGZ.F1 refseq_POGZ.R1 200 359
NCBIGene 36.3 23126
Single exon skipping, size difference: 159
Exclusion in the protein (no frameshift)
Reference transcript: NM_015100
pfam DDE 165aa 2e-15
in ref transcript
DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which contain three carboxylate residues that are believed to be responsible for coordinating metal ions needed for catalysis. The catalytic activity of this enzyme involves DNA cleavage at a specific site followed by a strand transfer reaction. Interestingly this family also includes the CENP-B protein. This domain in that protein appears to have lost the metal binding residues and is unlikely to have endonuclease activity. Centromere Protein B (CENP-B) is a DNA-binding protein localised to the centromere.
smart CENPB 62aa 6e-12
in ref transcript
Putative DNA-binding domain in centromere protein B, mouse jerky and transposases.
POLDIP3
refseq_POLDIP3.F1 refseq_POLDIP3.R1 301 388
NCBIGene 36.3 84271
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_032311
cd RRM 65aa 1e-09
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM 64aa 8e-09
in ref transcript
RNA recognition motif.
COG COG0724 133aa 2e-04
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
POLR3H
refseq_POLR3H.F1 refseq_POLR3H.R1 126 181
NCBIGene 36.2 171568
Alternative 5-prime, size difference: 55
Exclusion in 5'UTR
Reference transcript: NM_001018051
POMC
refseq_POMC.F1 refseq_POMC.R1 145 195
NCBIGene 36.3 5443
Single exon skipping, size difference: 50
Exclusion in 5'UTR
Reference transcript: NM_001035256
pfam NPP 45aa 8e-16
in ref transcript
Pro-opiomelanocortin, N-terminal region. This family features the N-terminal peptide of pro-opiomelanocortin (NPP). It is thought to represent an important pituitary peptide, given its high yield from pituitary glands, and exhibits a potent in vitro aldosterone-stimulating activity.
pfam Op_neuropeptide 29aa 2e-07
in ref transcript
Opioids neuropeptide. This family corresponds to the conserved YGG motif that is found in a wide variety of opioid neuropeptides such as enkephalin.
pfam ACTH_domain 39aa 1e-04
in ref transcript
Corticotropin ACTH domain.
POMZP3
refseq_POMZP3.F1 refseq_POMZP3.R1 186 416
NCBIGene 36.3 22932
Multiple exon skipping, size difference: 230
Exclusion in the protein causing a frameshift, Exclusion of the stop codon
Reference transcript: NM_012230
Changed!
smart ZP 65aa 9e-09
in ref transcript
Zona pellucida (ZP) domain. ZP proteins are responsible for sperm-adhesion fo the zona pellucida. ZP domains are also present in multidomain transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type III (betaglycan).
POP5
refseq_POP5.F1 refseq_POP5.R1 182 332
NCBIGene 36.3 51367
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_015918
Changed!
pfam RNase_P_Rpp14 111aa 4e-21
in ref transcript
Rpp14/Pop5 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40. This protein is known as Pop5 in eukaryotes.
Changed!
COG POP5 114aa 8e-06
in ref transcript
RNase P/RNase MRP subunit POP5 [Translation, ribosomal structure and biogenesis].
PORCN
refseq_PORCN.F1 refseq_PORCN.R1 155 226
NCBIGene 36.3 64840
Alternative 3-prime, size difference: 71
Exclusion in the protein causing a frameshift
Reference transcript: NM_203475
Changed!
pfam MBOAT 288aa 8e-29
in ref transcript
MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue.
Changed!
COG COG5202 336aa 5e-09
in ref transcript
Predicted membrane protein [Function unknown].
PPA2
refseq_PPA2.F2 refseq_PPA2.R2 286 340
NCBIGene 36.3 27068
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_176869
Changed!
cd pyrophosphatase 184aa 1e-48
in ref transcript
Inorganic pyrophosphatase. These enzymes hydrolyze inorganic pyrophosphate (PPi) to two molecules of orthophosphates (Pi). The reaction requires bivalent cations. The enzymes in general exist as homooligomers.
Changed!
pfam Pyrophosphatase 185aa 3e-55
in ref transcript
Inorganic pyrophosphatase.
Changed!
COG Ppa 216aa 7e-33
in ref transcript
Inorganic pyrophosphatase [Energy production and conversion].
Changed!
cd pyrophosphatase 178aa 1e-40
in modified transcript
Changed!
pfam Pyrophosphatase 171aa 3e-50
in modified transcript
Changed!
COG Ppa 198aa 7e-29
in modified transcript
PPA2
refseq_PPA2.F3 refseq_PPA2.R3 188 275
NCBIGene 36.3 27068
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_176869
Changed!
cd pyrophosphatase 184aa 1e-48
in ref transcript
Inorganic pyrophosphatase. These enzymes hydrolyze inorganic pyrophosphate (PPi) to two molecules of orthophosphates (Pi). The reaction requires bivalent cations. The enzymes in general exist as homooligomers.
Changed!
pfam Pyrophosphatase 185aa 3e-55
in ref transcript
Inorganic pyrophosphatase.
Changed!
COG Ppa 216aa 7e-33
in ref transcript
Inorganic pyrophosphatase [Energy production and conversion].
Changed!
cd pyrophosphatase 155aa 5e-39
in modified transcript
Changed!
pfam Pyrophosphatase 156aa 4e-45
in modified transcript
Changed!
COG Ppa 187aa 2e-25
in modified transcript
PPAP2C
refseq_PPAP2C.F1 refseq_PPAP2C.R1 127 168
NCBIGene 36.3 8612
Alternative 5-prime, size difference: 41
Exclusion in the protein causing a frameshift
Reference transcript: NM_003712
Changed!
cd PAP2_wunen 131aa 2e-48
in ref transcript
PAP2, wunen subfamily. Most likely a family of membrane associated phosphatidic acid phosphatases. Wunen is a drosophila protein expressed in the central nervous system, which provides repellent activity towards primordial germ cells (PGCs), controls the survival of PGCs and is essential in the migration process of these cells towards the somatic gonadal precursors.
Changed!
pfam PAP2 123aa 4e-19
in ref transcript
PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2. The family also includes a variety of haloperoxidases that function by oxidising halides in the presence of hydrogen peroxide to form the corresponding hypohalous acids.
Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, including PP1, PP2A and PP2B (calcineurin) family members.
cd EFh 66aa 2e-07
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Changed!
smart PP2Ac 322aa 2e-91
in ref transcript
Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, that includes PP1, PP2A and PP2B (calcineurin) family members.
pfam PPP5 62aa 3e-07
in ref transcript
PPP5. This domain is specific to the PPP5 subfamily of serine/threonine phosphatases.
smart EH 67aa 0.001
in ref transcript
Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Changed!
PTZ PTZ00244 314aa 5e-30
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
Changed!
cd PP2Ac 256aa 2e-42
in modified transcript
Changed!
smart PP2Ac 260aa 1e-64
in modified transcript
Changed!
PTZ PTZ00244 176aa 5e-22
in modified transcript
PPEF1
refseq_PPEF1.F1 refseq_PPEF1.R2 219 303
NCBIGene 36.3 5475
Alternative 5-prime, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_006240
Changed!
cd PP2Ac 318aa 5e-63
in ref transcript
Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, including PP1, PP2A and PP2B (calcineurin) family members.
cd EFh 66aa 2e-07
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Changed!
smart PP2Ac 322aa 2e-91
in ref transcript
Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, that includes PP1, PP2A and PP2B (calcineurin) family members.
pfam PPP5 62aa 3e-07
in ref transcript
PPP5. This domain is specific to the PPP5 subfamily of serine/threonine phosphatases.
smart EH 67aa 0.001
in ref transcript
Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Changed!
PTZ PTZ00244 314aa 5e-30
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
Changed!
cd PP2Ac 290aa 2e-65
in modified transcript
Changed!
smart PP2Ac 294aa 8e-93
in modified transcript
Changed!
PTZ PTZ00244 286aa 7e-33
in modified transcript
Changed!
PTZ PTZ00184 148aa 0.010
in modified transcript
calmodulin; Provisional.
PPHLN1
refseq_PPHLN1.F1 refseq_PPHLN1.R1 118 283
NCBIGene 36.3 51535
Single exon skipping, size difference: 165
Exclusion in the protein (no frameshift)
Reference transcript: NM_016488
PPHLN1
refseq_PPHLN1.F4 refseq_PPHLN1.R4 112 155
NCBIGene 36.3 51535
Single exon skipping, size difference: 43
Inclusion in 5'UTR
Reference transcript: NM_016488
PPIL2
refseq_PPIL2.F1 refseq_PPIL2.R1 110 248
NCBIGene 36.3 23759
Alternative 3-prime, size difference: 138
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_148176
cd cyclophilin_RING 160aa 9e-76
in ref transcript
cyclophilin_RING: cyclophilin-type peptidylprolyl cis- trans isomerases (cyclophilins) having a modified RING finger domain. This group includes the nuclear proteins, Human hCyP-60 and Caenorhabditis elegans MOG-6 which, compared to the archetypal cyclophilin Human cyclophilin A exhibit reduced peptidylprolyl cis- trans isomerase activity and lack a residue important for cyclophilin binding. Human hCyP-60 has been shown to physically interact with the proteinase inhibitor peptide eglin c and; C. elegans MOG-6 to physically interact with MEP-1, a nuclear zinc finger protein. MOG-6 has been shown to function in germline sex determination.
pfam Pro_isomerase 140aa 4e-35
in ref transcript
Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans isomerases, also known as cyclophilins, share this domain of about 109 amino acids. Cyclophilins have been found in all organisms studied so far and catalyse peptidyl-prolyl isomerisation during which the peptide bond preceding proline (the peptidyl-prolyl bond) is stabilised in the cis conformation. Mammalian cyclophilin A (CypA) is a major cellular target for the immunosuppressive drug cyclosporin A (CsA). Other roles for cyclophilins may include chaperone and cell signalling function.
smart Ubox 60aa 1e-12
in ref transcript
Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
COG PpiB 153aa 8e-40
in ref transcript
Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones].
COG COG1723 117aa 0.010
in ref transcript
Uncharacterized conserved protein [Function unknown].
PPIL2
refseq_PPIL2.F2 refseq_PPIL2.R2 237 379
NCBIGene 36.3 23759
Alternative 3-prime, size difference: 4
Inclusion in the protein causing a frameshift
Reference transcript: NM_148176
cd cyclophilin_RING 160aa 9e-76
in ref transcript
cyclophilin_RING: cyclophilin-type peptidylprolyl cis- trans isomerases (cyclophilins) having a modified RING finger domain. This group includes the nuclear proteins, Human hCyP-60 and Caenorhabditis elegans MOG-6 which, compared to the archetypal cyclophilin Human cyclophilin A exhibit reduced peptidylprolyl cis- trans isomerase activity and lack a residue important for cyclophilin binding. Human hCyP-60 has been shown to physically interact with the proteinase inhibitor peptide eglin c and; C. elegans MOG-6 to physically interact with MEP-1, a nuclear zinc finger protein. MOG-6 has been shown to function in germline sex determination.
pfam Pro_isomerase 140aa 4e-35
in ref transcript
Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans isomerases, also known as cyclophilins, share this domain of about 109 amino acids. Cyclophilins have been found in all organisms studied so far and catalyse peptidyl-prolyl isomerisation during which the peptide bond preceding proline (the peptidyl-prolyl bond) is stabilised in the cis conformation. Mammalian cyclophilin A (CypA) is a major cellular target for the immunosuppressive drug cyclosporin A (CsA). Other roles for cyclophilins may include chaperone and cell signalling function.
smart Ubox 60aa 1e-12
in ref transcript
Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
COG PpiB 153aa 8e-40
in ref transcript
Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones].
Changed!
COG COG1723 117aa 0.010
in ref transcript
Uncharacterized conserved protein [Function unknown].
PPIL2
refseq_PPIL2.F3 refseq_PPIL2.R3 256 398
NCBIGene 36.3 23759
Alternative 3-prime, size difference: 4
Inclusion in the protein causing a frameshift
Reference transcript: NM_148176
cd cyclophilin_RING 160aa 9e-76
in ref transcript
cyclophilin_RING: cyclophilin-type peptidylprolyl cis- trans isomerases (cyclophilins) having a modified RING finger domain. This group includes the nuclear proteins, Human hCyP-60 and Caenorhabditis elegans MOG-6 which, compared to the archetypal cyclophilin Human cyclophilin A exhibit reduced peptidylprolyl cis- trans isomerase activity and lack a residue important for cyclophilin binding. Human hCyP-60 has been shown to physically interact with the proteinase inhibitor peptide eglin c and; C. elegans MOG-6 to physically interact with MEP-1, a nuclear zinc finger protein. MOG-6 has been shown to function in germline sex determination.
pfam Pro_isomerase 140aa 4e-35
in ref transcript
Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans isomerases, also known as cyclophilins, share this domain of about 109 amino acids. Cyclophilins have been found in all organisms studied so far and catalyse peptidyl-prolyl isomerisation during which the peptide bond preceding proline (the peptidyl-prolyl bond) is stabilised in the cis conformation. Mammalian cyclophilin A (CypA) is a major cellular target for the immunosuppressive drug cyclosporin A (CsA). Other roles for cyclophilins may include chaperone and cell signalling function.
smart Ubox 60aa 1e-12
in ref transcript
Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
COG PpiB 153aa 8e-40
in ref transcript
Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones].
Changed!
COG COG1723 117aa 0.010
in ref transcript
Uncharacterized conserved protein [Function unknown].
PPIL3
refseq_PPIL3.F2 refseq_PPIL3.R2 105 396
NCBIGene 36.3 53938
Alternative 5-prime and 3-prime, size difference: 291
Inclusion in 5'UTR, Inclusion in 5'UTR
Reference transcript: NM_032472
cd Cyclophilin_PPIL3_like 158aa 2e-55
in ref transcript
Cyclophilin_PPIL3_like. Proteins similar to Human cyclophilin-like peptidylprolyl cis- trans isomerase (PPIL3). Members of this family lack a key residue important for cyclosporin binding: the tryptophan residue corresponding to W121 in human hCyP-18a; most members have a histidine at this position. The exact function of the protein is not known.
pfam Pro_isomerase 152aa 5e-26
in ref transcript
Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans isomerases, also known as cyclophilins, share this domain of about 109 amino acids. Cyclophilins have been found in all organisms studied so far and catalyse peptidyl-prolyl isomerisation during which the peptide bond preceding proline (the peptidyl-prolyl bond) is stabilised in the cis conformation. Mammalian cyclophilin A (CypA) is a major cellular target for the immunosuppressive drug cyclosporin A (CsA). Other roles for cyclophilins may include chaperone and cell signalling function.
COG PpiB 157aa 2e-24
in ref transcript
Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones].
PPM1A
refseq_PPM1A.F2 refseq_PPM1A.R2 279 381
NCBIGene 36.3 5494
Single exon skipping, size difference: 102
Inclusion in 5'UTR
Reference transcript: NM_021003
cd PP2Cc 269aa 5e-71
in ref transcript
Serine/threonine phosphatases, family 2C, catalytic domain; The protein architecture and deduced catalytic mechanism of PP2C phosphatases are similar to the PP1, PP2A, PP2B family of protein Ser/Thr phosphatases, with which PP2C shares no sequence similarity.
smart PP2Cc 273aa 1e-82
in ref transcript
Serine/threonine phosphatases, family 2C, catalytic domain. The protein architecture and deduced catalytic mechanism of PP2C phosphatases are similar to the PP1, PP2A, PP2B family of protein Ser/Thr phosphatases, with which PP2C shares no sequence similarity.
pfam PP2C_C 80aa 1e-32
in ref transcript
Protein serine/threonine phosphatase 2C, C-terminal domain. Protein phosphatase 2C (PP2C) is involved in regulating cellular responses to stress in various eukaryotes. It consists of two domains: an N-terminal catalytic domain and a C-terminal domain characteristic of mammalian PP2Cs. This domain consists of three antiparallel alpha helices, one of which packs against two corresponding alpha-helices of the N-terminal domain. The C-terminal domain does not seem to play a role in catalysis, but it may provide protein substrate specificity due to the cleft that is created between it and the catalytic domain.
PTZ PTZ00224 293aa 2e-45
in ref transcript
protein phosphatase 2C; Provisional.
PPP1CA
refseq_PPP1CA.F2 refseq_PPP1CA.R2 129 162
NCBIGene 36.3 5499
Alternative 3-prime, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_001008709
cd PP2Ac 267aa 1e-116
in ref transcript
Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, including PP1, PP2A and PP2B (calcineurin) family members.
smart PP2Ac 270aa 1e-121
in ref transcript
Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, that includes PP1, PP2A and PP2B (calcineurin) family members.
Changed!
PTZ PTZ00244 281aa 1e-118
in ref transcript
Changed!
PTZ PTZ00244 289aa 1e-120
in modified transcript
PPP1CB
refseq_PPP1CB.F2 refseq_PPP1CB.R2 118 405
NCBIGene 36.2 5500
Multiple exon skipping, size difference: 287
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_206876
Changed!
cd PP2Ac 267aa 1e-114
in ref transcript
Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, including PP1, PP2A and PP2B (calcineurin) family members.
Changed!
smart PP2Ac 271aa 1e-120
in ref transcript
Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, that includes PP1, PP2A and PP2B (calcineurin) family members.
Changed!
PTZ PTZ00244 288aa 1e-117
in ref transcript
Changed!
cd PP2Ac 168aa 2e-69
in modified transcript
Changed!
smart PP2Ac 168aa 4e-73
in modified transcript
Changed!
PTZ PTZ00244 192aa 7e-72
in modified transcript
PPP1R12B
refseq_PPP1R12B.F1 refseq_PPP1R12B.R1 133 314
NCBIGene 36.3 4660
Single exon skipping, size difference: 181
Exclusion of the stop codon
Reference transcript: NM_032105
cd ANK 103aa 2e-25
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 152aa 7e-20
in ref transcript
cd ANK 58aa 1e-06
in ref transcript
pfam Ank 33aa 4e-06
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
pfam Ank 32aa 2e-05
in ref transcript
pfam Ank 31aa 1e-04
in ref transcript
TIGR trp 170aa 8e-04
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
pfam Ank 33aa 0.005
in ref transcript
COG Arp 125aa 1e-12
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
Changed!
pfam FHA 69aa 8e-13
in ref transcript
FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
Changed!
COG COG1716 89aa 0.001
in ref transcript
FOG: FHA domain [Signal transduction mechanisms].
Changed!
cd FHA 65aa 2e-06
in modified transcript
Changed!
pfam FHA 42aa 2e-07
in modified transcript
PPP2R2B
refseq_PPP2R2B.F1 refseq_PPP2R2B.R1 100 294
NCBIGene 36.3 5521
Single exon skipping, size difference: 194
Exclusion in the protein causing a frameshift
Reference transcript: NM_181674
Changed!
cd WD40 347aa 4e-07
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Changed!
pfam PP2A_B_N 71aa 1e-28
in ref transcript
Serine-threonine phosphatase 2A subunit B alpha and beta N-terminal domain. Protein phosphatase 2A (PP2A) is a serine/threonine phosphatase involved in many aspects of cellular function including the regulation of metabolic enzymes and proteins involved in signal transduction. PP2A is a trimeric enzyme consisting of a central catalytic core, and two subunits, A and B. This is the N-terminal domain of subunit B that is thought to perform substrate recognition function or be responsible for targeting the enzyme complex to the appropriate subcellular compartment. Changes in intracellular PP2A subunit composition have been found modulate micro-tubule dynamics. It would appear that reduced amounts of neuronal B-alpha-containing PP2A heterotrimers contribute to micro-tubule destabilisation in Alzheimer's disease.
Changed!
pfam PP2A_B_subs_rcg 61aa 5e-27
in ref transcript
Serine-threonine phosphatase 2A subunit B alpha and beta central domain. Protein phosphatase 2A (PP2A) is a serine/threonine phosphatase involved in many aspects of cellular function including the regulation of metabolic enzymes and proteins involved in signal transduction. PP2A is a trimeric enzyme consisting of a central catalytic core, and two subunits, A and B. This is a central highly conserved region of subunit B that is thought to perform a substrate recognition function or to be be responsible for targeting the enzyme complex to the appropriate subcellular compartment. Changes in intracellular PP2A subunit composition have been found to modulate micro-tubule dynamics. It would appear that reduced amounts of neuronal B-alpha-containing PP2A heterotrimers contribute to micro-tubule destabilisation in Alzheimer's disease.
Changed!
COG CDC55 417aa 1e-122
in ref transcript
Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms].
PPP2R2D
refseq_PPP2R2D.F1 refseq_PPP2R2D.R1 177 304
NCBIGene 36.3 55844
Alternative 3-prime, size difference: 127
Exclusion of the protein initiation site
Reference transcript: NM_018461
Changed!
pfam PP2A_B_subs_rcg 61aa 3e-26
in ref transcript
Serine-threonine phosphatase 2A subunit B alpha and beta central domain. Protein phosphatase 2A (PP2A) is a serine/threonine phosphatase involved in many aspects of cellular function including the regulation of metabolic enzymes and proteins involved in signal transduction. PP2A is a trimeric enzyme consisting of a central catalytic core, and two subunits, A and B. This is a central highly conserved region of subunit B that is thought to perform a substrate recognition function or to be be responsible for targeting the enzyme complex to the appropriate subcellular compartment. Changes in intracellular PP2A subunit composition have been found to modulate micro-tubule dynamics. It would appear that reduced amounts of neuronal B-alpha-containing PP2A heterotrimers contribute to micro-tubule destabilisation in Alzheimer's disease.
Changed!
COG CDC55 282aa 9e-80
in ref transcript
Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms].
Changed!
COG CDC55 221aa 2e-52
in modified transcript
PPP2R2D
refseq_PPP2R2D.F2 refseq_PPP2R2D.R2 242 355
NCBIGene 36.3 55844
Single exon skipping, size difference: 113
Inclusion in 5'UTR
Reference transcript: NM_018461
pfam PP2A_B_subs_rcg 61aa 3e-26
in ref transcript
Serine-threonine phosphatase 2A subunit B alpha and beta central domain. Protein phosphatase 2A (PP2A) is a serine/threonine phosphatase involved in many aspects of cellular function including the regulation of metabolic enzymes and proteins involved in signal transduction. PP2A is a trimeric enzyme consisting of a central catalytic core, and two subunits, A and B. This is a central highly conserved region of subunit B that is thought to perform a substrate recognition function or to be be responsible for targeting the enzyme complex to the appropriate subcellular compartment. Changes in intracellular PP2A subunit composition have been found to modulate micro-tubule dynamics. It would appear that reduced amounts of neuronal B-alpha-containing PP2A heterotrimers contribute to micro-tubule destabilisation in Alzheimer's disease.
COG CDC55 282aa 9e-80
in ref transcript
Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms].
PPP2R4
refseq_PPP2R4.F1 refseq_PPP2R4.R1 121 226
NCBIGene 36.3 5524
Single exon skipping, size difference: 105
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_021131
Changed!
cd PTPA 266aa 1e-116
in ref transcript
Phosphotyrosyl phosphatase activator (PTPA) is also known as protein phosphatase 2A (PP2A) phosphatase activator. PTPA is an essential, well conserved protein that stimulates the tyrosyl phosphatase activity of PP2A. It also reactivates the serine/threonine phosphatase activity of an inactive form of PP2A. Together, PTPA and PP2A constitute an ATPase. It has been suggested that PTPA alters the relative specificity of PP2A from phosphoserine/phosphothreonine substrates to phosphotyrosine substrates in an ATP-hydrolysis-dependent manner. Basal expression of PTPA is controlled by the transcription factor Yin Yang1 (YY1). PTPA has been suggested to play a role in the insertion of metals to the PP2A catalytic subunit (PP2Ac) active site, to act as a chaperone, and more recently, to have peptidyl prolyl cis/trans isomerase activity that specifically targets human PP2Ac.
Changed!
pfam PTPA 299aa 1e-142
in ref transcript
Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is relatively low when compared to the better recognised phosphoserine/ threonine protein phosphorylase activity. The specific biological role of PTPA is unknown, Basal expression of PTPA depends on the activity of a ubiquitous transcription factor, Yin Yang 1 (YY1). The tumour suppressor protein p53 can inhibit PTPA expression through an unknown mechanism that negatively controls YY1.
Changed!
COG LAG1 301aa 3e-85
in ref transcript
Phosphotyrosyl phosphatase activator [Cell division and chromosome partitioning / Signal transduction mechanisms].
Changed!
cd PTPA 301aa 1e-112
in modified transcript
Changed!
pfam PTPA 334aa 1e-138
in modified transcript
Changed!
COG LAG1 336aa 2e-82
in modified transcript
PPP2R5C
refseq_PPP2R5C.F2 refseq_PPP2R5C.R2 287 404
NCBIGene 36.3 5527
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_002719
pfam B56 409aa 1e-170
in ref transcript
Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzyme to act on a diverse array of substrates is largely controlled by the nature of its regulatory B subunit. There are multiple families of B subunits (See also pfam01240), this family is called the B56 family.
PQBP1
refseq_PQBP1.F1 refseq_PQBP1.R1 183 316
NCBIGene 36.2 10084
Alternative 5-prime, size difference: 133
Exclusion in the protein causing a frameshift
Reference transcript: NM_001032381
pfam WW 31aa 0.004
in ref transcript
WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
PQLC2
refseq_PQLC2.F2 refseq_PQLC2.R2 159 401
NCBIGene 36.3 54896
Single exon skipping, size difference: 242
Exclusion of the protein initiation site
Reference transcript: NM_001040125
pfam PQ-loop 57aa 4e-06
in ref transcript
PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localisation of cystinosin to lysosomes. However, the PQ motif appears not to be a general lysosome-targeting motif. It is thought likely to possess a more general function. Most probably this involves a glutamine residue.
PRAME
refseq_PRAME.F1 refseq_PRAME.R1 112 133
NCBIGene 36.3 23532
Alternative 3-prime, size difference: 21
Inclusion in 5'UTR
Reference transcript: NM_206956
PRC1
refseq_PRC1.F2 refseq_PRC1.R2 154 196
NCBIGene 36.3 9055
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_003981
Changed!
pfam MAP65_ASE1 579aa 1e-159
in ref transcript
Microtubule associated protein (MAP65/ASE1 family).
Changed!
pfam MAP65_ASE1 565aa 1e-153
in modified transcript
PRCC
refseq_PRCC.F1 refseq_PRCC.R1 126 222
NCBIGene 36.3 5546
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_005973
pfam PRCC_Cterm 41aa 2e-13
in ref transcript
Mitotic checkpoint protein PRCC_Cterm. This is the highly conserved C-terminal domain of the renal papillary carcinoma protein PRCC. The function of this domain is not known.
PRCP
refseq_PRCP.F2 refseq_PRCP.R2 137 200
NCBIGene 36.3 5547
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_199418
Changed!
pfam Peptidase_S28 421aa 1e-84
in ref transcript
Serine carboxypeptidase S28. These serine proteases include several eukaryotic enzymes such as lysosomal Pro-X carboxypeptidase, dipeptidyl-peptidase II, and thymus-specific serine peptidase.
Changed!
pfam Peptidase_S28 423aa 9e-86
in modified transcript
PRDM10
refseq_PRDM10.F1 refseq_PRDM10.R1 120 159
NCBIGene 36.3 56980
Alternative 5-prime, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_020228
PRDM15
refseq_PRDM15.F2 refseq_PRDM15.R2 179 377
NCBIGene 36.3 63977
Single exon skipping, size difference: 198
Exclusion in the protein (no frameshift)
Reference transcript: NM_022115
smart SET 109aa 0.003
in ref transcript
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues.
COG COG5048 170aa 0.002
in ref transcript
FOG: Zn-finger [General function prediction only].
PRDM16
refseq_PRDM16.F1 refseq_PRDM16.R1 226 283
NCBIGene 36.3 63976
Alternative 3-prime, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_022114
smart SET 124aa 9e-09
in ref transcript
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues.
pfam zf-C2H2 23aa 0.003
in ref transcript
Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
COG COG5048 70aa 0.007
in ref transcript
FOG: Zn-finger [General function prediction only].
PRDX1
refseq_PRDX1.F1 refseq_PRDX1.R1 118 144
NCBIGene 36.3 5052
Alternative 5-prime, size difference: 26
Exclusion in 5'UTR
Reference transcript: NM_002574
cd PRX_Typ2cys 173aa 2e-87
in ref transcript
Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides. The functional unit of typical 2-cys PRX is a homodimer. A unique intermolecular redox-active disulfide center is utilized for its activity. Upon reaction with peroxides, its peroxidatic cysteine is oxidized into a sulfenic acid intermediate which is resolved by bonding with the resolving cysteine from the other subunit of the homodimer. This intermolecular disulfide bond is then reduced by thioredoxin, tryparedoxin or AhpF. Typical 2-cys PRXs, like 1-cys PRXs, form decamers which are stabilized by reduction of the active site cysteine. Typical 2-cys PRX interacts through beta strands at one edge of the monomer (B-type interface) to form the functional homodimer, and uses an A-type interface (similar to the dimeric interface in atypical 2-cys PRX and PRX5) at the opposite end of the monomer to form the stable decameric (pentamer of dimers) structure.
TIGR AhpC 179aa 1e-46
in ref transcript
This gene contains two invariant cysteine residues, one near the N-terminus and one near the C-terminus, each followed immediately by a proline residue.
PTZ PTZ00253 196aa 5e-85
in ref transcript
tryparedoxin peroxidase; Provisional.
PRDX2
refseq_PRDX2.F1 refseq_PRDX2.R1 188 342
NCBIGene 36.2 7001
Single exon skipping, size difference: 154
Exclusion in the protein causing a frameshift
Reference transcript: NM_005809
Changed!
cd PRX_Typ2cys 172aa 1e-85
in ref transcript
Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides. The functional unit of typical 2-cys PRX is a homodimer. A unique intermolecular redox-active disulfide center is utilized for its activity. Upon reaction with peroxides, its peroxidatic cysteine is oxidized into a sulfenic acid intermediate which is resolved by bonding with the resolving cysteine from the other subunit of the homodimer. This intermolecular disulfide bond is then reduced by thioredoxin, tryparedoxin or AhpF. Typical 2-cys PRXs, like 1-cys PRXs, form decamers which are stabilized by reduction of the active site cysteine. Typical 2-cys PRX interacts through beta strands at one edge of the monomer (B-type interface) to form the functional homodimer, and uses an A-type interface (similar to the dimeric interface in atypical 2-cys PRX and PRX5) at the opposite end of the monomer to form the stable decameric (pentamer of dimers) structure.
Changed!
TIGR AhpC 181aa 5e-51
in ref transcript
This gene contains two invariant cysteine residues, one near the N-terminus and one near the C-terminus, each followed immediately by a proline residue.
Changed!
PTZ PTZ00253 195aa 2e-80
in ref transcript
tryparedoxin peroxidase; Provisional.
Changed!
cd PRX_Typ2cys 28aa 7e-07
in modified transcript
Changed!
pfam AhpC-TSA 29aa 7e-04
in modified transcript
AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA).
Changed!
PTZ PTZ00253 35aa 1e-04
in modified transcript
PRDX3
refseq_PRDX3.F2 refseq_PRDX3.R2 107 161
NCBIGene 36.3 10935
Alternative 5-prime and 3-prime, size difference: 54
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_006793
Changed!
cd PRX_Typ2cys 172aa 8e-84
in ref transcript
Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides. The functional unit of typical 2-cys PRX is a homodimer. A unique intermolecular redox-active disulfide center is utilized for its activity. Upon reaction with peroxides, its peroxidatic cysteine is oxidized into a sulfenic acid intermediate which is resolved by bonding with the resolving cysteine from the other subunit of the homodimer. This intermolecular disulfide bond is then reduced by thioredoxin, tryparedoxin or AhpF. Typical 2-cys PRXs, like 1-cys PRXs, form decamers which are stabilized by reduction of the active site cysteine. Typical 2-cys PRX interacts through beta strands at one edge of the monomer (B-type interface) to form the functional homodimer, and uses an A-type interface (similar to the dimeric interface in atypical 2-cys PRX and PRX5) at the opposite end of the monomer to form the stable decameric (pentamer of dimers) structure.
Changed!
TIGR AhpC 181aa 7e-55
in ref transcript
This gene contains two invariant cysteine residues, one near the N-terminus and one near the C-terminus, each followed immediately by a proline residue.
Changed!
PTZ PTZ00253 197aa 4e-77
in ref transcript
tryparedoxin peroxidase; Provisional.
Changed!
cd PRX_Typ2cys 169aa 6e-83
in modified transcript
Changed!
TIGR AhpC 177aa 2e-55
in modified transcript
Changed!
PTZ PTZ00253 190aa 2e-76
in modified transcript
PRDX5
refseq_PRDX5.F2 refseq_PRDX5.R2 196 328
NCBIGene 36.3 25824
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_012094
Changed!
cd PRX5_like 155aa 8e-51
in ref transcript
Peroxiredoxin (PRX) family, PRX5-like subfamily; members are similar to the human protein, PRX5, a homodimeric TRX peroxidase, widely expressed in tissues and found cellularly in mitochondria, peroxisomes and the cytosol. The cellular location of PRX5 suggests that it may have an important antioxidant role in organelles that are major sources of reactive oxygen species (ROS), as well as a role in the control of signal transduction. PRX5 has been shown to reduce hydrogen peroxide, alkyl hydroperoxides and peroxynitrite. As with all other PRXs, the N-terminal peroxidatic cysteine of PRX5 is oxidized into a sulfenic acid intermediate upon reaction with peroxides. Human PRX5 is able to resolve this intermediate by forming an intramolecular disulfide bond with its C-terminal cysteine (the resolving cysteine), which can then be reduced by TRX, just like an atypical 2-cys PRX. This resolving cysteine, however, is not conserved in other members of the subfamily. In such cases, it is assumed that the oxidized cysteine is directly resolved by an external small-molecule or protein reductant, typical of a 1-cys PRX. In the case of the H. influenza PRX5 hybrid, the resolving glutaredoxin domain is on the same protein chain as PRX. PRX5 homodimers show an A-type interface, similar to atypical 2-cys PRXs.
Changed!
pfam Redoxin 145aa 1e-30
in ref transcript
Redoxin. This family of redoxins includes peroxiredoxin, thioredoxin and glutaredoxin proteins.
Changed!
COG AHP1 159aa 3e-39
in ref transcript
Peroxiredoxin [Posttranslational modification, protein turnover, chaperones].
Changed!
cd PRX5_like 111aa 4e-26
in modified transcript
Changed!
pfam Redoxin 101aa 2e-12
in modified transcript
Changed!
COG AHP1 115aa 7e-17
in modified transcript
PRKAA1
refseq_PRKAA1.F1 refseq_PRKAA1.R1 182 227
NCBIGene 36.3 5562
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_206907
Changed!
cd S_TKc 269aa 6e-70
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 249aa 1e-76
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00263 272aa 6e-37
in ref transcript
protein kinase A catalytic subunit; Provisional.
Changed!
cd S_TKc 254aa 1e-72
in modified transcript
Changed!
smart S_TKc 234aa 2e-79
in modified transcript
Changed!
PTZ PTZ00263 257aa 2e-39
in modified transcript
PRKAG1
refseq_PRKAG1.F1 refseq_PRKAG1.R1 151 200
NCBIGene 36.3 5571
Alternative 3-prime, size difference: 49
Inclusion in the protein causing a frameshift
Reference transcript: NM_002733
Changed!
cd CBS_pair_28 120aa 4e-41
in ref transcript
The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generalized epilepsy, hypercalciuric nephrolithiasis, and classic Bartter syndrome (CLC chloride channel family members), Wolff-Parkinson-White syndrome (gamma 2 subunit of AMP-activated protein kinase), retinitis pigmentosa (IMP dehydrogenase-1), and homocystinuria (cystathionine beta-synthase).
Changed!
cd CBS_pair_5 130aa 2e-29
in ref transcript
The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generalized epilepsy, hypercalciuric nephrolithiasis, and classic Bartter syndrome (CLC chloride channel family members), Wolff-Parkinson-White syndrome (gamma 2 subunit of AMP-activated protein kinase), retinitis pigmentosa (IMP dehydrogenase-1), and homocystinuria (cystathionine beta-synthase).
Changed!
cd CBS_pair_14 123aa 1e-11
in ref transcript
The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generalized epilepsy, hypercalciuric nephrolithiasis, and classic Bartter syndrome (CLC chloride channel family members), Wolff-Parkinson-White syndrome (gamma 2 subunit of AMP-activated protein kinase), retinitis pigmentosa (IMP dehydrogenase-1), and homocystinuria (cystathionine beta-synthase).
Changed!
pfam CBS 121aa 1e-16
in ref transcript
CBS domain pair. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a pair of CBS domains, that has been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP.
Changed!
pfam CBS 132aa 7e-12
in ref transcript
Changed!
PRK PRK05567 109aa 2e-08
in ref transcript
Changed!
COG COG3448 120aa 3e-06
in ref transcript
CBS-domain-containing membrane protein [Signal transduction mechanisms].
Changed!
COG COG2905 128aa 5e-05
in ref transcript
Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms].
PRKAR1A
refseq_PRKAR1A.F1 refseq_PRKAR1A.R1 169 200
NCBIGene 36.3 5573
Alternative 5-prime, size difference: 31
Exclusion in 5'UTR
Reference transcript: NM_212472
cd CAP_ED 116aa 1e-21
in ref transcript
effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
cd CAP_ED 110aa 4e-20
in ref transcript
smart cNMP 119aa 2e-22
in ref transcript
Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
smart cNMP 115aa 4e-21
in ref transcript
smart RIIa 38aa 1e-08
in ref transcript
RIIalpha, Regulatory subunit portion of type II PKA R-subunit. RIIalpha, Regulatory subunit portion of type II PKA R-subunit. Contains dimerisation interface and binding site for A-kinase-anchoring proteins (AKAPs).
COG Crp 111aa 4e-12
in ref transcript
cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
COG Crp 127aa 5e-11
in ref transcript
ZMYND8
refseq_PRKCBP1.F1 refseq_PRKCBP1.R1 197 272
NCBIGene 36.3 23613
Alternative 3-prime, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_183047
cd Bromo_RACK7 99aa 1e-49
in ref transcript
Bromodomain, RACK7_like subfamily. RACK7 (also called human protein kinase C-binding protein) was identified as a potential tumor suppressor genes, it shares domain architecture with BS69/ZMYND11; both have been implicated in the regulation of cellular proliferation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd BS69_related 81aa 4e-35
in ref transcript
The PWWP domain is part of BS69 protein, a nuclear protein that specifically binds adenoviral E1A and Epstein-Barr viral EBNA2 proteins, suppressing their transactivation functions. BS69 is a multi-domain protein, containing bromo, PHD, PWWP, and MYND domains. The specific role of the PWWP domain within BS69 is not clearly identified, but BS69 functions in chromatin remodeling, consistent with other PWWP-containing proteins. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
smart BROMO 101aa 3e-15
in ref transcript
bromo domain.
pfam PWWP 62aa 5e-08
in ref transcript
PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown.
pfam PHD 41aa 2e-07
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
pfam zf-MYND 35aa 4e-05
in ref transcript
MYND finger.
COG COG5076 127aa 8e-07
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
ZMYND8
refseq_PRKCBP1.F3 refseq_PRKCBP1.R3 180 264
NCBIGene 36.3 23613
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_183047
cd Bromo_RACK7 99aa 1e-49
in ref transcript
Bromodomain, RACK7_like subfamily. RACK7 (also called human protein kinase C-binding protein) was identified as a potential tumor suppressor genes, it shares domain architecture with BS69/ZMYND11; both have been implicated in the regulation of cellular proliferation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd BS69_related 81aa 4e-35
in ref transcript
The PWWP domain is part of BS69 protein, a nuclear protein that specifically binds adenoviral E1A and Epstein-Barr viral EBNA2 proteins, suppressing their transactivation functions. BS69 is a multi-domain protein, containing bromo, PHD, PWWP, and MYND domains. The specific role of the PWWP domain within BS69 is not clearly identified, but BS69 functions in chromatin remodeling, consistent with other PWWP-containing proteins. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
smart BROMO 101aa 3e-15
in ref transcript
bromo domain.
pfam PWWP 62aa 5e-08
in ref transcript
PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown.
pfam PHD 41aa 2e-07
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
pfam zf-MYND 35aa 4e-05
in ref transcript
MYND finger.
COG COG5076 127aa 8e-07
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
PRMT1
refseq_PRMT1.F1 refseq_PRMT1.R1 131 250
NCBIGene 36.3 3276
Exon skipping and alternative 3-prime or 5-prime, size difference: 119
Inclusion in the protein causing a frameshift, Inclusion in the protein causing a new stop codon
Reference transcript: NM_001536
Changed!
cd AdoMet_MTases 101aa 6e-06
in ref transcript
S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
Changed!
pfam PrmA 59aa 2e-05
in ref transcript
Ribosomal protein L11 methyltransferase (PrmA). This family consists of several Ribosomal protein L11 methyltransferase (EC:2.1.1.-) sequences.
Changed!
PRK prmA 77aa 5e-06
in ref transcript
ribosomal protein L11 methyltransferase; Reviewed.
PRMT2
refseq_PRMT2.F2 refseq_PRMT2.R2 123 232
NCBIGene 36.3 3275
Single exon skipping, size difference: 109
Exclusion in 5'UTR
Reference transcript: NM_206962
cd SH3 50aa 2e-08
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd AdoMet_MTases 101aa 3e-07
in ref transcript
S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
smart SH3 54aa 2e-10
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
pfam MTS 83aa 3e-06
in ref transcript
Methyltransferase small domain. This domain is found in ribosomal RNA small subunit methyltransferase C as well as other methyltransferases.
COG COG4076 131aa 4e-07
in ref transcript
Predicted RNA methylase [General function prediction only].
PRMT5
refseq_PRMT5.F2 refseq_PRMT5.R2 260 345
NCBIGene 36.3 10419
Alternative 5-prime, size difference: 85
Inclusion in the protein causing a new stop codon
Reference transcript: NM_006109
Changed!
cd AdoMet_MTases 71aa 4e-04
in ref transcript
S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
Changed!
pfam PRMT5 439aa 0.0
in ref transcript
PRMT5 arginine-N-methyltransferase. The human homologue of yeast Skb1 (Shk1 kinase-binding protein 1) is PRMT5, an arginine-N-methyltransferase. These proteins appear to be key mitotic regulators. They play a role in Jak signalling in higher eukaryotes.
Changed!
COG COG4076 168aa 4e-06
in ref transcript
Predicted RNA methylase [General function prediction only].
PRO1853
refseq_PRO1853.F2 refseq_PRO1853.R2 229 383
NCBIGene 36.2 55471
Alternative 5-prime, size difference: 154
Exclusion in the protein causing a frameshift
Reference transcript: NM_144736
Changed!
pfam DUF185 247aa 5e-33
in ref transcript
Uncharacterized ACR, COG1565. This family contains several uncharacterised proteins. One member has been described as an ATP synthase beta subunit transcription termination factor rho protein.
Changed!
COG COG1565 357aa 4e-61
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
pfam DUF185 57aa 1e-07
in modified transcript
Changed!
COG COG1565 106aa 3e-27
in modified transcript
PRPF40B
refseq_PRPF40B.F2 refseq_PRPF40B.R2 102 123
NCBIGene 36.3 25766
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_001031698
cd WW 30aa 3e-05
in ref transcript
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
cd WW 26aa 2e-04
in ref transcript
pfam WW 30aa 4e-06
in ref transcript
WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
smart WW 26aa 5e-05
in ref transcript
Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
smart FF 49aa 5e-05
in ref transcript
Contains two conserved F residues. A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
smart FF 49aa 6e-04
in ref transcript
Changed!
COG PRP40 280aa 2e-18
in ref transcript
Splicing factor [RNA processing and modification].
COG PRP40 91aa 7e-17
in ref transcript
Changed!
COG PRP40 299aa 1e-19
in modified transcript
PRR13
refseq_PRR13.F1 refseq_PRR13.R1 249 399
NCBIGene 36.3 54458
Alternative 3-prime, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_018457
PRR5
refseq_PRR5.F2 refseq_PRR5.R2 231 348
NCBIGene 36.3 55615
Single exon skipping, size difference: 117
Exclusion of the protein initiation site
Reference transcript: NM_015366
Changed!
pfam HbrB 132aa 6e-40
in ref transcript
HbrB-like. HbrB is involved hyphal growth and polarity.
Changed!
pfam HbrB 71aa 4e-22
in modified transcript
PRRX1
refseq_PRRX1.F1 refseq_PRRX1.R1 161 233
NCBIGene 36.3 5396
Single exon skipping, size difference: 72
Inclusion in the protein causing a new stop codon
Reference transcript: NM_022716
cd homeodomain 55aa 2e-14
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Changed!
cd Tryp_SPc 242aa 3e-66
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
Changed!
smart Tryp_SPc 241aa 7e-71
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Changed!
COG COG5640 255aa 7e-23
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
Changed!
cd Tryp_SPc 228aa 4e-57
in modified transcript
Changed!
smart Tryp_SPc 227aa 5e-61
in modified transcript
Changed!
COG COG5640 241aa 3e-20
in modified transcript
PSMA3
refseq_PSMA3.F1 refseq_PSMA3.R1 111 132
NCBIGene 36.3 5684
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_002788
Changed!
cd proteasome_alpha_type_3 213aa 1e-109
in ref transcript
proteasome_alpha_type_3. The 20S proteasome, multisubunit proteolytic complex, is the central enzyme of nonlysosomal protein degradation in both the cytosol and nucleus. It is composed of 28 subunits arranged as four homoheptameric rings that stack on top of one another forming an elongated alpha-beta-beta-alpha cylinder with a central cavity. The proteasome alpha and beta subunits are members of the N-terminal nucleophile (Ntn)-hydrolase superfamily. Their N-terminal threonine residues are exposed as a nucleophile in peptide bond hydrolysis. Mammals have 7 alpha and 7 beta proteasome subunits while archaea have one of each.
Changed!
pfam Proteasome 187aa 2e-47
in ref transcript
Proteasome A-type and B-type.
pfam Proteasome_A_N 23aa 8e-09
in ref transcript
Proteasome subunit A N-terminal signature. This domain is conserved in the A subunits of the proteasome complex proteins.
Changed!
PRK PRK03996 234aa 1e-44
in ref transcript
proteasome subunit alpha; Provisional.
Changed!
cd proteasome_alpha_type_3 206aa 1e-106
in modified transcript
Changed!
pfam Proteasome 180aa 3e-44
in modified transcript
Changed!
PRK PRK03996 227aa 4e-44
in modified transcript
PSMA8
refseq_PSMA8.F1 refseq_PSMA8.R1 100 118
NCBIGene 36.3 143471
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_144662
Changed!
cd proteasome_alpha_type_7 215aa 1e-102
in ref transcript
proteasome_alpha_type_7. The 20S proteasome, multisubunit proteolytic complex, is the central enzyme of nonlysosomal protein degradation in both the cytosol and nucleus. It is composed of 28 subunits arranged as four homoheptameric rings that stack on top of one another forming an elongated alpha-beta-beta-alpha cylinder with a central cavity. The proteasome alpha and beta subunits are members of the N-terminal nucleophile (Ntn)-hydrolase superfamily. Their N-terminal threonine residues are exposed as a nucleophile in peptide bond hydrolysis. Mammals have 7 alpha and 7 beta proteasome subunits while archaea have one of each.
Changed!
TIGR arc_protsome_A 233aa 6e-67
in ref transcript
This protein family describes the archaeal proteasome alpha subunit, homologous to both the beta subunit and to the alpha and beta subunits of eukaryotic proteasome subunits. This family is universal in the first 29 complete archaeal genomes but occasionally is duplicated.
Changed!
PRK PRK03996 233aa 5e-71
in ref transcript
proteasome subunit alpha; Provisional.
Changed!
cd proteasome_alpha_type_7 209aa 1e-104
in modified transcript
Changed!
TIGR arc_protsome_A 227aa 6e-69
in modified transcript
Changed!
PRK PRK03996 227aa 5e-73
in modified transcript
PSMC3IP
refseq_PSMC3IP.F1 refseq_PSMC3IP.R1 145 181
NCBIGene 36.3 29893
Alternative 3-prime, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_016556
cd Fur_like 58aa 0.003
in ref transcript
Ferric uptake regulator(Fur) and related metalloregulatory proteins; typically iron-dependent, DNA-binding repressors and activators. Ferric uptake regulator (Fur) and related metalloregulatory proteins are iron-dependent, DNA-binding repressors and activators mainly involved in iron metabolism. A general model for Fur repression under iron-rich conditions is that activated Fur (a dimer having one Fe2+ coordinated per monomer) binds to specific DNA sequences (Fur boxes) in the promoter region of iron-responsive genes, hindering access of RNA polymerase, and repressing transcription. Positive regulation by Fur can be direct or indirect, as in the Fur repression of an anti-sense regulatory small RNA. Some members sense metal ions other than Fe2+. For example, the zinc uptake regulator (Zur) responds to Zn2+, the manganese uptake regulator (Mur) responds to Mn2+, and the nickel uptake regulator (Nur) responds to Ni2+. Other members sense signals other than metal ions. For example, PerR, a metal-dependent sensor of hydrogen peroxide. PerR regulates DNA-binding activity through metal-based protein oxidation, and co-ordinates Mn2+ or Fe2+ at its regulatory site. Fur family proteins contain an N-terminal winged-helix DNA-binding domain followed by a dimerization domain; this CD spans both those domains.
Changed!
pfam TBPIP 167aa 9e-56
in ref transcript
Tat binding protein 1(TBP-1)-interacting protein (TBPIP). This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation.
Changed!
pfam TBPIP 155aa 3e-50
in modified transcript
PSMC4
refseq_PSMC4.F1 refseq_PSMC4.R1 111 204
NCBIGene 36.3 5704
Alternative 3-prime, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_006503
cd AAA 168aa 2e-16
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
Changed!
TIGR 26Sp45 347aa 1e-103
in ref transcript
Many proteins may score above the trusted cutoff because an internal.
Changed!
COG RPT1 383aa 1e-137
in ref transcript
ATP-dependent 26S proteasome regulatory subunit [Posttranslational modification, protein turnover, chaperones].
Changed!
TIGR 26Sp45 348aa 1e-104
in modified transcript
Changed!
COG RPT1 350aa 1e-135
in modified transcript
PSMD10
refseq_PSMD10.F1 refseq_PSMD10.R1 194 273
NCBIGene 36.3 5716
Alternative 5-prime, size difference: 79
Exclusion in the protein causing a frameshift
Reference transcript: NM_002814
Changed!
cd ANK 124aa 2e-30
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
pfam Ank 33aa 6e-06
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
pfam Ank 31aa 2e-05
in ref transcript
pfam Ank 31aa 3e-05
in ref transcript
Changed!
TIGR trp 153aa 2e-04
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Changed!
COG Arp 156aa 1e-17
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
PTZ PTZ00322 66aa 6e-08
in ref transcript
Changed!
cd ANK 115aa 1e-27
in modified transcript
Changed!
TIGR trp 103aa 1e-04
in modified transcript
Changed!
COG Arp 137aa 8e-16
in modified transcript
PSME3
refseq_PSME3.F2 refseq_PSME3.R2 156 195
NCBIGene 36.3 10197
Alternative 3-prime, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_176863
Changed!
pfam PA28_beta 163aa 1e-68
in ref transcript
Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex binds to the 20S proteasome ana simulates peptidase activity in and ATP-independent manner.
pfam PA28_alpha 58aa 6e-20
in ref transcript
Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex binds to the 20S proteasome ana simulates peptidase activity in and ATP-independent manner.
Changed!
pfam PA28_beta 150aa 1e-70
in modified transcript
PSMF1
refseq_PSMF1.F2 refseq_PSMF1.R2 119 202
NCBIGene 36.2 9491
Single exon skipping, size difference: 83
Exclusion in the protein causing a frameshift
Reference transcript: NM_178578
Changed!
pfam PI31_Prot_Reg 86aa 4e-04
in ref transcript
PI31 proteasome regulator. PI31 is a cellular regulator of proteasome formation and of proteasome-mediated antigen processing.
OLA1
refseq_PTD004.F1 refseq_PTD004.R1 199 300
NCBIGene 36.3 29789
Single exon skipping, size difference: 101
Exclusion of the protein initiation site
Reference transcript: NM_013341
Changed!
cd YchF 279aa 1e-124
in ref transcript
YchF subfamily. YchF is a member of the Obg family, which includes four other subfamilies of GTPases: Obg, DRG, Ygr210, and NOG1. Obg is an essential gene that is involved in DNA replication in C. crescentus and Streptomyces griseus and is associated with the ribosome. Several members of the family, including YchF, possess the TGS domain related to the RNA-binding proteins. Experimental data and genomic analysis suggest that YchF may be part of a nucleoprotein complex and may function as a GTP-dependent translational factor.
cd TGS_YchF_C 83aa 2e-46
in ref transcript
TGS_YchF_C: This subfamily represents TGS domain-containing YchF GTP-binding protein, a universally conserved GTPase whose function is unknown. The N-terminal domain of the YchF protein belongs to the Obg-like family of GTPases, and some members of the family contain a C-terminal TGS domain. TGS is a small domain of about 50 amino acid residues with a predominantly beta-sheet structure. There is no direct information on the function of the TGS domain, but its presence in two types of regulatory proteins (the GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role.
Changed!
TIGR TIGR00092 366aa 8e-92
in ref transcript
This predicted GTP-binding protein is found in a single copy in every complete bacterial genome, and is found in Eukaryotes. A more distantly related protein, separated from this model, is found in the archaea. It is known to bind GTP and double-stranded nucleic acid. It is suggested to belong to a nucleoprotein complex and act as a translation factor.
Changed!
PTZ PTZ00258 373aa 1e-169
in ref transcript
GTP-binding protein; Provisional.
Changed!
cd YchF 141aa 1e-40
in modified transcript
Changed!
TIGR TIGR00092 226aa 8e-51
in modified transcript
Changed!
PTZ PTZ00258 228aa 9e-90
in modified transcript
Phosphotriesterase (PTE) catalyzes the hydrolysis of organophosphate nerve agents, including the chemical warfare agents VX, soman, and sarin as well as the insecticide paraoxon. PTE exists as a homodimer with one active site per monomer. The active site is located next to a binuclear metal center, at the C-terminal end of a TIM alpha- beta barrel motif. The native enzyme contains two zinc ions at the active site however these can be replaced with other metals such as cobalt, cadmium, nickel or manganese and the enzyme remains active.
pfam PTE 333aa 1e-119
in ref transcript
Phosphotriesterase family.
COG Php 345aa 3e-53
in ref transcript
Predicted metal-dependent hydrolase with the TIM-barrel fold [General function prediction only].
PTGER3
refseq_PTGER3.F1 refseq_PTGER3.R1 129 156
NCBIGene 36.3 5733
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_198712
pfam 7tm_1 252aa 3e-10
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
PTGES2
refseq_PTGES2.F2 refseq_PTGES2.R2 309 402
NCBIGene 36.3 80142
Single exon skipping, size difference: 93
Inclusion in the protein causing a new stop codon
Reference transcript: NM_025072
Changed!
cd GST_C_mPGES2 149aa 8e-60
in ref transcript
GST_C family; microsomal Prostaglandin E synthase Type 2 (mPGES2) subfamily; mPGES2 is a membrane-anchored dimeric protein containing a CXXC motif which catalyzes the isomerization of PGH2 to PGE2. Unlike cytosolic PGE synthase (cPGES) and microsomal PGES Type 1 (mPGES1), mPGES2 does not require glutathione (GSH) for its activity, although its catalytic rate is increased two- to four-fold in the presence of DTT, GSH, or other thiol compounds. PGE2 is widely distributed in various tissues and is implicated in the sleep/wake cycle, relaxation/contraction of smooth muscle, excretion of sodium ions, maintenance of body temperature, and mediation of inflammation. mPGES2 contains an N-terminal hydrophobic domain which is membrane associated and a C-terminal soluble domain with a GST-like structure. The C-terminus contains two structural domains a N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain. The GST active site is located in a cleft between the two domains.
Changed!
cd GST_N_mPGES2 75aa 8e-33
in ref transcript
GST_N family; microsomal Prostaglandin E synthase Type 2 (mPGES2) subfamily; mPGES2 is a membrane-anchored dimeric protein containing a CXXC motif which catalyzes the isomerization of PGH2 to PGE2. Unlike cytosolic PGE synthase (cPGES) and microsomal PGES Type 1 (mPGES1), mPGES2 does not require glutathione (GSH) for its activity, although its catalytic rate is increased two- to four-fold in the presence of DTT, GSH or other thiol compounds. PGE2 is widely distributed in various tissues and is implicated in the sleep/wake cycle, relaxation/contraction of smooth muscle, excretion of sodium ions, maintenance of body temperature and mediation of inflammation. mPGES2 contains an N-terminal hydrophobic domain which is membrane associated, and a C-terminal soluble domain with a GST-like structure.
pfam Glutaredoxin 50aa 5e-08
in ref transcript
Glutaredoxin.
Changed!
COG Gst 140aa 5e-04
in ref transcript
Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd GST_N_mPGES2 57aa 4e-24
in modified transcript
Changed!
COG GrxC 58aa 6e-04
in modified transcript
Glutaredoxin and related proteins [Posttranslational modification, protein turnover, chaperones].
PTGFR
refseq_PTGFR.F2 refseq_PTGFR.R2 335 406
NCBIGene 36.3 5737
Single exon skipping, size difference: 71
Inclusion in the protein causing a frameshift
Reference transcript: NM_000959
Changed!
pfam 7tm_1 201aa 1e-12
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
Changed!
pfam 7tm_1 193aa 5e-10
in modified transcript
PTK2B
refseq_PTK2B.F1 refseq_PTK2B.R1 159 285
NCBIGene 36.3 2185
Single exon skipping, size difference: 126
Exclusion in the protein (no frameshift)
Reference transcript: NM_173174
cd PTKc_FAK 270aa 1e-146
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Focal Adhesion Kinase. Protein Tyrosine Kinase (PTK) family; Focal Adhesion kinase (FAK); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FAK is a cytoplasmic (or nonreceptor) tyr kinase that contains an autophosphorylation site and a FERM domain at the N-terminus, a central tyr kinase domain, proline-rich regions, and a C-terminal FAT (focal adhesion targeting) domain. FAK activity is dependent on integrin-mediated cell adhesion, which facilitates N-terminal autophosphorylation. Full activation is achieved by the phosphorylation of its two adjacent A-loop tyrosines. FAK is important in mediating signaling initiated at sites of cell adhesions and at growth factor receptors. Through diverse molecular interactions, FAK functions as a biosensor or integrator to control cell motility. It is a key regulator of cell survival, proliferation, migration and invasion, and thus plays an important role in the development and progression of cancer. Src binds to autophosphorylated FAK forming the FAK-Src dual kinase complex, which is activated in a wide variety of tumor cells and generates signals promoting growth and metastasis. FAK is being developed as a target for cancer therapy.
pfam Pkinase_Tyr 233aa 5e-98
in ref transcript
Protein tyrosine kinase.
pfam Focal_AT 139aa 6e-59
in ref transcript
Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion targeting' (FAT) region is necessary and sufficient for localising FAK to focal adhesions. The crystal structure of FAT shows it forms a four-helix bundle that resembles those found in two other proteins involved in cell adhesion, alpha-catenin and vinculin. The binding of FAT to the focal adhesion protein, paxillin, requires the integrity of the helical bundle, whereas binding to another focal adhesion protein, talin, does not.
smart B41 227aa 2e-31
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
COG SPS1 229aa 2e-16
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
PTK2B
refseq_PTK2B.F3 refseq_PTK2B.R3 102 262
NCBIGene 36.3 2185
Single exon skipping, size difference: 160
Exclusion in 5'UTR
Reference transcript: NM_173174
cd PTKc_FAK 270aa 1e-146
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Focal Adhesion Kinase. Protein Tyrosine Kinase (PTK) family; Focal Adhesion kinase (FAK); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FAK is a cytoplasmic (or nonreceptor) tyr kinase that contains an autophosphorylation site and a FERM domain at the N-terminus, a central tyr kinase domain, proline-rich regions, and a C-terminal FAT (focal adhesion targeting) domain. FAK activity is dependent on integrin-mediated cell adhesion, which facilitates N-terminal autophosphorylation. Full activation is achieved by the phosphorylation of its two adjacent A-loop tyrosines. FAK is important in mediating signaling initiated at sites of cell adhesions and at growth factor receptors. Through diverse molecular interactions, FAK functions as a biosensor or integrator to control cell motility. It is a key regulator of cell survival, proliferation, migration and invasion, and thus plays an important role in the development and progression of cancer. Src binds to autophosphorylated FAK forming the FAK-Src dual kinase complex, which is activated in a wide variety of tumor cells and generates signals promoting growth and metastasis. FAK is being developed as a target for cancer therapy.
pfam Pkinase_Tyr 233aa 5e-98
in ref transcript
Protein tyrosine kinase.
pfam Focal_AT 139aa 6e-59
in ref transcript
Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion targeting' (FAT) region is necessary and sufficient for localising FAK to focal adhesions. The crystal structure of FAT shows it forms a four-helix bundle that resembles those found in two other proteins involved in cell adhesion, alpha-catenin and vinculin. The binding of FAT to the focal adhesion protein, paxillin, requires the integrity of the helical bundle, whereas binding to another focal adhesion protein, talin, does not.
smart B41 227aa 2e-31
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
COG SPS1 229aa 2e-16
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
PTK7
refseq_PTK7.F1 refseq_PTK7.R1 181 301
NCBIGene 36.3 5754
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_002821
cd PTK_CCK4 274aa 1e-148
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Colon Carcinoma Kinase 4. Protein Tyrosine Kinase (PTK) family; Colon carcinoma kinase 4 (CCK4); pseudokinase domain. The PTKc (catalytic domain) family, to which this subfamily belongs, includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. CCK4, also called protein tyrosine kinase 7 (PTK7), is an orphan receptor tyr kinase (RTK) containing an extracellular region with seven immunoglobulin domains, a transmembrane segment, and an intracellular inactive pseudokinase domain. Studies in mice reveal that CCK4 is essential for neural development. Mouse embryos containing a truncated CCK4 die perinatally and display craniorachischisis, a severe form of neural tube defect. The mechanism of action of the CCK4 pseudokinase is still unknown. Other pseudokinases such as HER3 rely on the activity of partner RTKs.
cd IGcam 85aa 5e-16
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IGc2 61aa 3e-06
in ref transcript
COG SPS1 265aa 6e-19
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
PTK7
refseq_PTK7.F4 refseq_PTK7.R4 197 365
NCBIGene 36.3 5754
Exon skipping and alternative 3-prime or 5-prime, size difference: 168
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_002821
cd PTK_CCK4 274aa 1e-148
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Colon Carcinoma Kinase 4. Protein Tyrosine Kinase (PTK) family; Colon carcinoma kinase 4 (CCK4); pseudokinase domain. The PTKc (catalytic domain) family, to which this subfamily belongs, includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. CCK4, also called protein tyrosine kinase 7 (PTK7), is an orphan receptor tyr kinase (RTK) containing an extracellular region with seven immunoglobulin domains, a transmembrane segment, and an intracellular inactive pseudokinase domain. Studies in mice reveal that CCK4 is essential for neural development. Mouse embryos containing a truncated CCK4 die perinatally and display craniorachischisis, a severe form of neural tube defect. The mechanism of action of the CCK4 pseudokinase is still unknown. Other pseudokinases such as HER3 rely on the activity of partner RTKs.
cd IGcam 85aa 5e-16
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IGc2 61aa 3e-06
in ref transcript
COG SPS1 265aa 6e-19
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
PTK9
refseq_PTK9.F1 refseq_PTK9.R1 109 499
NCBIGene 36.2 5756
Alternative 5-prime, size difference: 390
Inclusion in the protein causing a new stop codon
Reference transcript: NM_002822
Changed!
cd ADF 134aa 2e-23
in ref transcript
Actin depolymerisation factor/cofilin -like domains; present in a family of essential eukaryotic actin regulatory proteins; these proteins enhance the turnover rate of actin and interact with actin monomers as well as actin filaments.
Changed!
cd ADF 131aa 8e-23
in ref transcript
Changed!
smart ADF 133aa 3e-28
in ref transcript
Actin depolymerisation factor/cofilin -like domains. Severs actin filaments and binds to actin monomers.
Changed!
smart ADF 128aa 3e-26
in ref transcript
PTP4A2
refseq_PTP4A2.F1 refseq_PTP4A2.R1 156 362
NCBIGene 36.3 8073
Multiple exon skipping, size difference: 206
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_080391
Changed!
cd DSPc 116aa 4e-08
in ref transcript
Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
Changed!
smart PTPc_motif 86aa 7e-10
in ref transcript
Protein tyrosine phosphatase, catalytic domain motif.
Changed!
PTZ PTZ00242 149aa 1e-53
in ref transcript
protein tyrosine phosphatase; Provisional.
Changed!
PTZ PTZ00242 59aa 3e-13
in modified transcript
PTP4A3
refseq_PTP4A3.F2 refseq_PTP4A3.R2 221 296
NCBIGene 36.3 11156
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_032611
Changed!
cd DSPc 104aa 2e-07
in ref transcript
Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
Changed!
pfam Y_phosphatase 134aa 8e-11
in ref transcript
Protein-tyrosine phosphatase.
Changed!
PTZ PTZ00242 151aa 9e-53
in ref transcript
protein tyrosine phosphatase; Provisional.
Changed!
cd PTPc 69aa 0.001
in modified transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
Changed!
pfam Y_phosphatase 95aa 4e-04
in modified transcript
Changed!
PTZ PTZ00242 126aa 1e-37
in modified transcript
PTPN2
refseq_PTPN2.F1 refseq_PTPN2.R1 260 362
NCBIGene 36.3 5771
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_080422
cd PTPc 230aa 1e-78
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
smart PTPc 270aa 2e-88
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
COG PTP2 235aa 1e-41
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
PTPN5
refseq_PTPN5.F1 refseq_PTPN5.R1 238 283
NCBIGene 36.3 84867
Alternative 3-prime, size difference: 45
Inclusion in 5'UTR
Reference transcript: NM_032781
cd PTPc 229aa 3e-76
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
smart PTPc 254aa 2e-84
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
COG PTP2 237aa 8e-35
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
PTPN5
refseq_PTPN5.F4 refseq_PTPN5.R4 280 376
NCBIGene 36.3 84867
Alternative 3-prime, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_032781
cd PTPc 229aa 3e-76
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
smart PTPc 254aa 2e-84
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
COG PTP2 237aa 8e-35
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
PTPRA
refseq_PTPRA.F1 refseq_PTPRA.R1 146 173
NCBIGene 36.3 5786
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_002836
cd PTPc 231aa 3e-89
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 231aa 5e-85
in ref transcript
smart PTPc 259aa 1e-97
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 261aa 1e-93
in ref transcript
COG PTP2 231aa 2e-43
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 241aa 3e-43
in ref transcript
PTPRC
refseq_PTPRC.F1 refseq_PTPRC.R1 200 344
NCBIGene 36.3 5788
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_002838
cd PTPc 232aa 2e-86
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 256aa 2e-71
in ref transcript
cd FN3 81aa 2e-06
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 71aa 0.009
in ref transcript
smart PTPc 260aa 1e-93
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 283aa 9e-81
in ref transcript
pfam fn3 71aa 7e-04
in ref transcript
Fibronectin type III domain.
pfam fn3 81aa 0.003
in ref transcript
Changed!
pfam Herpes_BLLF1 136aa 0.003
in ref transcript
Herpes virus major outer envelope glycoprotein (BLLF1). This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
COG PTP2 250aa 3e-42
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 265aa 4e-30
in ref transcript
Changed!
smart FN3 76aa 0.002
in modified transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Changed!
pfam Herpes_BLLF1 124aa 0.006
in modified transcript
PTPRD
refseq_PTPRD.F2 refseq_PTPRD.R2 136 163
NCBIGene 36.3 5789
Multiple exon skipping, size difference: 27
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_002839
cd PTPc 230aa 5e-92
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 231aa 8e-90
in ref transcript
cd FN3 92aa 3e-13
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 90aa 7e-13
in ref transcript
Changed!
cd IGcam 88aa 2e-12
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 91aa 4e-12
in ref transcript
cd FN3 95aa 8e-12
in ref transcript
cd FN3 93aa 2e-11
in ref transcript
cd FN3 89aa 3e-11
in ref transcript
cd FN3 107aa 4e-09
in ref transcript
cd FN3 82aa 2e-06
in ref transcript
cd IGcam 78aa 1e-05
in ref transcript
smart PTPc 256aa 1e-101
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 260aa 1e-100
in ref transcript
pfam I-set 92aa 5e-16
in ref transcript
Immunoglobulin I-set domain.
pfam fn3 85aa 8e-15
in ref transcript
Fibronectin type III domain.
pfam fn3 88aa 2e-13
in ref transcript
pfam fn3 83aa 2e-13
in ref transcript
smart FN3 79aa 3e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Changed!
smart IGc2 74aa 7e-11
in ref transcript
Immunoglobulin C-2 Type.
smart FN3 83aa 2e-10
in ref transcript
pfam fn3 100aa 5e-10
in ref transcript
smart IG_like 79aa 7e-08
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart FN3 81aa 4e-05
in ref transcript
COG PTP2 268aa 1e-48
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 281aa 9e-45
in ref transcript
Changed!
cd IGcam 79aa 4e-13
in modified transcript
Changed!
smart IGc2 65aa 4e-11
in modified transcript
PTPRD
refseq_PTPRD.F3 refseq_PTPRD.R4 211 250
NCBIGene 36.3 5789
Single exon skipping, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_002839
cd PTPc 230aa 5e-92
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 231aa 8e-90
in ref transcript
cd FN3 92aa 3e-13
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 90aa 7e-13
in ref transcript
cd IGcam 88aa 2e-12
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 91aa 4e-12
in ref transcript
cd FN3 95aa 8e-12
in ref transcript
cd FN3 93aa 2e-11
in ref transcript
cd FN3 89aa 3e-11
in ref transcript
cd FN3 107aa 4e-09
in ref transcript
cd FN3 82aa 2e-06
in ref transcript
Changed!
cd IGcam 78aa 1e-05
in ref transcript
smart PTPc 256aa 1e-101
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 260aa 1e-100
in ref transcript
pfam I-set 92aa 5e-16
in ref transcript
Immunoglobulin I-set domain.
pfam fn3 85aa 8e-15
in ref transcript
Fibronectin type III domain.
pfam fn3 88aa 2e-13
in ref transcript
pfam fn3 83aa 2e-13
in ref transcript
smart FN3 79aa 3e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
smart IGc2 74aa 7e-11
in ref transcript
Immunoglobulin C-2 Type.
smart FN3 83aa 2e-10
in ref transcript
pfam fn3 100aa 5e-10
in ref transcript
Changed!
smart IG_like 79aa 7e-08
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart FN3 81aa 4e-05
in ref transcript
COG PTP2 268aa 1e-48
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 281aa 9e-45
in ref transcript
Changed!
cd IGcam 83aa 6e-06
in modified transcript
Changed!
pfam I-set 84aa 6e-08
in modified transcript
PTPRD
refseq_PTPRD.F4 refseq_PTPRD.R5 138 165
NCBIGene 36.3 5789
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_002839
cd PTPc 230aa 5e-92
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 231aa 8e-90
in ref transcript
cd FN3 92aa 3e-13
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 90aa 7e-13
in ref transcript
cd IGcam 88aa 2e-12
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 91aa 4e-12
in ref transcript
cd FN3 95aa 8e-12
in ref transcript
cd FN3 93aa 2e-11
in ref transcript
cd FN3 89aa 3e-11
in ref transcript
Changed!
cd FN3 107aa 4e-09
in ref transcript
cd FN3 82aa 2e-06
in ref transcript
cd IGcam 78aa 1e-05
in ref transcript
smart PTPc 256aa 1e-101
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 260aa 1e-100
in ref transcript
pfam I-set 92aa 5e-16
in ref transcript
Immunoglobulin I-set domain.
pfam fn3 85aa 8e-15
in ref transcript
Fibronectin type III domain.
pfam fn3 88aa 2e-13
in ref transcript
pfam fn3 83aa 2e-13
in ref transcript
smart FN3 79aa 3e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
smart IGc2 74aa 7e-11
in ref transcript
Immunoglobulin C-2 Type.
smart FN3 83aa 2e-10
in ref transcript
Changed!
pfam fn3 100aa 5e-10
in ref transcript
smart IG_like 79aa 7e-08
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart FN3 81aa 4e-05
in ref transcript
COG PTP2 268aa 1e-48
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 281aa 9e-45
in ref transcript
Changed!
cd FN3 98aa 6e-09
in modified transcript
Changed!
pfam fn3 91aa 5e-09
in modified transcript
PTPRF
refseq_PTPRF.F1 refseq_PTPRF.R1 123 150
NCBIGene 36.3 5792
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_002840
cd PTPc 230aa 2e-94
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 231aa 4e-89
in ref transcript
cd IGcam 79aa 7e-13
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd FN3 93aa 2e-12
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 89aa 1e-11
in ref transcript
cd IGcam 90aa 2e-11
in ref transcript
cd FN3 95aa 5e-10
in ref transcript
cd FN3 82aa 3e-09
in ref transcript
cd FN3 90aa 3e-09
in ref transcript
cd FN3 90aa 1e-08
in ref transcript
cd IGcam 83aa 2e-08
in ref transcript
Changed!
cd FN3 107aa 0.001
in ref transcript
smart PTPc 256aa 1e-105
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 256aa 4e-97
in ref transcript
pfam I-set 92aa 9e-18
in ref transcript
Immunoglobulin I-set domain.
smart FN3 79aa 3e-13
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
pfam fn3 88aa 3e-11
in ref transcript
Fibronectin type III domain.
smart FN3 83aa 7e-11
in ref transcript
pfam I-set 84aa 1e-10
in ref transcript
smart IGc2 65aa 1e-10
in ref transcript
Immunoglobulin C-2 Type.
pfam fn3 83aa 3e-10
in ref transcript
pfam fn3 70aa 3e-07
in ref transcript
pfam fn3 83aa 7e-07
in ref transcript
COG PTP2 272aa 1e-49
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 264aa 5e-47
in ref transcript
Changed!
cd FN3 98aa 4e-04
in modified transcript
PTPRN2
refseq_PTPRN2.F1 refseq_PTPRN2.R1 258 309
NCBIGene 36.3 5799
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_002847
cd PTPc 233aa 1e-79
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
smart PTPc 260aa 6e-93
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
COG PTP2 275aa 7e-40
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
PTPRN2
refseq_PTPRN2.F4 refseq_PTPRN2.R4 324 411
NCBIGene 36.3 5799
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_002847
cd PTPc 233aa 1e-79
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
smart PTPc 260aa 6e-93
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
COG PTP2 275aa 7e-40
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
PTPRO
refseq_PTPRO.F2 refseq_PTPRO.R2 312 396
NCBIGene 36.3 5800
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_030667
cd PTPc 229aa 3e-87
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd FN3 93aa 1e-04
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 83aa 0.005
in ref transcript
smart PTPc 256aa 1e-97
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
pfam fn3 83aa 8e-05
in ref transcript
Fibronectin type III domain.
pfam fn3 87aa 0.002
in ref transcript
smart FN3 83aa 0.005
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
COG PTP2 234aa 1e-45
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
PTPRS
refseq_PTPRS.F2 refseq_PTPRS.R2 135 183
NCBIGene 36.3 5802
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_002850
cd PTPc 230aa 5e-95
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 231aa 2e-90
in ref transcript
cd FN3 90aa 6e-14
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 89aa 2e-12
in ref transcript
cd IGcam 87aa 4e-12
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd FN3 93aa 4e-11
in ref transcript
cd IGcam 91aa 5e-11
in ref transcript
cd FN3 90aa 2e-07
in ref transcript
cd FN3 80aa 2e-07
in ref transcript
cd FN3 106aa 2e-07
in ref transcript
cd IGcam 83aa 2e-06
in ref transcript
cd FN3 83aa 6e-05
in ref transcript
cd FN3 64aa 0.001
in ref transcript
smart PTPc 256aa 1e-106
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 260aa 1e-100
in ref transcript
pfam I-set 92aa 2e-17
in ref transcript
Immunoglobulin I-set domain.
smart FN3 79aa 1e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
smart FN3 80aa 2e-12
in ref transcript
smart FN3 83aa 5e-11
in ref transcript
smart IGc2 74aa 1e-10
in ref transcript
Immunoglobulin C-2 Type.
pfam fn3 80aa 7e-09
in ref transcript
Fibronectin type III domain.
smart IG_like 79aa 7e-09
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam fn3 91aa 4e-08
in ref transcript
pfam fn3 99aa 2e-07
in ref transcript
smart FN3 79aa 6e-05
in ref transcript
smart FN3 62aa 3e-04
in ref transcript
COG PTP2 268aa 6e-51
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 266aa 2e-48
in ref transcript
PTPRS
refseq_PTPRS.F3 refseq_PTPRS.R3 136 163
NCBIGene 36.3 5802
Multiple exon skipping, size difference: 27
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_002850
cd PTPc 230aa 5e-95
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 231aa 2e-90
in ref transcript
cd FN3 90aa 6e-14
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 89aa 2e-12
in ref transcript
Changed!
cd IGcam 87aa 4e-12
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd FN3 93aa 4e-11
in ref transcript
cd IGcam 91aa 5e-11
in ref transcript
cd FN3 90aa 2e-07
in ref transcript
cd FN3 80aa 2e-07
in ref transcript
cd FN3 106aa 2e-07
in ref transcript
cd IGcam 83aa 2e-06
in ref transcript
cd FN3 83aa 6e-05
in ref transcript
cd FN3 64aa 0.001
in ref transcript
smart PTPc 256aa 1e-106
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 260aa 1e-100
in ref transcript
pfam I-set 92aa 2e-17
in ref transcript
Immunoglobulin I-set domain.
smart FN3 79aa 1e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
smart FN3 80aa 2e-12
in ref transcript
smart FN3 83aa 5e-11
in ref transcript
Changed!
smart IGc2 74aa 1e-10
in ref transcript
Immunoglobulin C-2 Type.
pfam fn3 80aa 7e-09
in ref transcript
Fibronectin type III domain.
smart IG_like 79aa 7e-09
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam fn3 91aa 4e-08
in ref transcript
pfam fn3 99aa 2e-07
in ref transcript
smart FN3 79aa 6e-05
in ref transcript
smart FN3 62aa 3e-04
in ref transcript
COG PTP2 268aa 6e-51
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 266aa 2e-48
in ref transcript
Changed!
cd IGcam 78aa 1e-12
in modified transcript
Changed!
smart IGc2 65aa 6e-11
in modified transcript
PTPRS
refseq_PTPRS.F6 refseq_PTPRS.R6 150 177
NCBIGene 36.3 5802
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_002850
cd PTPc 230aa 5e-95
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 231aa 2e-90
in ref transcript
cd FN3 90aa 6e-14
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 89aa 2e-12
in ref transcript
cd IGcam 87aa 4e-12
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd FN3 93aa 4e-11
in ref transcript
cd IGcam 91aa 5e-11
in ref transcript
cd FN3 90aa 2e-07
in ref transcript
cd FN3 80aa 2e-07
in ref transcript
Changed!
cd FN3 106aa 2e-07
in ref transcript
cd IGcam 83aa 2e-06
in ref transcript
cd FN3 83aa 6e-05
in ref transcript
cd FN3 64aa 0.001
in ref transcript
smart PTPc 256aa 1e-106
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 260aa 1e-100
in ref transcript
pfam I-set 92aa 2e-17
in ref transcript
Immunoglobulin I-set domain.
smart FN3 79aa 1e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
smart FN3 80aa 2e-12
in ref transcript
smart FN3 83aa 5e-11
in ref transcript
smart IGc2 74aa 1e-10
in ref transcript
Immunoglobulin C-2 Type.
pfam fn3 80aa 7e-09
in ref transcript
Fibronectin type III domain.
smart IG_like 79aa 7e-09
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam fn3 91aa 4e-08
in ref transcript
Changed!
pfam fn3 99aa 2e-07
in ref transcript
smart FN3 79aa 6e-05
in ref transcript
smart FN3 62aa 3e-04
in ref transcript
COG PTP2 268aa 6e-51
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 266aa 2e-48
in ref transcript
Changed!
cd FN3 97aa 1e-07
in modified transcript
Changed!
smart FN3 87aa 5e-07
in modified transcript
PTPRT
refseq_PTPRT.F1 refseq_PTPRT.R1 228 285
NCBIGene 36.3 11122
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_133170
cd PTPc 228aa 6e-92
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 233aa 1e-69
in ref transcript
cd MAM 158aa 5e-35
in ref transcript
Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
cd FN3 83aa 3e-07
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 91aa 0.005
in ref transcript
smart PTPc 255aa 1e-102
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 263aa 4e-76
in ref transcript
smart MAM 161aa 1e-47
in ref transcript
Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
pfam fn3 76aa 2e-07
in ref transcript
Fibronectin type III domain.
pfam fn3 72aa 8e-04
in ref transcript
COG PTP2 240aa 1e-46
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 284aa 3e-27
in ref transcript
PTPRU
refseq_PTPRU.F1 refseq_PTPRU.R1 128 158
NCBIGene 36.3 10076
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_005704
cd PTPc 221aa 6e-83
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 234aa 6e-67
in ref transcript
cd MAM 160aa 9e-41
in ref transcript
Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
cd FN3 102aa 6e-09
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 96aa 2e-04
in ref transcript
pfam Y_phosphatase 222aa 1e-91
in ref transcript
Protein-tyrosine phosphatase.
smart PTPc 264aa 2e-72
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart MAM 165aa 3e-48
in ref transcript
Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
pfam fn3 94aa 2e-09
in ref transcript
Fibronectin type III domain.
pfam fn3 72aa 3e-04
in ref transcript
smart FN3 86aa 0.002
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
COG PTP2 219aa 3e-38
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 289aa 2e-22
in ref transcript
PTPRU
refseq_PTPRU.F2 refseq_PTPRU.R2 101 119
NCBIGene 36.3 10076
Single exon skipping, size difference: 18
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_005704
Changed!
cd PTPc 221aa 6e-83
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 234aa 6e-67
in ref transcript
cd MAM 160aa 9e-41
in ref transcript
Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
cd FN3 102aa 6e-09
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 96aa 2e-04
in ref transcript
Changed!
pfam Y_phosphatase 222aa 1e-91
in ref transcript
Protein-tyrosine phosphatase.
smart PTPc 264aa 2e-72
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart MAM 165aa 3e-48
in ref transcript
Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
pfam fn3 94aa 2e-09
in ref transcript
Fibronectin type III domain.
pfam fn3 72aa 3e-04
in ref transcript
smart FN3 86aa 0.002
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Changed!
COG PTP2 219aa 3e-38
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 289aa 2e-22
in ref transcript
Changed!
cd PTPc 227aa 7e-81
in modified transcript
Changed!
pfam Y_phosphatase 228aa 1e-89
in modified transcript
Changed!
COG PTP2 225aa 4e-38
in modified transcript
PXMP4
refseq_PXMP4.F2 refseq_PXMP4.R2 195 394
NCBIGene 36.3 11264
Single exon skipping, size difference: 199
Exclusion in the protein causing a frameshift
Reference transcript: NM_007238
Changed!
pfam Pmp24 171aa 4e-68
in ref transcript
Peroxisomal membrane protein 24. Peroxisomes are single membrane bound organelles, present in practically all eukaryotic cells, and involved in a variety of metabolic pathways; the deduced protein is extremely basic, a characteristic of many other peroxisomal intrinsic membrane proteins. They carry two short stretches of hydrophobic residues shown to be necessary for the correct targeting of these proteins. A sequence with these characteristics is found in human PMP24 (amino acid residues 16-30). However, in the absence of experimental data, the involvement of this domain in the targeting of PMP24 remains to be proved. PMP24 was known as Pmp27.
Changed!
pfam Pmp24 32aa 9e-15
in modified transcript
PYCARD
refseq_PYCARD.F1 refseq_PYCARD.R1 272 329
NCBIGene 36.3 29108
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_013258
pfam CARD 84aa 6e-18
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
pfam PAAD_DAPIN 84aa 2e-12
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
PYHIN1
refseq_PYHIN1.F1 refseq_PYHIN1.R1 227 352
NCBIGene 36.3 149628
Single exon skipping, size difference: 125
Exclusion of the stop codon
Reference transcript: NM_152501
pfam HIN 159aa 3e-51
in ref transcript
HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758.
pfam PAAD_DAPIN 79aa 6e-17
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
QRICH1
refseq_QRICH1.F2 refseq_QRICH1.R2 337 397
NCBIGene 36.3 54870
Single exon skipping, size difference: 60
Exclusion in 5'UTR
Reference transcript: NM_017730
RAB27A
refseq_RAB27A.F2 refseq_RAB27A.R2 177 266
NCBIGene 36.3 5873
Single exon skipping, size difference: 89
Exclusion in 5'UTR
Reference transcript: NM_183234
cd Rab27A 180aa 1e-105
in ref transcript
Rab27a subfamily. The Rab27a subfamily consists of Rab27a and its highly homologous isoform, Rab27b. Unlike most Rab proteins whose functions remain poorly defined, Rab27a has many known functions. Rab27a has multiple effector proteins, and depending on which effector it binds, Rab27a has different functions as well as tissue distribution and/or cellular localization. Putative functions have been assigned to Rab27a when associated with the effector proteins Slp1, Slp2, Slp3, Slp4, Slp5, DmSlp, rabphilin, Dm/Ce-rabphilin, Slac2-a, Slac2-b, Slac2-c, Noc2, JFC1, and Munc13-4. Rab27a has been associated with several human diseases, including hemophagocytic syndrome (Griscelli syndrome or GS), Hermansky-Pudlak syndrome, and choroidermia. In the case of GS, a rare, autosomal recessive disease, a Rab27a mutation is directly responsible for the disorder. When Rab27a is localized to the secretory granules of pancreatic beta cells, it is believed to mediate glucose-stimulated insulin secretion, making it a potential target for diabetes therapy. When bound to JFC1 in prostate cells, Rab27a is believed to regulate the exocytosis of prostate- specific markers. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
smart RAB 175aa 1e-66
in ref transcript
Rab subfamily of small GTPases. Rab GTPases are implicated in vesicle trafficking.
COG COG1100 187aa 2e-24
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
RAB28
refseq_RAB28.F2 refseq_RAB28.R2 291 386
NCBIGene 36.3 9364
Single exon skipping, size difference: 95
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001017979
Changed!
cd Rab28 209aa 1e-106
in ref transcript
Rab28 subfamily. First identified in maize, Rab28 has been shown to be a late embryogenesis-abundant (Lea) protein that is regulated by the plant hormone abcisic acid (ABA). In Arabidopsis, Rab28 is expressed during embryo development and is generally restricted to provascular tissues in mature embryos. Unlike maize Rab28, it is not ABA-inducible. Characterization of the human Rab28 homolog revealed two isoforms, which differ by a 95-base pair insertion, producing an alternative sequence for the 30 amino acids at the C-terminus. The two human isoforms are presumbly the result of alternative splicing. Since they differ at the C-terminus but not in the GTP-binding region, they are predicted to be targeted to different cellular locations. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins.
smart RAB 166aa 6e-38
in ref transcript
Rab subfamily of small GTPases. Rab GTPases are implicated in vesicle trafficking.
Changed!
COG COG1100 132aa 6e-20
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
Changed!
cd Rab28 205aa 1e-98
in modified transcript
Changed!
COG COG1100 195aa 5e-20
in modified transcript
RAB37
refseq_RAB37.F1 refseq_RAB37.R1 110 467
NCBIGene 36.2 326624
Alternative 5-prime and 3-prime, size difference: 357
Exclusion in 3'UTR, Exclusion in 3'UTR
Reference transcript: NM_001006637
cd Rab26 190aa 1e-104
in ref transcript
Rab26 subfamily. First identified in rat pancreatic acinar cells, Rab26 is believed to play a role in recruiting mature granules to the plasma membrane upon beta-adrenergic stimulation. Rab26 belongs to the Rab functional group III, which are considered key regulators of intracellular vesicle transport during exocytosis. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins.
smart RAB 162aa 8e-74
in ref transcript
Rab subfamily of small GTPases. Rab GTPases are implicated in vesicle trafficking.
PTZ PTZ00099 133aa 7e-31
in ref transcript
rab6; Provisional.
RAB3IP
refseq_RAB3IP.F1 refseq_RAB3IP.R1 108 208
NCBIGene 36.3 117177
Single exon skipping, size difference: 100
Exclusion in the protein causing a frameshift
Reference transcript: NM_175623
pfam Sec2p 125aa 7e-37
in ref transcript
GDP/GTP exchange factor Sec2p. In Saccharomyces cerevisiae, Sec2p is a GDP/GTP exchange factor for Sec4p, which is required for vesicular transport at the post-Golgi stage of yeast secretion.
COG Smc 87aa 4e-04
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
RAB5C
refseq_RAB5C.F1 refseq_RAB5C.R1 257 374
NCBIGene 36.3 5878
Single exon skipping, size difference: 117
Exclusion in 5'UTR
Reference transcript: NM_201434
cd Rab5_related 163aa 4e-95
in ref transcript
Rab5-related subfamily. This subfamily includes Rab5 and Rab22 of mammals, Ypt51/Ypt52/Ypt53 of yeast, and RabF of plants. The members of this subfamily are involved in endocytosis and endocytic-sorting pathways. In mammals, Rab5 GTPases localize to early endosomes and regulate fusion of clathrin-coated vesicles to early endosomes and fusion between early endosomes. In yeast, Ypt51p family members similarly regulate membrane trafficking through prevacuolar compartments. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
pfam Ras 162aa 2e-71
in ref transcript
Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. As regards Rab GTPases, these are important regulators of vesicle formation, motility and fusion. They share a fold in common with all Ras GTPases: this is a six-stranded beta-sheet surrounded by five alpha-helices.
COG COG1100 196aa 4e-32
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
ERC1
refseq_RAB6IP2.F1 refseq_RAB6IP2.R1 189 321
NCBIGene 36.3 23085
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_178040
Changed!
pfam Cast 829aa 1e-148
in ref transcript
RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
pfam RBD-FIP 41aa 1e-07
in ref transcript
FIP domain. The FIP domain is the Rab11-binding domain (RBD) at the C-terminus of a family of Rab11-interacting proteins (FIPs). The Rab proteins constitute the largest family of small GTPases (>60 members in mammals). Among them Rab11 is a well characterised regulator of endocytic and recycling pathways. Rab11 associates with a broad range of post-Golgi organelles, including recycling endosomes.
COG Smc 340aa 3e-09
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
COG Smc 318aa 3e-08
in ref transcript
COG Smc 298aa 5e-06
in ref transcript
Changed!
pfam Cast 785aa 1e-151
in modified transcript
Changed!
COG Smc 328aa 7e-05
in modified transcript
ERC1
refseq_RAB6IP2.F3 refseq_RAB6IP2.R3 125 290
NCBIGene 36.3 23085
Alternative 5-prime, size difference: 165
Inclusion in 5'UTR
Reference transcript: NM_178040
pfam Cast 829aa 1e-148
in ref transcript
RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
pfam RBD-FIP 41aa 1e-07
in ref transcript
FIP domain. The FIP domain is the Rab11-binding domain (RBD) at the C-terminus of a family of Rab11-interacting proteins (FIPs). The Rab proteins constitute the largest family of small GTPases (>60 members in mammals). Among them Rab11 is a well characterised regulator of endocytic and recycling pathways. Rab11 associates with a broad range of post-Golgi organelles, including recycling endosomes.
COG Smc 340aa 3e-09
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG Smc 318aa 3e-08
in ref transcript
COG Smc 298aa 5e-06
in ref transcript
ERC1
refseq_RAB6IP2.F5 refseq_RAB6IP2.R5 242 326
NCBIGene 36.3 23085
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_178040
Changed!
pfam Cast 829aa 1e-148
in ref transcript
RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
pfam RBD-FIP 41aa 1e-07
in ref transcript
FIP domain. The FIP domain is the Rab11-binding domain (RBD) at the C-terminus of a family of Rab11-interacting proteins (FIPs). The Rab proteins constitute the largest family of small GTPases (>60 members in mammals). Among them Rab11 is a well characterised regulator of endocytic and recycling pathways. Rab11 associates with a broad range of post-Golgi organelles, including recycling endosomes.
Changed!
COG Smc 340aa 3e-09
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG Smc 318aa 3e-08
in ref transcript
Changed!
COG Smc 298aa 5e-06
in ref transcript
Changed!
pfam Cast 801aa 1e-154
in modified transcript
Changed!
COG Smc 365aa 7e-07
in modified transcript
Changed!
COG COG2433 126aa 0.004
in modified transcript
Uncharacterized conserved protein [Function unknown].
ERC1
refseq_RAB6IP2.F6 refseq_RAB6IP2.R6 225 290
NCBIGene 36.3 23085
Single exon skipping, size difference: 65
Inclusion in the protein causing a new stop codon
Reference transcript: NM_178040
pfam Cast 829aa 1e-148
in ref transcript
RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
Changed!
pfam RBD-FIP 41aa 1e-07
in ref transcript
FIP domain. The FIP domain is the Rab11-binding domain (RBD) at the C-terminus of a family of Rab11-interacting proteins (FIPs). The Rab proteins constitute the largest family of small GTPases (>60 members in mammals). Among them Rab11 is a well characterised regulator of endocytic and recycling pathways. Rab11 associates with a broad range of post-Golgi organelles, including recycling endosomes.
COG Smc 340aa 3e-09
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG Smc 318aa 3e-08
in ref transcript
COG Smc 298aa 5e-06
in ref transcript
RAD50
refseq_RAD50.F1 refseq_RAD50.R1 104 169
NCBIGene 36.3 10111
Alternative 5-prime, size difference: 65
Exclusion in the protein causing a frameshift
Reference transcript: NM_005732
Changed!
cd ABC_Rad50 150aa 4e-33
in ref transcript
The catalytic domains of Rad50 are similar to the ATP-binding cassette of ABC transporters, but are not associated with membrane-spanning domains. The conserved ATP-binding motifs common to Rad50 and the ABC transporter family include the Walker A and Walker B motifs, the Q loop, a histidine residue in the switch region, a D-loop, and a conserved LSGG sequence. This conserved sequence, LSGG, is the most specific and characteristic motif of this family and is thus known as the ABC signature sequence.
Changed!
cd ABC_Rad50 165aa 1e-27
in ref transcript
Changed!
TIGR rad50 1311aa 0.0
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Changed!
COG SbcC 676aa 8e-18
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
Changed!
COG SbcC 870aa 5e-14
in ref transcript
Changed!
cd ABC_Rad50 98aa 2e-27
in modified transcript
Changed!
TIGR rad50 99aa 8e-53
in modified transcript
Changed!
COG SbcC 98aa 5e-12
in modified transcript
RALY
refseq_RALY.F2 refseq_RALY.R2 170 218
NCBIGene 36.3 22913
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_016732
cd RRM 67aa 3e-11
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM_2 67aa 7e-11
in ref transcript
RNA recognition motif.
COG COG0724 95aa 1e-04
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
RANBP3
refseq_RANBP3.F2 refseq_RANBP3.R2 179 383
NCBIGene 36.3 8498
Single exon skipping, size difference: 204
Exclusion in the protein (no frameshift)
Reference transcript: NM_007322
cd RanBD 125aa 3e-26
in ref transcript
Ran-binding domain; This domain of approximately 150 residues shares structural similarity to the PH domain, but lacks detectable sequence similarity. Ran is a Ras-like nuclear small GTPase, which regulates receptor-mediated transport between the nucleus and the cytoplasm. RanGTP hydrolysis is stimulated by RanGAP together with the Ran-binding domain containing acessory proteins RanBP1 and RanBP2. These accessory proteins stabilize the active GTP-bound form of Ran . The Ran-binding domain is found in multiple copies in Nuclear pore complex proteins.
smart RanBD 111aa 3e-15
in ref transcript
Ran-binding domain. Domain of apporximately 150 residues that stabilises the GTP-bound form of Ran (the Ras-like nuclear small GTPase).
COG YRB1 71aa 1e-04
in ref transcript
Ran GTPase-activating protein (Ran-binding protein) [Intracellular trafficking and secretion].
RAPH1
refseq_RAPH1.F1 refseq_RAPH1.R1 200 356
NCBIGene 36.3 65059
Multiple exon skipping, size difference: 156
Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_213589
cd PH_Apbb1ip 113aa 1e-54
in ref transcript
Apbb1ip (Amyloid beta (A4) Precursor protein-Binding, family B, member 1 Interacting Protein) pleckstrin homology (PH) domain. Apbb1ip consists of a Ras-associated domain and a PH domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
cd GRB7_RA 82aa 6e-22
in ref transcript
Grb7_RA The RA (RAS-associated like) domain of Grb7. Grb7 is an adaptor molecule that mediates signal transduction from multiple cell surface receptors to various downstream signaling pathways. Grb7 and its related family members Grb10 and Grb14 share a conserved domain architecture that includes an amino-terminal proline-rich region, a central segment termed the GM region (for Grb and Mig) which includes the RA, PIR, and PH domains, and a carboxyl-terminal SH2 domain. Grb7/10/14 family proteins are phosphorylated on serine/threonine as well as tyrosine residues and are mainly localized to the cytoplasm.
smart PH 109aa 5e-10
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
smart RA 85aa 5e-10
in ref transcript
Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.).
RAPSN
refseq_RAPSN.F1 refseq_RAPSN.R1 151 328
NCBIGene 36.3 5913
Multiple exon skipping, size difference: 177
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_005055
cd TPR 112aa 9e-07
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
pfam Rapsyn_N 80aa 2e-31
in ref transcript
Rapsyn N-terminal myristoylation and linker region. Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane.
COG COG5540 41aa 1e-04
in ref transcript
RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
RASA1
refseq_RASA1.F1 refseq_RASA1.R1 118 183
NCBIGene 36.3 5921
Alternative 5-prime, size difference: 65
Inclusion in the protein causing a new stop codon
Reference transcript: NM_002890
Changed!
cd RasGAP_p120GAP 315aa 1e-166
in ref transcript
p120GAP is a negative regulator of Ras that stimulates hydrolysis of bound GTP to GDP. Once the Ras regulator p120GAP, a member of the GAP protein family, is recruited to the membrane, it is transiently immobilized to interact with Ras-GTP. The down regulation of Ras by p120GAP is a critical step in the regulation of many cellular processes, which is disrupted in approximately 30% of human cancers. p120GAP contains SH2, SH3, PH, calcium- and lipid-binding domains, suggesting its involvement in a complex network of cellular interactions in vivo.
Changed!
cd PH_RasGAP_CG5898 80aa 2e-20
in ref transcript
RAS GTPase-activating protein (GAP) CG5898 Pleckstrin homology (PH) domain. This protein has a domain architecture of SH2-SH3-SH2-PH-C2-Ras_GAP. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinsases, regulators of G-proteins, endocytotic GTPAses, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes.
Changed!
cd SH2 91aa 5e-16
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Changed!
cd SH2 80aa 4e-15
in ref transcript
Changed!
cd C2 98aa 1e-09
in ref transcript
Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
Changed!
cd SH3 44aa 6e-04
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Changed!
smart RasGAP 347aa 1e-107
in ref transcript
GTPase-activator protein for Ras-like GTPases. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position. Improved domain limits from structure.
Changed!
pfam SH2 76aa 8e-23
in ref transcript
SH2 domain.
Changed!
pfam SH2 76aa 2e-20
in ref transcript
Changed!
smart C2 95aa 2e-09
in ref transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
Changed!
smart PH 84aa 4e-09
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
Changed!
pfam SH3_1 53aa 2e-06
in ref transcript
SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
Changed!
COG IQG1 291aa 7e-04
in ref transcript
Protein involved in regulation of cellular morphogenesis/cytokinesis [Cell division and chromosome partitioning / Signal transduction mechanisms].
Changed!
COG COG5038 99aa 0.003
in ref transcript
Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
RASGEF1C
refseq_RASGEF1C.F1 refseq_RASGEF1C.R1 171 209
NCBIGene 36.2 255426
Single exon skipping, size difference: 38
Inclusion in the protein causing a frameshift
Reference transcript: NM_175062
Changed!
cd RasGEF 243aa 4e-48
in ref transcript
Guanine nucleotide exchange factor for Ras-like small GTPases. Small GTP-binding proteins of the Ras superfamily function as molecular switches in fundamental events such as signal transduction, cytoskeleton dynamics and intracellular trafficking. Guanine-nucleotide-exchange factors (GEFs) positively regulate these GTP-binding proteins in response to a variety of signals. GEFs catalyze the dissociation of GDP from the inactive GTP-binding proteins. GTP can then bind and induce structural changes that allow interaction with effectors.
cd REM 109aa 4e-11
in ref transcript
Guanine nucleotide exchange factor for Ras-like GTPases; N-terminal domain (RasGef_N), also called REM domain (Ras exchanger motif). This domain is common in nucleotide exchange factors for Ras-like small GTPases and is typically found immediately N-terminal to the RasGef (Cdc25-like) domain. REM contacts the GTPase and is assumed to participate in the catalytic activity of the exchange factor. Proteins with the REM domain include Sos1 and Sos2, which relay signals from tyrosine-kinase mediated signalling to Ras, RasGRP1-4, RasGRF1,2, CNrasGEF, and RAP-specific nucleotide exchange factors, to name a few.
Changed!
smart RasGEF 248aa 2e-53
in ref transcript
Guanine nucleotide exchange factor for Ras-like small GTPases.
pfam RasGEF_N 89aa 4e-07
in ref transcript
Guanine nucleotide exchange factor for Ras-like GTPases; N-terminal motif. A subset of guanine nucleotide exchange factor for Ras-like small GTPases appear to possess this motif/domain N-terminal to the RasGef (Cdc25-like) domain.
Changed!
cd RasGEF 147aa 1e-36
in modified transcript
Changed!
smart RasGEF 147aa 3e-39
in modified transcript
RASGRP2
refseq_RASGRP2.F2 refseq_RASGRP2.R2 257 363
NCBIGene 36.2 10235
Single exon skipping, size difference: 106
Exclusion in the protein causing a frameshift
Reference transcript: NM_005825
Changed!
cd RasGEF 234aa 1e-62
in ref transcript
Guanine nucleotide exchange factor for Ras-like small GTPases. Small GTP-binding proteins of the Ras superfamily function as molecular switches in fundamental events such as signal transduction, cytoskeleton dynamics and intracellular trafficking. Guanine-nucleotide-exchange factors (GEFs) positively regulate these GTP-binding proteins in response to a variety of signals. GEFs catalyze the dissociation of GDP from the inactive GTP-binding proteins. GTP can then bind and induce structural changes that allow interaction with effectors.
Changed!
cd REM 112aa 2e-15
in ref transcript
Guanine nucleotide exchange factor for Ras-like GTPases; N-terminal domain (RasGef_N), also called REM domain (Ras exchanger motif). This domain is common in nucleotide exchange factors for Ras-like small GTPases and is typically found immediately N-terminal to the RasGef (Cdc25-like) domain. REM contacts the GTPase and is assumed to participate in the catalytic activity of the exchange factor. Proteins with the REM domain include Sos1 and Sos2, which relay signals from tyrosine-kinase mediated signalling to Ras, RasGRP1-4, RasGRF1,2, CNrasGEF, and RAP-specific nucleotide exchange factors, to name a few.
Changed!
cd C1 50aa 4e-11
in ref transcript
Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
Changed!
cd EFh 52aa 1e-05
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Changed!
smart RasGEF 234aa 8e-69
in ref transcript
Guanine nucleotide exchange factor for Ras-like small GTPases.
Changed!
smart RasGEFN 115aa 2e-20
in ref transcript
Guanine nucleotide exchange factor for Ras-like GTPases; N-terminal motif. A subset of guanine nucleotide exchange factor for Ras-like small GTPases appear to possess this domain N-terminal to the RasGef (Cdc25-like) domain. The recent crystal structureof Sos shows that this domain is alpha-helical and plays a "purely structural role" (Nature 394, 337-343).
Changed!
pfam C1_1 51aa 4e-10
in ref transcript
Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
Changed!
PRK PRK12309 60aa 1e-04
in ref transcript
Changed!
cd RASSF1_RA 94aa 5e-35
in ref transcript
RASSF1 (also known as RASSF3 and NORE1) is a tumour suppressor protein with a C-terminal Ras-associating (RA) domain that binds Ras. RASSF1 also binds the proapoptotic protein kinase MST1 and is thus thought to regulate the proapoptotic signalling pathway. RASSF1 also associates with microtubule-associated proteins like MAP1B and regulates tubulin polymerization. RASSF1 also binds CDC20 and regulates mitosis by inhibiting the anaphase-promoting complex and preventing degradation of cyclin A and cyclin B until the spindle checkpoint becomes fully operational.
cd C1 38aa 9e-05
in ref transcript
Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
Changed!
smart RA 92aa 3e-14
in ref transcript
Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.).
pfam C1_1 38aa 4e-05
in ref transcript
Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
Changed!
cd RASSF1_RA 59aa 7e-20
in modified transcript
Changed!
smart RA 56aa 4e-07
in modified transcript
RBBP6
refseq_RBBP6.F2 refseq_RBBP6.R2 207 309
NCBIGene 36.3 5930
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_006910
cd RING 44aa 6e-08
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
pfam DWNN 73aa 7e-28
in ref transcript
DWNN domain. DWNN is a ubiquitin like domain found at the N terminus of the RBBP6 family of splicing-associated proteins. The DWNN domain is independently expressed in higher vertebrates so it may function as a novel ubiquitin-like modifier of other proteins.
smart RING 41aa 2e-08
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
smart ZnF_C2HC 16aa 0.005
in ref transcript
zinc finger.
COG COG5222 321aa 9e-40
in ref transcript
Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].
RBBP8
refseq_RBBP8.F1 refseq_RBBP8.R1 145 226
NCBIGene 36.3 5932
Alternative 5-prime, size difference: 81
Inclusion in 5'UTR
Reference transcript: NM_203291
pfam CtIP_N 120aa 6e-48
in ref transcript
Tumour-suppressor protein CtIP N-terminal domain. CtIP is predominantly a nuclear protein that complexes with both BRCA1 and the BRCA1-associated RING domain protein (BARD1). At the protein level, CtIP expression varies with cell cycle progression in a pattern identical to that of BRCA1. Thus, the steady-state levels of CtIP polypeptides, which remain low in resting cells and G1 cycling cells, increase dramatically as Dividing cells traverse the G1/S boundary. CtIP can potentially modulate the functions ascribed to BRCA1 in transcriptional regulation, DNA repair, and/or cell cycle checkpoint control. This N-terminal domain carries a coiled-coil region and is essential for homodimerisation of the protein. The C-terminal domain is family CtIP_C and carries functionally important CxxC and RHR motifs, absence of which lead cells to grow slowly and show hypersensitivity to genotoxins.
pfam SAE2 88aa 5e-14
in ref transcript
DNA repair protein SAE2/CtIP. SAE2 is a protein involved in repairing meiotic and mitotic double-strand breaks in DNA. It has been shown to negatively regulate DNA damage checkpoint signalling. SAE2 is homologous to the CtIP proteins in mammals.
PRK PRK12704 130aa 0.003
in ref transcript
phosphodiesterase; Provisional.
RBBP8
refseq_RBBP8.F3 refseq_RBBP8.R3 100 197
NCBIGene 36.3 5932
Single exon skipping, size difference: 97
Exclusion in the protein causing a frameshift
Reference transcript: NM_002894
pfam CtIP_N 120aa 6e-48
in ref transcript
Tumour-suppressor protein CtIP N-terminal domain. CtIP is predominantly a nuclear protein that complexes with both BRCA1 and the BRCA1-associated RING domain protein (BARD1). At the protein level, CtIP expression varies with cell cycle progression in a pattern identical to that of BRCA1. Thus, the steady-state levels of CtIP polypeptides, which remain low in resting cells and G1 cycling cells, increase dramatically as Dividing cells traverse the G1/S boundary. CtIP can potentially modulate the functions ascribed to BRCA1 in transcriptional regulation, DNA repair, and/or cell cycle checkpoint control. This N-terminal domain carries a coiled-coil region and is essential for homodimerisation of the protein. The C-terminal domain is family CtIP_C and carries functionally important CxxC and RHR motifs, absence of which lead cells to grow slowly and show hypersensitivity to genotoxins.
Changed!
pfam SAE2 88aa 5e-14
in ref transcript
DNA repair protein SAE2/CtIP. SAE2 is a protein involved in repairing meiotic and mitotic double-strand breaks in DNA. It has been shown to negatively regulate DNA damage checkpoint signalling. SAE2 is homologous to the CtIP proteins in mammals.
PRK PRK12704 130aa 0.003
in ref transcript
phosphodiesterase; Provisional.
RBCK1
refseq_RBCK1.F1 refseq_RBCK1.R1 127 272
NCBIGene 36.3 10616
Single exon skipping, size difference: 145
Exclusion in the protein causing a frameshift
Reference transcript: NM_031229
Changed!
cd Hoil1_N 75aa 4e-35
in ref transcript
HOIL1_N HOIL-1 (heme-oxidized IRP2 ubiquitin ligase-1) is an E3 ubiquitin-protein ligase that recognizes heme-oxidized IRP2 (iron regulatory protein2) and is thought to affect the turnover of oxidatively damaged proteins. Hoil-1 has an amino-terminal ubiquitin-like domain as well as an RBR signature consisting of two RING finger domains separated by an IBR/DRIL domain.
RBM10
refseq_RBM10.F1 refseq_RBM10.R1 152 383
NCBIGene 36.3 8241
Single exon skipping, size difference: 231
Exclusion in the protein (no frameshift)
Reference transcript: NM_005676
Changed!
cd RRM 76aa 9e-09
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 81aa 1e-04
in ref transcript
pfam G-patch 44aa 3e-13
in ref transcript
G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines.
Changed!
smart RRM 67aa 5e-09
in ref transcript
RNA recognition motif.
smart ZnF_RBZ 25aa 2e-07
in ref transcript
Zinc finger domain. Zinc finger domain in Ran-binding proteins (RanBPs), and other proteins. In RanBPs, this domain binds RanGDP.
Changed!
TIGR ELAV_HUD_SF 199aa 1e-04
in ref transcript
These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
smart RRM 77aa 3e-04
in ref transcript
Changed!
COG COG0724 104aa 0.002
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
cd RRM 72aa 2e-05
in modified transcript
Changed!
smart RRM 46aa 2e-06
in modified transcript
RBM3
refseq_RBM3.F2 refseq_RBM3.R2 120 389
NCBIGene 36.2 5935
Single exon skipping, size difference: 269
Inclusion in the protein causing a new stop codon
Reference transcript: NM_006743
Changed!
cd RRM 75aa 5e-21
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
smart RRM 71aa 2e-20
in ref transcript
RNA recognition motif.
Changed!
COG COG0724 79aa 1e-11
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
cd RRM 68aa 1e-18
in modified transcript
Changed!
smart RRM 65aa 4e-18
in modified transcript
Changed!
COG COG0724 74aa 5e-13
in modified transcript
RBM38
refseq_RBM38.F1 refseq_RBM38.R1 132 187
NCBIGene 36.3 55544
Single exon skipping, size difference: 55
Exclusion in the protein causing a frameshift
Reference transcript: NM_017495
cd RRM 74aa 6e-17
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM 70aa 3e-18
in ref transcript
RNA recognition motif.
COG COG0724 91aa 1e-14
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
RBM39
refseq_RBM39.F2 refseq_RBM39.R2 149 222
NCBIGene 36.2 9584
Single exon skipping, size difference: 73
Inclusion in the protein causing a new stop codon
Reference transcript: NM_184234
Changed!
cd RRM 73aa 1e-20
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
cd RRM 73aa 3e-09
in ref transcript
Changed!
TIGR SF-CC1 427aa 1e-152
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
Changed!
COG COG0724 99aa 9e-15
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
RBM39
refseq_RBM39.F3 refseq_RBM39.R3 119 191
NCBIGene 36.2 9584
Single exon skipping, size difference: 72
Inclusion in the protein causing a new stop codon
Reference transcript: NM_184234
Changed!
cd RRM 73aa 1e-20
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
cd RRM 73aa 3e-09
in ref transcript
Changed!
TIGR SF-CC1 427aa 1e-152
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
Changed!
COG COG0724 99aa 9e-15
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
RBM39
refseq_RBM39.F5 refseq_RBM39.R5 101 119
NCBIGene 36.3 9584
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_184234
cd RRM 73aa 1e-20
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 73aa 3e-09
in ref transcript
Changed!
TIGR SF-CC1 427aa 1e-152
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
COG COG0724 99aa 9e-15
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
TIGR SF-CC1 421aa 1e-150
in modified transcript
RBM9
refseq_RBM9.F1 refseq_RBM9.R1 206 258
NCBIGene 36.3 23543
Single exon skipping, size difference: 40
Exclusion in the protein causing a frameshift
Reference transcript: NM_001082578
cd RRM 72aa 2e-20
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM_2 72aa 2e-20
in ref transcript
RNA recognition motif.
COG COG0724 86aa 3e-12
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
RBM9
refseq_RBM9.F3 refseq_RBM9.R3 105 145
NCBIGene 36.3 23543
Single exon skipping, size difference: 40
Exclusion in the protein causing a frameshift
Reference transcript: NM_001082578
cd RRM 72aa 2e-20
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM_2 72aa 2e-20
in ref transcript
RNA recognition motif.
COG COG0724 86aa 3e-12
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
RBMS1
refseq_RBMS1.F1 refseq_RBMS1.R1 189 299
NCBIGene 36.3 5937
Single exon skipping, size difference: 110
Inclusion in the protein causing a new stop codon
Reference transcript: NM_016836
Changed!
cd RRM 66aa 8e-14
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
cd RRM 51aa 1e-11
in ref transcript
Changed!
TIGR ELAV_HUD_SF 182aa 5e-22
in ref transcript
These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Changed!
COG COG0724 175aa 3e-08
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
COG COG0724 113aa 1e-07
in ref transcript
RBMS1
refseq_RBMS1.F3 refseq_RBMS1.R3 100 148
NCBIGene 36.3 5937
Single exon skipping, size difference: 48
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_016836
cd RRM 66aa 8e-14
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 51aa 1e-11
in ref transcript
TIGR ELAV_HUD_SF 182aa 5e-22
in ref transcript
These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
Changed!
COG COG0724 175aa 3e-08
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
COG COG0724 113aa 1e-07
in ref transcript
Changed!
COG COG0724 187aa 2e-08
in modified transcript
RBMS3
refseq_RBMS3.F2 refseq_RBMS3.R2 112 160
NCBIGene 36.3 27303
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_001003793
cd RRM 66aa 5e-14
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 51aa 7e-12
in ref transcript
TIGR ELAV_HUD_SF 166aa 9e-23
in ref transcript
These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
COG COG0724 143aa 2e-08
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
COG COG0724 119aa 7e-07
in modified transcript
RBMS3
refseq_RBMS3.F4 refseq_RBMS3.R4 111 162
NCBIGene 36.3 27303
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_001003793
cd RRM 66aa 5e-14
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 51aa 7e-12
in ref transcript
TIGR ELAV_HUD_SF 166aa 9e-23
in ref transcript
These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
COG COG0724 143aa 2e-08
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
TIGR PABP-1234 333aa 1e-18
in modified transcript
There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
Changed!
COG COG0724 119aa 6e-07
in modified transcript
RBPMS
refseq_RBPMS.F1 refseq_RBPMS.R1 144 230
NCBIGene 36.3 11030
Single exon skipping, size difference: 86
Exclusion of the stop codon
Reference transcript: NM_001008711
cd RRM 74aa 1e-08
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
pfam RRM_1 62aa 9e-09
in ref transcript
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
RBPMS
refseq_RBPMS.F2 refseq_RBPMS.R2 149 355
NCBIGene 36.3 11030
Single exon skipping, size difference: 206
Inclusion in 3'UTR
Reference transcript: NM_001008711
cd RRM 74aa 1e-08
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
pfam RRM_1 62aa 9e-09
in ref transcript
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
RBPJ
refseq_RBPSUH.F1 refseq_RBPSUH.R1 124 240
NCBIGene 36.3 3516
Single exon skipping, size difference: 116
Exclusion of the protein initiation site
Reference transcript: NM_005349
cd IPT_RBP-Jkappa 89aa 2e-45
in ref transcript
IPT domain of the recombination signal Jkappa binding protein (RBP-Jkappa). RBP-J kappa, was initially considered to be involved in V(D)J recombination because of its DNA binding specificity and structural similarity to site-specific recombinases known as the integrase family. Further studies indicated that RBP-J kappa functions as a repressor of transcription, via destabilization of the general transcription factor IID and recruitment of histone deacetylase complexes.
pfam Beta-trefoil 150aa 4e-68
in ref transcript
Beta-trefoil. Members of this family of DNA binding domains adopt a beta-trefoil fold, that is, a capped beta-barrel with internal pseudo threefold symmetry. In the DNA-binding protein LAG-1, it also is the site of mutually exclusive interactions with NotchIC (and the viral protein EBNA2) and corepressors (SMRT/N-Cor and CIR).
pfam LAG1-DNAbind 132aa 3e-55
in ref transcript
LAG1, DNA binding. Members of this family are found in various eukaryotic hypothetical proteins and in the DNA-binding protein LAG-1. They adopt a beta sandwich structure, with nine strands in two beta-sheets, in a Greek-key topology, and allow for DNA binding.
pfam TIG 81aa 1e-07
in ref transcript
IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: This family does not currently recognise a significant number of members.
RBPJ
refseq_RBPSUH.F3 refseq_RBPSUH.R1 144 327
NCBIGene 36.3 3516
Single exon skipping, size difference: 183
Inclusion in the protein causing a new stop codon
Reference transcript: NM_015874
Changed!
cd IPT_RBP-Jkappa 89aa 2e-45
in ref transcript
IPT domain of the recombination signal Jkappa binding protein (RBP-Jkappa). RBP-J kappa, was initially considered to be involved in V(D)J recombination because of its DNA binding specificity and structural similarity to site-specific recombinases known as the integrase family. Further studies indicated that RBP-J kappa functions as a repressor of transcription, via destabilization of the general transcription factor IID and recruitment of histone deacetylase complexes.
Changed!
pfam Beta-trefoil 150aa 3e-68
in ref transcript
Beta-trefoil. Members of this family of DNA binding domains adopt a beta-trefoil fold, that is, a capped beta-barrel with internal pseudo threefold symmetry. In the DNA-binding protein LAG-1, it also is the site of mutually exclusive interactions with NotchIC (and the viral protein EBNA2) and corepressors (SMRT/N-Cor and CIR).
Changed!
pfam LAG1-DNAbind 132aa 3e-55
in ref transcript
LAG1, DNA binding. Members of this family are found in various eukaryotic hypothetical proteins and in the DNA-binding protein LAG-1. They adopt a beta sandwich structure, with nine strands in two beta-sheets, in a Greek-key topology, and allow for DNA binding.
Changed!
pfam TIG 81aa 2e-07
in ref transcript
IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: This family does not currently recognise a significant number of members.
RCCD1
refseq_RCCD1.F1 refseq_RCCD1.R1 324 408
NCBIGene 36.3 91433
Single exon skipping, size difference: 84
Exclusion in 5'UTR
Reference transcript: NM_033544
COG ATS1 181aa 1e-12
in ref transcript
Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
RECQL5
refseq_RECQL5.F2 refseq_RECQL5.R2 197 278
NCBIGene 36.2 9400
Alternative 3-prime, size difference: 81
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_004259
cd HELICc 117aa 4e-20
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
Changed!
cd DEXDc 125aa 3e-09
in ref transcript
DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
Changed!
TIGR recQ 397aa 2e-92
in ref transcript
The ATP-dependent DNA helicase RecQ of E. coli is about 600 residues long. This model represents bacterial proteins with a high degree of similarity in domain architecture and in primary sequence to E. coli RecQ. The model excludes eukaryotic and archaeal proteins with RecQ-like regions, as well as more distantly related bacterial helicases related to RecQ.
pfam RecQ5 205aa 7e-87
in ref transcript
RecQ helicase protein-like 5 (RecQ5). This family represents a conserved region approximately 200 residues long within eukaryotic RecQ helicase protein-like 5 (RecQ5). The RecQ helicases have been implicated in DNA repair and recombination, and RecQ5 may have an important role in DNA metabolism.
Changed!
COG RecQ 396aa 4e-96
in ref transcript
Superfamily II DNA helicase [DNA replication, recombination, and repair].
Changed!
cd DEXDc 149aa 1e-16
in modified transcript
Changed!
TIGR recQ 424aa 1e-109
in modified transcript
Changed!
COG RecQ 423aa 1e-112
in modified transcript
Changed!
PRK PRK07003 150aa 0.010
in modified transcript
DNA polymerase III subunits gamma and tau; Validated.
RFC2
refseq_RFC2.F1 refseq_RFC2.R1 181 283
NCBIGene 36.3 5982
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_181471
Changed!
cd AAA 135aa 1e-18
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
pfam Rep_fac_C 89aa 2e-24
in ref transcript
Replication factor C. This family includes several subunits of DNA replication factor C.
Changed!
TIGR dnaX_nterm 278aa 6e-21
in ref transcript
This model represents the well-conserved first ~ 365 amino acids of the translation of the dnaX gene. The full-length product of the dnaX gene in the model bacterium E. coli is the DNA polymerase III tau subunit. A translational frameshift leads to early termination and a truncated protein subunit gamma, about 1/3 shorter than tau and present in roughly equal amounts. This frameshift mechanism is not necessarily universal for species with DNA polymerase III but appears conserved in the exterme thermophile Thermus thermophilis.
Changed!
PRK rfc 310aa 3e-99
in ref transcript
replication factor C small subunit; Reviewed.
Changed!
cd AAA 101aa 3e-08
in modified transcript
Changed!
TIGR dnaX_nterm 244aa 2e-12
in modified transcript
Changed!
PRK rfc 276aa 1e-81
in modified transcript
RFC3
refseq_RFC3.F2 refseq_RFC3.R2 139 172
NCBIGene 36.2 5983
Alternative 3-prime, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_002915
cd AAA 159aa 3e-08
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
pfam RFC-E_C 95aa 2e-42
in ref transcript
Clamp-loader complex subunit E C-terminus. This is the C-terminal domain of RFC-E, one of the five RFC proteins of the clamp loader complex (replication factor-C, RFC) which binds to the DNA sliding clamp (proliferating cell nuclear antigen, PCNA). The five modules of RFC assemble into a right-handed spiral, which results in only three of the five RFC subunits (RFC-A, RFC-B and RFC-C) making contact with PCNA, leaving a wedge-shaped gap between RFC-E and the PCNA clamp-loader complex. The C-terminal is vital for the correct orientation of RFC-E with respect to RFC-A.
Changed!
TIGR holB 169aa 1e-08
in ref transcript
At position 126-127 of the seed alignment, this family lacks the HM motif of gamma/tau; at 132 it has a near-invariant A vs. an invariant F in gamma/tau.
Changed!
TIGR dnaX_nterm 173aa 0.003
in ref transcript
This model represents the well-conserved first ~ 365 amino acids of the translation of the dnaX gene. The full-length product of the dnaX gene in the model bacterium E. coli is the DNA polymerase III tau subunit. A translational frameshift leads to early termination and a truncated protein subunit gamma, about 1/3 shorter than tau and present in roughly equal amounts. This frameshift mechanism is not necessarily universal for species with DNA polymerase III but appears conserved in the exterme thermophile Thermus thermophilis.
Changed!
PRK PRK12402 337aa 1e-40
in ref transcript
replication factor C small subunit 2; Reviewed.
Changed!
TIGR holB 168aa 3e-08
in modified transcript
Changed!
PRK PRK12402 326aa 4e-38
in modified transcript
RFC5
refseq_RFC5.F1 refseq_RFC5.R1 106 470
NCBIGene 36.3 5985
Single exon skipping, size difference: 364
Inclusion in the protein causing a frameshift
Reference transcript: NM_007370
Changed!
cd AAA 142aa 5e-16
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
Changed!
TIGR dnaX_nterm 267aa 2e-26
in ref transcript
This model represents the well-conserved first ~ 365 amino acids of the translation of the dnaX gene. The full-length product of the dnaX gene in the model bacterium E. coli is the DNA polymerase III tau subunit. A translational frameshift leads to early termination and a truncated protein subunit gamma, about 1/3 shorter than tau and present in roughly equal amounts. This frameshift mechanism is not necessarily universal for species with DNA polymerase III but appears conserved in the exterme thermophile Thermus thermophilis.
Changed!
PRK rfc 313aa 4e-92
in ref transcript
replication factor C small subunit; Reviewed.
TRIM13
refseq_RFP2.F1 refseq_RFP2.R1 106 220
NCBIGene 36.3 10206
Single exon skipping, size difference: 114
Exclusion of the protein initiation site
Reference transcript: NM_001007278
cd RING 53aa 1e-06
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
smart RING 48aa 2e-07
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
pfam zf-B_box 42aa 2e-06
in ref transcript
B-box zinc finger.
COG COG5222 84aa 0.002
in ref transcript
Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].
RFWD2
refseq_RFWD2.F1 refseq_RFWD2.R1 266 338
NCBIGene 36.3 64326
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_022457
cd WD40 307aa 6e-33
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd RING 42aa 6e-06
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
smart RING 38aa 2e-07
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
pfam WD40 40aa 0.003
in ref transcript
WD domain, G-beta repeat.
COG COG2319 284aa 8e-17
in ref transcript
FOG: WD40 repeat [General function prediction only].
COG PEX10 41aa 4e-04
in ref transcript
RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
RFWD2
refseq_RFWD2.F2 refseq_RFWD2.R2 321 381
NCBIGene 36.3 64326
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_022457
cd WD40 307aa 6e-33
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd RING 42aa 6e-06
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
smart RING 38aa 2e-07
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
pfam WD40 40aa 0.003
in ref transcript
WD domain, G-beta repeat.
COG COG2319 284aa 8e-17
in ref transcript
FOG: WD40 repeat [General function prediction only].
COG PEX10 41aa 4e-04
in ref transcript
RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
RFX2
refseq_RFX2.F1 refseq_RFX2.R1 282 357
NCBIGene 36.3 5990
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_000635
pfam RFX1_trans_act 156aa 1e-28
in ref transcript
RFX1 transcription activation region. The RFX family is a family of winged-helix DNA binding proteins. RFX1 is a regulatory factor essential for expression of MHC class II genes. This region is to found N terminal to the RFX DNA binding region (pfam02257) in some mammalian RFX proteins, and is thought to activate transcription when associated with DNA. Deletion analysis has identified the region 233-351 in human RFX1 as being required for maximal activation.
Changed!
pfam RFX_DNA_binding 65aa 1e-20
in ref transcript
RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer.
Changed!
pfam RFX_DNA_binding 83aa 2e-23
in modified transcript
RFXANK
refseq_RFXANK.F1 refseq_RFXANK.R1 126 192
NCBIGene 36.3 8625
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_003721
cd ANK 125aa 3e-24
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
pfam Ank 28aa 6e-04
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
pfam Ank 29aa 8e-04
in ref transcript
Changed!
COG Arp 192aa 7e-13
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
COG Arp 145aa 2e-12
in modified transcript
RFXANK
refseq_RFXANK.F4 refseq_RFXANK.R3 175 244
NCBIGene 36.3 8625
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_003721
cd ANK 125aa 3e-24
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
pfam Ank 28aa 6e-04
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
pfam Ank 29aa 8e-04
in ref transcript
Changed!
COG Arp 192aa 7e-13
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
COG Arp 145aa 2e-12
in modified transcript
RGMB
refseq_RGMB.F1 refseq_RGMB.R1 132 377
NCBIGene 36.3 285704
Alternative 3-prime, size difference: 245
Inclusion in 5'UTR
Reference transcript: NM_001012761
pfam RGM_C 214aa 5e-97
in ref transcript
Repulsive guidance molecule (RGM) C-terminus. This family consists of several mammalian and one bird sequence from Gallus gallus (Chicken). This family represents the C-terminal region of several sequences but in others it represents the full protein. All of the mammalian proteins are hypothetical and have no known function but the member from chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the stratum opticum.
pfam RGM_N 176aa 5e-50
in ref transcript
Repulsive guidance molecule (RGM) N-terminus. This family consists of the N-terminal region of several mammalian and one bird sequence from Gallus gallus (Chicken). All of the mammalian proteins are hypothetical and have no known function but the member from chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the stratum opticum.
RGNEF
refseq_RGNEF.F1 refseq_RGNEF.R1 228 360
NCBIGene 36.2 64283
Single exon skipping, size difference: 132
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: XM_932949
cd RhoGEF 193aa 4e-34
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
cd C1 46aa 4e-06
in ref transcript
Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
smart RhoGEF 190aa 1e-39
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
smart C1 46aa 3e-06
in ref transcript
Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains). Some bind phorbol esters and diacylglycerol. Some bind RasGTP. Zinc-binding domains.
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
Changed!
pfam 7tm_1 170aa 6e-10
in modified transcript
RGS13
refseq_RGS13.F1 refseq_RGS13.R1 117 157
NCBIGene 36.3 6003
Single exon skipping, size difference: 40
Exclusion in 5'UTR
Reference transcript: NM_002927
pfam RGS 116aa 2e-35
in ref transcript
Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
RHBDF2
refseq_RHBDF2.F1 refseq_RHBDF2.R1 192 279
NCBIGene 36.3 79651
Alternative 3-prime, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_024599
pfam Rhomboid 124aa 4e-24
in ref transcript
Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth factor Spitz, allowing it to activate the Drosophila EGF receptor. Analysis has shown that Rhomboid-1 is an intramembrane serine protease (EC:3.4.21.105). Parasite-encoded rhomboid enzymes are also important for invasion of host cells by Toxoplasma and the malaria parasite.
COG GlpG 133aa 8e-14
in ref transcript
Uncharacterized membrane protein (homolog of Drosophila rhomboid) [General function prediction only].
RHCE
refseq_RHCE.F1 refseq_RHCE.R1 179 259
NCBIGene 36.3 6006
Single exon skipping, size difference: 80
Exclusion in the protein causing a frameshift
Reference transcript: NM_020485
Changed!
pfam Ammonium_transp 360aa 2e-48
in ref transcript
Ammonium Transporter Family.
COG AmtB 104aa 1e-04
in ref transcript
Ammonia permease [Inorganic ion transport and metabolism].
Changed!
pfam Ammonium_transp 329aa 9e-49
in modified transcript
RHCE
refseq_RHCE.F3 refseq_RHCE.R3 147 281
NCBIGene 36.3 6006
Single exon skipping, size difference: 134
Exclusion in the protein causing a frameshift
Reference transcript: NM_020485
Changed!
pfam Ammonium_transp 360aa 2e-48
in ref transcript
Ammonium Transporter Family.
Changed!
COG AmtB 104aa 1e-04
in ref transcript
Ammonia permease [Inorganic ion transport and metabolism].
Changed!
pfam Ammonium_transp 293aa 1e-48
in modified transcript
Changed!
COG AmtB 108aa 2e-05
in modified transcript
RHCE
refseq_RHCE.F5 refseq_RHCE.R5 105 420
NCBIGene 36.3 6006
Multiple exon skipping, size difference: 315
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_020485
Changed!
pfam Ammonium_transp 360aa 2e-48
in ref transcript
Ammonium Transporter Family.
Changed!
COG AmtB 104aa 1e-04
in ref transcript
Ammonia permease [Inorganic ion transport and metabolism].
Changed!
pfam Ammonium_transp 150aa 4e-13
in modified transcript
Changed!
pfam Ammonium_transp 110aa 4e-07
in modified transcript
RHOT1
refseq_RHOT1.F1 refseq_RHOT1.R1 111 207
NCBIGene 36.3 55288
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_001033568
cd Miro1 165aa 8e-74
in ref transcript
Miro1 subfamily. Miro (mitochondrial Rho) proteins have tandem GTP-binding domains separated by a linker region containing putative calcium-binding EF hand motifs. Genes encoding Miro-like proteins were found in several eukaryotic organisms. This CD represents the N-terminal GTPase domain of Miro proteins. These atypical Rho GTPases have roles in mitochondrial homeostasis and apoptosis. Most Rho proteins contain a lipid modification site at the C-terminus; however, Miro is one of few Rho subfamilies that lack this feature.
Changed!
cd Miro2 166aa 8e-71
in ref transcript
Miro2 subfamily. Miro (mitochondrial Rho) proteins have tandem GTP-binding domains separated by a linker region containing putative calcium-binding EF hand motifs. Genes encoding Miro-like proteins were found in several eukaryotic organisms. This CD represents the putative GTPase domain in the C terminus of Miro proteins. These atypical Rho GTPases have roles in mitochondrial homeostasis and apoptosis. Most Rho proteins contain a lipid modification site at the C-terminus; however, Miro is one of few Rho subfamilies that lack this feature.
pfam EF_assoc_2 89aa 2e-38
in ref transcript
EF hand associated. This region predominantly appears near EF-hands (pfam00036) in GTP-binding proteins. It is found in all three eukaryotic kingdoms.
pfam EF_assoc_1 75aa 1e-23
in ref transcript
EF hand associated. This region typically appears on the C-terminus of EF hands in GTP-binding proteins such as Arht/Rhot (may be involved in mitochondrial homeostasis and apoptosis). The EF hand associated region is found in yeast, vertebrates and plants.
smart RHO 163aa 5e-15
in ref transcript
Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
pfam Miro 111aa 2e-11
in ref transcript
Miro-like protein. Mitochondrial Rho proteins (Miro-1 and Miro-2), are atypical Rho GTPases. They have a unique domain organisation, with tandem GTP-binding domains and two EF hand domains (pfam00036), that may bind calcium. They are also larger than classical small GTPases. It has been proposed that they are involved in mitochondrial homeostasis and apoptosis.
COG COG1100 173aa 5e-15
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
Changed!
cd Miro2 167aa 1e-71
in modified transcript
RHOT1
refseq_RHOT1.F2 refseq_RHOT1.R2 276 400
NCBIGene 36.2 55288
Single exon skipping, size difference: 124
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001033568
Changed!
cd Miro1 165aa 8e-74
in ref transcript
Miro1 subfamily. Miro (mitochondrial Rho) proteins have tandem GTP-binding domains separated by a linker region containing putative calcium-binding EF hand motifs. Genes encoding Miro-like proteins were found in several eukaryotic organisms. This CD represents the N-terminal GTPase domain of Miro proteins. These atypical Rho GTPases have roles in mitochondrial homeostasis and apoptosis. Most Rho proteins contain a lipid modification site at the C-terminus; however, Miro is one of few Rho subfamilies that lack this feature.
Changed!
cd Miro2 166aa 8e-71
in ref transcript
Miro2 subfamily. Miro (mitochondrial Rho) proteins have tandem GTP-binding domains separated by a linker region containing putative calcium-binding EF hand motifs. Genes encoding Miro-like proteins were found in several eukaryotic organisms. This CD represents the putative GTPase domain in the C terminus of Miro proteins. These atypical Rho GTPases have roles in mitochondrial homeostasis and apoptosis. Most Rho proteins contain a lipid modification site at the C-terminus; however, Miro is one of few Rho subfamilies that lack this feature.
Changed!
pfam EF_assoc_2 89aa 2e-38
in ref transcript
EF hand associated. This region predominantly appears near EF-hands (pfam00036) in GTP-binding proteins. It is found in all three eukaryotic kingdoms.
Changed!
pfam EF_assoc_1 75aa 1e-23
in ref transcript
EF hand associated. This region typically appears on the C-terminus of EF hands in GTP-binding proteins such as Arht/Rhot (may be involved in mitochondrial homeostasis and apoptosis). The EF hand associated region is found in yeast, vertebrates and plants.
Changed!
smart RHO 163aa 5e-15
in ref transcript
Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
Changed!
pfam Miro 111aa 2e-11
in ref transcript
Miro-like protein. Mitochondrial Rho proteins (Miro-1 and Miro-2), are atypical Rho GTPases. They have a unique domain organisation, with tandem GTP-binding domains and two EF hand domains (pfam00036), that may bind calcium. They are also larger than classical small GTPases. It has been proposed that they are involved in mitochondrial homeostasis and apoptosis.
Changed!
COG COG1100 173aa 5e-15
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
Changed!
cd Miro1 29aa 6e-08
in modified transcript
Changed!
pfam Miro 27aa 0.007
in modified transcript
RIPK5
refseq_RIPK5.F1 refseq_RIPK5.R1 114 249
NCBIGene 36.3 25778
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_015375
Changed!
cd S_TKc 245aa 2e-37
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
pfam Pkinase 245aa 2e-38
in ref transcript
Protein kinase domain.
Changed!
COG SPS1 245aa 1e-20
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
cd S_TKc 200aa 2e-27
in modified transcript
Changed!
pfam Pkinase 168aa 2e-27
in modified transcript
Changed!
COG SPS1 176aa 2e-15
in modified transcript
RNASE4
refseq_RNASE4.F2 refseq_RNASE4.R2 278 382
NCBIGene 36.2 6038
Single exon skipping, size difference: 104
Exclusion of the stop codon
Reference transcript: NM_194430
RNF12
refseq_RNF12.F2 refseq_RNF12.R2 231 291
NCBIGene 36.3 51132
Single exon skipping, size difference: 60
Exclusion in 5'UTR
Reference transcript: NM_183353
cd RING 46aa 1e-10
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
smart RING 41aa 6e-09
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
COG COG5540 75aa 2e-09
in ref transcript
RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
RNF121
refseq_RNF121.F1 refseq_RNF121.R1 105 368
NCBIGene 36.3 55298
Multiple exon skipping, size difference: 263
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_018320
Changed!
cd RING 54aa 7e-04
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
Changed!
smart RING 50aa 0.001
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
Changed!
COG HRD1 58aa 7e-06
in ref transcript
HRD ubiquitin ligase complex, ER membrane component [Posttranslational modification, protein turnover, chaperones].
RNF121
refseq_RNF121.F3 refseq_RNF121.R3 161 199
NCBIGene 36.3 55298
Single exon skipping, size difference: 38
Exclusion in the protein causing a frameshift
Reference transcript: NM_018320
Changed!
cd RING 54aa 7e-04
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
Changed!
smart RING 50aa 0.001
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
Changed!
COG HRD1 58aa 7e-06
in ref transcript
HRD ubiquitin ligase complex, ER membrane component [Posttranslational modification, protein turnover, chaperones].
RNF13
refseq_RNF13.F2 refseq_RNF13.R2 184 286
NCBIGene 36.3 11342
Single exon skipping, size difference: 102
Exclusion in 5'UTR
Reference transcript: NM_007282
cd PA_C_RZF_like 158aa 8e-50
in ref transcript
PA_C-RZF_ like: Protease-associated (PA) domain C_RZF-like. This group includes various PA domain-containing proteins similar to C-RZF (chicken embryo RING zinc finger) protein. These proteins contain a C3H2C3 RING finger. C-RZF is expressed in embryo cells and is restricted mainly to brain and heart, it is localized to both the nucleus and endosomes. Additional C3H2C3 RING finger proteins belonging to this group, include Arabidopsis ReMembR-H2 protein and mouse sperizin. ReMembR-H2 is likely to be an integral membrane protein, and to traffic through the endosomal pathway. Sperizin is expressed in haploid germ cells and localized in the cytoplasm, it may participate in spermatogenesis. The significance of the PA domain to these proteins has not been ascertained. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate.
cd RING 46aa 1e-10
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
pfam PA 92aa 2e-11
in ref transcript
PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family. It has been suggested that this domain forms a lid-like structure that covers the active site in active proteases, and is involved in protein recognition in vacuolar sorting receptors.
smart RING 42aa 7e-09
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
COG COG5540 253aa 1e-11
in ref transcript
RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
RNF135
refseq_RNF135.F1 refseq_RNF135.R1 120 283
NCBIGene 36.3 84282
Single exon skipping, size difference: 163
Exclusion in the protein causing a frameshift
Reference transcript: NM_032322
cd RING 44aa 5e-06
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
Changed!
smart SPRY 109aa 6e-15
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
Changed!
smart RING 42aa 1e-05
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
Changed!
smart PRY 51aa 0.006
in ref transcript
associated with SPRY domains.
Changed!
COG PEX10 49aa 4e-04
in ref transcript
RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
TIGR rad18 73aa 8e-06
in modified transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Changed!
COG RAD18 69aa 1e-04
in modified transcript
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_016271
Changed!
cd RING 44aa 2e-06
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
Changed!
smart RING 40aa 9e-05
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
Changed!
COG COG5222 40aa 2e-05
in ref transcript
Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].
RNF14
refseq_RNF14.F1 refseq_RNF14.R1 176 350
NCBIGene 36.3 9604
Single exon skipping, size difference: 174
Exclusion in 5'UTR
Reference transcript: NM_004290
pfam RWD 130aa 3e-14
in ref transcript
RWD domain. This domain was identified in WD40 repeat proteins and Ring finger domain proteins. The function of this domain is unknown. GCN2 is the alpha-subunit of the only translation initiation factor (eIF2 alpha) kinase that appears in all eukaryotes. Its function requires an interaction with GCN1 via the domain at its N-terminus, which is termed the RWD domain after three major RWD-containing proteins: RING finger-containing proteins, WD-repeat-containing proteins, and yeast DEAD (DEXD)-like helicases. The structure forms an alpha + beta sandwich fold consisting of two layers: a four-stranded antiparallel beta-sheet, and three side-by-side alpha-helices.
pfam IBR 62aa 4e-12
in ref transcript
IBR domain. The IBR (In Between Ring fingers) domain is often found to occur between pairs of ring fingers (pfam00097). This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain. Proteins that contain two Ring fingers and an IBR domain (these proteins are also termed RBR family proteins) are thought to exist in all eukaryotic organisms. RBR family members play roles in protein quality control and can indirectly regulate transcription. Evidence suggests that RBR proteins are often parts of cullin-containing ubiquitin ligase complexes. The ubiquitin ligase Parkin is an RBR family protein whose mutations are involved in forms of familial Parkinson's disease.
RNF14
refseq_RNF14.F4 refseq_RNF14.R4 217 369
NCBIGene 36.3 9604
Single exon skipping, size difference: 152
Exclusion in the protein causing a frameshift
Reference transcript: NM_183399
Changed!
pfam RWD 130aa 3e-14
in ref transcript
RWD domain. This domain was identified in WD40 repeat proteins and Ring finger domain proteins. The function of this domain is unknown. GCN2 is the alpha-subunit of the only translation initiation factor (eIF2 alpha) kinase that appears in all eukaryotes. Its function requires an interaction with GCN1 via the domain at its N-terminus, which is termed the RWD domain after three major RWD-containing proteins: RING finger-containing proteins, WD-repeat-containing proteins, and yeast DEAD (DEXD)-like helicases. The structure forms an alpha + beta sandwich fold consisting of two layers: a four-stranded antiparallel beta-sheet, and three side-by-side alpha-helices.
Changed!
pfam IBR 62aa 4e-12
in ref transcript
IBR domain. The IBR (In Between Ring fingers) domain is often found to occur between pairs of ring fingers (pfam00097). This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain. Proteins that contain two Ring fingers and an IBR domain (these proteins are also termed RBR family proteins) are thought to exist in all eukaryotic organisms. RBR family members play roles in protein quality control and can indirectly regulate transcription. Evidence suggests that RBR proteins are often parts of cullin-containing ubiquitin ligase complexes. The ubiquitin ligase Parkin is an RBR family protein whose mutations are involved in forms of familial Parkinson's disease.
Changed!
pfam RWD 45aa 0.002
in modified transcript
RNF34
refseq_RNF34.F2 refseq_RNF34.R2 135 201
NCBIGene 36.3 80196
Single exon skipping, size difference: 66
Exclusion of the protein initiation site
Reference transcript: NM_194271
RNF38
refseq_RNF38.F2 refseq_RNF38.R2 227 377
NCBIGene 36.3 152006
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_022781
cd RING 44aa 6e-07
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
smart RING 41aa 3e-06
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
COG COG5540 42aa 7e-10
in ref transcript
RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
RNF38
refseq_RNF38.F4 refseq_RNF38.R3 252 402
NCBIGene 36.3 152006
Single exon skipping, size difference: 150
Exclusion in 5'UTR
Reference transcript: NM_194328
cd RING 44aa 8e-07
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
smart RING 41aa 4e-06
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
COG COG5540 42aa 5e-10
in ref transcript
RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
RNF41
refseq_RNF41.F2 refseq_RNF41.R2 221 364
NCBIGene 36.3 10193
Single exon skipping, size difference: 143
Inclusion in the protein causing a new stop codon
Reference transcript: NM_194359
Changed!
cd RING 43aa 7e-06
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
Changed!
pfam USP8_interact 179aa 8e-86
in ref transcript
USP8 interacting. This domain interacts with the UBP deubiquitinating enzyme USP8.
Changed!
smart RING 38aa 4e-05
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
Changed!
COG COG5219 44aa 0.002
in ref transcript
Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].
RNF41
refseq_RNF41.F3 refseq_RNF41.R3 189 332
NCBIGene 36.3 10193
Alternative 5-prime, size difference: 143
Exclusion in 5'UTR
Reference transcript: NM_005785
cd RING 43aa 7e-06
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
pfam USP8_interact 179aa 8e-86
in ref transcript
USP8 interacting. This domain interacts with the UBP deubiquitinating enzyme USP8.
smart RING 38aa 4e-05
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
COG COG5219 44aa 0.002
in ref transcript
Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].
RNF8
refseq_RNF8.F2 refseq_RNF8.R2 190 395
NCBIGene 36.3 9025
Single exon skipping, size difference: 205
Exclusion in the protein causing a frameshift
Reference transcript: NM_003958
cd FHA 95aa 2e-11
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
Changed!
cd RING 43aa 5e-08
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
pfam FHA 72aa 3e-12
in ref transcript
FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
Changed!
smart RING 38aa 7e-08
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
smart LRR_RI 28aa 0.009
in ref transcript
Leucine rich repeat, ribonuclease inhibitor type.
RNH1
refseq_RNH1.F3 refseq_RNH1.R1 163 231
NCBIGene 36.3 6050
Alternative 3-prime, size difference: 68
Inclusion in 5'UTR
Reference transcript: NM_203384
cd LRR_RI 320aa 6e-76
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
smart LRR_RI 28aa 0.009
in ref transcript
Leucine rich repeat, ribonuclease inhibitor type.
RNH1
refseq_RNH1.F6 refseq_RNH1.R3 124 297
NCBIGene 36.3 6050
Single exon skipping, size difference: 173
Exclusion in 5'UTR
Reference transcript: NM_203387
cd LRR_RI 320aa 6e-76
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Changed!
pfam Ceramidase_alk 134aa 5e-45
in ref transcript
Neutral/alkaline non-lysosomal ceramidase. This family represents a group of neutral/alkaline ceramidases found in both bacteria and eukaryotes.
RPAIN
refseq_RPAIN.F1 refseq_RPAIN.R1 153 217
NCBIGene 36.2 84268
Single exon skipping, size difference: 64
Exclusion in the protein causing a frameshift
Reference transcript: NM_001033002
RPE
refseq_RPE.F1 refseq_RPE.R1 171 251
NCBIGene 36.3 6120
Single exon skipping, size difference: 80
Exclusion in the protein causing a frameshift
Reference transcript: NM_199229
Changed!
cd RPE 209aa 3e-84
in ref transcript
Ribulose-5-phosphate 3-epimerase (RPE). This enzyme catalyses the interconversion of D-ribulose 5-phosphate (Ru5P) into D-xylulose 5-phosphate, as part of the Calvin cycle (reductive pentose phosphate pathway) in chloroplasts and in the oxidative pentose phosphate pathway. In the Calvin cycle Ru5P is phosphorylated by phosphoribulose kinase to ribulose-1,5-bisphosphate, which in turn is used by RubisCO (ribulose-1,5-bisphosphate carboxylase/oxygenase) to incorporate CO2 as the central step in carbohydrate synthesis.
Changed!
pfam Ribul_P_3_epim 199aa 5e-62
in ref transcript
Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate.
Changed!
PTZ PTZ00170 210aa 1e-82
in ref transcript
D-ribulose-5-phosphate 3-epimerase; Provisional.
Changed!
cd RPE 37aa 2e-12
in modified transcript
Changed!
pfam Ribul_P_3_epim 37aa 3e-09
in modified transcript
Changed!
PRK PRK05581 37aa 1e-11
in modified transcript
ribulose-phosphate 3-epimerase; Validated.
RPE
refseq_RPE.F3 refseq_RPE.R3 192 246
NCBIGene 36.3 6120
Single exon skipping, size difference: 54
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_199229
Changed!
cd RPE 209aa 3e-84
in ref transcript
Ribulose-5-phosphate 3-epimerase (RPE). This enzyme catalyses the interconversion of D-ribulose 5-phosphate (Ru5P) into D-xylulose 5-phosphate, as part of the Calvin cycle (reductive pentose phosphate pathway) in chloroplasts and in the oxidative pentose phosphate pathway. In the Calvin cycle Ru5P is phosphorylated by phosphoribulose kinase to ribulose-1,5-bisphosphate, which in turn is used by RubisCO (ribulose-1,5-bisphosphate carboxylase/oxygenase) to incorporate CO2 as the central step in carbohydrate synthesis.
Changed!
pfam Ribul_P_3_epim 199aa 5e-62
in ref transcript
Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate.
Changed!
PTZ PTZ00170 210aa 1e-82
in ref transcript
D-ribulose-5-phosphate 3-epimerase; Provisional.
Changed!
cd RPE 227aa 7e-82
in modified transcript
Changed!
TIGR rpe 226aa 1e-59
in modified transcript
This family consists of Ribulose-phosphate 3-epimerase, also known as pentose-5-phosphate 3-epimerase (PPE). PPE converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. It has been found in a wide range of bacteria, archebacteria, fungi and plants.
Changed!
PTZ PTZ00170 228aa 8e-80
in modified transcript
RPL17
refseq_RPL17.F1 refseq_RPL17.R1 111 360
NCBIGene 36.3 6139
Alternative 3-prime, size difference: 249
Inclusion in 5'UTR
Reference transcript: NM_001035006
cd Ribosomal_L22 136aa 1e-26
in ref transcript
Ribosomal protein L22/L17e. L22 (L17 in eukaryotes) is a core protein of the large ribosomal subunit. It is the only ribosomal protein that interacts with all six domains of 23S rRNA, and is one of the proteins important for directing the proper folding and stabilizing the conformation of 23S rRNA. L22 is the largest protein contributor to the surface of the polypeptide exit channel, the tunnel through which the polypeptide product passes. L22 is also one of six proteins located at the putative translocon binding site on the exterior surface of the ribosome.
TIGR L22_arch 150aa 6e-67
in ref transcript
This model describes the ribosomal protein of the eukaryotic cytosol and of the Archaea, variously designated as L17, L22, and L23. The corresponding bacterial homolog, described by a separate model, is designated L22.
PTZ PTZ00178 163aa 2e-73
in ref transcript
60S ribosomal protein L22/L17e; Provisional.
RPL3
refseq_RPL3.F1 refseq_RPL3.R1 186 333
NCBIGene 36.3 6122
Alternative 5-prime and 3-prime, size difference: 147
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_000967
Changed!
TIGR L3_arch 367aa 3e-73
in ref transcript
This model describes exclusively the archaeal class of ribosomal protein L3. A separate model (TIGR03625) describes the bacterial/organelle form, and both belong to pfam00297. Eukaryotic proteins are excluded from this model.
Changed!
PTZ PTZ00103 401aa 0.0
in ref transcript
60S ribosomal protein L3; Provisional.
Changed!
TIGR L3_arch 318aa 1e-66
in modified transcript
Changed!
PTZ PTZ00103 352aa 1e-165
in modified transcript
RPL32
refseq_RPL32.F1 refseq_RPL32.R1 156 255
NCBIGene 36.3 6161
Alternative 3-prime, size difference: 99
Inclusion in 5'UTR
Reference transcript: NM_000994
cd Ribosomal_L32_L32e 105aa 8e-43
in ref transcript
Ribosomal_L32_L32e: L32 is a protein from the large subunit that contains a surface-exposed globular domain and a finger-like projection that extends into the RNA core to stabilize the tertiary structure. L32 does not appear to play a role in forming the A (aminacyl), P (peptidyl) or E (exit) sites of the ribosome, but does interact with 23S rRNA, which has a "kink-turn" secondary structure motif. L32 is overexpressed in human prostate cancer and has been identified as a stably expressed housekeeping gene in macrophages of human chronic obstructive pulmonary disease (COPD) patients. In Schizosaccharomyces pombe, L32 has also been suggested to play a role as a transcriptional regulator in the nucleus. Found in archaea and eukaryotes, this protein is known as L32 in eukaryotes and L32e in archaea.
pfam Ribosomal_L32e 106aa 3e-42
in ref transcript
Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria.
PTZ PTZ00159 133aa 7e-47
in ref transcript
60S ribosomal protein L32; Provisional.
RPL38
refseq_RPL38.F1 refseq_RPL38.R1 133 168
NCBIGene 36.3 6169
Alternative 5-prime, size difference: 35
Exclusion in 5'UTR
Reference transcript: NM_000999
pfam Ribosomal_L38e 68aa 3e-14
in ref transcript
Ribosomal L38e protein family.
PTZ PTZ00181 62aa 1e-05
in ref transcript
60S ribosomal protein L38; Provisional.
RPL6
refseq_RPL6.F1 refseq_RPL6.R1 219 395
NCBIGene 36.3 6128
Alternative 5-prime, size difference: 176
Exclusion in 5'UTR
Reference transcript: NM_001024662
pfam Ribosomal_L6e 108aa 6e-39
in ref transcript
Ribosomal protein L6e.
pfam Ribosomal_L6e_N 32aa 3e-07
in ref transcript
Ribosomal protein L6, N-terminal domain.
COG RPL14A 109aa 1e-06
in ref transcript
Ribosomal protein L14E/L6E/L27E [Translation, ribosomal structure and biogenesis].
RPLP0
refseq_RPLP0.F2 refseq_RPLP0.R2 150 210
NCBIGene 36.3 6175
Alternative 3-prime, size difference: 60
Inclusion in 5'UTR
Reference transcript: NM_001002
cd Ribosomal_P0_L10e 175aa 1e-69
in ref transcript
Ribosomal protein L10 family, P0 and L10e subfamily; composed of eukaryotic 60S ribosomal protein P0 and the archaeal P0 homolog, L10e. P0 or L10e forms a tight complex with multiple copies of the small acidic protein L12(e). This complex forms a stalk structure on the large subunit of the ribosome. The stalk is known to contain the binding site for elongation factors G and Tu (EF-G and EF-Tu, respectively); however, there is disagreement as to whether or not L10 is involved in forming the binding site. The stalk is believed to be associated with GTPase activities in protein synthesis. In a neuroblastoma cell line, L10 has been shown to interact with the SH3 domain of Src and to activate the binding of the Nck1 adaptor protein with skeletal proteins such as the Wiskott-Aldrich Syndrome Protein (WASP) and the WASP-interacting protein (WIP). These eukaryotic and archaeal P0 sequences have an additional C-terminal domain homologous with acidic proteins P1 and P2.
pfam Ribosomal_L10 103aa 2e-32
in ref transcript
Ribosomal protein L10.
PTZ PTZ00135 269aa 4e-82
in ref transcript
ribosomal phosphoprotein P0; Provisional.
RPLP1
refseq_RPLP1.F1 refseq_RPLP1.R1 189 264
NCBIGene 36.3 6176
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_001003
Changed!
cd Ribosomal_P1 108aa 7e-24
in ref transcript
Ribosomal protein P1. This subfamily represents the eukaryotic large ribosomal protein P1. Eukaryotic P1 and P2 are functionally equivalent to the bacterial protein L7/L12, but are not homologous to L7/L12. P1 is located in the L12 stalk, with proteins P2, P0, L11, and 28S rRNA. P1 and P2 are the only proteins in the ribosome to occur as multimers, always appearing as sets of heterodimers. Recent data indicate that eukaryotes have four copies (two heterodimers), while most archaeal species contain six copies of L12p (three homodimers) and bacteria may have four or six copies (two or three homodimers), depending on the species. Experiments using S. cerevisiae P1 and P2 indicate that P1 proteins are positioned more internally with limited reactivity in the C-terminal domains, while P2 proteins seem to be more externally located and are more likely to interact with other cellular components. In lower eukaryotes, P1 and P2 are further subdivided into P1A, P1B, P2A, and P2B, which form P1A/P2B and P1B/P2A heterodimers. Some plant species have a third P-protein, called P3, which is not homologous to P1 and P2. In humans, P1 and P2 are strongly autoimmunogenic. They play a significant role in the etiology and pathogenesis of systemic lupus erythema (SLE). In addition, the ribosome-inactivating protein trichosanthin (TCS) interacts with human P0, P1, and P2, with its primary binding site located in the C-terminal region of P2. TCS inactivates the ribosome by depurinating a specific adenine in the sarcin-ricin loop of 28S rRNA.
Changed!
pfam Ribosomal_60s 92aa 8e-07
in ref transcript
60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2.
Changed!
COG RPP1A 53aa 2e-05
in ref transcript
Ribosomal protein L12E/L44/L45/RPP1/RPP2 [Translation, ribosomal structure and biogenesis].
Changed!
cd Ribosomal_P1 83aa 4e-07
in modified transcript
RPS6KA4
refseq_RPS6KA4.F1 refseq_RPS6KA4.R1 100 118
NCBIGene 36.3 8986
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_003942
cd STKc_MSK2_N 333aa 1e-178
in ref transcript
STKc_MSK2_N: Serine/Threonine Kinases (STKs), Mitogen and stress-activated kinase (MSK) subfamily, MSK2, N-terminal catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MSK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. MSKs contain an N-terminal kinase domain (NTD) from the AGC family and a C-terminal kinase domain (CTD) from the CAMK family, similar to 90 kDa ribosomal protein S6 kinases (RSKs). MSKs are activated by two major signaling cascades, the Ras-MAPK and p38 stress kinase pathways, which trigger phosphorylation in the activation loop (A-loop) of the CTD of MSK. The active CTD phosphorylates the hydrophobic motif (HM) of NTD, which facilitates the phosphorylation of the A-loop and activates the NTD, which in turn phosphorylates downstream targets. MSK2 and MSK1 play nonredundant roles in activating histone H3 kinases, which play pivotal roles in compaction of the chromatin fiber. MSK2 is the required H3 kinase in response to stress stimuli and activation of the p38 MAPK pathway. MSK2 also plays a role in the pathogenesis of psoriasis.
cd S_TKc 258aa 2e-57
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 259aa 5e-77
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
smart S_TKc 254aa 3e-62
in ref transcript
smart S_TK_X 61aa 2e-08
in ref transcript
Extension to Ser/Thr-type protein kinases.
PTZ PTZ00263 300aa 2e-74
in ref transcript
protein kinase A catalytic subunit; Provisional.
PTZ PTZ00263 255aa 1e-30
in ref transcript
RPS6KB2
refseq_RPS6KB2.F2 refseq_RPS6KB2.R2 122 144
NCBIGene 36.2 6199
Alternative 5-prime, size difference: 22
Exclusion in the protein causing a frameshift
Reference transcript: NM_003952
Changed!
cd STKc_p70S6K 323aa 0.0
in ref transcript
STKc_p70S6K: Serine/Threonine Kinases (STKs), 70 kDa ribosomal protein S6 kinase (p70S6K) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The p70S6K subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. p70S6K (or S6K) contains only one catalytic kinase domain, unlike p90 ribosomal S6 kinases (RSKs). It acts as a downstream effector of the STK mTOR (mammalian Target of Rapamycin) and plays a role in the regulation of the translation machinery during protein synthesis. p70S6K also plays a pivotal role in regulating cell size and glucose homeostasis. Its targets include S6, the translation initiation factor eIF3, and the insulin receptor substrate IRS-1, among others. Mammals contain two isoforms of p70S6K, named S6K1 and S6K2 (or S6K-beta).
Changed!
smart S_TKc 252aa 2e-84
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
smart S_TK_X 63aa 2e-15
in ref transcript
Extension to Ser/Thr-type protein kinases.
Changed!
PTZ PTZ00263 321aa 3e-81
in ref transcript
protein kinase A catalytic subunit; Provisional.
Changed!
cd STKc_p70S6K 190aa 1e-109
in modified transcript
Changed!
smart S_TKc 183aa 1e-65
in modified transcript
Changed!
PTZ PTZ00263 193aa 8e-54
in modified transcript
RRAGB
refseq_RRAGB.F1 refseq_RRAGB.R1 110 194
NCBIGene 36.3 10325
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_016656
Changed!
cd Ras_like_GTPase 180aa 8e-09
in ref transcript
Ras-like GTPase superfamily. The Ras-like superfamily of small GTPases consists of several families with an extremely high degree of structural and functional similarity. The Ras superfamily is divided into at least four families in eukaryotes: the Ras, Rho, Rab, and Sar1/Arf families. This superfamily also includes proteins like the GTP translation factors, Era-like GTPases, and G-alpha chain of the heterotrimeric G proteins. Members of the Ras superfamily regulate a wide variety of cellular functions: the Ras family regulates gene expression, the Rho family regulates cytoskeletal reorganization and gene expression, the Rab and Sar1/Arf families regulate vesicle trafficking, and the Ran family regulates nucleocytoplasmic transport and microtubule organization. The GTP translation factor family regulate initiation, elongation, termination, and release in translation, and the Era-like GTPase family regulates cell division, sporulation, and DNA replication. Members of the Ras superfamily are identified by the GTP binding site, which is made up of five characteristic sequence motifs, and the switch I and switch II regions.
pfam Gtr1_RagA 199aa 6e-74
in ref transcript
Gtr1/RagA G protein conserved region. GTR1 was first identified in Saccharomyces cerevisiae as a suppressor of a mutation in RCC1. Biochemical analysis revealed that Gtr1 is in fact a G protein of the Ras family. The RagA/B proteins are the human homologues of Gtr1. Included in this family is the human Rag C, a novel protein that has been shown to interact with RagA/B.
Changed!
pfam Ras 178aa 2e-05
in ref transcript
Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. As regards Rab GTPases, these are important regulators of vesicle formation, motility and fusion. They share a fold in common with all Ras GTPases: this is a six-stranded beta-sheet surrounded by five alpha-helices.
Changed!
cd Ras_like_GTPase 152aa 6e-11
in modified transcript
Changed!
pfam Ras 150aa 7e-07
in modified transcript
Changed!
COG COG1100 145aa 2e-06
in modified transcript
GTPase SAR1 and related small G proteins [General function prediction only].
RREB1
refseq_RREB1.F1 refseq_RREB1.R1 161 326
NCBIGene 36.3 6239
Single exon skipping, size difference: 165
Exclusion in the protein (no frameshift)
Reference transcript: NM_001003699
RSPO4
refseq_RSPO4.F1 refseq_RSPO4.R1 314 500
NCBIGene 36.3 343637
Single exon skipping, size difference: 186
Exclusion in the protein (no frameshift)
Reference transcript: NM_001029871
Changed!
smart FU 41aa 0.006
in modified transcript
Furin-like repeats.
RSU1
refseq_RSU1.F2 refseq_RSU1.R2 133 245
NCBIGene 36.3 6251
Single exon skipping, size difference: 112
Exclusion of the protein initiation site
Reference transcript: NM_012425
Changed!
cd LRR_RI 154aa 0.002
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
Changed!
COG COG4886 181aa 2e-09
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
Changed!
cd LRR_RI 114aa 3e-04
in modified transcript
Changed!
COG COG4886 147aa 2e-10
in modified transcript
RTEL1
refseq_RTEL1.F1 refseq_RTEL1.R1 231 401
NCBIGene 36.3 51750
Alternative 5-prime, size difference: 170
Exclusion in the protein causing a frameshift
Reference transcript: NM_032957
pfam DEAD_2 162aa 1e-61
in ref transcript
DEAD_2. This represents a conserved region within a number of RAD3-like DNA-binding helicases that are seemingly ubiquitous - members include proteins of eukaryotic, bacterial and archaeal origin. RAD3 is involved in nucleotide excision repair, and forms part of the transcription factor TFIIH in yeast.
TIGR rad3 292aa 4e-60
in ref transcript
All proteins in this family for which funcitons are known are DNA-DNA helicases that funciton in the initiation of transcription and nucleotide excision repair as part of the TFIIH complex. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
COG DinG 737aa 1e-53
in ref transcript
Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair].
RTN3
refseq_RTN3.F2 refseq_RTN3.R2 308 367
NCBIGene 36.3 10313
Single exon skipping, size difference: 59
Exclusion in the protein causing a frameshift
Reference transcript: NM_201428
Changed!
pfam Reticulon 171aa 8e-41
in ref transcript
Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues.
Changed!
pfam Reticulon 157aa 4e-36
in modified transcript
SGSM1
refseq_RUTBC2.F2 refseq_RUTBC2.R2 100 283
NCBIGene 36.3 129049
Single exon skipping, size difference: 183
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039948
smart TBC 145aa 2e-30
in ref transcript
Domain in Tre-2, BUB2p, and Cdc16p. Probable Rab-GAPs. Widespread domain present in Gyp6 and Gyp7, thereby giving rise to the notion that it performs a GTP-activator activity on Rab-like GTPases.
pfam RUN 146aa 5e-21
in ref transcript
RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signalling pathways. The domain is comprises six conserved regions, which in some proteins have considerable insertions between them. The domain core is thought to take up a predominantly alpha fold, with basic amino acids in regions A and D possibly playing a functional role in interactions with Ras GTPases.
COG COG5210 212aa 2e-18
in ref transcript
GTPase-activating protein [General function prediction only].
RWDD1
refseq_RWDD1.F1 refseq_RWDD1.R1 102 230
NCBIGene 36.3 51389
Single exon skipping, size difference: 128
Exclusion in 5'UTR
Reference transcript: NM_016104
SAA1
refseq_SAA1.F1 refseq_SAA1.R1 162 309
NCBIGene 36.3 6288
Alternative 5-prime, size difference: 147
Exclusion in 5'UTR
Reference transcript: NM_000331
pfam SAA 103aa 2e-47
in ref transcript
Serum amyloid A protein.
SAR1B
refseq_SAR1B.F1 refseq_SAR1B.R1 113 232
NCBIGene 36.3 51128
Single exon skipping, size difference: 119
Exclusion in 5'UTR
Reference transcript: NM_001033503
cd Sar1 195aa 3e-98
in ref transcript
Sar1 subfamily. Sar1 is an essential component of COPII vesicle coats involved in export of cargo from the ER. The GTPase activity of Sar1 functions as a molecular switch to control protein-protein and protein-lipid interactions that direct vesicle budding from the ER. Activation of the GDP to the GTP-bound form of Sar1 involves the membrane-associated guanine nucleotide exchange factor (GEF) Sec12. Sar1 is unlike all Ras superfamily GTPases that use either myristoyl or prenyl groups to direct membrane association and function, in that Sar1 lacks such modification. Instead, Sar1 contains a unique nine-amino-acid N-terminal extension. This extension contains an evolutionarily conserved cluster of bulky hydrophobic amino acids, referred to as the Sar1-N-terminal activation recruitment (STAR) motif. The STAR motif mediates the recruitment of Sar1 to ER membranes and facilitates its interaction with mammalian Sec12 GEF leading to activation.
smart SAR 193aa 5e-75
in ref transcript
Sar1p-like members of the Ras-family of small GTPases. Yeast SAR1 is an essential gene required for transport of secretory proteins from the endoplasmic reticulum to the Golgi apparatus.
PTZ PTZ00133 191aa 3e-23
in ref transcript
ADP-ribosylation factor; Provisional.
SC4MOL
refseq_SC4MOL.F1 refseq_SC4MOL.R1 123 409
NCBIGene 36.3 6307
Single exon skipping, size difference: 286
Exclusion of the protein initiation site
Reference transcript: NM_006745
pfam FA_hydroxylase 113aa 3e-18
in ref transcript
Fatty acid hydroxylase superfamily. This superfamily includes fatty acid and carotene hydroxylases and sterol desaturases. Beta-carotene hydroxylase is involved in zeaxanthin synthesis by hydroxylating beta-carotene, but the enzyme may be involved in other pathways. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain two copies of a HXHH motif. Members of this family are integral membrane proteins.
Changed!
COG ERG3 234aa 6e-24
in ref transcript
Sterol desaturase [Lipid metabolism].
Changed!
COG ERG3 143aa 1e-20
in modified transcript
SCAMP1
refseq_SCAMP1.F1 refseq_SCAMP1.R1 200 360
NCBIGene 36.2 9522
Single exon skipping, size difference: 160
Exclusion in the protein causing a frameshift
Reference transcript: NM_004866
Changed!
pfam SCAMP 128aa 4e-47
in ref transcript
SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cytoplasmic tail. SCAMPs probably function in endocytosis by recruiting EH-domain proteins to the N-terminal NPF repeats but may have additional functions mediated by their other sequences.
Changed!
pfam SCAMP 42aa 6e-20
in modified transcript
SCAMP3
refseq_SCAMP3.F2 refseq_SCAMP3.R2 139 217
NCBIGene 36.3 10067
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_005698
pfam SCAMP 178aa 1e-50
in ref transcript
SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cytoplasmic tail. SCAMPs probably function in endocytosis by recruiting EH-domain proteins to the N-terminal NPF repeats but may have additional functions mediated by their other sequences.
SCARF1
refseq_SCARF1.F1 refseq_SCARF1.R1 171 208
NCBIGene 36.3 8578
Alternative 3-prime, size difference: 37
Exclusion in the protein causing a frameshift
Reference transcript: NM_003693
SCEL
refseq_SCEL.F1 refseq_SCEL.R1 337 397
NCBIGene 36.3 8796
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_144777
smart LIM 39aa 7e-04
in ref transcript
Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.
SCFD1
refseq_SCFD1.F1 refseq_SCFD1.R1 109 180
NCBIGene 36.3 23256
Single exon skipping, size difference: 71
Exclusion in the protein causing a frameshift
Reference transcript: NM_016106
Changed!
pfam Sec1 594aa 1e-167
in ref transcript
Sec1 family.
Changed!
COG SEC1 628aa 2e-79
in ref transcript
Proteins involved in synaptic transmission and general secretion, Sec1 family [Intracellular trafficking and secretion].
SCMH1
refseq_SCMH1.F1 refseq_SCMH1.R1 221 287
NCBIGene 36.3 22955
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_001031694
cd SAM 65aa 2e-08
in ref transcript
Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
smart MBT 96aa 7e-35
in ref transcript
Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
smart MBT 99aa 1e-31
in ref transcript
pfam SAM_1 64aa 7e-12
in ref transcript
SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.
SCMH1
refseq_SCMH1.F3 refseq_SCMH1.R3 148 362
NCBIGene 36.3 22955
Multiple exon skipping, size difference: 214
Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
Reference transcript: NM_001031694
cd SAM 65aa 2e-08
in ref transcript
Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
Changed!
smart MBT 96aa 7e-35
in ref transcript
Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
smart MBT 99aa 1e-31
in ref transcript
pfam SAM_1 64aa 7e-12
in ref transcript
SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.
Changed!
smart MBT 79aa 8e-27
in modified transcript
SCML1
refseq_SCML1.F2 refseq_SCML1.R2 151 303
NCBIGene 36.3 6322
Exon skipping and alternative 3-prime or 5-prime, size difference: 152
Exclusion of the protein initiation site, Exclusion in the protein (no frameshift)
Reference transcript: NM_006746
cd SAM 66aa 8e-08
in ref transcript
Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
pfam SAM_1 65aa 1e-10
in ref transcript
SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.
SCML1
refseq_SCML1.F3 refseq_SCML1.R1 137 218
NCBIGene 36.3 6322
Exon skipping and alternative 3-prime or 5-prime, size difference: 81
Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001037540
cd SAM 66aa 9e-08
in ref transcript
Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
pfam SAM_1 65aa 1e-10
in ref transcript
SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.
SCNM1
refseq_SCNM1.F2 refseq_SCNM1.R2 106 155
NCBIGene 36.2 79005
Alternative 5-prime, size difference: 49
Inclusion in the protein causing a frameshift
Reference transcript: NM_024041
SCP2
refseq_SCP2.F1 refseq_SCP2.R1 101 233
NCBIGene 36.3 6342
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_002979
Changed!
cd nondecarbox_cond_enzymes 387aa 1e-153
in ref transcript
nondecarboxylating condensing enzymes; In general, thiolases catalyze the reversible thiolytic cleavage of 3-ketoacyl-CoA into acyl-CoA and acetyl-CoA, a 2-step reaction involving a covalent intermediate formed with a catalytic cysteine. There are 2 functional different classes: thiolase-I (3-ketoacyl-CoA thiolase) and thiolase-II (acetoacetyl-CoA thiolase). Thiolase-I can cleave longer fatty acid molecules and plays an important role in the beta-oxidative degradation of fatty acids. Thiolase-II has a high substrate specificity. Although it can cleave acetoacyl-CoA, its main function is the synthesis of acetoacyl-CoA from two molecules of acetyl-CoA, which gives it importance in several biosynthetic pathways.
Changed!
TIGR AcCoA-C-Actrans 364aa 4e-26
in ref transcript
This model represents a large family of enzymes which catalyze the thiolysis of a linear fatty acid CoA (or acetoacetyl-CoA) using a second CoA molecule to produce acetyl-CoA and a CoA-ester product two carbons shorter (or, alternatively, the condensation of two molecules of acetyl-CoA to produce acetoacetyl-CoA and CoA). This enzyme is also known as "thiolase", "3-ketoacyl-CoA thiolase", "beta-ketothiolase" and "Fatty oxidation complex beta subunit". When catalyzing the degradative reaction on fatty acids the corresponding EC number is 2.3.1.16. The condensation reaction corresponds to 2.3.1.9. Note that the enzymes which catalyze the condensation are generally not involved in fatty acid biosynthesis, which is carried out by a decarboxylating condensation of acetyl and malonyl esters of acyl carrier proteins. Rather, this activity may produce acetoacetyl-CoA for pathways such as IPP biosynthesis in the absence of sufficient fatty acid oxidation.
pfam SCP2 88aa 7e-20
in ref transcript
SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145.
Changed!
PRK PRK08256 393aa 0.0
in ref transcript
lipid-transfer protein; Provisional.
COG COG3154 66aa 2e-07
in ref transcript
Putative lipid carrier protein [Lipid metabolism].
Changed!
cd nondecarbox_cond_enzymes 343aa 1e-129
in modified transcript
Changed!
TIGR AcCoA-C-Actrans 177aa 5e-17
in modified transcript
Changed!
PRK PRK08256 349aa 1e-168
in modified transcript
SCP2
refseq_SCP2.F3 refseq_SCP2.R3 118 248
NCBIGene 36.3 6342
Single exon skipping, size difference: 130
Exclusion in the protein causing a frameshift
Reference transcript: NM_002979
cd nondecarbox_cond_enzymes 387aa 1e-153
in ref transcript
nondecarboxylating condensing enzymes; In general, thiolases catalyze the reversible thiolytic cleavage of 3-ketoacyl-CoA into acyl-CoA and acetyl-CoA, a 2-step reaction involving a covalent intermediate formed with a catalytic cysteine. There are 2 functional different classes: thiolase-I (3-ketoacyl-CoA thiolase) and thiolase-II (acetoacetyl-CoA thiolase). Thiolase-I can cleave longer fatty acid molecules and plays an important role in the beta-oxidative degradation of fatty acids. Thiolase-II has a high substrate specificity. Although it can cleave acetoacyl-CoA, its main function is the synthesis of acetoacyl-CoA from two molecules of acetyl-CoA, which gives it importance in several biosynthetic pathways.
TIGR AcCoA-C-Actrans 364aa 4e-26
in ref transcript
This model represents a large family of enzymes which catalyze the thiolysis of a linear fatty acid CoA (or acetoacetyl-CoA) using a second CoA molecule to produce acetyl-CoA and a CoA-ester product two carbons shorter (or, alternatively, the condensation of two molecules of acetyl-CoA to produce acetoacetyl-CoA and CoA). This enzyme is also known as "thiolase", "3-ketoacyl-CoA thiolase", "beta-ketothiolase" and "Fatty oxidation complex beta subunit". When catalyzing the degradative reaction on fatty acids the corresponding EC number is 2.3.1.16. The condensation reaction corresponds to 2.3.1.9. Note that the enzymes which catalyze the condensation are generally not involved in fatty acid biosynthesis, which is carried out by a decarboxylating condensation of acetyl and malonyl esters of acyl carrier proteins. Rather, this activity may produce acetoacetyl-CoA for pathways such as IPP biosynthesis in the absence of sufficient fatty acid oxidation.
Changed!
pfam SCP2 88aa 7e-20
in ref transcript
SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145.
PRK PRK08256 393aa 0.0
in ref transcript
lipid-transfer protein; Provisional.
Changed!
COG COG3154 66aa 2e-07
in ref transcript
Putative lipid carrier protein [Lipid metabolism].
SCRIB
refseq_SCRIB.F1 refseq_SCRIB.R1 199 274
NCBIGene 36.3 23513
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_182706
cd PDZ_signaling 89aa 1e-16
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 88aa 4e-14
in ref transcript
cd PDZ_signaling 60aa 6e-12
in ref transcript
cd PDZ_signaling 92aa 3e-07
in ref transcript
cd LRR_RI 223aa 1e-05
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
smart PDZ 90aa 6e-19
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart PDZ 91aa 3e-16
in ref transcript
smart PDZ 66aa 3e-14
in ref transcript
TIGR degP_htrA_DO 172aa 2e-08
in ref transcript
This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
smart PDZ 94aa 6e-06
in ref transcript
COG COG4886 200aa 4e-27
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 220aa 2e-07
in ref transcript
Protein kinase domain.
SDCCAG3
refseq_SDCCAG3.F1 refseq_SDCCAG3.R1 209 278
NCBIGene 36.3 10807
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039707
pfam SMC_N 162aa 4e-06
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
SDF4
refseq_SDF4.F2 refseq_SDF4.R2 198 314
NCBIGene 36.3 51150
Alternative 3-prime, size difference: 116
Inclusion in the protein causing a frameshift
Reference transcript: NM_016176
cd EFh 61aa 0.002
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Changed!
cd EFh 54aa 0.003
in ref transcript
Changed!
COG FRQ1 89aa 0.009
in modified transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
SDHC
refseq_SDHC.F1 refseq_SDHC.R1 116 218
NCBIGene 36.3 6391
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_003001
Changed!
cd SQR_TypeC_SdhC 119aa 5e-24
in ref transcript
Succinate:quinone oxidoreductase (SQR) Type C subfamily, Succinate dehydrogenase C (SdhC) subunit; composed of bacterial SdhC and eukaryotic large cytochrome b binding (CybL) proteins. SQR catalyzes the oxidation of succinate to fumarate coupled to the reduction of quinone to quinol. Members of this family reduce high potential quinones such as ubiquinone. SQR is also called succinate dehydrogenase or Complex II, and is part of the citric acid cycle and the aerobic respiratory chain. SQR is composed of a flavoprotein catalytic subunit, an iron-sulfur protein and one or two hydrophobic transmembrane subunits. Proteins in this subfamily are classified as Type C SQRs because they contain two transmembrane subunits and one heme group. The heme and quinone binding sites reside in the transmembrane subunits. The SdhC or CybL protein is one of the two transmembrane subunits of bacterial and eukaryotic SQRs. The two-electron oxidation of succinate in the flavoprotein active site is coupled to the two-electron reduction of quinone in the membrane anchor subunits via electron transport through FAD and three iron-sulfur centers. The reversible reduction of quinone is an essential feature of respiration, allowing transfer of electrons between respiratory complexes.
Changed!
pfam Sdh_cyt 105aa 2e-22
in ref transcript
Succinate dehydrogenase cytochrome b subunit.
Changed!
COG SdhC 103aa 7e-12
in ref transcript
Succinate dehydrogenase/fumarate reductase, cytochrome b subunit [Energy production and conversion].
Changed!
cd SQR_TypeC_SdhC 116aa 3e-18
in modified transcript
Changed!
pfam Sdh_cyt 100aa 4e-16
in modified transcript
Changed!
COG SdhC 101aa 2e-07
in modified transcript
SDHC
refseq_SDHC.F4 refseq_SDHC.R4 214 378
NCBIGene 36.3 6391
Single exon skipping, size difference: 164
Exclusion in the protein causing a frameshift
Reference transcript: NM_003001
Changed!
cd SQR_TypeC_SdhC 119aa 5e-24
in ref transcript
Succinate:quinone oxidoreductase (SQR) Type C subfamily, Succinate dehydrogenase C (SdhC) subunit; composed of bacterial SdhC and eukaryotic large cytochrome b binding (CybL) proteins. SQR catalyzes the oxidation of succinate to fumarate coupled to the reduction of quinone to quinol. Members of this family reduce high potential quinones such as ubiquinone. SQR is also called succinate dehydrogenase or Complex II, and is part of the citric acid cycle and the aerobic respiratory chain. SQR is composed of a flavoprotein catalytic subunit, an iron-sulfur protein and one or two hydrophobic transmembrane subunits. Proteins in this subfamily are classified as Type C SQRs because they contain two transmembrane subunits and one heme group. The heme and quinone binding sites reside in the transmembrane subunits. The SdhC or CybL protein is one of the two transmembrane subunits of bacterial and eukaryotic SQRs. The two-electron oxidation of succinate in the flavoprotein active site is coupled to the two-electron reduction of quinone in the membrane anchor subunits via electron transport through FAD and three iron-sulfur centers. The reversible reduction of quinone is an essential feature of respiration, allowing transfer of electrons between respiratory complexes.
Changed!
pfam Sdh_cyt 105aa 2e-22
in ref transcript
Succinate dehydrogenase cytochrome b subunit.
Changed!
COG SdhC 103aa 7e-12
in ref transcript
Succinate dehydrogenase/fumarate reductase, cytochrome b subunit [Energy production and conversion].
Changed!
cd SQR_TypeC_SdhC 30aa 4e-08
in modified transcript
Changed!
pfam Sdh_cyt 31aa 3e-08
in modified transcript
Changed!
COG SdhC 29aa 5e-04
in modified transcript
SEC13L1
refseq_SEC13L1.F2 refseq_SEC13L1.R2 154 188
NCBIGene 36.2 6396
Single exon skipping, size difference: 34
Exclusion of the protein initiation site
Reference transcript: NM_030673
cd WD40 280aa 3e-23
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
smart WD40 38aa 0.001
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
pfam WD40 42aa 0.002
in ref transcript
WD domain, G-beta repeat.
COG COG2319 296aa 1e-16
in ref transcript
FOG: WD40 repeat [General function prediction only].
SEC23B
refseq_SEC23B.F1 refseq_SEC23B.R1 102 120
NCBIGene 36.3 10483
Alternative 5-prime, size difference: 18
Exclusion in 5'UTR
Reference transcript: NM_032985
cd Sec23-like 265aa 1e-125
in ref transcript
Sec23-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 23 is very similar to Sec24. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup lack the consensus MIDAS motif but have the overall Para-Rossmann type fold that is characteristic of this superfamily.
pfam Sec23_trunk 267aa 5e-81
in ref transcript
Sec23/Sec24 trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.
pfam Sec23_helical 101aa 3e-36
in ref transcript
Sec23/Sec24 helical domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is composed of five alpha helices.
pfam Sec23_BS 104aa 1e-23
in ref transcript
Sec23/Sec24 beta-sandwich domain.
pfam zf-Sec23_Sec24 42aa 2e-13
in ref transcript
Sec23/Sec24 zinc finger. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is found to be zinc binding domain.
pfam Gelsolin 87aa 1e-11
in ref transcript
Gelsolin repeat.
COG SEC23 758aa 0.0
in ref transcript
Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion].
SEC24C
refseq_SEC24C.F1 refseq_SEC24C.R1 236 323
NCBIGene 36.3 9632
Single exon skipping, size difference: 87
Exclusion in 5'UTR
Reference transcript: NM_004922
cd Sec24-like 245aa 3e-93
in ref transcript
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.
pfam Sec23_trunk 245aa 4e-96
in ref transcript
Sec23/Sec24 trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.
pfam Sec23_helical 104aa 3e-29
in ref transcript
Sec23/Sec24 helical domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is composed of five alpha helices.
pfam Sec23_BS 84aa 2e-25
in ref transcript
Sec23/Sec24 beta-sandwich domain.
pfam zf-Sec23_Sec24 40aa 7e-15
in ref transcript
Sec23/Sec24 zinc finger. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is found to be zinc binding domain.
pfam Gelsolin 31aa 7e-05
in ref transcript
Gelsolin repeat.
pfam PAT1 173aa 1e-04
in ref transcript
Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001077207
cd WD40 244aa 4e-22
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
COG COG2319 306aa 6e-15
in ref transcript
FOG: WD40 repeat [General function prediction only].
SEMA6D
refseq_SEMA6D.F1 refseq_SEMA6D.R1 106 145
NCBIGene 36.3 80031
Alternative 3-prime, size difference: 39
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_153618
pfam Sema 418aa 1e-121
in ref transcript
Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor and the human Plexin-A3 precursor.
Changed!
pfam PSI 45aa 7e-07
in ref transcript
Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related Caenorhabditis elegans protein contains four copies of the repeat. The Met receptor contains a single copy of the repeat. The Pfam alignment shows 6 conserved cysteine residues that may form three conserved disulphide bridges, whereas shows 8 conserved cysteines. The pattern of conservation suggests that cysteines 5 and 7 (that are not absolutely conserved) form a disulphide bridge (Personal observation. A Bateman).
pfam DUF1043 111aa 0.007
in ref transcript
Protein of unknown function (DUF1043). This family consists of several hypothetical bacterial proteins of unknown function.
Changed!
pfam PSI 35aa 1e-05
in modified transcript
SEMA6D
refseq_SEMA6D.F3 refseq_SEMA6D.R3 220 388
NCBIGene 36.3 80031
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_153618
pfam Sema 418aa 1e-121
in ref transcript
Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor and the human Plexin-A3 precursor.
pfam PSI 45aa 7e-07
in ref transcript
Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related Caenorhabditis elegans protein contains four copies of the repeat. The Met receptor contains a single copy of the repeat. The Pfam alignment shows 6 conserved cysteine residues that may form three conserved disulphide bridges, whereas shows 8 conserved cysteines. The pattern of conservation suggests that cysteines 5 and 7 (that are not absolutely conserved) form a disulphide bridge (Personal observation. A Bateman).
pfam DUF1043 111aa 0.007
in ref transcript
Protein of unknown function (DUF1043). This family consists of several hypothetical bacterial proteins of unknown function.
SEP15
refseq_SEP15.F1 refseq_SEP15.R1 160 210
NCBIGene 36.3 9403
Single exon skipping, size difference: 50
Exclusion in 3'UTR
Reference transcript: NM_004261
SEPN1
refseq_SEPN1.F1 refseq_SEPN1.R1 115 217
NCBIGene 36.3 57190
Single exon skipping, size difference: 102
Inclusion in the protein causing a new stop codon
Reference transcript: NM_206926
SEPT10
refseq_SEPT10.F1 refseq_SEPT10.R1 320 389
NCBIGene 36.3 151011
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_144710
cd CDC_Septin 269aa 1e-112
in ref transcript
CDC/Septin. Septins are a conserved family of GTP-binding proteins associated with diverse processes in dividing and non-dividing cells. They were first discovered in the budding yeast S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and CDC12) required for normal bud morphology. Septins are also present in metazoan cells, where they are required for cytokinesis in some systems, and implicated in a variety of other processes involving organization of the cell cortex and exocytosis. In humans, 12 septin genes generate dozens of polypeptides, many of which comprise heterooligomeric complexes. Since septin mutants are commonly defective in cytokinesis and formation of the neck formation of the neck filaments/septin rings, septins have been considered to be the primary constituents of the neck filaments. Septins belong to the GTPase superfamily for their conserved GTPase motifs and enzymatic activities.
pfam Septin 268aa 8e-90
in ref transcript
Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerise to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis.
COG CDC3 354aa 1e-81
in ref transcript
Septin family protein [Cell division and chromosome partitioning / Cytoskeleton].
SEPT2
refseq_SEPT2.F2 refseq_SEPT2.R2 219 320
NCBIGene 36.3 4735
Single exon skipping, size difference: 101
Exclusion in 5'UTR
Reference transcript: NM_001008491
cd CDC_Septin 275aa 1e-127
in ref transcript
CDC/Septin. Septins are a conserved family of GTP-binding proteins associated with diverse processes in dividing and non-dividing cells. They were first discovered in the budding yeast S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and CDC12) required for normal bud morphology. Septins are also present in metazoan cells, where they are required for cytokinesis in some systems, and implicated in a variety of other processes involving organization of the cell cortex and exocytosis. In humans, 12 septin genes generate dozens of polypeptides, many of which comprise heterooligomeric complexes. Since septin mutants are commonly defective in cytokinesis and formation of the neck formation of the neck filaments/septin rings, septins have been considered to be the primary constituents of the neck filaments. Septins belong to the GTPase superfamily for their conserved GTPase motifs and enzymatic activities.
pfam Septin 280aa 1e-139
in ref transcript
Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerise to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis.
COG CDC3 312aa 1e-92
in ref transcript
Septin family protein [Cell division and chromosome partitioning / Cytoskeleton].
SEPT2
refseq_SEPT2.F3 refseq_SEPT2.R1 128 213
NCBIGene 36.3 4735
Single exon skipping, size difference: 85
Exclusion in 5'UTR
Reference transcript: NM_001008492
cd CDC_Septin 275aa 1e-127
in ref transcript
CDC/Septin. Septins are a conserved family of GTP-binding proteins associated with diverse processes in dividing and non-dividing cells. They were first discovered in the budding yeast S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and CDC12) required for normal bud morphology. Septins are also present in metazoan cells, where they are required for cytokinesis in some systems, and implicated in a variety of other processes involving organization of the cell cortex and exocytosis. In humans, 12 septin genes generate dozens of polypeptides, many of which comprise heterooligomeric complexes. Since septin mutants are commonly defective in cytokinesis and formation of the neck formation of the neck filaments/septin rings, septins have been considered to be the primary constituents of the neck filaments. Septins belong to the GTPase superfamily for their conserved GTPase motifs and enzymatic activities.
pfam Septin 280aa 1e-139
in ref transcript
Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerise to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis.
COG CDC3 312aa 1e-92
in ref transcript
Septin family protein [Cell division and chromosome partitioning / Cytoskeleton].
SEPT6
refseq_SEPT6.F1 refseq_SEPT6.R1 253 298
NCBIGene 36.3 23157
Single exon skipping, size difference: 45
Inclusion in the protein causing a new stop codon
Reference transcript: NM_145802
cd CDC_Septin 269aa 1e-117
in ref transcript
CDC/Septin. Septins are a conserved family of GTP-binding proteins associated with diverse processes in dividing and non-dividing cells. They were first discovered in the budding yeast S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and CDC12) required for normal bud morphology. Septins are also present in metazoan cells, where they are required for cytokinesis in some systems, and implicated in a variety of other processes involving organization of the cell cortex and exocytosis. In humans, 12 septin genes generate dozens of polypeptides, many of which comprise heterooligomeric complexes. Since septin mutants are commonly defective in cytokinesis and formation of the neck formation of the neck filaments/septin rings, septins have been considered to be the primary constituents of the neck filaments. Septins belong to the GTPase superfamily for their conserved GTPase motifs and enzymatic activities.
pfam Septin 270aa 2e-88
in ref transcript
Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerise to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis.
COG CDC3 322aa 5e-77
in ref transcript
Septin family protein [Cell division and chromosome partitioning / Cytoskeleton].
SERBP1
refseq_SERBP1.F1 refseq_SERBP1.R1 100 118
NCBIGene 36.3 26135
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_001018067
Changed!
pfam HABP4_PAI-RBP1 125aa 2e-16
in ref transcript
Hyaluronan / mRNA binding family. This family includes the HABP4 family of hyaluronan-binding proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1. HABP4 has been observed to bind hyaluronan (a glucosaminoglycan), but it is not known whether this is its primary role in vivo. It has also been observed to bind RNA, but with a lower affinity than that for hyaluronan. PAI-1 mRNA-binding protein specifically binds the mRNA of type-1 plasminogen activator inhibitor (PAI-1), and is thought to be involved in regulation of mRNA stability. However, in both cases, the sequence motifs predicted to be important for ligand binding are not conserved throughout the family, so it is not known whether members of this family share a common function.
Changed!
pfam HABP4_PAI-RBP1 119aa 5e-18
in modified transcript
SERPINA1
refseq_SERPINA1.F1 refseq_SERPINA1.R1 100 414
NCBIGene 36.3 5265
Multiple exon skipping, size difference: 314
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_001002236
cd alpha-1-antitrypsin_like 358aa 1e-154
in ref transcript
alpha-1-antitrypsin_like. This family contains a variety of different members of clade A of the serpin superfamily. They include the classical serine proteinase inhibitors, alpha-1-antitrypsin and alpha-1-antichymotrypsin, protein C inhibitor, kallistatin, and noninhibitory serpins, like corticosteroid and thyroxin binding globulins. In general, SERine Proteinase INhibitors (serpins) exhibit conformational polymorphism shifting from native to cleaved, latent, delta, or polymorphic forms. Many serpins, such as antitrypsin and antichymotrypsin, function as serine protease inhibitors which regulate blood coagulation cascades. Non-inhibitory serpins perform many diverse functions such as chaperoning proteins or transporting hormones. Serpins are of medical interest because mutants have been associated with blood clotting disorders, emphysema, cirrhosis, and dementia.
pfam Serpin 373aa 1e-158
in ref transcript
Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich.
COG COG4826 359aa 2e-52
in ref transcript
Serine protease inhibitor [Posttranslational modification, protein turnover, chaperones].
SETD4
refseq_SETD4.F1 refseq_SETD4.R1 113 207
NCBIGene 36.2 54093
Single exon skipping, size difference: 94
Inclusion in the protein causing a frameshift
Reference transcript: NM_017438
Changed!
pfam Rubis-subs-bind 119aa 2e-21
in ref transcript
Rubisco LSMT substrate-binding. Members of this family adopt a multihelical structure, with an irregular array of long and short alpha-helices. They allow binding of the protein to substrate, such as the N-terminal tails of histones H3 and H4 and the large subunit of the Rubisco holoenzyme complex.
SETD4
refseq_SETD4.F3 refseq_SETD4.R3 150 199
NCBIGene 36.2 54093
Alternative 3-prime, size difference: 49
Inclusion in 5'UTR
Reference transcript: NM_017438
pfam Rubis-subs-bind 119aa 2e-21
in ref transcript
Rubisco LSMT substrate-binding. Members of this family adopt a multihelical structure, with an irregular array of long and short alpha-helices. They allow binding of the protein to substrate, such as the N-terminal tails of histones H3 and H4 and the large subunit of the Rubisco holoenzyme complex.
SETD5
refseq_SETD5.F2 refseq_SETD5.R2 181 220
NCBIGene 36.2 55209
Single exon skipping, size difference: 39
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: XM_931455
pfam SET 102aa 4e-21
in ref transcript
SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
COG COG2940 100aa 7e-05
in ref transcript
Proteins containing SET domain [General function prediction only].
SETD5
refseq_SETD5.F3 refseq_SETD5.R3 263 320
NCBIGene 36.2 55209
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: XM_931455
pfam SET 102aa 4e-21
in ref transcript
SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
COG COG2940 100aa 7e-05
in ref transcript
Proteins containing SET domain [General function prediction only].
SETD5
refseq_SETD5.F5 refseq_SETD5.R5 148 373
NCBIGene 36.2 55209
Single exon skipping, size difference: 225
Inclusion in the protein causing a new stop codon
Reference transcript: XM_931455
Changed!
pfam SET 102aa 4e-21
in ref transcript
SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
Changed!
COG COG2940 100aa 7e-05
in ref transcript
Proteins containing SET domain [General function prediction only].
SEZ6L2
refseq_SEZ6L2.F1 refseq_SEZ6L2.R1 221 260
NCBIGene 36.3 26470
Single exon skipping, size difference: 39
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_201575
cd CUB 91aa 6e-11
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd CUB 69aa 7e-11
in ref transcript
cd CCP 57aa 4e-09
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd CUB 113aa 3e-08
in ref transcript
cd CCP 59aa 3e-08
in ref transcript
cd CCP 60aa 2e-07
in ref transcript
cd CCP 56aa 2e-06
in ref transcript
pfam CUB 91aa 2e-08
in ref transcript
CUB domain.
smart CCP 56aa 3e-08
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
smart CCP 56aa 5e-08
in ref transcript
pfam Sushi 58aa 1e-07
in ref transcript
Sushi domain (SCR repeat).
smart CCP 60aa 3e-07
in ref transcript
smart CUB 59aa 3e-06
in ref transcript
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
smart CUB 102aa 5e-04
in ref transcript
pfam Sushi 57aa 0.003
in ref transcript
SF3A1
refseq_SF3A1.F1 refseq_SF3A1.R1 121 316
NCBIGene 36.3 10291
Alternative 5-prime and 3-prime, size difference: 195
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_005877
cd SF3a120_C 76aa 4e-32
in ref transcript
SF3a120_C Mammalian splicing factor SF3a consists of three subunits of 60, 66, and 120 kDa and functions early during pre-mRNA splicing by converting the U2 snRNP to its active form. The 120kDa subunit (SF3a120) has a carboxy-terminal ubiquitin-like domain and two SWAP (suppressor-of-white-apricot) domains, referred to collectively as the SURP module, at its amino-terminus.
smart SWAP 54aa 4e-21
in ref transcript
Suppressor-of-White-APricot splicing regulator. domain present in regulators which are responsible for pre-mRNA splicing processes.
Changed!
smart SWAP 54aa 3e-17
in ref transcript
pfam ubiquitin 67aa 1e-11
in ref transcript
Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
PTZ PTZ00044 69aa 0.002
in ref transcript
ubiquitin; Provisional.
Changed!
smart SWAP 47aa 4e-14
in modified transcript
SFI1
refseq_SFI1.F1 refseq_SFI1.R1 253 346
NCBIGene 36.3 9814
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_001007467
Changed!
pfam Sfi1 551aa 3e-05
in ref transcript
Sfi1 spindle body protein. This is a family of fungal spindle pole body proteins that play a role in spindle body duplication. They contain binding sites for calmodulin-like proteins called centrins which are present in microtubule-organising centres.
Changed!
pfam Sfi1 568aa 3e-07
in modified transcript
SFMBT1
refseq_SFMBT1.F1 refseq_SFMBT1.R1 205 324
NCBIGene 36.3 51460
Single exon skipping, size difference: 119
Exclusion in 5'UTR
Reference transcript: NM_001005158
cd SAM 63aa 3e-07
in ref transcript
Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
smart MBT 98aa 1e-30
in ref transcript
Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
smart MBT 94aa 4e-28
in ref transcript
smart MBT 91aa 2e-23
in ref transcript
smart MBT 102aa 1e-21
in ref transcript
smart SAM 65aa 7e-06
in ref transcript
Sterile alpha motif. Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerisation.
SFRS5
refseq_SFRS5.F1 refseq_SFRS5.R1 228 357
NCBIGene 36.3 6430
Alternative 5-prime, size difference: 129
Exclusion in 5'UTR
Reference transcript: NM_001039465
cd RRM 67aa 6e-12
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 68aa 3e-08
in ref transcript
pfam RRM_1 63aa 4e-12
in ref transcript
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
smart RRM_2 68aa 8e-09
in ref transcript
RNA recognition motif.
TIGR PABP-1234 168aa 3e-07
in ref transcript
There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
COG COG0724 72aa 2e-06
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
COG COG0724 69aa 0.001
in ref transcript
SFXN4
refseq_SFXN4.F2 refseq_SFXN4.R2 271 392
NCBIGene 36.2 119559
Single exon skipping, size difference: 121
Inclusion in the protein causing a new stop codon
Reference transcript: NM_213649
Changed!
TIGR mtc 298aa 2e-27
in ref transcript
The MTC family consists of a limited number of homologues, all from eukaryotes. A single member of the family has been functionally characterized, the tricarboxylate carrier from rat liver mitochondria. The rat liver mitochondrial tricarboxylate carrier has been reported to transport citrate, cis-aconitate, threo-D-isocitrate, D- and L-tartrate, malate, succinate and phosphoenolpyruvate. It presumably functions by a proton symport mechanism.
Changed!
TIGR mtc 271aa 2e-25
in modified transcript
SFXN4
refseq_SFXN4.F3 refseq_SFXN4.R3 103 130
NCBIGene 36.2 119559
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_213649
Changed!
TIGR mtc 298aa 2e-27
in ref transcript
The MTC family consists of a limited number of homologues, all from eukaryotes. A single member of the family has been functionally characterized, the tricarboxylate carrier from rat liver mitochondria. The rat liver mitochondrial tricarboxylate carrier has been reported to transport citrate, cis-aconitate, threo-D-isocitrate, D- and L-tartrate, malate, succinate and phosphoenolpyruvate. It presumably functions by a proton symport mechanism.
Changed!
TIGR mtc 289aa 1e-26
in modified transcript
SGK3
refseq_SGK3.F2 refseq_SGK3.R2 162 258
NCBIGene 36.3 23678
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_001033578
Changed!
cd STKc_SGK3 325aa 0.0
in ref transcript
STKc_SGK3: Serine/Threonine Kinases (STKs), Serum- and Glucocorticoid-induced Kinase (SGK) subfamily, SGK3 isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The SGK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. There are three isoforms of SGK, named SGK1, SGK2, and SGK3 (also called cytokine-independent survival kinase CISK). SGK3 is expressed in most tissues and is most abundant in the embryo and adult heart and spleen. It was originally discovered in a screen for antiapoptotic genes. It phosphorylates and inhibits the proapoptotic proteins, Bad and FKHRL1. SGK3 also regulates many transporters, ion channels, and receptors. It plays a critical role in hair follicle morphogenesis and hair cycling.
cd PX_CISK 109aa 9e-53
in ref transcript
The phosphoinositide binding Phox Homology Domain of Cytokine-Independent Survival Kinase. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Cytokine-independent survival kinase (CISK), also called Serum- and Glucocorticoid-induced Kinase 3 (SGK3), plays a role in cell growth and survival. It is expressed in most tissues and is most abundant in the embryo and adult heart and spleen. It was originally discovered in a screen for antiapoptotic genes. It phosphorylates and inhibits the proapoptotic proteins, Bad and FKHRL1. CISK/SGK3 also regulates many transporters, ion channels, and receptors. It plays a critical role in hair follicle morphogenesis and hair cycling. N-terminal to a catalytic kinase domain, CISK contains a PX domain which binds highly phosphorylated PIs, directs membrane localization, and regulates the enzyme's activity.
Changed!
smart S_TKc 248aa 7e-79
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam PX 104aa 7e-19
in ref transcript
PX domain. PX domains bind to phosphoinositides.
smart S_TK_X 67aa 9e-10
in ref transcript
Extension to Ser/Thr-type protein kinases.
Changed!
PTZ PTZ00263 293aa 9e-76
in ref transcript
protein kinase A catalytic subunit; Provisional.
COG COG5391 113aa 1e-08
in ref transcript
Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
Changed!
cd STKc_SGK3 293aa 1e-158
in modified transcript
Changed!
smart S_TKc 216aa 3e-58
in modified transcript
Changed!
PTZ PTZ00263 261aa 2e-57
in modified transcript
SH3BP5
refseq_SH3BP5.F1 refseq_SH3BP5.R1 134 202
NCBIGene 36.3 9467
Alternative 3-prime, size difference: 68
Inclusion in the protein causing a new stop codon
Reference transcript: NM_004844
Changed!
pfam SH3BP5 233aa 3e-82
in ref transcript
SH3 domain-binding protein 5 (SH3BP5). This family consists of several eukaryotic SH3 domain-binding protein 5 or c-Jun N-terminal kinase (JNK)-interacting proteins (SH3BP5 or Sab). Sab binds to and serves as a substrate for JNK in vitro, and has been found to interact with the Src homology 3 (SH3) domain of Bruton's tyrosine kinase (Btk). Inspection of the sequence of Sab reveals the presence of two putative mitogen-activated protein kinase interaction motifs (KIMs) similar to that found in the JNK docking domain of the c-Jun transcription factor, and four potential serine-proline JNK phosphorylation sites in the C-terminal half of the molecule.
Changed!
PRK PRK10929 193aa 0.008
in ref transcript
hypothetical protein; Provisional.
Changed!
pfam SH3BP5 27aa 9e-05
in modified transcript
SHANK2
refseq_SHANK2.F1 refseq_SHANK2.R1 100 121
NCBIGene 36.3 22941
Single exon skipping, size difference: 21
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_012309
cd SAM 62aa 2e-16
in ref transcript
Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
cd PDZ_signaling 94aa 3e-15
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
pfam SAM_1 61aa 1e-18
in ref transcript
SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.
smart PDZ 95aa 8e-13
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
SHOX2
refseq_SHOX2.F2 refseq_SHOX2.R2 186 222
NCBIGene 36.3 6474
Alternative 3-prime, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_006884
cd homeodomain 59aa 2e-15
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 75aa 1e-15
in ref transcript
cd RRM 74aa 3e-05
in ref transcript
Changed!
TIGR half-pint 295aa 1e-132
in ref transcript
In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
TIGR half-pint 127aa 2e-59
in ref transcript
TIGR half-pint 56aa 7e-13
in ref transcript
Changed!
COG COG0724 258aa 1e-16
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
TIGR half-pint 278aa 1e-130
in modified transcript
Changed!
COG COG0724 160aa 3e-15
in modified transcript
SIGLEC6
refseq_SIGLEC6.F1 refseq_SIGLEC6.R1 130 306
NCBIGene 36.3 946
Multiple exon skipping, size difference: 176
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001245
cd IGcam 80aa 8e-05
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam I-set 83aa 8e-08
in ref transcript
Immunoglobulin I-set domain.
pfam V-set 105aa 3e-07
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
pfam C2-set_2 74aa 0.003
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
SIGLEC6
refseq_SIGLEC6.F4 refseq_SIGLEC6.R4 228 276
NCBIGene 36.3 946
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_001245
cd IGcam 80aa 8e-05
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam I-set 83aa 8e-08
in ref transcript
Immunoglobulin I-set domain.
pfam V-set 105aa 3e-07
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
pfam C2-set_2 74aa 0.003
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
SIGLEC7
refseq_SIGLEC7.F1 refseq_SIGLEC7.R1 206 485
NCBIGene 36.3 27036
Single exon skipping, size difference: 279
Exclusion in the protein (no frameshift)
Reference transcript: NM_014385
cd IGcam 74aa 1e-04
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
smart IG_like 78aa 2e-08
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam V-set 99aa 1e-07
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
Changed!
pfam C2-set_2 74aa 7e-05
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
SIL1
refseq_SIL1.F1 refseq_SIL1.R1 151 239
NCBIGene 36.3 64374
Single exon skipping, size difference: 88
Exclusion in 5'UTR
Reference transcript: NM_001037633
SIP1
refseq_SIP1.F1 refseq_SIP1.R1 107 166
NCBIGene 36.3 8487
Single exon skipping, size difference: 59
Exclusion in the protein causing a frameshift
Reference transcript: NM_003616
Changed!
pfam SIP1 247aa 5e-97
in ref transcript
Survival motor neuron (SMN) interacting protein 1 (SIP1). Survival motor neuron (SMN) interacting protein 1 (SIP1) interacts with SMN protein and plays a crucial role in the biogenesis of spliceosomes. There is evidence that the protein is linked to spinal muscular atrophy (SMA) and amyotrophic lateral sclerosis(ALS) in humans.
Changed!
pfam SIP1 221aa 2e-85
in modified transcript
SIP1
refseq_SIP1.F3 refseq_SIP1.R3 216 261
NCBIGene 36.3 8487
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_003616
Changed!
pfam SIP1 247aa 5e-97
in ref transcript
Survival motor neuron (SMN) interacting protein 1 (SIP1). Survival motor neuron (SMN) interacting protein 1 (SIP1) interacts with SMN protein and plays a crucial role in the biogenesis of spliceosomes. There is evidence that the protein is linked to spinal muscular atrophy (SMA) and amyotrophic lateral sclerosis(ALS) in humans.
Changed!
pfam SIP1 232aa 1e-91
in modified transcript
SIRPB2
refseq_SIRPB2.F1 refseq_SIRPB2.R1 106 400
NCBIGene 36.2 284759
Alternative 5-prime, size difference: 294
Exclusion in the protein (no frameshift)
Reference transcript: XM_209363
cd IGcam 84aa 5e-05
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam V-set 107aa 3e-10
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
Changed!
pfam V-set 113aa 7e-10
in ref transcript
SIRPG
refseq_SIRPG.F2 refseq_SIRPG.R2 103 436
NCBIGene 36.3 55423
Single exon skipping, size difference: 333
Exclusion in the protein (no frameshift)
Reference transcript: NM_018556
Changed!
cd IGc 87aa 2e-06
in ref transcript
Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGc 89aa 5e-05
in ref transcript
cd IGv 98aa 7e-05
in ref transcript
Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
pfam V-set 90aa 3e-11
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
smart IGc1 73aa 5e-07
in ref transcript
Immunoglobulin C-Type.
Changed!
smart IGc1 70aa 9e-06
in ref transcript
Changed!
pfam V-set 103aa 8e-12
in modified transcript
SIRT2
refseq_SIRT2.F1 refseq_SIRT2.R1 183 230
NCBIGene 36.3 22933
Single exon skipping, size difference: 47
Exclusion in the protein causing a frameshift
Reference transcript: NM_012237
Changed!
cd SIRT1 255aa 1e-108
in ref transcript
SIRT1: Eukaryotic group (class1) which includes human sirtuins SIRT1-3 and yeast Hst1-4; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation. Sir2 proteins have been shown to regulate gene silencing, DNA repair, and life span. The most-studied function, gene silencing, involves the inactivation of chromosome domains containing key regulatory genes by packaging them into a specialized chromatin structure that is inaccessible to DNA-binding proteins. The nuclear SIRT1 has been shown to target the p53 tumor suppressor protein for deacetylation to suppress DNA damage, and the cytoplasmic SIRT2 homolog has been shown to target alpha-tubulin for deacetylation for the maintenance of cell integrity.
Changed!
pfam SIR2 182aa 3e-69
in ref transcript
Sir2 family. This region is characteristic of Silent information regulator 2 (Sir2) proteins, or sirtuins. These are protein deacetylases that depend on nicotine adenine dinucleotide (NAD). They are found in many subcellular locations, including the nucleus, cytoplasm and mitochondria. Eukaryotic forms play in important role in the regulation of transcriptional repression. Moreover, they are involved in microtubule organisation and DNA damage repair processes.
Changed!
COG SIR2 206aa 3e-48
in ref transcript
NAD-dependent protein deacetylases, SIR2 family [Transcription].
SIRT3
refseq_SIRT3.F1 refseq_SIRT3.R1 138 284
NCBIGene 36.3 23410
Alternative 5-prime, size difference: 146
Exclusion in the protein causing a frameshift
Reference transcript: NM_012239
Changed!
cd SIRT1 236aa 2e-94
in ref transcript
SIRT1: Eukaryotic group (class1) which includes human sirtuins SIRT1-3 and yeast Hst1-4; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation. Sir2 proteins have been shown to regulate gene silencing, DNA repair, and life span. The most-studied function, gene silencing, involves the inactivation of chromosome domains containing key regulatory genes by packaging them into a specialized chromatin structure that is inaccessible to DNA-binding proteins. The nuclear SIRT1 has been shown to target the p53 tumor suppressor protein for deacetylation to suppress DNA damage, and the cytoplasmic SIRT2 homolog has been shown to target alpha-tubulin for deacetylation for the maintenance of cell integrity.
Changed!
pfam SIR2 179aa 3e-56
in ref transcript
Sir2 family. This region is characteristic of Silent information regulator 2 (Sir2) proteins, or sirtuins. These are protein deacetylases that depend on nicotine adenine dinucleotide (NAD). They are found in many subcellular locations, including the nucleus, cytoplasm and mitochondria. Eukaryotic forms play in important role in the regulation of transcriptional repression. Moreover, they are involved in microtubule organisation and DNA damage repair processes.
Changed!
COG SIR2 253aa 4e-47
in ref transcript
NAD-dependent protein deacetylases, SIR2 family [Transcription].
SIVA1
refseq_SIVA.F1 refseq_SIVA.R1 118 313
NCBIGene 36.3 10572
Single exon skipping, size difference: 195
Exclusion in the protein (no frameshift)
Reference transcript: NM_006427
Changed!
pfam Siva 175aa 4e-84
in ref transcript
Cd27 binding protein (Siva). Siva binds to the CD27 cytoplasmic tail. It has a DD homology region, a box-B-like ring finger, and a zinc finger-like domain. Overexpression of Siva in various cell lines induces apoptosis, suggesting an important role for Siva in the CD27-transduced apoptotic pathway. Siva-1 binds to and inhibits BCL-X(L)-mediated protection against UV radiation-induced apoptosis. Indeed, the unique amphipathic helical region (SAH) present in Siva-1 is required for its binding to BCL-X(L) and sensitising cells to UV radiation. Natural complexes of Siva-1/BCL-X(L) are detected in HUT78 and murine thymocyte, suggesting a potential role for Siva-1 in regulating T cell homeostasis. This family contains both Siva-1 and the shorter Siva-2 lacking the sequence coded by exon 2. It has been suggested that Siva-2 could regulate the function of Siva-1.
Changed!
pfam Siva 71aa 2e-29
in modified transcript
Changed!
pfam CALCOCO1 70aa 8e-04
in ref transcript
Calcium binding and coiled-coil domain (CALCOCO1) like. Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
Changed!
COG COG5411 277aa 2e-31
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
cd SH3 54aa 0.004
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
pfam SH2 83aa 1e-17
in ref transcript
SH2 domain.
pfam SH3_1 55aa 3e-05
in ref transcript
SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
Changed!
cd SH2 85aa 6e-16
in modified transcript
SEPSECS
refseq_SLA_LP.F2 refseq_SLA_LP.R2 123 278
NCBIGene 36.3 51091
Single exon skipping, size difference: 155
Exclusion of the protein initiation site
Reference transcript: NM_153825
cd AAT_like 182aa 5e-04
in ref transcript
Aspartate aminotransferase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). Pyridoxal phosphate combines with an alpha-amino acid to form a compound called a Schiff base or aldimine intermediate, which depending on the reaction, is the substrate in four kinds of reactions (1) transamination (movement of amino groups), (2) racemization (redistribution of enantiomers), (3) decarboxylation (removing COOH groups), and (4) various side-chain reactions depending on the enzyme involved. Pyridoxal phosphate (PLP) dependent enzymes were previously classified into alpha, beta and gamma classes, based on the chemical characteristics (carbon atom involved) of the reaction they catalyzed. The availability of several structures allowed a comprehensive analysis of the evolutionary classification of PLP dependent enzymes, and it was found that the functional classification did not always agree with the evolutionary history of these enzymes. The major groups in this CD corresponds to Aspartate aminotransferase a, b and c, Tyrosine, Alanine, Aromatic-amino-acid, Glutamine phenylpyruvate, 1-Aminocyclopropane-1-carboxylate synthase, Histidinol-phosphate, gene products of malY and cobC, Valine-pyruvate aminotransferase and Rhizopine catabolism regulatory protein.
Changed!
TIGR selenium_SpcS 398aa 0.0
in ref transcript
In the archaea and eukaryotes, the conversion of the mischarged serine to selenocysteine (Sec) on its tRNA is accomplished in two steps. This enzyme, O-phosphoseryl-tRNA(Sec) selenium transferase, acts second, after a phosphophorylation step catalyzed by a homolog of the bacterial SelA protein.
COG GadB 342aa 3e-16
in ref transcript
Glutamate decarboxylase and related PLP-dependent proteins [Amino acid transport and metabolism].
Changed!
TIGR selenium_SpcS 369aa 0.0
in modified transcript
SLC22A1
refseq_SLC22A1.F1 refseq_SLC22A1.R1 149 262
NCBIGene 36.3 6580
Single exon skipping, size difference: 113
Exclusion in the protein causing a frameshift
Reference transcript: NM_003057
cd MFS 166aa 2e-16
in ref transcript
The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
Changed!
cd MFS 143aa 4e-08
in ref transcript
Changed!
TIGR 2A0119 513aa 1e-180
in ref transcript
COG AraJ 124aa 5e-09
in ref transcript
Arabinose efflux permease [Carbohydrate transport and metabolism].
Changed!
PRK xylE 301aa 3e-07
in ref transcript
D-xylose transporter XylE; Provisional.
Changed!
cd MFS 124aa 1e-05
in modified transcript
Changed!
TIGR 2A0119 454aa 1e-166
in modified transcript
Changed!
PRK xylE 241aa 1e-07
in modified transcript
SLC22A12
refseq_SLC22A12.F2 refseq_SLC22A12.R2 197 352
NCBIGene 36.3 116085
Single exon skipping, size difference: 155
Exclusion in the protein causing a frameshift
Reference transcript: NM_144585
Changed!
cd MFS 158aa 1e-13
in ref transcript
The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
Changed!
cd MFS 141aa 0.004
in ref transcript
Changed!
TIGR 2A0119 516aa 1e-103
in ref transcript
Changed!
COG AraJ 93aa 6e-05
in ref transcript
Arabinose efflux permease [Carbohydrate transport and metabolism].
Changed!
TIGR 2A0119 159aa 2e-22
in modified transcript
SLC22A17
refseq_SLC22A17.F1 refseq_SLC22A17.R1 306 360
NCBIGene 36.3 51310
Alternative 5-prime, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_020372
Changed!
cd MFS 75aa 0.004
in ref transcript
The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
Changed!
TIGR 2A0119 374aa 2e-21
in ref transcript
Changed!
PRK xylE 158aa 4e-06
in ref transcript
D-xylose transporter XylE; Provisional.
Changed!
cd MFS 118aa 2e-04
in modified transcript
Changed!
TIGR 2A0119 356aa 1e-23
in modified transcript
Changed!
PRK xylE 213aa 7e-08
in modified transcript
SLC22A6
refseq_SLC22A6.F1 refseq_SLC22A6.R1 121 253
NCBIGene 36.3 9356
Alternative 3-prime, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_004790
cd MFS 123aa 6e-14
in ref transcript
The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
Changed!
cd MFS 171aa 2e-09
in ref transcript
Changed!
TIGR 2A0119 505aa 0.0
in ref transcript
Changed!
PRK xylE 429aa 3e-08
in ref transcript
D-xylose transporter XylE; Provisional.
Changed!
cd MFS 122aa 0.003
in modified transcript
Changed!
TIGR 2A0119 461aa 1e-172
in modified transcript
Changed!
COG AraJ 67aa 3e-07
in modified transcript
Arabinose efflux permease [Carbohydrate transport and metabolism].
Changed!
PRK PRK03893 128aa 1e-05
in modified transcript
putative sialic acid transporter; Provisional.
Changed!
PRK xylE 353aa 5e-05
in modified transcript
SLC22A6
refseq_SLC22A6.F2 refseq_SLC22A6.R2 149 188
NCBIGene 36.3 9356
Alternative 5-prime, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_004790
cd MFS 123aa 6e-14
in ref transcript
The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
cd MFS 171aa 2e-09
in ref transcript
TIGR 2A0119 505aa 0.0
in ref transcript
Changed!
PRK xylE 429aa 3e-08
in ref transcript
D-xylose transporter XylE; Provisional.
Changed!
PRK xylE 430aa 1e-07
in modified transcript
SLC24A4
refseq_SLC24A4.F1 refseq_SLC24A4.R1 100 157
NCBIGene 36.3 123041
Alternative 5-prime, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_153646
TIGR 2A1904 173aa 3e-59
in ref transcript
TIGR 2A1904 193aa 2e-45
in ref transcript
COG ECM27 132aa 1e-15
in ref transcript
Ca2+/Na+ antiporter [Inorganic ion transport and metabolism].
COG ECM27 151aa 2e-15
in ref transcript
SLC25A25
refseq_SLC25A25.F2 refseq_SLC25A25.R2 183 226
NCBIGene 36.3 114789
Alternative 5-prime, size difference: 43
Inclusion in the protein causing a frameshift
Reference transcript: NM_052901
Changed!
cd EFh 60aa 4e-09
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Changed!
pfam Mito_carr 93aa 7e-24
in ref transcript
Mitochondrial carrier protein.
Changed!
pfam Mito_carr 89aa 2e-23
in ref transcript
Changed!
pfam Mito_carr 92aa 1e-16
in ref transcript
Changed!
PTZ PTZ00169 269aa 1e-29
in ref transcript
ADP/ATP transporter on adenylate translocase; Provisional.
Changed!
COG FRQ1 113aa 9e-11
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
SLC25A26
refseq_SLC25A26.F1 refseq_SLC25A26.R1 349 434
NCBIGene 36.2 115286
Single exon skipping, size difference: 85
Inclusion in the protein causing a new stop codon
Reference transcript: NM_173471
Changed!
pfam Mito_carr 83aa 4e-11
in ref transcript
Mitochondrial carrier protein.
Changed!
pfam Mito_carr 89aa 2e-10
in ref transcript
Changed!
PTZ PTZ00168 168aa 1e-17
in ref transcript
mitochondrial carrier protein; Provisional.
Changed!
pfam Mito_carr 61aa 8e-08
in modified transcript
Changed!
PTZ PTZ00168 61aa 4e-06
in modified transcript
SLC25A29
refseq_SLC25A29.F2 refseq_SLC25A29.R2 246 290
NCBIGene 36.2 123096
Single exon skipping, size difference: 44
Inclusion in the protein causing a new stop codon
Reference transcript: NM_152333
SLC25A3
refseq_SLC25A3.F1 refseq_SLC25A3.R1 100 391
NCBIGene 36.2 5250
Alternative 5-prime, size difference: 291
Inclusion in the protein causing a new stop codon
Reference transcript: NM_005888
Changed!
pfam Mito_carr 75aa 2e-16
in ref transcript
Mitochondrial carrier protein.
Changed!
pfam Mito_carr 83aa 6e-16
in ref transcript
Changed!
pfam Mito_carr 78aa 2e-06
in ref transcript
Changed!
PTZ PTZ00169 259aa 4e-08
in ref transcript
ADP/ATP transporter on adenylate translocase; Provisional.
SLC26A1
refseq_SLC26A1.F2 refseq_SLC26A1.R2 104 319
NCBIGene 36.3 10861
Single exon skipping, size difference: 215
Exclusion in 5'UTR
Reference transcript: NM_213613
cd STAS_SulP_like_sulfate_transporter 142aa 1e-14
in ref transcript
Sulphate Transporter and Anti-Sigma factor antagonist domain of SulP-like sulfate transporters, plays a role in the function and regulation of the transport activity, proposed general NTP binding function. The SulP family is a large and diverse family of anion transporters, with members from eubacteria, plants, fungi, and mammals. They contain 10 to 14 transmembrane helices which form the catalytic core of the protein and a C-terminal extension, the STAS (Sulphate Transporter and AntiSigma factor antagonist) domain which plays a role in the function and regulation of the transport activity. The STAS domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism].
SLC26A10
refseq_SLC26A10.F1 refseq_SLC26A10.R1 305 400
NCBIGene 36.2 65012
Single exon skipping, size difference: 107
Inclusion in the protein causing a frameshift
Reference transcript: NM_133489
Changed!
cd STAS_SulP_like_sulfate_transporter 123aa 4e-08
in ref transcript
Sulphate Transporter and Anti-Sigma factor antagonist domain of SulP-like sulfate transporters, plays a role in the function and regulation of the transport activity, proposed general NTP binding function. The SulP family is a large and diverse family of anion transporters, with members from eubacteria, plants, fungi, and mammals. They contain 10 to 14 transmembrane helices which form the catalytic core of the protein and a C-terminal extension, the STAS (Sulphate Transporter and AntiSigma factor antagonist) domain which plays a role in the function and regulation of the transport activity. The STAS domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism].
Changed!
TIGR sulP 395aa 1e-46
in modified transcript
Changed!
COG SUL1 397aa 1e-29
in modified transcript
SLC26A10
refseq_SLC26A10.F3 refseq_SLC26A10.R4 155 262
NCBIGene 36.2 65012
Single exon skipping, size difference: 107
Inclusion in the protein causing a frameshift
Reference transcript: NM_133489
Changed!
cd STAS_SulP_like_sulfate_transporter 123aa 4e-08
in ref transcript
Sulphate Transporter and Anti-Sigma factor antagonist domain of SulP-like sulfate transporters, plays a role in the function and regulation of the transport activity, proposed general NTP binding function. The SulP family is a large and diverse family of anion transporters, with members from eubacteria, plants, fungi, and mammals. They contain 10 to 14 transmembrane helices which form the catalytic core of the protein and a C-terminal extension, the STAS (Sulphate Transporter and AntiSigma factor antagonist) domain which plays a role in the function and regulation of the transport activity. The STAS domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism].
Changed!
TIGR sulP 395aa 1e-46
in modified transcript
Changed!
COG SUL1 397aa 1e-29
in modified transcript
SLC26A8
refseq_SLC26A8.F1 refseq_SLC26A8.R1 101 416
NCBIGene 36.3 116369
Multiple exon skipping, size difference: 315
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_052961
cd STAS_SulP_like_sulfate_transporter 58aa 1e-08
in ref transcript
Sulphate Transporter and Anti-Sigma factor antagonist domain of SulP-like sulfate transporters, plays a role in the function and regulation of the transport activity, proposed general NTP binding function. The SulP family is a large and diverse family of anion transporters, with members from eubacteria, plants, fungi, and mammals. They contain 10 to 14 transmembrane helices which form the catalytic core of the protein and a C-terminal extension, the STAS (Sulphate Transporter and AntiSigma factor antagonist) domain which plays a role in the function and regulation of the transport activity. The STAS domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
Changed!
COG SUL1 504aa 3e-53
in ref transcript
Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism].
COG SpoIIAA 61aa 0.009
in ref transcript
Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) [Signal transduction mechanisms].
Changed!
TIGR sulP 288aa 2e-41
in modified transcript
Changed!
TIGR sulP 134aa 4e-14
in modified transcript
Changed!
COG SUL1 399aa 8e-32
in modified transcript
SLC2A11
refseq_SLC2A11.F1 refseq_SLC2A11.R1 119 230
NCBIGene 36.3 66035
Single exon skipping, size difference: 111
Exclusion of the protein initiation site
Reference transcript: NM_030807
cd MFS 119aa 1e-04
in ref transcript
The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
cd MFS 181aa 0.002
in ref transcript
Changed!
pfam Sugar_tr 454aa 2e-63
in ref transcript
Sugar (and other) transporter.
PRK xylE 378aa 6e-19
in ref transcript
D-xylose transporter XylE; Provisional.
Changed!
pfam Sugar_tr 445aa 3e-62
in modified transcript
SLC30A10
refseq_SLC30A10.F1 refseq_SLC30A10.R1 104 142
NCBIGene 36.2 55532
Alternative 3-prime, size difference: 38
Inclusion in the protein causing a new stop codon
Reference transcript: NM_018713
Changed!
TIGR CDF 153aa 2e-28
in ref transcript
This model describes a broadly distributed family of transporters, a number of which have been shown to transport divalent cations of cobalt, cadmium and/or zinc. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.
TIGR CDF 94aa 1e-18
in ref transcript
Changed!
COG CzcD 152aa 9e-30
in ref transcript
Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism].
COG CzcD 102aa 5e-23
in ref transcript
SLC30A2
refseq_SLC30A2.F1 refseq_SLC30A2.R1 143 290
NCBIGene 36.3 7780
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_001004434
Changed!
TIGR CDF 275aa 2e-69
in ref transcript
This model describes a broadly distributed family of transporters, a number of which have been shown to transport divalent cations of cobalt, cadmium and/or zinc. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.
Changed!
COG CzcD 304aa 4e-53
in ref transcript
Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism].
Changed!
TIGR CDF 218aa 2e-46
in modified transcript
Changed!
COG CzcD 216aa 2e-32
in modified transcript
SLC37A3
refseq_SLC37A3.F1 refseq_SLC37A3.R1 101 403
NCBIGene 36.3 84255
Multiple exon skipping, size difference: 302
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_207113
cd MFS 144aa 6e-13
in ref transcript
The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
Changed!
cd MFS 185aa 3e-09
in ref transcript
Changed!
TIGR 2A0104 371aa 3e-24
in ref transcript
Changed!
COG UhpC 466aa 3e-42
in ref transcript
Sugar phosphate permease [Carbohydrate transport and metabolism].
Changed!
TIGR 2A0104 258aa 2e-17
in modified transcript
Changed!
COG UhpC 328aa 4e-29
in modified transcript
SLC3A2
refseq_SLC3A2.F1 refseq_SLC3A2.R1 268 361
NCBIGene 36.3 6520
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_001012661
TIGR trehalose_treC 136aa 4e-16
in ref transcript
Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.
pfam Alpha-amylase 244aa 8e-14
in ref transcript
Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain.
TIGR treS_nterm 120aa 3e-08
in ref transcript
Trehalose synthase interconverts maltose and alpha, alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor (PMID:15378530).
COG AmyA 340aa 3e-18
in ref transcript
Glycosidases [Carbohydrate transport and metabolism].
SLC41A3
refseq_SLC41A3.F1 refseq_SLC41A3.R1 287 395
NCBIGene 36.3 54946
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_001008485
Changed!
pfam MgtE 136aa 2e-14
in ref transcript
Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples have two copies of the region. This suggests that the eubacterial examples may act as dimers. Members of this family probably transport Mg2+ or other divalent cations into the cell. The alignment contains two highly conserved aspartates that may be involved in cation binding (Bateman A unpubl.).
pfam MgtE 132aa 0.007
in ref transcript
COG COG1824 170aa 1e-05
in ref transcript
Permease, similar to cation transporters [Inorganic ion transport and metabolism].
Changed!
COG COG1824 171aa 1e-05
in ref transcript
Changed!
pfam MgtE 65aa 3e-07
in modified transcript
Changed!
COG COG1824 55aa 0.010
in modified transcript
SLC4A3
refseq_SLC4A3.F1 refseq_SLC4A3.R1 189 270
NCBIGene 36.3 6508
Alternative 3-prime, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_201574
TIGR ae 910aa 0.0
in ref transcript
They preferentially catalyze anion exchange (antiport) reactions, typically acting as HCO3-:Cl- antiporters, but also transporting a range of other inorganic and organic anions. Additionally, renal Na+:HCO3- cotransporters have been found to be members of the AE family. They catalyze the reabsorption of HCO3- in the renal proximal tubule.
SLC4A5
refseq_SLC4A5.F2 refseq_SLC4A5.R2 229 277
NCBIGene 36.3 57835
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_021196
Changed!
TIGR ae 957aa 0.0
in ref transcript
They preferentially catalyze anion exchange (antiport) reactions, typically acting as HCO3-:Cl- antiporters, but also transporting a range of other inorganic and organic anions. Additionally, renal Na+:HCO3- cotransporters have been found to be members of the AE family. They catalyze the reabsorption of HCO3- in the renal proximal tubule.
Changed!
TIGR ae 941aa 0.0
in modified transcript
SLC6A20
refseq_SLC6A20.F1 refseq_SLC6A20.R1 243 354
NCBIGene 36.3 54716
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_020208
Changed!
pfam SNF 576aa 1e-123
in ref transcript
Sodium:neurotransmitter symporter family.
Changed!
COG COG0733 300aa 3e-35
in ref transcript
Na+-dependent transporters of the SNF family [General function prediction only].
COG COG0733 164aa 6e-08
in ref transcript
Changed!
pfam SNF 539aa 1e-102
in modified transcript
Changed!
COG COG0733 117aa 2e-20
in modified transcript
Changed!
COG COG0733 68aa 2e-07
in modified transcript
SLC6A9
refseq_SLC6A9.F2 refseq_SLC6A9.R2 241 403
NCBIGene 36.3 6536
Single exon skipping, size difference: 162
Exclusion in the protein (no frameshift)
Reference transcript: NM_201649
pfam SNF 551aa 0.0
in ref transcript
Sodium:neurotransmitter symporter family.
COG COG0733 495aa 1e-55
in ref transcript
Na+-dependent transporters of the SNF family [General function prediction only].
SMARCA1
refseq_SMARCA1.F1 refseq_SMARCA1.R1 157 220
NCBIGene 36.2 6594
Alternative 5-prime and 3-prime, size difference: 63
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_003069
cd HELICc 144aa 1e-21
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
Changed!
cd DEXDc 140aa 4e-19
in ref transcript
DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
cd SANT 40aa 6e-04
in ref transcript
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Changed!
pfam SNF2_N 271aa 1e-108
in ref transcript
SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
pfam SLIDE 110aa 4e-42
in ref transcript
SLIDE. The SLIDE domain adopts a secondary structure comprising a main core of three alpha-helices. It has a role in DNA binding, contacting DNA target sites similar to c-Myb (pfam00249) repeats or homeodomains.
pfam HAND 81aa 3e-26
in ref transcript
HAND. The HAND domain adopts a secondary structure consisting of four alpha helices, three of which (H2, H3, H4) form an L-like configuration. Helix H2 runs antiparallel to helices H3 and H4, packing closely against helix H4, whilst helix H1 reposes in the concave surface formed by these three helices and runs perpendicular to them. The domain confers DNA and nucleosome binding properties to the protein.
smart HELICc 97aa 1e-18
in ref transcript
helicase superfamily c-terminal domain.
smart SANT 42aa 9e-05
in ref transcript
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.
Changed!
COG HepA 493aa 3e-86
in ref transcript
Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].
Changed!
cd DEXDc 116aa 1e-12
in modified transcript
Changed!
pfam SNF2_N 250aa 2e-93
in modified transcript
Changed!
COG HepA 472aa 3e-76
in modified transcript
SMARCA2
refseq_SMARCA2.F1 refseq_SMARCA2.R1 238 292
NCBIGene 36.3 6595
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_003070
Changed!
cd Bromo_SNF2L2 125aa 6e-47
in ref transcript
Bromodomain, SNF2L2-like subfamily, specific to animals. SNF2L2 (SNF2-alpha) or SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A member 2 is a global transcriptional activator, which cooperates with nuclear hormone receptors to boost transcriptional activation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd HELICc 133aa 2e-20
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
cd DEXDc 140aa 3e-18
in ref transcript
DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
pfam SNF2_N 286aa 3e-95
in ref transcript
SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
Changed!
smart BROMO 126aa 8e-24
in ref transcript
bromo domain.
pfam Helicase_C 81aa 5e-18
in ref transcript
Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
smart HSA 58aa 5e-13
in ref transcript
domain in helicases and associated with SANT domains.
pfam BRK 45aa 4e-12
in ref transcript
BRK domain. The function of this domain is unknown. It is often found associated with helicases and transcription factors.
pfam QLQ 37aa 4e-10
in ref transcript
QLQ. The QLQ domain is named after the conserved Gln, Leu, Gln motif. The QLQ domain is found at the N-terminus of SWI2/SNF2 protein, which has been shown to be involved in protein-protein interactions. This domain has thus been postulated to be involved in mediating protein interactions.
COG HepA 502aa 2e-82
in ref transcript
Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].
Changed!
COG COG5076 91aa 4e-12
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
Changed!
cd Bromo_SNF2L2 107aa 3e-50
in modified transcript
Changed!
smart BROMO 108aa 4e-26
in modified transcript
Changed!
COG COG5076 122aa 6e-14
in modified transcript
SMARCB1
refseq_SMARCB1.F2 refseq_SMARCB1.R2 133 160
NCBIGene 36.3 6598
Alternative 5-prime, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_003073
pfam SNF5 219aa 5e-93
in ref transcript
SNF5 / SMARCB1 / INI1. SNF5 is a component of the yeast SWI/SNF complex, which is an ATP-dependent nucleosome-remodelling complex that regulates the transcription of a subset of yeast genes. SNF5 is a key component of all SWI/SNF-class complexes characterised so far. This family consists of the conserved region of SNF5, including a direct repeat motif. SNF5 is essential for the assembly promoter targeting and chromatin remodelling activity of the SWI-SNF complex. SNF5 is also known as SMARCB1, for SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin, subfamily b, member 1, and also INI1 for integrase interactor 1. Loss-of function mutations in SNF5 are thought to contribute to oncogenesis in malignant rhabdoid tumours (MRTs).
SMARCC2
refseq_SMARCC2.F1 refseq_SMARCC2.R1 137 230
NCBIGene 36.3 6601
Single exon skipping, size difference: 93
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_003075
pfam SWIRM 85aa 3e-29
in ref transcript
SWIRM domain. This SWIRM domain is a small alpha-helical domain of about 85 amino acid residues found in chromosomal proteins. It contains a helix-turn helix motif and binds to DNA.
AAA ATPase containing von Willebrand factor type A (vWA) domain [General function prediction only].
Changed!
COG RSC8 381aa 2e-67
in modified transcript
SMARCC2
refseq_SMARCC2.F3 refseq_SMARCC2.R3 112 457
NCBIGene 36.3 6601
Alternative 5-prime, size difference: 345
Exclusion in the protein (no frameshift)
Reference transcript: NM_003075
pfam SWIRM 85aa 3e-29
in ref transcript
SWIRM domain. This SWIRM domain is a small alpha-helical domain of about 85 amino acid residues found in chromosomal proteins. It contains a helix-turn helix motif and binds to DNA.
AAA ATPase containing von Willebrand factor type A (vWA) domain [General function prediction only].
SMARCD1
refseq_SMARCD1.F1 refseq_SMARCD1.R1 199 322
NCBIGene 36.3 6602
Single exon skipping, size difference: 123
Exclusion in the protein (no frameshift)
Reference transcript: NM_003076
smart SWIB 77aa 3e-24
in ref transcript
SWI complex, BAF60b domains.
Changed!
pfam Bindin 60aa 0.009
in ref transcript
Bindin.
COG COG5531 204aa 4e-15
in ref transcript
SWIB-domain-containing proteins implicated in chromatin remodeling [Chromatin structure and dynamics].
SMC4
refseq_SMC4.F2 refseq_SMC4.R2 220 295
NCBIGene 36.2 10051
Alternative 3-prime, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_001002800
cd ABC_SMC4_euk 160aa 1e-51
in ref transcript
Eukaryotic SMC4 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).
cd ABC_SMC4_euk 95aa 9e-51
in ref transcript
pfam SMC_N 1192aa 0.0
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
COG Smc 1201aa 1e-123
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
SMG7
refseq_SMG7.F1 refseq_SMG7.R1 256 394
NCBIGene 36.3 9887
Alternative 5-prime, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_173156
cd TPR 79aa 0.003
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
pfam EST1 119aa 6e-25
in ref transcript
Telomerase activating protein Est1. Est1 is a protein which recruits or activates telomerase at the site of polymerisation.
SMG7
refseq_SMG7.F3 refseq_SMG7.R3 101 133
NCBIGene 36.3 9887
Single exon skipping, size difference: 32
Exclusion in the protein causing a frameshift
Reference transcript: NM_173156
Changed!
cd TPR 79aa 0.003
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
Changed!
pfam EST1 119aa 6e-25
in ref transcript
Telomerase activating protein Est1. Est1 is a protein which recruits or activates telomerase at the site of polymerisation.
SMN2
refseq_SMN2.F1 refseq_SMN2.R1 132 186
NCBIGene 36.3 6607
Single exon skipping, size difference: 54
Exclusion of the stop codon
Reference transcript: NM_017411
cd TUDOR 48aa 2e-08
in ref transcript
Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
pfam SMN 166aa 3e-40
in ref transcript
Survival motor neuron protein (SMN). This family consists of several eukaryotic survival motor neuron (SMN) proteins. The Survival of Motor Neurons (SMN) protein, the product of the spinal muscular atrophy-determining gene, is part of a large macromolecular complex (SMN complex) that functions in the assembly of spliceosomal small nuclear ribonucleoproteins (snRNPs). The SMN complex functions as a specificity factor essential for the efficient assembly of Sm proteins on U snRNAs and likely protects cells from illicit, and potentially deleterious, non-specific binding of Sm proteins to RNAs.
Changed!
pfam SMN 35aa 2e-11
in ref transcript
Changed!
pfam SMN 27aa 2e-08
in modified transcript
SMOX
refseq_SMOX.F1 refseq_SMOX.R1 263 353
NCBIGene 36.3 54498
Single exon skipping, size difference: 90
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_175839
Changed!
pfam Amino_oxidase 500aa 1e-53
in ref transcript
Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. The family also contains phytoene dehydrogenases and related enzymes. In vertebrates MAO plays an important role regulating the intracellular levels of amines via there oxidation; these include various neurotransmitters, neurotoxins and trace amines. In lower eukaryotes such as aspergillus and in bacteria the main role of amine oxidases is to provide a source of ammonium. PAOs in plants, bacteria and protozoa oxidase spermidine and spermine to an aminobutyral, diaminopropane and hydrogen peroxide and are involved in the catabolism of polyamines. Other members of this family include tryptophan 2-monooxygenase, putrescine oxidase, corticosteroid binding proteins and antibacterial glycoproteins.
Changed!
COG COG1231 241aa 1e-08
in ref transcript
Monoamine oxidase [Amino acid transport and metabolism].
COG COG1233 72aa 3e-04
in ref transcript
Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism].
Changed!
pfam Amino_oxidase 530aa 9e-51
in modified transcript
Changed!
COG COG1231 181aa 5e-06
in modified transcript
SMPD1
refseq_SMPD1.F2 refseq_SMPD1.R2 217 349
NCBIGene 36.3 6609
Exon skipping and alternative 3-prime or 5-prime, size difference: 132
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_000543
smart SapB 74aa 1e-06
in ref transcript
Saposin (B) Domains. Present in multiple copies in prosaposin and in pulmonary surfactant-associated protein B. In plant aspartic proteinases, a saposin domain is circularly permuted. This causes the prediction algorithm to predict two such domains, where only one is truly present.
Changed!
pfam Metallophos 97aa 0.006
in ref transcript
Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacterial SbcD or yeast MRE11. The most conserved regions in this superfamily centre around the metal chelating residues.
COG COG0535 101aa 0.003
in ref transcript
Predicted Fe-S oxidoreductases [General function prediction only].
Changed!
pfam Metallophos 151aa 1e-06
in modified transcript
SMPD4
refseq_SMPD4.F1 refseq_SMPD4.R1 118 256
NCBIGene 36.2 55627
Alternative 3-prime, size difference: 138
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_017751
SMPD4
refseq_SMPD4.F4 refseq_SMPD4.R4 300 387
NCBIGene 36.3 55627
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_017951
SMTN
refseq_SMTN.F2 refseq_SMTN.R2 186 282
NCBIGene 36.3 6525
Alternative 5-prime, size difference: 96
Inclusion in the protein causing a new stop codon
Reference transcript: NM_134270
Changed!
cd CH 128aa 1e-19
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
Changed!
pfam CH 128aa 6e-23
in ref transcript
Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
Changed!
COG SAC6 92aa 1e-12
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
Changed!
cd CH 101aa 9e-21
in modified transcript
Changed!
pfam CH 101aa 9e-24
in modified transcript
Changed!
COG SAC6 102aa 2e-14
in modified transcript
SMURF1
refseq_SMURF1.F2 refseq_SMURF1.R2 270 348
NCBIGene 36.3 57154
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_020429
cd HECTc 356aa 1e-129
in ref transcript
HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
cd C2 105aa 8e-12
in ref transcript
Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
cd WW 30aa 2e-06
in ref transcript
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
cd WW 31aa 1e-04
in ref transcript
pfam HECT 306aa 1e-125
in ref transcript
HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
pfam C2 85aa 7e-15
in ref transcript
C2 domain.
smart WW 32aa 7e-08
in ref transcript
Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
smart WW 32aa 1e-05
in ref transcript
Changed!
COG HUL4 530aa 1e-133
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
COG COG5038 89aa 0.001
in ref transcript
Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
Changed!
COG HUL4 504aa 1e-133
in modified transcript
Changed!
COG HUL4 301aa 3e-04
in modified transcript
SNAP23
refseq_SNAP23.F1 refseq_SNAP23.R1 188 347
NCBIGene 36.3 8773
Single exon skipping, size difference: 159
Exclusion in the protein (no frameshift)
Reference transcript: NM_003825
cd t_SNARE 60aa 3e-07
in ref transcript
Soluble NSF (N-ethylmaleimide-sensitive fusion protein)-Attachment protein (SNAP) REceptor domain; these alpha-helical motifs form twisted and parallel heterotetrameric helix bundles; the core complex contains one helix from a protein that is anchored in the vesicle membrane (synaptobrevin), one helix from a protein of the target membrane (syntaxin), and two helices from another protein anchored in the target membrane (SNAP-25); their interaction forms a core which is composed of a polar zero layer, a flanking leucine-zipper layer acts as a water tight shield to isolate ionic interactions in the zero layer from the surrounding solvent.
Changed!
cd t_SNARE 59aa 0.006
in ref transcript
pfam SNARE 58aa 5e-09
in ref transcript
SNARE domain. Most if not all vesicular membrane fusion events in eukaryotic cells are believed to be mediated by a conserved fusion machinery, the SNARE [soluble N-ethylmaleimide-sensitive factor (NSF) attachment protein (SNAP) receptors] machinery. The SNARE domain is thought to act as a protein-protein interaction module in the assembly of a SNARE protein complex.
Changed!
pfam SNAP-25 62aa 2e-08
in ref transcript
SNAP-25 family. SNAP-25 (synaptosome-associated protein 25 kDa) proteins are components of SNARE complexes. Members of this family contain a cluster of cysteine residues that can be palmitoylated for membrane attachment.
smart t_SNARE 66aa 2e-05
in ref transcript
Helical region found in SNAREs. All alpha-helical motifs that form twisted and parallel four-helix bundles in target soluble N-ethylmaleimide-sensitive factor (NSF) attachment protein (SNAP) receptor proteins. This motif found in "Q-SNAREs".
SNCA
refseq_SNCA.F2 refseq_SNCA.R2 283 367
NCBIGene 36.3 6622
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_000345
Changed!
pfam Synuclein 132aa 8e-26
in ref transcript
Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid plaques in Alzheimer's patients.
Changed!
pfam Synuclein 107aa 1e-17
in modified transcript
SNCB
refseq_SNCB.F1 refseq_SNCB.R1 204 361
NCBIGene 36.3 6620
Single exon skipping, size difference: 157
Exclusion in 5'UTR
Reference transcript: NM_001001502
pfam Synuclein 92aa 8e-26
in ref transcript
Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid plaques in Alzheimer's patients.
SNRP70
refseq_SNRP70.F2 refseq_SNRP70.R2 109 181
NCBIGene 36.2 6625
Single exon skipping, size difference: 72
Inclusion in the protein causing a new stop codon
Reference transcript: NM_003089
Changed!
cd RRM 74aa 5e-17
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
smart RRM 71aa 2e-16
in ref transcript
RNA recognition motif.
Changed!
COG COG0724 91aa 1e-10
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
cd RRM 53aa 3e-12
in modified transcript
Changed!
smart RRM 51aa 1e-11
in modified transcript
Changed!
COG COG0724 68aa 8e-08
in modified transcript
SNRPB
refseq_SNRPB.F1 refseq_SNRPB.R1 160 306
NCBIGene 36.3 6628
Alternative 3-prime, size difference: 146
Inclusion in the protein causing a new stop codon
Reference transcript: NM_198216
cd Sm_B 79aa 2e-40
in ref transcript
The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm subunit B heterodimerizes with subunit D3 and three such heterodimers form a hexameric ring structure with alternating B and D3 subunits. The D3 - B heterodimer also assembles into a heptameric ring containing D1, D2, E, F, and G subunits. Sm-like proteins exist in archaea as well as prokaryotes which form heptameric and hexameric ring structures similar to those found in eukaryotes.
smart Sm 76aa 2e-16
in ref transcript
snRNP Sm proteins. small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing.
COG LSM1 82aa 3e-08
in ref transcript
Small nuclear ribonucleoprotein (snRNP) homolog [Transcription].
SNRPB2
refseq_SNRPB2.F1 refseq_SNRPB2.R1 169 269
NCBIGene 36.3 6629
Alternative 5-prime, size difference: 100
Exclusion in 5'UTR
Reference transcript: NM_003092
cd RRM 76aa 2e-11
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 60aa 5e-08
in ref transcript
smart RRM_2 75aa 3e-11
in ref transcript
RNA recognition motif.
pfam RRM_1 60aa 3e-10
in ref transcript
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
COG COG0724 95aa 5e-07
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
COG COG0724 89aa 0.002
in ref transcript
SNRPD2
refseq_SNRPD2.F1 refseq_SNRPD2.R1 157 272
NCBIGene 36.3 6633
Single exon skipping, size difference: 115
Inclusion in the protein causing a new stop codon
Reference transcript: NM_004597
Changed!
cd Sm_D2 88aa 2e-35
in ref transcript
The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm subunit D2 heterodimerizes with subunit D1 and three such heterodimers form a hexameric ring structure with alternating D1 and D2 subunits. The D1 - D2 heterodimer also assembles into a heptameric ring containing D2, D3, E, F, and G subunits. Sm-like proteins exist in archaea as well as prokaryotes which form heptameric and hexameric ring structures similar to those found in eukaryotes.
Changed!
smart Sm 72aa 1e-10
in ref transcript
snRNP Sm proteins. small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing.
Changed!
COG LSM1 82aa 2e-07
in ref transcript
Small nuclear ribonucleoprotein (snRNP) homolog [Transcription].
SNX1
refseq_SNX1.F1 refseq_SNX1.R1 301 496
NCBIGene 36.3 6642
Multiple exon skipping, size difference: 195
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_003099
Changed!
cd PX_SNX1 123aa 3e-64
in ref transcript
The phosphoinositide binding Phox Homology domain of Sorting Nexin 1. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX1 is both membrane associated and a cytosolic protein that exists as a tetramer in protein complexes. It can associate reversibly with membranes of the endosomal compartment, thereby coating these vesicles. SNX1 is a component of the retromer complex, a membrane coat multimeric complex required for endosomal retrieval of lysosomal hydrolase receptors to the Golgi. The retromer consists of a cargo-recognition subcomplex and a subcomplex formed by a dimer of sorting nexins (SNX1 and/or SNX2), which ensures efficient cargo sorting by facilitating proper membrane localization of the cargo-recognition subcomplex. SNX1 contains a Bin/Amphiphysin/Rvs (BAR) domain C-terminal to the PX domain. The PX domain of SNX1 specifically binds phosphatidylinositol-3-phosphate (PI3P) and PI(3,5)P2, while the BAR domain detects membrane curvature. Both domains help determine the specific membrane-targeting of SNX1, which is localized to a microdomain in early endosomes where it regulates cation-independent mannose-6-phosphate receptor retrieval to the trans Golgi network.
pfam Vps5 234aa 6e-81
in ref transcript
Vps5 C terminal like. Vps5 is a sorting nexin that functions in membrane trafficking. This is the C terminal dimerisation domain.
Changed!
smart PX 118aa 8e-24
in ref transcript
PhoX homologous domain, present in p47phox and p40phox. Eukaryotic domain of unknown function present in phox proteins, PLD isoforms, a PI3K isoform.
Changed!
pfam Sorting_nexin 122aa 1e-22
in ref transcript
Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2.
Changed!
COG COG5391 355aa 7e-08
in ref transcript
Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
Changed!
cd PX_SNX1 113aa 2e-59
in modified transcript
Changed!
smart PX 111aa 5e-21
in modified transcript
Changed!
pfam Sorting_nexin 81aa 3e-14
in modified transcript
Changed!
COG COG5391 392aa 9e-09
in modified transcript
SNX1
refseq_SNX1.F2 refseq_SNX1.R2 189 333
NCBIGene 36.3 6642
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_003099
cd PX_SNX1 123aa 3e-64
in ref transcript
The phosphoinositide binding Phox Homology domain of Sorting Nexin 1. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX1 is both membrane associated and a cytosolic protein that exists as a tetramer in protein complexes. It can associate reversibly with membranes of the endosomal compartment, thereby coating these vesicles. SNX1 is a component of the retromer complex, a membrane coat multimeric complex required for endosomal retrieval of lysosomal hydrolase receptors to the Golgi. The retromer consists of a cargo-recognition subcomplex and a subcomplex formed by a dimer of sorting nexins (SNX1 and/or SNX2), which ensures efficient cargo sorting by facilitating proper membrane localization of the cargo-recognition subcomplex. SNX1 contains a Bin/Amphiphysin/Rvs (BAR) domain C-terminal to the PX domain. The PX domain of SNX1 specifically binds phosphatidylinositol-3-phosphate (PI3P) and PI(3,5)P2, while the BAR domain detects membrane curvature. Both domains help determine the specific membrane-targeting of SNX1, which is localized to a microdomain in early endosomes where it regulates cation-independent mannose-6-phosphate receptor retrieval to the trans Golgi network.
Changed!
pfam Vps5 234aa 6e-81
in ref transcript
Vps5 C terminal like. Vps5 is a sorting nexin that functions in membrane trafficking. This is the C terminal dimerisation domain.
smart PX 118aa 8e-24
in ref transcript
PhoX homologous domain, present in p47phox and p40phox. Eukaryotic domain of unknown function present in phox proteins, PLD isoforms, a PI3K isoform.
pfam Sorting_nexin 122aa 1e-22
in ref transcript
Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2.
Changed!
COG COG5391 355aa 7e-08
in ref transcript
Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
Changed!
pfam Vps5 186aa 2e-56
in modified transcript
Changed!
COG COG5391 109aa 8e-07
in modified transcript
SNX11
refseq_SNX11.F1 refseq_SNX11.R1 114 174
NCBIGene 36.3 29916
Single exon skipping, size difference: 60
Exclusion in 5'UTR
Reference transcript: NM_152244
cd PX_SNX10 92aa 7e-37
in ref transcript
The phosphoinositide binding Phox Homology domain of Sorting Nexin 10. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. Some SNXs are localized in early endosome structures such as clathrin-coated pits, while others are located in late structures of the endocytic pathway. SNX10 may be involved in the regulation of endosome homeostasis. Its expression induces the formation of giant vacuoles in mammalian cells.
pfam PX 91aa 2e-20
in ref transcript
PX domain. PX domains bind to phosphoinositides.
COG COG5391 93aa 3e-09
in ref transcript
Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
SNX14
refseq_SNX14.F1 refseq_SNX14.R1 136 163
NCBIGene 36.3 57231
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_153816
cd PX_SNX14 123aa 3e-50
in ref transcript
The phosphoinositide binding Phox Homology domain of Sorting Nexin 14. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX14 may be involved in recruiting other proteins to the membrane via protein-protein and protein-ligand interaction. It is expressed in the embryonic nervous system of mice, and is co-expressed in the motoneurons and the anterior pituary with Islet-1. SNX14 shows a similar domain architecture as SNX13, containing an N-terminal PXA domain, a regulator of G protein signaling (RGS) domain, a PX domain, and a C-terminal domain that is conserved in some SNXs.
pfam Nexin_C 106aa 5e-25
in ref transcript
Sorting nexin C terminal. This region is found a the C terminal of proteins belonging to the sorting nexin family. It is found on proteins which also contain pfam00787.
smart PXA 156aa 6e-22
in ref transcript
Domain associated with PX domains. unpubl. observations.
pfam PX 105aa 2e-17
in ref transcript
PX domain. PX domains bind to phosphoinositides.
smart RGS 128aa 2e-07
in ref transcript
Regulator of G protein signalling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
COG COG5391 140aa 0.002
in ref transcript
Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
SNX15
refseq_SNX15.F1 refseq_SNX15.R1 141 399
NCBIGene 36.3 29907
Single exon skipping, size difference: 258
Exclusion in the protein (no frameshift)
Reference transcript: NM_013306
cd PX_SNX15 118aa 9e-51
in ref transcript
The phosphoinositide binding Phox Homology domain of Sorting Nexin 15. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX15 contains an N-terminal PX domain and a C-terminal Microtubule Interacting and Trafficking (MIT) domain. It plays a role in protein trafficking processes in the endocytic pathway and the trans-Golgi network. The PX domain of SNX15 interacts with the PDGF receptor and is responsible for the membrane association of the protein.
Changed!
cd MIT_SNX15 75aa 1e-19
in ref transcript
MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
Changed!
smart MIT 77aa 4e-18
in ref transcript
Microtubule Interacting and Trafficking molecule domain.
pfam PX 112aa 2e-12
in ref transcript
PX domain. PX domains bind to phosphoinositides.
Changed!
cd MIT_SNX15 74aa 2e-10
in modified transcript
Changed!
smart MIT 74aa 9e-09
in modified transcript
SNX16
refseq_SNX16.F2 refseq_SNX16.R2 269 356
NCBIGene 36.3 64089
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_152836
Changed!
cd PX_SNX16 110aa 2e-57
in ref transcript
The phosphoinositide binding Phox Homology domain of Sorting Nexin 16. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX16 contains a central PX domain followed by a coiled-coil region. SNX16 is localized in early and recycling endosomes through the binding of its PX domain to phosphatidylinositol-3-phosphate (PI3P). It plays a role in epidermal growth factor (EGF) signaling by regulating EGF receptor membrane trafficking.
Changed!
pfam PX 89aa 3e-16
in ref transcript
PX domain. PX domains bind to phosphoinositides.
Changed!
COG COG5391 148aa 7e-05
in ref transcript
Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
Changed!
cd PX_SNX16 81aa 3e-36
in modified transcript
Changed!
pfam PX 57aa 3e-07
in modified transcript
SNX7
refseq_SNX7.F2 refseq_SNX7.R2 222 375
NCBIGene 36.3 51375
Single exon skipping, size difference: 153
Exclusion in the protein (no frameshift)
Reference transcript: NM_015976
cd PX_SNX7 116aa 2e-62
in ref transcript
The phosphoinositide binding Phox Homology domain of Sorting Nexin 7. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. Some SNXs are localized in early endosome structures such as clathrin-coated pits, while others are located in late structures of the endocytic pathway. SNX7 harbors a Bin/Amphiphysin/Rvs (BAR) domain, which detects membrane curvature, C-terminal to the PX domain, similar to the sorting nexins SNX1-2, SNX4-6, SNX8, SNX30, and SNX32. Both domains have been shown to determine the specific membrane-targeting of SNX1. The specific function of SNX7 has yet to be elucidated.
pfam PX 114aa 1e-21
in ref transcript
PX domain. PX domains bind to phosphoinositides.
Changed!
COG COG5391 374aa 2e-12
in ref transcript
Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
Changed!
COG COG5391 236aa 5e-12
in modified transcript
SNX7
refseq_SNX7.F3 refseq_SNX7.R3 283 403
NCBIGene 36.3 51375
Single exon skipping, size difference: 120
Inclusion in the protein causing a new stop codon
Reference transcript: NM_015976
Changed!
cd PX_SNX7 116aa 2e-62
in ref transcript
The phosphoinositide binding Phox Homology domain of Sorting Nexin 7. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. Some SNXs are localized in early endosome structures such as clathrin-coated pits, while others are located in late structures of the endocytic pathway. SNX7 harbors a Bin/Amphiphysin/Rvs (BAR) domain, which detects membrane curvature, C-terminal to the PX domain, similar to the sorting nexins SNX1-2, SNX4-6, SNX8, SNX30, and SNX32. Both domains have been shown to determine the specific membrane-targeting of SNX1. The specific function of SNX7 has yet to be elucidated.
Changed!
pfam PX 114aa 1e-21
in ref transcript
PX domain. PX domains bind to phosphoinositides.
Changed!
COG COG5391 374aa 2e-12
in ref transcript
Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
SOCS4
refseq_SOCS4.F1 refseq_SOCS4.R1 236 365
NCBIGene 36.3 122809
Single exon skipping, size difference: 129
Exclusion in 5'UTR
Reference transcript: NM_199421
cd SOCS_SOCS4 56aa 8e-27
in ref transcript
SOCS (suppressors of cytokine signaling) box of SOCS4-like proteins. Together with CIS1, the CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.
cd SH2 96aa 1e-11
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
smart SH2 85aa 2e-14
in ref transcript
Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
smart SOCS 40aa 9e-09
in ref transcript
suppressors of cytokine signalling. suppressors of cytokine signalling.
SORBS1
refseq_SORBS1.F1 refseq_SORBS1.R1 182 389
NCBIGene 36.3 10580
Alternative 3-prime, size difference: 207
Exclusion in the protein (no frameshift)
Reference transcript: NM_001034954
cd SH3 54aa 3e-12
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd SH3 52aa 8e-11
in ref transcript
cd SH3 52aa 6e-10
in ref transcript
smart SH3 59aa 4e-13
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
smart SH3 53aa 3e-12
in ref transcript
smart SH3 59aa 7e-11
in ref transcript
pfam Sorb 41aa 4e-10
in ref transcript
Sorbin homologous domain.
SORBS1
refseq_SORBS1.F4 refseq_SORBS1.R4 195 363
NCBIGene 36.3 10580
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_001034954
cd SH3 54aa 3e-12
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd SH3 52aa 8e-11
in ref transcript
cd SH3 52aa 6e-10
in ref transcript
smart SH3 59aa 4e-13
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
smart SH3 53aa 3e-12
in ref transcript
smart SH3 59aa 7e-11
in ref transcript
pfam Sorb 41aa 4e-10
in ref transcript
Sorbin homologous domain.
SORBS1
refseq_SORBS1.F6 refseq_SORBS1.R6 189 285
NCBIGene 36.3 10580
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_001034954
cd SH3 54aa 3e-12
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd SH3 52aa 8e-11
in ref transcript
cd SH3 52aa 6e-10
in ref transcript
smart SH3 59aa 4e-13
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
smart SH3 53aa 3e-12
in ref transcript
smart SH3 59aa 7e-11
in ref transcript
pfam Sorb 41aa 4e-10
in ref transcript
Sorbin homologous domain.
SORBS1
refseq_SORBS1.F7 refseq_SORBS1.R7 130 157
NCBIGene 36.3 10580
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_001034954
cd SH3 54aa 3e-12
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd SH3 52aa 8e-11
in ref transcript
cd SH3 52aa 6e-10
in ref transcript
smart SH3 59aa 4e-13
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
smart SH3 53aa 3e-12
in ref transcript
smart SH3 59aa 7e-11
in ref transcript
pfam Sorb 41aa 4e-10
in ref transcript
Sorbin homologous domain.
SORBS2
refseq_SORBS2.F1 refseq_SORBS2.R1 204 273
NCBIGene 36.3 8470
Alternative 3-prime, size difference: 69
Exclusion in 5'UTR
Reference transcript: NM_021069
cd SH3 52aa 8e-14
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd SH3 54aa 9e-12
in ref transcript
cd SH3 54aa 3e-11
in ref transcript
pfam Sorb 50aa 9e-21
in ref transcript
Sorbin homologous domain.
smart SH3 53aa 5e-16
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
smart SH3 59aa 2e-12
in ref transcript
smart SH3 59aa 2e-12
in ref transcript
SORBS2
refseq_SORBS2.F2 refseq_SORBS2.R2 217 358
NCBIGene 36.3 8470
Multiple exon skipping, size difference: 141
Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_021069
cd SH3 52aa 8e-14
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd SH3 54aa 9e-12
in ref transcript
cd SH3 54aa 3e-11
in ref transcript
pfam Sorb 50aa 9e-21
in ref transcript
Sorbin homologous domain.
smart SH3 53aa 5e-16
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
smart SH3 59aa 2e-12
in ref transcript
smart SH3 59aa 2e-12
in ref transcript
SORCS1
refseq_SORCS1.F1 refseq_SORCS1.R1 268 377
NCBIGene 36.3 114815
Exon skipping and alternative 3-prime or 5-prime, size difference: 109
Inclusion in the protein causing a new stop codon, Exclusion in the protein causing a frameshift
Reference transcript: NM_001013031
smart VPS10 598aa 0.0
in ref transcript
VPS10 domain.
pfam PKD 80aa 8e-06
in ref transcript
PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.
SOX6
refseq_SOX6.F1 refseq_SOX6.R1 227 269
NCBIGene 36.3 55553
Alternative 5-prime, size difference: 42
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_033326
cd SOX-TCF_HMG-box 64aa 1e-24
in ref transcript
SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
smart HMG 64aa 5e-16
in ref transcript
high mobility group.
COG NHP6B 52aa 2e-06
in ref transcript
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
SOX6
refseq_SOX6.F4 refseq_SOX6.R4 137 260
NCBIGene 36.3 55553
Single exon skipping, size difference: 123
Exclusion in the protein (no frameshift)
Reference transcript: NM_033326
cd SOX-TCF_HMG-box 64aa 1e-24
in ref transcript
SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
smart HMG 64aa 5e-16
in ref transcript
high mobility group.
COG NHP6B 52aa 2e-06
in ref transcript
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
SOX6
refseq_SOX6.F5 refseq_SOX6.R5 185 245
NCBIGene 36.3 55553
Alternative 3-prime, size difference: 60
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_033326
cd SOX-TCF_HMG-box 64aa 1e-24
in ref transcript
SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
smart HMG 64aa 5e-16
in ref transcript
high mobility group.
COG NHP6B 52aa 2e-06
in ref transcript
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
SP110
refseq_SP110.F2 refseq_SP110.R2 295 367
NCBIGene 36.3 3431
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_080424
Changed!
cd Bromo_SP100C_like 101aa 8e-39
in ref transcript
Bromodomain, SP100C_like subfamily. The SP100C protein is a splice variant of SP100, a major component of PML-SP100 nuclear bodies (NBs), which are poorly understood. It is covalently modified by SUMO-1 and may play a role in processes at the chromatin level. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
pfam SAND 82aa 7e-29
in ref transcript
SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation.
pfam Sp100 101aa 3e-26
in ref transcript
Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342.
Changed!
smart BROMO 98aa 5e-09
in ref transcript
bromo domain.
Changed!
cd Bromo_SP100C_like 110aa 2e-38
in modified transcript
Changed!
smart BROMO 74aa 1e-06
in modified transcript
SPAST
refseq_SPAST.F1 refseq_SPAST.R1 231 327
NCBIGene 36.3 6683
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_014946
cd MIT_spastin 80aa 2e-22
in ref transcript
MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
cd AAA 164aa 5e-21
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
pfam AAA 186aa 3e-50
in ref transcript
ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes.
smart MIT 78aa 3e-13
in ref transcript
Microtubule Interacting and Trafficking molecule domain.
pfam Mg_chelatase 54aa 3e-05
in ref transcript
Magnesium chelatase, subunit ChlI. Magnesium-chelatase is a three-component enzyme that catalyses the insertion of Mg2+ into protoporphyrin IX. This is the first unique step in the synthesis of (bacterio)chlorophyll. Due to this, it is thought that Mg-chelatase has an important role in channelling inter- mediates into the (bacterio)chlorophyll branch in response to conditions suitable for photosynthetic growth. ChlI and BchD have molecular weight between 38-42 kDa.
COG SpoVK 268aa 7e-60
in ref transcript
ATPases of the AAA+ class [Posttranslational modification, protein turnover, chaperones].
SPATA7
refseq_SPATA7.F1 refseq_SPATA7.R1 110 206
NCBIGene 36.3 55812
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_018418
SPHK1
refseq_SPHK1.F1 refseq_SPHK1.R1 107 470
NCBIGene 36.3 8877
Alternative 5-prime, size difference: 363
Exclusion of the protein initiation site
Reference transcript: NM_182965
pfam DAGK_cat 104aa 1e-19
in ref transcript
Diacylglycerol kinase catalytic domain. Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues. YegS is the Escherichia coli protein in this family whose crystal structure reveals an active site in the inter-domain cleft formed by four conserved sequence motifs, revealing a novel metal-binding site. The residues of this site are conserved across the family.
COG LCB5 333aa 4e-18
in ref transcript
Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only].
SPHK1
refseq_SPHK1.F3 refseq_SPHK1.R3 164 206
NCBIGene 36.3 8877
Alternative 3-prime, size difference: 42
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_182965
pfam DAGK_cat 104aa 1e-19
in ref transcript
Diacylglycerol kinase catalytic domain. Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues. YegS is the Escherichia coli protein in this family whose crystal structure reveals an active site in the inter-domain cleft formed by four conserved sequence motifs, revealing a novel metal-binding site. The residues of this site are conserved across the family.
COG LCB5 333aa 4e-18
in ref transcript
Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only].
SPINT1
refseq_SPINT1.F1 refseq_SPINT1.R1 192 240
NCBIGene 36.3 6692
Alternative 5-prime, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_181642
cd KU 52aa 8e-12
in ref transcript
BPTI/Kunitz family of serine protease inhibitors; Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure.
cd KU 54aa 1e-11
in ref transcript
cd LDLa 35aa 7e-06
in ref transcript
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
pfam MANEC 92aa 3e-31
in ref transcript
MANEC domain. This region of similarity, comprising 8 conserved cysteines, is found in the N-terminal region of several membrane-associated and extracellular proteins. Although formerly called MANSC (for motif at N terminus with seven cysteines) it has now been renamed by MANEC (motif at N terminus with eight cysteines) by Richard Mitter and Stephen Fitzgerald after the discovery of an eighth conserved cysteine. It is postulated that this domain may play a role in the formation of protein complexes involving various protease activators and inhibitors.
pfam Kunitz_BPTI 53aa 3e-14
in ref transcript
Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure. Certain family members are similar to the tick anticoagulant peptide (TAP). This is a highly selective inhibitor of factor Xa in the blood coagulation pathways. TAP molecules are highly dipolar, and are arranged to form a twisted two- stranded antiparallel beta-sheet followed by an alpha helix.
pfam Kunitz_BPTI 52aa 4e-13
in ref transcript
smart LDLa 33aa 3e-06
in ref transcript
Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
SPO11
refseq_SPO11.F2 refseq_SPO11.R2 263 377
NCBIGene 36.3 23626
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_012444
cd TOPRIM_TopoIIB_SPO 163aa 2e-53
in ref transcript
TOPRIM_TopoIIB_SPO: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in the type IIB family of DNA topoisomerases and Spo11. This subgroup contains proteins similar to Sulfolobus shibatae topoisomerase VI (TopoVI) and Saccharomyces cerevisiae meiotic recombination factor: Spo11. Type II DNA topoisomerases catalyze the ATP-dependent transport of one DNA duplex through another, in the process generating transient double strand breaks via covalent attachments to both DNA strands at the 5' positions. TopoVI enzymes are heterotetramers found in archaea and plants. Spo11 plays a role in generating the double strand breaks that initiate homologous recombination during meiosis. S. shibatae TopoVI relaxes both positive and negative supercoils, and in addition has a strong decatenase activity. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD. For topoisomerases the conserved glutamate is believed to act as a general base in strand joining and, as a general acid in strand cleavage. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function.
pfam SPO11_like 42aa 3e-19
in ref transcript
SPO11 homologue.
pfam TP6A_N 68aa 7e-19
in ref transcript
Type IIB DNA topoisomerase. Type II DNA topoisomerases are ubiquitous enzymes that catalyse the ATP-dependent transport of one DNA duplex through a second DNA segment via a transient double-strand break. Type II DNA topoisomerases are now subdivided into two sub-families, type IIA and IIB DNA topoisomerases. TP6A_N is present in type IIB topoisomerase and is thought to be involved in DNA binding owing to its sequence similarity to Escherichia coli catabolite activator protein (CAP).
Changed!
PRK PRK04342 354aa 4e-65
in ref transcript
DNA topoisomerase VI subunit A; Provisional.
Changed!
PRK PRK04342 291aa 2e-63
in modified transcript
SPOP
refseq_SPOP.F1 refseq_SPOP.R1 118 240
NCBIGene 36.3 8405
Single exon skipping, size difference: 122
Exclusion in 5'UTR
Reference transcript: NM_001007226
cd MATH_SPOP 139aa 8e-70
in ref transcript
Speckle-type POZ protein (SPOP) family, MATH domain; composed of proteins with similarity to human SPOP. SPOP was isolated as a novel antigen recognized by serum from a scleroderma patient, whose overexpression in COS cells results in a discrete speckled pattern in the nuclei. It contains an N-terminal MATH domain and a C-terminal BTB (also called POZ) domain. Together with Cul3, SPOP constitutes an ubiquitin E3 ligase which is able to ubiquitinate the PcG protein BMI1, the variant histone macroH2A1 and the death domain-associated protein Daxx. Therefore, SPOP may be involved in the regulation of these proteins and may play a role in transcriptional regulation, apoptosis and X-chromosome inactivation. Cul3 binds to the BTB domain of SPOP whereas Daxx and the macroH2A1 nonhistone region have been shown to bind to the MATH domain. Both MATH and BTB domains are necessary for the nuclear speckled accumulation of SPOP. There are many proteins, mostly uncharacterized, containing both MATH and BTB domains from C. elegans and plants which are excluded from this family.
pfam BTB 108aa 3e-21
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
smart MATH 102aa 8e-10
in ref transcript
meprin and TRAF homology.
SPOP
refseq_SPOP.F2 refseq_SPOP.R2 121 172
NCBIGene 36.3 8405
Single exon skipping, size difference: 51
Exclusion in 5'UTR
Reference transcript: NM_001007230
cd MATH_SPOP 139aa 8e-70
in ref transcript
Speckle-type POZ protein (SPOP) family, MATH domain; composed of proteins with similarity to human SPOP. SPOP was isolated as a novel antigen recognized by serum from a scleroderma patient, whose overexpression in COS cells results in a discrete speckled pattern in the nuclei. It contains an N-terminal MATH domain and a C-terminal BTB (also called POZ) domain. Together with Cul3, SPOP constitutes an ubiquitin E3 ligase which is able to ubiquitinate the PcG protein BMI1, the variant histone macroH2A1 and the death domain-associated protein Daxx. Therefore, SPOP may be involved in the regulation of these proteins and may play a role in transcriptional regulation, apoptosis and X-chromosome inactivation. Cul3 binds to the BTB domain of SPOP whereas Daxx and the macroH2A1 nonhistone region have been shown to bind to the MATH domain. Both MATH and BTB domains are necessary for the nuclear speckled accumulation of SPOP. There are many proteins, mostly uncharacterized, containing both MATH and BTB domains from C. elegans and plants which are excluded from this family.
pfam BTB 108aa 3e-21
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
smart MATH 102aa 8e-10
in ref transcript
meprin and TRAF homology.
SREBF1
refseq_SREBF1.F1 refseq_SREBF1.R1 182 272
NCBIGene 36.3 6720
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_001005291
cd HLH 58aa 9e-12
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
pfam HLH 51aa 2e-12
in ref transcript
Helix-loop-helix DNA-binding domain.
SS18
refseq_SS18.F1 refseq_SS18.R1 115 208
NCBIGene 36.3 6760
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_001007559
pfam SSXT 66aa 4e-20
in ref transcript
SSXT protein (N-terminal region). The SSXT or SS18 protein is involved in synovial sarcoma in humans. A SYT-SSX fusion gene resulting from the chromosomal translocation t(X;18) (p11;q11) is characteristic of synovial sarcomas. This translocation fuses the SSXT (SYT) gene from chromosome 18 to either of two homologous genes at Xp11, SSX1 or SSX2.
SSBP3
refseq_SSBP3.F1 refseq_SSBP3.R1 126 207
NCBIGene 36.3 23648
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_145716
pfam SSDP 144aa 2e-06
in ref transcript
Single-stranded DNA binding protein, SSDP. This is a family of eukaryotic single-stranded DNA binding proteins with specificity to a pyrimidine-rich element found in the promoter region of the alpha2(I) collagen gene.
SSBP4
refseq_SSBP4.F1 refseq_SSBP4.R1 100 166
NCBIGene 36.3 170463
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_032627
pfam SSDP 149aa 7e-15
in ref transcript
Single-stranded DNA binding protein, SSDP. This is a family of eukaryotic single-stranded DNA binding proteins with specificity to a pyrimidine-rich element found in the promoter region of the alpha2(I) collagen gene.
Changed!
pfam PAT1 157aa 3e-07
in ref transcript
Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
Changed!
PRK PRK07764 149aa 1e-04
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
Changed!
PRK PRK07764 137aa 5e-04
in modified transcript
SSH3
refseq_SSH3.F1 refseq_SSH3.R1 246 344
NCBIGene 36.2 54961
Alternative 3-prime, size difference: 98
Exclusion in the protein causing a frameshift
Reference transcript: NM_017857
cd DSPc 136aa 2e-37
in ref transcript
Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
smart DSPc 135aa 3e-36
in ref transcript
Dual specificity phosphatase, catalytic domain.
pfam DEK_C 54aa 4e-12
in ref transcript
DEK C terminal domain. DEK is a chromatin associated protein that is linked with cancers and autoimmune disease. This domain is found at the C terminal of DEK and is of clinical importance since it can reverse the characteristic abnormal DNA-mutagen sensitivity in fibroblasts from ataxia-telangiectasia (A-T) patients. The structure of this domain shows it to be homologous to the E2F/DP transcription factor family. This domain is also found in chitin synthase proteins, and in protein phosphatases.
PRK PRK12361 131aa 7e-12
in ref transcript
hypothetical protein; Provisional.
SSX4
refseq_SSX4B.F1 refseq_SSX4B.R1 121 257
NCBIGene 36.3 6759
Single exon skipping, size difference: 136
Exclusion in the protein causing a frameshift
Reference transcript: NM_005636
Changed!
pfam SSXRD 34aa 2e-08
in ref transcript
SSXRD motif. SSX1 can repress transcription, and this has been attributed to a putative Kruppel associated box (KRAB) repression domain at the N-terminus. However, from the analysis of these deletion constructs further repression activity was found at the C-terminus of SSX1. Which has been called the SSXRD (SSX Repression Domain). The potent repression exerted by full-length SSX1 appears to localise to this region.
smart KRAB 60aa 1e-05
in ref transcript
krueppel associated box.
ST3GAL3
refseq_ST3GAL3.F1 refseq_ST3GAL3.R1 107 401
NCBIGene 36.3 6487
Multiple exon skipping, size difference: 294
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_174963
Changed!
pfam Glyco_transf_29 218aa 4e-38
in ref transcript
Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29.
Changed!
pfam Glyco_transf_29 85aa 4e-19
in modified transcript
ST3GAL3
refseq_ST3GAL3.F3 refseq_ST3GAL3.R3 134 179
NCBIGene 36.3 6487
Alternative 5-prime, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_174963
pfam Glyco_transf_29 218aa 4e-38
in ref transcript
Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29.
ST3GAL3
refseq_ST3GAL3.F5 refseq_ST3GAL3.R5 109 271
NCBIGene 36.3 6487
Single exon skipping, size difference: 162
Exclusion in the protein (no frameshift)
Reference transcript: NM_174963
pfam Glyco_transf_29 218aa 4e-38
in ref transcript
Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29.
ST6GALNAC4
refseq_ST6GALNAC4.F1 refseq_ST6GALNAC4.R1 137 224
NCBIGene 36.3 27090
Single exon skipping, size difference: 87
Exclusion of the protein initiation site
Reference transcript: NM_175039
Changed!
pfam Glyco_transf_29 273aa 2e-69
in ref transcript
Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29.
Changed!
pfam Glyco_transf_29 201aa 3e-61
in modified transcript
ST7
refseq_ST7.F1 refseq_ST7.R1 197 266
NCBIGene 36.3 7982
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_021908
Changed!
pfam ST7 535aa 0.0
in ref transcript
ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Changed!
pfam ST7 512aa 0.0
in modified transcript
ST7L
refseq_ST7L.F1 refseq_ST7L.R1 166 259
NCBIGene 36.3 54879
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_017744
Changed!
pfam ST7 502aa 0.0
in ref transcript
ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Changed!
pfam ST7 471aa 0.0
in modified transcript
ST7L
refseq_ST7L.F2 refseq_ST7L.R2 101 152
NCBIGene 36.3 54879
Alternative 3-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_017744
Changed!
pfam ST7 502aa 0.0
in ref transcript
ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Changed!
pfam ST7 485aa 0.0
in modified transcript
STAP2
refseq_STAP2.F2 refseq_STAP2.R2 223 361
NCBIGene 36.3 55620
Alternative 3-prime, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_017720
cd SH2 96aa 8e-05
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
pfam SH2 78aa 0.003
in ref transcript
SH2 domain.
STARD5
refseq_STARD5.F2 refseq_STARD5.R2 108 158
NCBIGene 36.2 80765
Single exon skipping, size difference: 50
Exclusion in the protein causing a frameshift
Reference transcript: NM_181900
Changed!
cd START 211aa 8e-31
in ref transcript
START(STeroidogenic Acute Regulatory (STAR) related lipid Transfer) Domain. These domains are 200-210 amino acid in length and occur in proteins involved in lipid transport (phosphatidylcholine) and metabolism, signal transduction, and transcriptional regulation. The most striking feature of the START domain structure is a predominantly hydrophobic tunnel extending nearly the entire protein and used to binding a single molecule of large lipophilic compounds, like cholesterol.
Changed!
pfam START 206aa 2e-17
in ref transcript
START domain.
STATH
refseq_STATH.F1 refseq_STATH.R1 112 142
NCBIGene 36.3 6779
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_003154
Changed!
pfam Statherin 19aa 5e-05
in ref transcript
Statherin. Statherin functions biologically to inhibit the nucleation and growth of calcium phosphate minerals. The N-terminus of statherin is highly charge, the glutamic acids of which have been shown to be important in the recognition hydroxyapatite.
STAU1
refseq_STAU1.F1 refseq_STAU1.R1 101 119
NCBIGene 36.3 6780
Alternative 3-prime, size difference: 18
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_017453
cd DSRM 67aa 1e-12
in ref transcript
Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
smart DSRM 67aa 3e-14
in ref transcript
Double-stranded RNA binding motif.
Changed!
smart DSRM 32aa 0.001
in ref transcript
PRK rnc 72aa 1e-10
in ref transcript
ribonuclease III; Reviewed.
STAU1
refseq_STAU1.F2 refseq_STAU1.R2 107 396
NCBIGene 36.3 6780
Single exon skipping, size difference: 289
Exclusion of the protein initiation site
Reference transcript: NM_017453
cd DSRM 67aa 1e-12
in ref transcript
Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
smart DSRM 67aa 3e-14
in ref transcript
Double-stranded RNA binding motif.
smart DSRM 32aa 0.001
in ref transcript
PRK rnc 72aa 1e-10
in ref transcript
ribonuclease III; Reviewed.
STRN4
refseq_STRN4.F1 refseq_STRN4.R1 118 139
NCBIGene 36.3 29888
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039877
cd WD40 306aa 2e-42
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
pfam Striatin 134aa 3e-31
in ref transcript
Striatin family. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (pfam00400) repeat domain. It acts as a scaffold protein and is involved in signalling pathways.
smart WD40 40aa 1e-08
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 40aa 1e-06
in ref transcript
smart WD40 40aa 6e-06
in ref transcript
COG COG2319 303aa 5e-25
in ref transcript
FOG: WD40 repeat [General function prediction only].
STX16
refseq_STX16.F1 refseq_STX16.R1 112 175
NCBIGene 36.3 8675
Exon skipping and alternative 3-prime or 5-prime, size difference: 63
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001001433
cd t_SNARE 47aa 3e-06
in ref transcript
Soluble NSF (N-ethylmaleimide-sensitive fusion protein)-Attachment protein (SNAP) REceptor domain; these alpha-helical motifs form twisted and parallel heterotetrameric helix bundles; the core complex contains one helix from a protein that is anchored in the vesicle membrane (synaptobrevin), one helix from a protein of the target membrane (syntaxin), and two helices from another protein anchored in the target membrane (SNAP-25); their interaction forms a core which is composed of a polar zero layer, a flanking leucine-zipper layer acts as a water tight shield to isolate ionic interactions in the zero layer from the surrounding solvent.
pfam Syntaxin 107aa 6e-13
in ref transcript
Syntaxin. Syntaxins are the prototype family of SNARE proteins. They usually consist of three main regions - a C-terminal transmembrane region, a central SNARE domain which is characteristic of and conserved in all syntaxins (pfam05739), and an N-terminal domain that is featured in this entry. This domain varies between syntaxin isoforms; in syntaxin 1A it is found as three alpha-helices with a left-handed twist. It may fold back on the SNARE domain to allow the molecule to adopt a 'closed' configuration that prevents formation of the core fusion complex - it thus has an auto-inhibitory role. The function of syntaxins is determined by their localisation. They are involved in neuronal exocytosis, ER-Golgi transport and Golgi-endosome transport, for example. They also interact with other proteins as well as those involved in SNARE complexes. These include vesicle coat proteins, Rab GTPases, and tethering factors.
pfam SNARE 52aa 6e-10
in ref transcript
SNARE domain. Most if not all vesicular membrane fusion events in eukaryotic cells are believed to be mediated by a conserved fusion machinery, the SNARE [soluble N-ethylmaleimide-sensitive factor (NSF) attachment protein (SNAP) receptors] machinery. The SNARE domain is thought to act as a protein-protein interaction module in the assembly of a SNARE protein complex.
COG COG5325 227aa 3e-15
in ref transcript
t-SNARE complex subunit, syntaxin [Intracellular trafficking and secretion].
STX2
refseq_STX2.F1 refseq_STX2.R1 192 318
NCBIGene 36.3 2054
Single exon skipping, size difference: 126
Exclusion of the stop codon
Reference transcript: NM_194356
cd SynN 152aa 1e-29
in ref transcript
Syntaxin N-terminus domain; syntaxins are nervous system-specific proteins implicated in the docking of synaptic vesicles with the presynaptic plasma membrane; they are a family of receptors for intracellular transport vesicles; each target membrane may be identified by a specific member of the syntaxin family; syntaxins contain a moderately well conserved amino-terminal domain, called Habc, whose structure is an antiparallel three-helix bundle; a linker of about 30 amino acids connects this to the carboxy-terminal region, designated H3 (t_SNARE), of the syntaxin cytoplasmic domain; the highly conserved H3 region forms a single, long alpha-helix when it is part of the core SNARE complex and anchors the protein on the cytoplasmic surface of cellular membranes; H3 is not included in defining this domain.
cd t_SNARE 60aa 2e-07
in ref transcript
Soluble NSF (N-ethylmaleimide-sensitive fusion protein)-Attachment protein (SNAP) REceptor domain; these alpha-helical motifs form twisted and parallel heterotetrameric helix bundles; the core complex contains one helix from a protein that is anchored in the vesicle membrane (synaptobrevin), one helix from a protein of the target membrane (syntaxin), and two helices from another protein anchored in the target membrane (SNAP-25); their interaction forms a core which is composed of a polar zero layer, a flanking leucine-zipper layer acts as a water tight shield to isolate ionic interactions in the zero layer from the surrounding solvent.
smart SynN 119aa 4e-23
in ref transcript
Syntaxin N-terminal domain. Three-helix domain that (in Sso1p) slows the rate of its reaction with the SNAP-25 homologue Sec9p.
pfam SNARE 62aa 6e-13
in ref transcript
SNARE domain. Most if not all vesicular membrane fusion events in eukaryotic cells are believed to be mediated by a conserved fusion machinery, the SNARE [soluble N-ethylmaleimide-sensitive factor (NSF) attachment protein (SNAP) receptors] machinery. The SNARE domain is thought to act as a protein-protein interaction module in the assembly of a SNARE protein complex.
Changed!
COG COG5074 234aa 9e-15
in ref transcript
t-SNARE complex subunit, syntaxin [Intracellular trafficking and secretion].
Changed!
COG COG5074 235aa 7e-14
in modified transcript
STXBP1
refseq_STXBP1.F1 refseq_STXBP1.R1 227 353
NCBIGene 36.3 6812
Single exon skipping, size difference: 126
Exclusion of the stop codon
Reference transcript: NM_003165
Changed!
pfam Sec1 549aa 1e-180
in ref transcript
Sec1 family.
Changed!
COG SEC1 543aa 7e-55
in ref transcript
Proteins involved in synaptic transmission and general secretion, Sec1 family [Intracellular trafficking and secretion].
Changed!
pfam Sec1 543aa 1e-178
in modified transcript
Changed!
COG SEC1 535aa 2e-53
in modified transcript
SULF2
refseq_SULF2.F1 refseq_SULF2.R1 159 213
NCBIGene 36.3 55959
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_018837
pfam Sulfatase 352aa 3e-34
in ref transcript
Sulfatase.
COG AslA 353aa 2e-29
in ref transcript
Arylsulfatase A and related enzymes [Inorganic ion transport and metabolism].
SULT1A1
refseq_SULT1A1.F1 refseq_SULT1A1.R1 129 160
NCBIGene 36.3 6817
Alternative 3-prime, size difference: 31
Inclusion in 5'UTR
Reference transcript: NM_001055
pfam Sulfotransfer_1 250aa 7e-77
in ref transcript
Sulfotransferase domain.
SULT1C2
refseq_SULT1C1.F1 refseq_SULT1C1.R1 160 193
NCBIGene 36.3 6819
Exon skipping and alternative 3-prime or 5-prime, size difference: 33
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_176825
Changed!
pfam Sulfotransfer_1 261aa 7e-86
in ref transcript
Sulfotransferase domain.
Changed!
pfam Sulfotransfer_1 250aa 2e-95
in modified transcript
SUMO1
refseq_SUMO1.F1 refseq_SUMO1.R1 288 363
NCBIGene 36.3 7341
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_003352
Changed!
cd Sumo 87aa 2e-31
in ref transcript
Small ubiquitin-related modifier (SUMO) proteins are conjugated to numerous intracellular targets and serve to modulate protein interaction, localization, activity or stability. SUMO (also known as "Smt3" and "sentrin" in other organisms) is linked to several different pathways, including nucleocytoplasmic transport. Attachment of SUMO to targets proteins is stimulated by PIAS (Protein inhibitor of activated STATs) proteins which serve as E3-like ligases.
pfam ubiquitin 66aa 2e-10
in ref transcript
Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
Changed!
COG SMT3 96aa 2e-19
in ref transcript
Ubiquitin-like protein (sentrin) [Posttranslational modification, protein turnover, chaperones].
Changed!
cd Sumo 71aa 1e-27
in modified transcript
Changed!
COG SMT3 72aa 4e-17
in modified transcript
SUOX
refseq_SUOX.F1 refseq_SUOX.R1 121 247
NCBIGene 36.3 6821
Single exon skipping, size difference: 126
Exclusion in 5'UTR
Reference transcript: NM_000456
cd eukary_SO_Moco 359aa 1e-145
in ref transcript
molybdopterin binding domain of sulfite oxidase (SO). SO catalyzes the terminal reaction in the oxidative degradation of the sulfur-containing amino acids cysteine and methionine. Common features of all known members of the sulfite oxidase (SO) family of molybdopterin binding domains are that they contain one single pterin cofactor and part of the coordination of the metal (Mo) is a cysteine ligand of the protein and that they catalyze the transfer of an oxygen to or from a lone pair of electrons on the substrate.
pfam Mo-co_dimer 127aa 5e-46
in ref transcript
Mo-co oxidoreductase dimerisation domain. This domain is found in molybdopterin cofactor (Mo-co) oxidoreductases. It is involved in dimer formation, and has an Ig-fold structure.
pfam Oxidored_molyb 183aa 1e-32
in ref transcript
Oxidoreductase molybdopterin binding domain. This domain is found in a variety of oxidoreductases. This domain binds to a molybdopterin cofactor. Xanthine dehydrogenases, that also bind molybdopterin, have essentially no similarity.
pfam Cyt-b5 77aa 1e-14
in ref transcript
Cytochrome b5-like Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anchored by an N-terminal transmembrane alpha helix. This family also includes a domain in some chitin synthases. There is no known ligand for this domain in the chitin synthases.
COG COG2041 237aa 5e-18
in ref transcript
Sulfite oxidase and related enzymes [General function prediction only].
COG CYB5 90aa 2e-05
in ref transcript
Cytochrome b involved in lipid metabolism [Energy production and conversion / Lipid metabolism].
SUPT3H
refseq_SUPT3H.F1 refseq_SUPT3H.R1 243 428
NCBIGene 36.3 8464
Multiple exon skipping, size difference: 185
Exclusion of the protein initiation site, Exclusion in the protein (no frameshift)
Reference transcript: NM_181356
Changed!
pfam TFIID-18kDa 82aa 5e-18
in ref transcript
Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone fold.
Changed!
pfam TFIID-18kDa 91aa 2e-22
in modified transcript
SVIL
refseq_SVIL.F1 refseq_SVIL.R1 160 256
NCBIGene 36.3 6840
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_021738
smart GEL 95aa 5e-17
in ref transcript
Gelsolin homology domain. Gelsolin/severin/villin homology domain. Calcium-binding and actin-binding. Both intra- and extracellular domains.
smart GEL 97aa 7e-11
in ref transcript
smart VHP 36aa 7e-11
in ref transcript
Villin headpiece domain.
smart GEL 107aa 5e-10
in ref transcript
smart GEL 96aa 4e-05
in ref transcript
smart GEL 99aa 0.002
in ref transcript
SYF2
refseq_SYF2.F1 refseq_SYF2.R1 196 322
NCBIGene 36.3 25949
Single exon skipping, size difference: 126
Exclusion in the protein (no frameshift)
Reference transcript: NM_015484
Changed!
pfam SYF2 145aa 6e-46
in ref transcript
SYF2 splicing factor. Proteins in this family are involved in cell cycle progression and pre-mRNA splicing.
Changed!
pfam SYF2 152aa 4e-49
in modified transcript
SYN3
refseq_SYN3.F1 refseq_SYN3.R1 109 401
NCBIGene 36.3 8224
Single exon skipping, size difference: 292
Exclusion in the protein causing a frameshift
Reference transcript: NM_003490
pfam Synapsin_C 203aa 1e-119
in ref transcript
Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown.
pfam Synapsin 105aa 2e-52
in ref transcript
Synapsin, N-terminal domain.
pfam Synapsin_N 29aa 1e-10
in ref transcript
Synapsin N-terminal. This highly conserved domain of synapsin proteins has a serine at position 9 or 10 which is a phosphorylation site. The domain appears to be the part of the molecule that binds to calmodulin.
Changed!
COG RimK 227aa 0.001
in ref transcript
Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) [Coenzyme metabolism / Translation, ribosomal structure and biogenesis].
Changed!
COG RimK 138aa 0.002
in modified transcript
SYNE1
refseq_SYNE1.F2 refseq_SYNE1.R2 104 125
NCBIGene 36.3 23345
Single exon skipping, size difference: 21
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_182961
cd SPEC 211aa 6e-14
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
Changed!
cd CH 107aa 1e-13
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
cd SPEC 216aa 6e-11
in ref transcript
cd SPEC 210aa 2e-09
in ref transcript
cd CH 105aa 8e-09
in ref transcript
cd SPEC 219aa 1e-07
in ref transcript
cd SPEC 217aa 4e-07
in ref transcript
cd SPEC 217aa 7e-07
in ref transcript
cd SPEC 215aa 8e-07
in ref transcript
cd SPEC 206aa 2e-05
in ref transcript
cd SPEC 212aa 3e-05
in ref transcript
cd SPEC 210aa 7e-05
in ref transcript
cd SPEC 227aa 8e-05
in ref transcript
cd SPEC 216aa 1e-04
in ref transcript
cd SPEC 217aa 3e-04
in ref transcript
cd SPEC 223aa 7e-04
in ref transcript
cd SPEC 206aa 0.003
in ref transcript
cd SPEC 212aa 0.003
in ref transcript
cd SPEC 208aa 0.004
in ref transcript
Changed!
smart CH 103aa 1e-14
in ref transcript
Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
pfam KASH 60aa 1e-13
in ref transcript
Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
pfam CH 104aa 9e-11
in ref transcript
Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
pfam SMC_N 808aa 2e-09
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
pfam SMC_N 755aa 2e-06
in ref transcript
smart SPEC 100aa 2e-06
in ref transcript
Spectrin repeats.
pfam Spectrin 98aa 8e-05
in ref transcript
Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
pfam SMC_N 247aa 3e-04
in ref transcript
pfam SMC_N 199aa 5e-04
in ref transcript
TIGR SMC_prok_B 254aa 6e-04
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart SPEC 103aa 0.001
in ref transcript
smart SPEC 97aa 0.002
in ref transcript
pfam SMC_N 247aa 0.003
in ref transcript
pfam Spectrin 101aa 0.005
in ref transcript
pfam SMC_N 296aa 0.009
in ref transcript
Changed!
COG SAC6 259aa 3e-20
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
COG Smc 854aa 2e-11
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG Smc 819aa 6e-06
in ref transcript
COG Smc 303aa 0.002
in ref transcript
PRK PRK02224 459aa 0.002
in ref transcript
chromosome segregation protein; Provisional.
Changed!
cd CH 114aa 1e-11
in modified transcript
Changed!
smart CH 110aa 2e-12
in modified transcript
Changed!
TIGR SMC_prok_B 839aa 0.010
in modified transcript
Changed!
COG SAC6 266aa 3e-18
in modified transcript
SYNE1
refseq_SYNE1.F3 refseq_SYNE1.R3 251 317
NCBIGene 36.3 23345
Alternative 5-prime, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_182961
cd SPEC 211aa 6e-14
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
cd CH 107aa 1e-13
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
cd SPEC 216aa 6e-11
in ref transcript
cd SPEC 210aa 2e-09
in ref transcript
cd CH 105aa 8e-09
in ref transcript
cd SPEC 219aa 1e-07
in ref transcript
cd SPEC 217aa 4e-07
in ref transcript
cd SPEC 217aa 7e-07
in ref transcript
cd SPEC 215aa 8e-07
in ref transcript
cd SPEC 206aa 2e-05
in ref transcript
cd SPEC 212aa 3e-05
in ref transcript
cd SPEC 210aa 7e-05
in ref transcript
cd SPEC 227aa 8e-05
in ref transcript
cd SPEC 216aa 1e-04
in ref transcript
cd SPEC 217aa 3e-04
in ref transcript
cd SPEC 223aa 7e-04
in ref transcript
cd SPEC 206aa 0.003
in ref transcript
cd SPEC 212aa 0.003
in ref transcript
cd SPEC 208aa 0.004
in ref transcript
smart CH 103aa 1e-14
in ref transcript
Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
pfam KASH 60aa 1e-13
in ref transcript
Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
pfam CH 104aa 9e-11
in ref transcript
Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
pfam SMC_N 808aa 2e-09
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
pfam SMC_N 755aa 2e-06
in ref transcript
smart SPEC 100aa 2e-06
in ref transcript
Spectrin repeats.
pfam Spectrin 98aa 8e-05
in ref transcript
Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
pfam SMC_N 247aa 3e-04
in ref transcript
pfam SMC_N 199aa 5e-04
in ref transcript
TIGR SMC_prok_B 254aa 6e-04
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart SPEC 103aa 0.001
in ref transcript
smart SPEC 97aa 0.002
in ref transcript
pfam SMC_N 247aa 0.003
in ref transcript
pfam Spectrin 101aa 0.005
in ref transcript
pfam SMC_N 296aa 0.009
in ref transcript
COG SAC6 259aa 3e-20
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
COG Smc 854aa 2e-11
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
COG Smc 819aa 6e-06
in ref transcript
COG Smc 303aa 0.002
in ref transcript
PRK PRK02224 459aa 0.002
in ref transcript
chromosome segregation protein; Provisional.
Changed!
TIGR SMC_prok_A 180aa 0.005
in modified transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Changed!
TIGR SMC_prok_B 839aa 0.010
in modified transcript
Changed!
COG Smc 760aa 4e-06
in modified transcript
SYNE1
refseq_SYNE1.F5 refseq_SYNE1.R5 157 226
NCBIGene 36.3 23345
Single exon skipping, size difference: 69
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_182961
cd SPEC 211aa 6e-14
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
cd CH 107aa 1e-13
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
cd SPEC 216aa 6e-11
in ref transcript
cd SPEC 210aa 2e-09
in ref transcript
cd CH 105aa 8e-09
in ref transcript
cd SPEC 219aa 1e-07
in ref transcript
cd SPEC 217aa 4e-07
in ref transcript
cd SPEC 217aa 7e-07
in ref transcript
cd SPEC 215aa 8e-07
in ref transcript
cd SPEC 206aa 2e-05
in ref transcript
cd SPEC 212aa 3e-05
in ref transcript
cd SPEC 210aa 7e-05
in ref transcript
cd SPEC 227aa 8e-05
in ref transcript
cd SPEC 216aa 1e-04
in ref transcript
cd SPEC 217aa 3e-04
in ref transcript
cd SPEC 223aa 7e-04
in ref transcript
cd SPEC 206aa 0.003
in ref transcript
cd SPEC 212aa 0.003
in ref transcript
cd SPEC 208aa 0.004
in ref transcript
smart CH 103aa 1e-14
in ref transcript
Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
pfam KASH 60aa 1e-13
in ref transcript
Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
pfam CH 104aa 9e-11
in ref transcript
Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
pfam SMC_N 808aa 2e-09
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
pfam SMC_N 755aa 2e-06
in ref transcript
smart SPEC 100aa 2e-06
in ref transcript
Spectrin repeats.
pfam Spectrin 98aa 8e-05
in ref transcript
Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
pfam SMC_N 247aa 3e-04
in ref transcript
pfam SMC_N 199aa 5e-04
in ref transcript
TIGR SMC_prok_B 254aa 6e-04
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart SPEC 103aa 0.001
in ref transcript
smart SPEC 97aa 0.002
in ref transcript
pfam SMC_N 247aa 0.003
in ref transcript
pfam Spectrin 101aa 0.005
in ref transcript
pfam SMC_N 296aa 0.009
in ref transcript
COG SAC6 259aa 3e-20
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
COG Smc 854aa 2e-11
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG Smc 819aa 6e-06
in ref transcript
COG Smc 303aa 0.002
in ref transcript
PRK PRK02224 459aa 0.002
in ref transcript
chromosome segregation protein; Provisional.
Changed!
TIGR SMC_prok_B 839aa 0.010
in modified transcript
SYNE1
refseq_SYNE1.F8 refseq_SYNE1.R8 236 404
NCBIGene 36.3 23345
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_182961
cd SPEC 211aa 6e-14
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
cd CH 107aa 1e-13
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
cd SPEC 216aa 6e-11
in ref transcript
cd SPEC 210aa 2e-09
in ref transcript
cd CH 105aa 8e-09
in ref transcript
Changed!
cd SPEC 219aa 1e-07
in ref transcript
cd SPEC 217aa 4e-07
in ref transcript
cd SPEC 217aa 7e-07
in ref transcript
cd SPEC 215aa 8e-07
in ref transcript
cd SPEC 206aa 2e-05
in ref transcript
cd SPEC 212aa 3e-05
in ref transcript
cd SPEC 210aa 7e-05
in ref transcript
Changed!
cd SPEC 227aa 8e-05
in ref transcript
cd SPEC 216aa 1e-04
in ref transcript
cd SPEC 217aa 3e-04
in ref transcript
cd SPEC 223aa 7e-04
in ref transcript
cd SPEC 206aa 0.003
in ref transcript
cd SPEC 212aa 0.003
in ref transcript
cd SPEC 208aa 0.004
in ref transcript
smart CH 103aa 1e-14
in ref transcript
Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
pfam KASH 60aa 1e-13
in ref transcript
Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
pfam CH 104aa 9e-11
in ref transcript
Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
pfam SMC_N 808aa 2e-09
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
pfam SMC_N 755aa 2e-06
in ref transcript
smart SPEC 100aa 2e-06
in ref transcript
Spectrin repeats.
pfam Spectrin 98aa 8e-05
in ref transcript
Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
pfam SMC_N 247aa 3e-04
in ref transcript
pfam SMC_N 199aa 5e-04
in ref transcript
TIGR SMC_prok_B 254aa 6e-04
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart SPEC 103aa 0.001
in ref transcript
smart SPEC 97aa 0.002
in ref transcript
pfam SMC_N 247aa 0.003
in ref transcript
pfam Spectrin 101aa 0.005
in ref transcript
pfam SMC_N 296aa 0.009
in ref transcript
COG SAC6 259aa 3e-20
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
COG Smc 854aa 2e-11
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG Smc 819aa 6e-06
in ref transcript
COG Smc 303aa 0.002
in ref transcript
PRK PRK02224 459aa 0.002
in ref transcript
chromosome segregation protein; Provisional.
Changed!
pfam SMC_N 200aa 0.002
in modified transcript
Changed!
pfam SMC_N 336aa 0.003
in modified transcript
Changed!
TIGR SMC_prok_B 839aa 0.010
in modified transcript
Changed!
COG SbcC 570aa 4e-04
in modified transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
SYNJ1
refseq_SYNJ1.F2 refseq_SYNJ1.R2 174 222
NCBIGene 36.3 8867
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_003895
cd RRM 59aa 3e-04
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
pfam Syja_N 289aa 1e-113
in ref transcript
SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin.
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
pfam Syja_N 289aa 1e-113
in ref transcript
SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin.
Changed!
COG COG5411 369aa 5e-58
in modified transcript
SYTL2
refseq_SYTL2.F1 refseq_SYTL2.R1 203 323
NCBIGene 36.3 54843
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_206927
cd C2_1 127aa 9e-17
in ref transcript
Protein kinase C conserved region 2, subgroup 1; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing synaptotagmins, specific protein kinases C (PKC) subtypes and other proteins, the N-terminal beta strand occupies the position of what is the C-terminal strand in subgroup 2.
cd C2_1 128aa 4e-15
in ref transcript
pfam C2 90aa 2e-10
in ref transcript
C2 domain.
pfam C2 88aa 7e-10
in ref transcript
TAC1
refseq_TAC1.F1 refseq_TAC1.R1 106 151
NCBIGene 36.3 6863
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_003182
TAC1
refseq_TAC1.F2 refseq_TAC1.R2 114 168
NCBIGene 36.3 6863
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_003182
TAC3
refseq_TAC3.F1 refseq_TAC3.R1 127 244
NCBIGene 36.3 6866
Single exon skipping, size difference: 117
Exclusion of the stop codon
Reference transcript: NM_001006667
pfam Neurokinin_B 55aa 4e-20
in ref transcript
Neurokinin B.
TACC2
refseq_TACC2.F1 refseq_TACC2.R1 124 214
NCBIGene 36.3 10579
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_206862
pfam TACC 206aa 2e-92
in ref transcript
Transforming acidic coiled-coil-containing protein (TACC). This family contains the proteins TACC 1, 2 and 3 the genes for which are found concentrated in the centrosomes of eukaryotic and may play a conserved role in organising centrosomal microtubules. The human TACC proteins have been linked to cancer and TACC2 has been identified as a possible tumour suppressor (AZU-1). The functional homologue (Alp7) in Schizosaccharomyces pombe has been shown to be required for organisation of bipolar spindles.
COG Smc 200aa 7e-10
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
PRK PRK07003 236aa 2e-04
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
TACC2
refseq_TACC2.F4 refseq_TACC2.R4 123 258
NCBIGene 36.3 10579
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_206862
pfam TACC 206aa 2e-92
in ref transcript
Transforming acidic coiled-coil-containing protein (TACC). This family contains the proteins TACC 1, 2 and 3 the genes for which are found concentrated in the centrosomes of eukaryotic and may play a conserved role in organising centrosomal microtubules. The human TACC proteins have been linked to cancer and TACC2 has been identified as a possible tumour suppressor (AZU-1). The functional homologue (Alp7) in Schizosaccharomyces pombe has been shown to be required for organisation of bipolar spindles.
COG Smc 200aa 7e-10
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
PRK PRK07003 236aa 2e-04
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
TAF1
refseq_TAF1.F2 refseq_TAF1.R2 229 292
NCBIGene 36.3 6872
Alternative 3-prime, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_004606
cd Bromo_TFIID 113aa 7e-51
in ref transcript
Bromodomain, TFIID-like subfamily. Human TAFII250 (or TAF250) is the largest subunit of TFIID, a large multi-domain complex, which initiates the assembly of the transcription machinery. TAFII250 contains two bromodomains that specifically bind to acetylated histone H4. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd Bromo_TFIID 89aa 2e-37
in ref transcript
smart BROMO 108aa 2e-27
in ref transcript
bromo domain.
smart BROMO 96aa 1e-20
in ref transcript
pfam TBP-binding 61aa 4e-11
in ref transcript
TATA box-binding protein binding. Members of this family adopt a structure consisting of three alpha helices and a beta-hairpin. They bind to TATA box-binding protein (TBP), inhibiting TBP interaction with the TATA element, thereby resulting in shutting down of gene transcription.
Changed!
pfam TAF 56aa 2e-17
in modified transcript
Changed!
COG TAF6 390aa 6e-66
in modified transcript
TAGAP
refseq_TAGAP.F1 refseq_TAGAP.R1 201 368
NCBIGene 36.3 117289
Single exon skipping, size difference: 167
Exclusion in the protein causing a frameshift
Reference transcript: NM_054114
Changed!
cd RhoGAP_ARHGAP20 197aa 7e-66
in ref transcript
RhoGAP_ARHGAP20: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP20-like proteins. ArhGAP20, also known as KIAA1391 and RA-RhoGAP, contains a RhoGAP, a RA, and a PH domain, and ANXL repeats. ArhGAP20 is activated by Rap1 and induces inactivation of Rho, which in turn leads to neurite outgrowth. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
Changed!
pfam RhoGAP 142aa 5e-31
in ref transcript
RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.
TAGLN
refseq_TAGLN.F1 refseq_TAGLN.R1 100 497
NCBIGene 36.3 6876
Alternative 5-prime, size difference: 397
Exclusion in 5'UTR
Reference transcript: NM_001001522
cd CH 113aa 5e-16
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
smart CH 107aa 1e-16
in ref transcript
Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
pfam Calponin 26aa 7e-06
in ref transcript
Calponin family repeat.
COG SCP1 177aa 1e-16
in ref transcript
Calponin [Cytoskeleton].
TAGLN3
refseq_TAGLN3.F1 refseq_TAGLN3.R1 167 341
NCBIGene 36.3 29114
Alternative 3-prime, size difference: 174
Inclusion in 5'UTR
Reference transcript: NM_001008272
cd CH 109aa 1e-17
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
smart CH 106aa 3e-19
in ref transcript
Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
pfam Calponin 25aa 1e-05
in ref transcript
Calponin family repeat.
COG SCP1 176aa 3e-18
in ref transcript
Calponin [Cytoskeleton].
TAS1R1
refseq_TAS1R1.F2 refseq_TAS1R1.R2 103 437
NCBIGene 36.3 80835
Multiple exon skipping, size difference: 334
Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_138697
Changed!
cd PBP1_Taste_receptor 455aa 1e-175
in ref transcript
Ligand-binding domain of the T1R taste receptor. The T1R is a member of the family C receptors within the G-protein coupled receptor superfamily, which also includes the metabotropic glutamate receptors, GABAb receptors, the calcium-sensing receptor (CaSR), the V2R pheromone receptors, and a small group of uncharacterized orphan receptors.
Changed!
pfam ANF_receptor 384aa 1e-48
in ref transcript
Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
Changed!
pfam 7tm_3 235aa 8e-34
in ref transcript
7 transmembrane sweet-taste receptor of 3 GCPR. This is a domain of seven transmembrane regions that forms the C-terminus of some subclass 3 G-coupled-protein receptors. It is often associated with a downstream cysteine-rich linker domain, NCD3G pfam07562, which is the human sweet-taste receptor, and the N-terminal domain, ANF_receptor pfam01094. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness.
Changed!
pfam NCD3G 54aa 7e-16
in ref transcript
Nine Cysteines Domain of family 3 GPCR. This conserved sequence contains several highly-conserved Cys residues that are predicted to form disulphide bridges. It is predicted to lie outside the cell membrane, tethered to the pfam00003 in several receptor proteins.
Changed!
COG LivK 280aa 3e-15
in ref transcript
ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
Changed!
cd PBP1_Taste_receptor 389aa 1e-147
in modified transcript
Changed!
pfam ANF_receptor 283aa 4e-45
in modified transcript
Changed!
COG LivK 212aa 1e-14
in modified transcript
TAZ
refseq_TAZ.F1 refseq_TAZ.R1 101 207
NCBIGene 36.2 6901
Alternative 5-prime, size difference: 106
Inclusion in the protein causing a new stop codon
Reference transcript: NM_000116
Changed!
pfam Acyltransferase 175aa 3e-18
in ref transcript
Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene.
Changed!
PRK PRK08633 232aa 4e-06
in ref transcript
Changed!
pfam Acyltransferase 46aa 0.001
in modified transcript
TAZ
refseq_TAZ.F2 refseq_TAZ.R2 100 190
NCBIGene 36.3 6901
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_000116
Changed!
pfam Acyltransferase 175aa 3e-18
in ref transcript
Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene.
Changed!
PRK PRK08633 232aa 4e-06
in ref transcript
Changed!
pfam Acyltransferase 145aa 5e-21
in modified transcript
Changed!
PRK PRK08633 202aa 7e-09
in modified transcript
TAZ
refseq_TAZ.F3 refseq_TAZ.R3 128 170
NCBIGene 36.3 6901
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_000116
Changed!
pfam Acyltransferase 175aa 3e-18
in ref transcript
Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene.
Changed!
PRK PRK08633 232aa 4e-06
in ref transcript
Changed!
pfam Acyltransferase 141aa 3e-12
in modified transcript
Changed!
PRK PRK08633 218aa 4e-04
in modified transcript
TBC1D9B
refseq_TBC1D9B.F1 refseq_TBC1D9B.R1 129 180
NCBIGene 36.3 23061
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_198868
pfam TBC 210aa 3e-49
in ref transcript
TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases.
pfam GRAM 59aa 4e-16
in ref transcript
GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins.
pfam GRAM 59aa 7e-13
in ref transcript
COG COG5210 343aa 2e-40
in ref transcript
GTPase-activating protein [General function prediction only].
TBCD
refseq_TBCD.F1 refseq_TBCD.R1 289 340
NCBIGene 36.2 6904
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_005993
COG CIN1 277aa 4e-20
in ref transcript
Beta-tubulin folding cofactor D [Posttranslational modification, protein turnover, chaperones / Cytoskeleton].
COG CIN1 63aa 2e-04
in ref transcript
Changed!
COG CIN1 109aa 0.003
in ref transcript
TBL1Y
refseq_TBL1Y.F1 refseq_TBL1Y.R1 114 209
NCBIGene 36.3 90665
Single exon skipping, size difference: 95
Exclusion in 5'UTR
Reference transcript: NM_033284
cd WD40 307aa 8e-63
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
smart WD40 40aa 1e-07
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 38aa 2e-07
in ref transcript
pfam LisH 27aa 2e-06
in ref transcript
LisH. The LisH (lis homology) domain mediates protein dimerisation and tetramerisation. The LisH domain is found in Sif2, a component of the Set3 complex which is responsible for repressing meiotic genes. It has been shown that the LisH domain helps mediate interaction with components of the Set3 complex.
pfam WD40 35aa 1e-04
in ref transcript
WD domain, G-beta repeat.
smart WD40 46aa 2e-04
in ref transcript
smart WD40 40aa 0.001
in ref transcript
pfam eIF2A 209aa 0.002
in ref transcript
Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
COG COG2319 378aa 5e-30
in ref transcript
FOG: WD40 repeat [General function prediction only].
TBL1Y
refseq_TBL1Y.F2 refseq_TBL1Y.R2 134 165
NCBIGene 36.3 90665
Single exon skipping, size difference: 31
Exclusion in 5'UTR
Reference transcript: NM_033284
cd WD40 307aa 8e-63
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
smart WD40 40aa 1e-07
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 38aa 2e-07
in ref transcript
pfam LisH 27aa 2e-06
in ref transcript
LisH. The LisH (lis homology) domain mediates protein dimerisation and tetramerisation. The LisH domain is found in Sif2, a component of the Set3 complex which is responsible for repressing meiotic genes. It has been shown that the LisH domain helps mediate interaction with components of the Set3 complex.
pfam WD40 35aa 1e-04
in ref transcript
WD domain, G-beta repeat.
smart WD40 46aa 2e-04
in ref transcript
smart WD40 40aa 0.001
in ref transcript
pfam eIF2A 209aa 0.002
in ref transcript
Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
COG COG2319 378aa 5e-30
in ref transcript
FOG: WD40 repeat [General function prediction only].
TBRG4
refseq_TBRG4.F2 refseq_TBRG4.R2 102 432
NCBIGene 36.3 9238
Multiple exon skipping, size difference: 330
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_004749
pfam FAST_1 71aa 3e-17
in ref transcript
FAST kinase-like protein, subdomain 1. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This region is often found immediately N-terminal to the FAST kinase-like protein, subdomain 2.
pfam RAP 58aa 4e-12
in ref transcript
RAP domain. This domain is found in various eukaryotic species, particularly in apicomplexans such as Plasmodium falciparum, where it is found in proteins that are important in various parasite-host cell interactions. It is thought to be an RNA-binding domain.
pfam FAST_2 84aa 3e-11
in ref transcript
FAST kinase-like protein, subdomain 2. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This subdomain is often found associated with the FAST kinase-like protein, subdomain 2.
TBRG4
refseq_TBRG4.F3 refseq_TBRG4.R3 115 136
NCBIGene 36.3 9238
Alternative 5-prime, size difference: 21
Exclusion in 5'UTR
Reference transcript: NM_004749
pfam FAST_1 71aa 3e-17
in ref transcript
FAST kinase-like protein, subdomain 1. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This region is often found immediately N-terminal to the FAST kinase-like protein, subdomain 2.
pfam RAP 58aa 4e-12
in ref transcript
RAP domain. This domain is found in various eukaryotic species, particularly in apicomplexans such as Plasmodium falciparum, where it is found in proteins that are important in various parasite-host cell interactions. It is thought to be an RNA-binding domain.
pfam FAST_2 84aa 3e-11
in ref transcript
FAST kinase-like protein, subdomain 2. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This subdomain is often found associated with the FAST kinase-like protein, subdomain 2.
TBX5
refseq_TBX5.F1 refseq_TBX5.R1 166 351
NCBIGene 36.3 6910
Single exon skipping, size difference: 185
Exclusion of the protein initiation site
Reference transcript: NM_000192
cd TBOX 189aa 5e-91
in ref transcript
T-box DNA binding domain of the T-box family of transcriptional regulators. The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.
pfam T-box 183aa 9e-90
in ref transcript
T-box. The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box proteins are found in a wide range of animals, but not in other kingdoms such as plants. Family members are all thought to bind to the DNA consensus sequence TCACACCT. they are found exclusively in the nucleus, and perform DNA-binding and transcriptional activation/repression roles. They are generally required for development of the specific tissues they are expressed in, and mutations in T-box genes are implicated in human conditions such as DiGeorge syndrome and X-linked cleft palate, which feature malformations.
TBXAS1
refseq_TBXAS1.F1 refseq_TBXAS1.R1 157 320
NCBIGene 36.3 6916
Single exon skipping, size difference: 163
Exclusion in the protein causing a frameshift
Reference transcript: NM_001061
Changed!
pfam p450 482aa 2e-71
in ref transcript
Cytochrome P450. Cytochrome P450s are haem-thiolate proteins involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures.
Changed!
COG CypX 478aa 3e-38
in ref transcript
Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism].
Changed!
pfam p450 410aa 2e-57
in modified transcript
Changed!
COG CypX 410aa 3e-32
in modified transcript
TCEA1
refseq_TCEA1.F1 refseq_TCEA1.R1 269 332
NCBIGene 36.3 6917
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_006756
Changed!
cd TFIIS_I 77aa 7e-16
in ref transcript
N-terminal domain (domain I) of transcription elongation factor S-II (TFIIS); similar to a domain found in elongin A and CRSP70; likely to be involved in transcription; domain I from TFIIS interacts with RNA polymerase II holoenzyme.
Changed!
TIGR TFSII 298aa 4e-89
in ref transcript
This model represents eukaryotic transcription elongation factor S-II. This protein allows stalled RNA transcription complexes to perform a cleavage of the nascent RNA and restart at the newly generated 3-prime end.
Changed!
cd TFIIS_I 56aa 1e-05
in modified transcript
Changed!
TIGR TFSII 277aa 9e-77
in modified transcript
TCEAL8
refseq_TCEAL8.F1 refseq_TCEAL8.R1 228 301
NCBIGene 36.3 90843
Single exon skipping, size difference: 73
Exclusion in 5'UTR
Reference transcript: NM_153333
pfam TFA 111aa 3e-08
in ref transcript
Transcription elongation factor A, SII-related family. The function of this family is unclear, but two members from Homo sapiesn are described as transcription elongation factor A, SII-like proteins.
TCERG1
refseq_TCERG1.F1 refseq_TCERG1.R1 130 193
NCBIGene 36.3 10915
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_006706
cd WW 28aa 2e-04
in ref transcript
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
cd WW 25aa 0.004
in ref transcript
cd WW 30aa 0.008
in ref transcript
pfam FF 52aa 5e-09
in ref transcript
FF domain. This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.
smart FF 54aa 4e-07
in ref transcript
Contains two conserved F residues. A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
pfam FF 61aa 1e-05
in ref transcript
smart FF 54aa 1e-05
in ref transcript
smart WW 28aa 2e-05
in ref transcript
Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
smart FF 56aa 2e-04
in ref transcript
pfam WW 27aa 5e-04
in ref transcript
WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
pfam FF 48aa 6e-04
in ref transcript
pfam WW 25aa 0.001
in ref transcript
COG PRP40 350aa 3e-09
in ref transcript
Splicing factor [RNA processing and modification].
COG PRP40 38aa 4e-04
in ref transcript
COG SbcC 223aa 0.005
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
TCF12
refseq_TCF12.F2 refseq_TCF12.R2 219 291
NCBIGene 36.3 6938
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_207037
cd HLH 61aa 1e-06
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
smart HLH 54aa 7e-10
in ref transcript
helix loop helix domain.
TCF20
refseq_TCF20.F2 refseq_TCF20.R2 101 229
NCBIGene 36.3 6942
Single exon skipping, size difference: 128
Exclusion of the stop codon
Reference transcript: NM_005650
TCF7
refseq_TCF7.F1 refseq_TCF7.R1 100 196
NCBIGene 36.3 6932
Exon skipping and alternative 3-prime or 5-prime, size difference: 96
Inclusion in the protein causing a frameshift, Inclusion in the protein causing a new stop codon
Reference transcript: NM_003202
cd SOX-TCF_HMG-box 71aa 9e-24
in ref transcript
SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
pfam CTNNB1_binding 187aa 6e-33
in ref transcript
N-terminal CTNNB1 binding. This region tends to appear at the N-terminus of proteins also containing DNA-binding HMG (high mobility group) boxes (pfam00505) and appears to bind the armadillo repeat of CTNNB1 (beta-catenin), forming a stable complex. Signaling by Wnt through TCF/LCF is involved in developmental patterning, induction of neural tissues, cell fate decisions and stem cell differentiation. Isoforms of HMG T-cell factors lacking the N-terminal CTNNB1-binding domain cannot fulfill their role as transcriptional activators in T-cell differentiation.
pfam HMG_box 68aa 5e-18
in ref transcript
HMG (high mobility group) box.
Changed!
PTZ PTZ00199 57aa 4e-04
in ref transcript
high mobility group protein; Provisional.
Changed!
COG NHP6B 96aa 2e-04
in modified transcript
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
TCL6
refseq_TCL6.F1 refseq_TCL6.R1 118 254
NCBIGene 36.3 27004
Single exon skipping, size difference: 136
Exclusion in the protein causing a frameshift
Reference transcript: NM_020554
TCOF1
refseq_TCOF1.F2 refseq_TCOF1.R2 217 331
NCBIGene 36.3 6949
Single exon skipping, size difference: 114
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001008656
pfam Treacle 321aa 9e-36
in ref transcript
Treacher Collins syndrome protein Treacle.
TCP1
refseq_TCP1.F1 refseq_TCP1.R1 137 223
NCBIGene 36.3 6950
Single exon skipping, size difference: 86
Exclusion in the protein causing a frameshift
Reference transcript: NM_030752
Changed!
cd TCP1_alpha 527aa 0.0
in ref transcript
TCP-1 (CTT or eukaryotic type II) chaperonin family, alpha subunit. Chaperonins are involved in productive folding of proteins. They share a common general morphology, a double toroid of 2 stacked rings. In contrast to bacterial group I chaperonins (GroEL), each ring of the eukaryotic cytosolic chaperonin (CTT) consists of eight different, but homologous subunits. Their common function is to sequester nonnative proteins inside their central cavity and promote folding by using energy derived from ATP hydrolysis. The best studied in vivo substrates of CTT are actin and tubulin.
Changed!
TIGR chap_CCT_alpha 536aa 0.0
in ref transcript
Members of this family, all eukaryotic, are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT alpha chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.
Changed!
COG GroL 533aa 1e-111
in ref transcript
Chaperonin GroEL (HSP60 family) [Posttranslational modification, protein turnover, chaperones].
TDP1
refseq_TDP1.F1 refseq_TDP1.R1 125 348
NCBIGene 36.3 55775
Single exon skipping, size difference: 223
Exclusion in 5'UTR
Reference transcript: NM_018319
pfam Tyr-DNA_phospho 422aa 1e-104
in ref transcript
Tyrosyl-DNA phosphodiesterase. Covalent intermediates between topoisomerase I and DNA can become dead-end complexes that lead to cell death. Tyrosyl-DNA phosphodiesterase can hydrolyse the bond between topoisomerase I and DNA.
TEAD4
refseq_TEAD4.F2 refseq_TEAD4.R2 106 235
NCBIGene 36.3 7004
Single exon skipping, size difference: 129
Exclusion in the protein (no frameshift)
Reference transcript: NM_003213
Changed!
pfam TEA 353aa 1e-130
in ref transcript
TEA/ATTS domain family.
Changed!
pfam TEA 310aa 1e-128
in modified transcript
TEAD4
refseq_TEAD4.F3 refseq_TEAD4.R3 109 364
NCBIGene 36.3 7004
Single exon skipping, size difference: 255
Exclusion of the protein initiation site
Reference transcript: NM_003213
Changed!
pfam TEA 353aa 1e-130
in ref transcript
TEA/ATTS domain family.
Changed!
pfam TEA 270aa 1e-107
in modified transcript
TEPP
refseq_TEPP.F2 refseq_TEPP.R2 106 187
NCBIGene 36.3 374739
Alternative 3-prime, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_199046
TEX11
refseq_TEX11.F1 refseq_TEX11.R1 205 327
NCBIGene 36.3 56159
Single exon skipping, size difference: 122
Exclusion of the protein initiation site
Reference transcript: NM_001003811
pfam SPO22 257aa 1e-58
in ref transcript
Meiosis protein SPO22/ZIP4 like. SPO22/ZIP4 in yeast is a meiosis specific protein involved in sporulation. It has been shown to regulate crossover distribution by promoting synaptonemal complex formation.
TEX14
refseq_TEX14.F1 refseq_TEX14.R1 123 243
NCBIGene 36.3 56155
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_198393
cd S_TKc 234aa 5e-19
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
cd ANK 109aa 5e-17
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
pfam Pkinase_Tyr 214aa 9e-23
in ref transcript
Protein tyrosine kinase.
COG SPS1 328aa 1e-08
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
COG Arp 141aa 5e-08
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
TFEC
refseq_TFEC.F2 refseq_TFEC.R2 255 342
NCBIGene 36.3 22797
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_012252
cd HLH 61aa 1e-05
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
Changed!
pfam HLH 54aa 3e-04
in ref transcript
Helix-loop-helix DNA-binding domain.
Changed!
smart HLH 53aa 2e-04
in modified transcript
helix loop helix domain.
TFG
refseq_TFG.F2 refseq_TFG.R2 171 461
NCBIGene 36.3 10342
Alternative 5-prime, size difference: 290
Exclusion in 5'UTR
Reference transcript: NM_006070
cd PB1_TFG 81aa 4e-41
in ref transcript
The PB1 domain found in TFG protein, an oncogenic gene product and fusion partner to nerve growth factor tyrosine kinase receptor TrkA and to the tyrosine kinase ALK. The PB1 domain is a modular domain mediating specific protein-protein interaction in many critical cell processes, such as osteoclastogenesis, angiogenesis, early cardiovascular development and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The PB1 domains of TFG represent a type I/II PB1 domain. The physiological function of TFG remains unknown.
smart PB1 77aa 5e-16
in ref transcript
PB1 domain. Phox and Bem1p domain, present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.
TFIP11
refseq_TFIP11.F2 refseq_TFIP11.R2 267 344
NCBIGene 36.3 24144
Single exon skipping, size difference: 77
Exclusion in 5'UTR
Reference transcript: NM_001008697
pfam TFP11 235aa 1e-106
in ref transcript
Tuftelin interacting protein 11. This family contains tuftelin interacting protein 11 which has been identified as both a nuclear and cytoplasmic protein, and has been implicated in the secretory pathway. Sip1, a septin interacting protein is also a member of this family. Proteins in this family often contain pfam01595.
pfam G-patch 45aa 4e-13
in ref transcript
G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines.
TGIF1
refseq_TGIF.F2 refseq_TGIF.R2 119 303
NCBIGene 36.3 7050
Single exon skipping, size difference: 184
Exclusion of the protein initiation site
Reference transcript: NM_173208
cd homeodomain 53aa 0.003
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
smart HOX 53aa 0.001
in ref transcript
Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.
TGM5
refseq_TGM5.F1 refseq_TGM5.R1 142 388
NCBIGene 36.3 9333
Single exon skipping, size difference: 246
Exclusion in the protein (no frameshift)
Reference transcript: NM_201631
pfam Transglut_C 97aa 9e-23
in ref transcript
Transglutaminase family, C-terminal ig like domain.
Changed!
pfam Transglut_N 117aa 3e-22
in ref transcript
Transglutaminase family.
smart TGc 94aa 4e-20
in ref transcript
Transglutaminase/protease-like homologues. Transglutaminases are enzymes that establish covalent links between proteins. A subset of transglutaminase homologues appear to catalyse the reverse reaction, the hydrolysis of peptide bonds. Proteins with this domain are both extracellular and intracellular, and it is likely that the eukaryotic intracellular proteins are involved in signalling events.
pfam Transglut_C 100aa 4e-16
in ref transcript
COG COG1305 98aa 3e-04
in ref transcript
Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism].
Changed!
pfam Transglut_N 61aa 1e-11
in modified transcript
THADA
refseq_THADA.F2 refseq_THADA.R2 121 303
NCBIGene 36.2 63892
Multiple exon skipping, size difference: 182
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_022065
pfam DUF2428 304aa 3e-68
in ref transcript
Putative death-receptor fusion protein (DUF2428). This is a family of proteins conserved from plants to humans. The function is not known. Several members have been annotated as being HEAT repeat-containing proteins while others are designated as death-receptor interacting proteins, but neither of these could be confirmed.
Changed!
COG COG5543 475aa 5e-37
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
COG COG5543 239aa 2e-19
in modified transcript
THAP1
refseq_THAP1.F1 refseq_THAP1.R1 109 305
NCBIGene 36.3 55145
Single exon skipping, size difference: 196
Exclusion in the protein causing a frameshift
Reference transcript: NM_018105
Changed!
pfam THAP 79aa 2e-16
in ref transcript
THAP domain. The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologs of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes.
THEG
refseq_THEG.F1 refseq_THEG.R1 131 203
NCBIGene 36.3 51298
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_016585
THOC5
refseq_THOC5.F2 refseq_THOC5.R2 108 224
NCBIGene 36.3 8563
Single exon skipping, size difference: 116
Exclusion in 5'UTR
Reference transcript: NM_001002877
pfam FimP 358aa 1e-128
in ref transcript
Fms-interacting protein. This entry carries part of the crucial 144 N-terminal residues of the FmiP protein, which is essential for the binding of the protein to the cytoplasmic domain of activated Fms-molecules in M-CSF induced haematopoietic differentiation of macrophages. The C-terminus contains a putative nuclear localisation sequence and a leucine zipper which suggest further, as yet unknown, nuclear functions. The level of FMIP expression might form a threshold that determines whether cells differentiate into macrophages or into granulocytes.
THOC5
refseq_THOC5.F4 refseq_THOC5.R3 197 313
NCBIGene 36.3 8563
Single exon skipping, size difference: 116
Exclusion in 5'UTR
Reference transcript: NM_001002878
pfam FimP 358aa 1e-128
in ref transcript
Fms-interacting protein. This entry carries part of the crucial 144 N-terminal residues of the FmiP protein, which is essential for the binding of the protein to the cytoplasmic domain of activated Fms-molecules in M-CSF induced haematopoietic differentiation of macrophages. The C-terminus contains a putative nuclear localisation sequence and a leucine zipper which suggest further, as yet unknown, nuclear functions. The level of FMIP expression might form a threshold that determines whether cells differentiate into macrophages or into granulocytes.
THSD1
refseq_THSD1.F1 refseq_THSD1.R1 228 387
NCBIGene 36.3 55901
Single exon skipping, size difference: 159
Exclusion in the protein (no frameshift)
Reference transcript: NM_018676
Changed!
smart TSP1 50aa 5e-10
in ref transcript
Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.
THSD3
refseq_THSD3.F2 refseq_THSD3.R2 243 348
NCBIGene 36.3 145501
Single exon skipping, size difference: 105
Inclusion in the protein causing a new stop codon
Reference transcript: NM_199296
Changed!
pfam AMOP 158aa 2e-40
in ref transcript
AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges.
Changed!
smart TSP1 41aa 3e-06
in ref transcript
Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.
THSD3
refseq_THSD3.F3 refseq_THSD3.R3 100 446
NCBIGene 36.3 145501
Single exon skipping, size difference: 346
Exclusion in the protein causing a frameshift
Reference transcript: NM_199296
Changed!
pfam AMOP 158aa 2e-40
in ref transcript
AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges.
Changed!
smart TSP1 41aa 3e-06
in ref transcript
Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.
THYN1
refseq_THYN1.F1 refseq_THYN1.R1 170 321
NCBIGene 36.3 29087
Single exon skipping, size difference: 151
Exclusion in the protein causing a frameshift
Reference transcript: NM_014174
Changed!
pfam DUF55 134aa 3e-29
in ref transcript
Protein of unknown function DUF55. This family of proteins have no known function.
Changed!
COG COG2947 169aa 3e-46
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
pfam DUF55 66aa 4e-16
in modified transcript
Changed!
COG COG2947 105aa 2e-34
in modified transcript
TIAL1
refseq_TIAL1.F1 refseq_TIAL1.R1 281 332
NCBIGene 36.3 7073
Alternative 3-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_001033925
cd RRM 74aa 2e-19
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
cd RRM 90aa 2e-16
in ref transcript
cd RRM 69aa 3e-13
in ref transcript
Changed!
TIGR PABP-1234 275aa 3e-28
in ref transcript
There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
Changed!
COG COG0724 277aa 2e-20
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
cd RRM 73aa 3e-18
in modified transcript
Changed!
TIGR PABP-1234 258aa 3e-29
in modified transcript
Changed!
COG COG0724 192aa 3e-20
in modified transcript
TJP1
refseq_TJP1.F1 refseq_TJP1.R1 145 385
NCBIGene 36.3 7082
Single exon skipping, size difference: 240
Exclusion in the protein (no frameshift)
Reference transcript: NM_003257
cd PDZ_signaling 86aa 3e-14
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 79aa 1e-10
in ref transcript
cd PDZ_signaling 78aa 7e-10
in ref transcript
pfam ZU5 103aa 7e-35
in ref transcript
ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function.
smart GuKc 176aa 5e-34
in ref transcript
Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
smart PDZ 91aa 4e-14
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart PDZ 81aa 1e-12
in ref transcript
pfam PDZ 77aa 3e-12
in ref transcript
PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins.
pfam SH3_2 62aa 3e-06
in ref transcript
Variant SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
TIGR degP_htrA_DO 214aa 7e-06
in ref transcript
This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
pfam TIR 139aa 4e-15
in ref transcript
TIR domain. The Toll/interleukin-1 receptor (TIR) homology domain is an intracellular signalling domain found in MyD88, interleukin 1 receptor and the Toll receptor. It contains three highly-conserved regions, and mediates protein-protein interactions between the Toll-like receptors (TLRs) and signal-transduction components. TIR-like motifs are also found in plant proteins thought to be involved in resistance to disease. When activated, TIR domains recruit cytoplasmic adaptor proteins MyD88 and TOLLIP (Toll interacting protein). In turn, these associate with various kinases to set off signalling cascades.
TIGR PCC 84aa 2e-06
in ref transcript
Note: this model is restricted to the amino half because a full-length model is incompatible with the HMM software package.
TM2D2
refseq_TM2D2.F1 refseq_TM2D2.R1 100 489
NCBIGene 36.3 83877
Alternative 3-prime, size difference: 389
Inclusion in the protein causing a new stop codon
Reference transcript: NM_078473
Changed!
pfam TM2 50aa 2e-11
in ref transcript
TM2 domain. This family is composed of a pair of transmembrane alpha helices connected by a short linker. The function of this domain is unknown, however it occurs in a wide range or protein contexts.
Changed!
PHA PHA01886 53aa 9e-04
in ref transcript
TM2 domain-containing protein.
TM2D2
refseq_TM2D2.F3 refseq_TM2D2.R3 172 378
NCBIGene 36.3 83877
Alternative 5-prime, size difference: 206
Exclusion in 5'UTR
Reference transcript: NM_031940
pfam TM2 50aa 1e-10
in ref transcript
TM2 domain. This family is composed of a pair of transmembrane alpha helices connected by a short linker. The function of this domain is unknown, however it occurs in a wide range or protein contexts.
PHA PHA01886 53aa 0.001
in ref transcript
TM2 domain-containing protein.
TM2D3
refseq_TM2D3.F1 refseq_TM2D3.R1 101 179
NCBIGene 36.3 80213
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_078474
pfam TM2 50aa 2e-06
in ref transcript
TM2 domain. This family is composed of a pair of transmembrane alpha helices connected by a short linker. The function of this domain is unknown, however it occurs in a wide range or protein contexts.
TM6SF2
refseq_TM6SF2.F1 refseq_TM6SF2.R1 162 287
NCBIGene 36.2 53345
Single exon skipping, size difference: 125
Exclusion in the protein causing a frameshift
Reference transcript: NM_001001524
TMEM10
refseq_TMEM10.F1 refseq_TMEM10.R1 105 144
NCBIGene 36.3 93377
Single exon skipping, size difference: 39
Inclusion in the protein causing a new stop codon
Reference transcript: NM_033207
TMEM107
refseq_TMEM107.F1 refseq_TMEM107.R1 101 119
NCBIGene 36.3 84314
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_032354
TMEM150
refseq_TMEM150.F1 refseq_TMEM150.R1 151 219
NCBIGene 36.3 129303
Single exon skipping, size difference: 68
Exclusion in the protein causing a frameshift
Reference transcript: NM_001031738
Changed!
pfam Frag1 229aa 2e-44
in ref transcript
Frag1/DRAM/Sfk1 family. This family includes Frag1, DRAM and Sfk1 proteins. Frag1 (FGF receptor activating protein 1) is a protein that is conserved from fungi to humans. There are four potential iso-prenylation sites throughout the peptide, viz CILW, CIIW and CIGL. Frag1 is a membrane-spanning protein that is ubiquitously expressed in adult tissues suggesting an important cellular function. Dram is a family of proteins conserved from nematodes to humans with six hydrophobic transmembrane regions and an Endoplasmic Reticulum signal peptide. It is a lysosomal protein that induces macro-autophagy as an effector of p53-mediated death, where p53 is the tumour-suppressor gene that is frequently mutated in cancer. Expression of Dram is stress-induced. This region is also part of a family of small plasma membrane proteins, referred to as Sfk1, that may act together with or upstream of Stt4p to generate normal levels of the essential phospholipid PI4P, thus allowing proper localisation of Stt4p to the actin cytoskeleton.
Changed!
pfam Frag1 66aa 2e-07
in modified transcript
TMEM44
refseq_TMEM44.F1 refseq_TMEM44.R1 108 141
NCBIGene 36.3 93109
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_138399
TMEM93
refseq_TMEM93.F1 refseq_TMEM93.R1 229 342
NCBIGene 36.3 83460
Alternative 5-prime, size difference: 113
Exclusion in 5'UTR
Reference transcript: NM_001014764
pfam DUF786 110aa 5e-25
in ref transcript
Protein of unknown function (DUF786). This family consists of several eukaryotic proteins of unknown function.
TMEM98
refseq_TMEM98.F1 refseq_TMEM98.R1 199 275
NCBIGene 36.3 26022
Single exon skipping, size difference: 76
Exclusion in 5'UTR
Reference transcript: NM_015544
TMPRSS4
refseq_TMPRSS4.F1 refseq_TMPRSS4.R1 150 293
NCBIGene 36.2 56649
Single exon skipping, size difference: 143
Exclusion in the protein causing a frameshift
Reference transcript: NM_019894
Changed!
cd Tryp_SPc 228aa 2e-68
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
cd LDLa 35aa 0.002
in ref transcript
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
Changed!
smart Tryp_SPc 226aa 2e-77
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
smart SR 91aa 9e-07
in ref transcript
Scavenger receptor Cys-rich. The sea ucrhin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.
pfam Ldl_recept_a 22aa 0.005
in ref transcript
Low-density lipoprotein receptor domain class A.
Changed!
COG COG5640 228aa 8e-26
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
Changed!
cd Tryp_SPc 132aa 1e-32
in modified transcript
Changed!
smart Tryp_SPc 134aa 1e-36
in modified transcript
Changed!
COG COG5640 133aa 2e-08
in modified transcript
TMUB2
refseq_TMUB2.F1 refseq_TMUB2.R1 112 401
NCBIGene 36.2 79089
Alternative 5-prime, size difference: 289
Exclusion in the protein causing a frameshift
Reference transcript: NM_177441
Changed!
cd UBL 63aa 1e-05
in ref transcript
UBLs function by remodeling the surface of their target proteins, changing their target's half-life, enzymatic activity, protein-protein interactions, subcellular localization or other properties. At least 10 different ubiquitin-like modifications exist in mammals, and attachment of different ubls to a target leads to different biological consequences. Ubl-conjugation cascades are initiated by activating enzymes, which also coordinate the ubls with their downstream pathways.
Changed!
pfam ubiquitin 51aa 5e-08
in ref transcript
Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
TNFRSF18
refseq_TNFRSF18.F1 refseq_TNFRSF18.R1 101 122
NCBIGene 36.3 8784
Alternative 5-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_004195
TNFRSF25
refseq_TNFRSF25.F1 refseq_TNFRSF25.R1 153 180
NCBIGene 36.3 8718
Alternative 5-prime, size difference: 7
Exclusion in the protein causing a frameshift
Reference transcript: NM_148965
Changed!
cd TNFR 107aa 4e-13
in ref transcript
Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Changed!
smart DEATH 85aa 6e-12
in ref transcript
DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
Changed!
cd TNFR 88aa 3e-09
in modified transcript
TNFRSF25
refseq_TNFRSF25.F1 refseq_TNFRSF25.R5 107 127
NCBIGene 36.3 8718
Alternative 3-prime, size difference: 20
Exclusion in the protein causing a frameshift
Reference transcript: NM_148965
Changed!
cd TNFR 107aa 4e-13
in ref transcript
Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Changed!
smart DEATH 85aa 6e-12
in ref transcript
DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
Changed!
cd TNFR 88aa 2e-08
in modified transcript
TNFRSF25
refseq_TNFRSF25.F4 refseq_TNFRSF25.R4 235 370
NCBIGene 36.3 8718
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_148965
Changed!
cd TNFR 107aa 4e-13
in ref transcript
Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Changed!
smart DEATH 85aa 6e-12
in ref transcript
DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
Changed!
cd TNFR 100aa 0.001
in modified transcript
Changed!
pfam Death 80aa 6e-12
in modified transcript
Death domain.
TNFSF13
refseq_TNFSF13.F2 refseq_TNFSF13.R2 217 265
NCBIGene 36.3 8741
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_003808
Changed!
cd TNF 132aa 2e-18
in ref transcript
Tumor Necrosis Factor; TNF superfamily members include the cytokines: TNF (TNF-alpha), LT (lymphotoxin-alpha, TNF-beta), CD40 ligand, Apo2L (TRAIL), Fas ligand, and osteoprotegerin (OPG) ligand. These proteins generally have an intracellular N-terminal domain, a short transmembrane segment, an extracellular stalk, and a globular TNF-like extracellular domain of about 150 residues. They initiate apoptosis by binding to related receptors, some of which have intracellular death domains. They generally form homo- or hetero- trimeric complexes.TNF cytokines bind one elongated receptor molecule along each of three clefts formed by neighboring monomers of the trimer with ligand trimerization a requiste for receptor binding.
pfam TNF 112aa 3e-17
in ref transcript
TNF(Tumour Necrosis Factor) family.
Changed!
cd TNF 118aa 2e-17
in modified transcript
TNFSF14
refseq_TNFSF14.F2 refseq_TNFSF14.R2 123 231
NCBIGene 36.3 8740
Alternative 5-prime, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_003807
cd TNF 145aa 2e-23
in ref transcript
Tumor Necrosis Factor; TNF superfamily members include the cytokines: TNF (TNF-alpha), LT (lymphotoxin-alpha, TNF-beta), CD40 ligand, Apo2L (TRAIL), Fas ligand, and osteoprotegerin (OPG) ligand. These proteins generally have an intracellular N-terminal domain, a short transmembrane segment, an extracellular stalk, and a globular TNF-like extracellular domain of about 150 residues. They initiate apoptosis by binding to related receptors, some of which have intracellular death domains. They generally form homo- or hetero- trimeric complexes.TNF cytokines bind one elongated receptor molecule along each of three clefts formed by neighboring monomers of the trimer with ligand trimerization a requiste for receptor binding.
smart TNF 129aa 2e-25
in ref transcript
Tumour necrosis factor family. Family of cytokines that form homotrimeric or heterotrimeric complexes. TNF mediates mature T-cell receptor-induced apoptosis through the p75 TNF receptor.
TNK2
refseq_TNK2.F1 refseq_TNK2.R1 328 418
NCBIGene 36.3 10188
Single exon skipping, size difference: 90
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001010938
cd PTKc_Ack_like 258aa 1e-147
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Activated Cdc42-associated kinase. Protein Tyrosine Kinase (PTK) family; Activated Cdc42-associated kinase (Ack) subfamily; catalytic (c) domain. Ack subfamily members include Ack1, thirty-eight-negative kinase 1 (Tnk1), and similar proteins. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Ack subfamily members are cytoplasmic (or nonreceptor) tyr kinases containing an N-terminal catalytic domain, an SH3 domain, a Cdc42-binding CRIB domain, and a proline-rich region. They are mainly expressed in brain and skeletal tissues and are involved in the regulation of cell adhesion and growth, receptor degradation, and axonal guidance. Ack1 is also associated with androgen-independent prostate cancer progression. Tnk1 regulates TNFalpha signaling and may play an important role in cell death.
cd SH3 52aa 6e-06
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
GTPase binding. The GTPase binding domain binds to the G protein Cdc42, inhibiting both its intrinsic and stimulated GTPase activity. The domain is largely unstructured in the absence of Cdc42.
smart SH3 54aa 2e-06
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
COG SPS1 266aa 6e-25
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
COG DedD 114aa 0.010
in modified transcript
Uncharacterized protein conserved in bacteria [Function unknown].
TNK2
refseq_TNK2.F3 refseq_TNK2.R3 197 242
NCBIGene 36.3 10188
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_001010938
cd PTKc_Ack_like 258aa 1e-147
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Activated Cdc42-associated kinase. Protein Tyrosine Kinase (PTK) family; Activated Cdc42-associated kinase (Ack) subfamily; catalytic (c) domain. Ack subfamily members include Ack1, thirty-eight-negative kinase 1 (Tnk1), and similar proteins. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Ack subfamily members are cytoplasmic (or nonreceptor) tyr kinases containing an N-terminal catalytic domain, an SH3 domain, a Cdc42-binding CRIB domain, and a proline-rich region. They are mainly expressed in brain and skeletal tissues and are involved in the regulation of cell adhesion and growth, receptor degradation, and axonal guidance. Ack1 is also associated with androgen-independent prostate cancer progression. Tnk1 regulates TNFalpha signaling and may play an important role in cell death.
cd SH3 52aa 6e-06
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
GTPase binding. The GTPase binding domain binds to the G protein Cdc42, inhibiting both its intrinsic and stimulated GTPase activity. The domain is largely unstructured in the absence of Cdc42.
smart SH3 54aa 2e-06
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
COG SPS1 266aa 6e-25
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
TNRC5
refseq_TNRC5.F1 refseq_TNRC5.R1 171 295
NCBIGene 36.2 10695
Single exon skipping, size difference: 124
Exclusion in the protein causing a frameshift
Reference transcript: NM_006586
TNRC6A
refseq_TNRC6A.F1 refseq_TNRC6A.R1 148 273
NCBIGene 36.2 27327
Single exon skipping, size difference: 125
Inclusion in the protein causing a new stop codon
Reference transcript: NM_020847
Changed!
cd RRM 68aa 3e-05
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
pfam Ago_hook 69aa 6e-11
in ref transcript
Argonaute hook. This region has been called the argonaute hook. It has been shown to bind to the Piwi domain pfam02171 of Argnonaute proteins.
TPD52L1
refseq_TPD52L1.F1 refseq_TPD52L1.R1 285 346
NCBIGene 36.3 7164
Single exon skipping, size difference: 61
Exclusion in the protein causing a frameshift
Reference transcript: NM_003287
Changed!
pfam TPD52 185aa 4e-39
in ref transcript
Tumour protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family. These proteins have an amino terminal coiled-coil that allows members to form homo- and heterodimers with each other.
Changed!
pfam TPD52 128aa 7e-27
in modified transcript
TPD52L2
refseq_TPD52L2.F1 refseq_TPD52L2.R1 108 135
NCBIGene 36.3 7165
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_199360
Changed!
pfam TPD52 163aa 4e-26
in ref transcript
Tumour protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family. These proteins have an amino terminal coiled-coil that allows members to form homo- and heterodimers with each other.
Changed!
pfam TPD52 154aa 7e-27
in modified transcript
TPM1
refseq_TPM1.F1 refseq_TPM1.R1 151 199
NCBIGene 36.3 7168
Alternative 5-prime, size difference: 48
Exclusion of the stop codon
Reference transcript: NM_000366
Changed!
pfam Tropomyosin 236aa 3e-45
in ref transcript
Tropomyosin.
Changed!
pfam Tropomyosin 237aa 1e-45
in modified transcript
TPO
refseq_TPO.F1 refseq_TPO.R1 225 396
NCBIGene 36.3 7173
Single exon skipping, size difference: 171
Exclusion in the protein (no frameshift)
Reference transcript: NM_000547
cd CCP 54aa 3e-06
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd EGF_CA 44aa 3e-06
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Changed!
pfam An_peroxidase 559aa 0.0
in ref transcript
Animal haem peroxidase.
pfam EGF_CA 43aa 2e-09
in ref transcript
Calcium binding EGF domain.
smart CCP 53aa 3e-06
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
Changed!
pfam An_peroxidase 502aa 1e-163
in modified transcript
TPO
refseq_TPO.F4 refseq_TPO.R4 115 247
NCBIGene 36.3 7173
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_000547
cd CCP 54aa 3e-06
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
Changed!
cd EGF_CA 44aa 3e-06
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
pfam An_peroxidase 559aa 0.0
in ref transcript
Animal haem peroxidase.
Changed!
pfam EGF_CA 43aa 2e-09
in ref transcript
Calcium binding EGF domain.
smart CCP 53aa 3e-06
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
TPO
refseq_TPO.F6 refseq_TPO.R6 221 351
NCBIGene 36.2 7173
Single exon skipping, size difference: 130
Exclusion in the protein causing a frameshift
Reference transcript: NM_000547
cd CCP 54aa 3e-06
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd EGF_CA 44aa 3e-06
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
pfam An_peroxidase 559aa 0.0
in ref transcript
Animal haem peroxidase.
pfam EGF_CA 43aa 2e-09
in ref transcript
Calcium binding EGF domain.
smart CCP 53aa 3e-06
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
TPTE
refseq_TPTE.F2 refseq_TPTE.R2 213 273
NCBIGene 36.3 7179
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_199259
cd PTPc 53aa 0.002
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
pfam PTEN_C2 131aa 5e-30
in ref transcript
C2 domain of PTEN tumour-suppressor protein. This is the C2 domain-like domain, in greek key form, of the PTEN protein, phosphatidyl-inositol triphosphate phosphatase, and it is the C-terminus. This domain may well include a CBR3 loop which means it plays a central role in membrane binding. This domain associates across an extensive interface with the N-terminal phosphatase domain DSPc (pfam00782) suggesting that the C2 domain productively positions the catalytic part of the protein on the membrane 1].
smart PTPc_motif 59aa 2e-04
in ref transcript
Protein tyrosine phosphatase, catalytic domain motif.
SEFIR domain. This family comprises IL17 receptors (IL17Rs) and SEF proteins. The latter are feedback inhibitors of FGF signalling and are also thought to be receptors. Due to its similarity to the TIR domain (pfam01582), the SEFIR region is thought to be involved in homotypic interactions with other SEFIR/TIR-domain-containing proteins. Thus, SEFs and IL17Rs may be involved in TOLL/IL1R-like signalling pathways.
TRAF6
refseq_TRAF6.F2 refseq_TRAF6.R2 296 389
NCBIGene 36.3 7189
Single exon skipping, size difference: 93
Exclusion in 5'UTR
Reference transcript: NM_145803
cd MATH_TRAF6 150aa 2e-67
in ref transcript
Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF6 subfamily, TRAF domain, C-terminal MATH subdomain; composed of proteins with similarity to human TRAF6, including the Drosophila protein DTRAF2. TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF6 is the most divergent in its TRAF domain among the mammalian TRAFs. In addition to mediating TNFR family signaling, it is also an essential signaling molecule of the interleukin-1/Toll-like receptor superfamily. Whereas other TRAF molecules display similar and overlapping TNFR-binding specificities, TRAF6 binds completely different sites on receptors such as CD40 and RANK. TRAF6 serves as a molecular bridge between innate and adaptive immunity and plays a central role in osteoimmunology. DTRAF2, as an activator of nuclear factor-kappaB, plays a pivotal role in Drosophila development and innate immunity. TRAF6 contains a RING finger domain, five zinc finger domains, and a TRAF domain. The TRAF domain can be divided into a more divergent N-terminal alpha helical region (TRAF-N), and a highly conserved C-terminal MATH subdomain (TRAF-C) with an eight-stranded beta-sandwich structure. TRAF-N mediates trimerization while TRAF-C interacts with receptors.
cd RING 39aa 3e-04
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
pfam MATH 143aa 1e-18
in ref transcript
MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode Caenorhabditis elegans.
pfam zf-TRAF 58aa 9e-14
in ref transcript
TRAF-type zinc finger.
smart RING 38aa 2e-06
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
pfam zf-TRAF 57aa 1e-04
in ref transcript
pfam Myosin_tail_1 46aa 0.007
in ref transcript
Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
COG COG5222 65aa 1e-04
in ref transcript
Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].
TRAF7
refseq_TRAF7.F1 refseq_TRAF7.R1 145 194
NCBIGene 36.2 84231
Alternative 5-prime and 3-prime, size difference: 49
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_032271
Changed!
cd WD40 281aa 4e-47
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Changed!
pfam WD40 39aa 1e-06
in ref transcript
WD domain, G-beta repeat.
Changed!
TIGR rad18 34aa 4e-05
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Changed!
pfam WD40 36aa 0.001
in ref transcript
Changed!
pfam WD40 41aa 0.002
in ref transcript
Changed!
pfam SMC_N 79aa 0.002
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Changed!
COG COG2319 330aa 2e-18
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
COG COG5152 36aa 4e-04
in ref transcript
Uncharacterized conserved protein, contains RING and CCCH-type Zn-fingers [General function prediction only].
TRAPPC2
refseq_TRAPPC2.F1 refseq_TRAPPC2.R1 109 251
NCBIGene 36.3 6399
Single exon skipping, size difference: 142
Exclusion in 5'UTR
Reference transcript: NM_001011658
pfam Sedlin_N 128aa 9e-49
in ref transcript
Sedlin, N-terminal conserved region. Mutations in this protein are associated with the X-linked spondyloepiphyseal dysplasia tarda syndrome (OMIM:313400). This family represents an N-terminal conserved region.
COG TRS20 133aa 4e-32
in ref transcript
Subunit of TRAPP, an ER-Golgi tethering complex [Cell motility and secretion].
TRERF1
refseq_TRERF1.F1 refseq_TRERF1.R1 150 200
NCBIGene 36.2 55809
Alternative 5-prime, size difference: 50
Inclusion in 5'UTR
Reference transcript: NM_033502
pfam ELM2 57aa 9e-10
in ref transcript
ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in a protein from Arabidopsis thaliana. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.
smart SANT 46aa 0.002
in ref transcript
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.
TRERF1
refseq_TRERF1.F3 refseq_TRERF1.R3 167 203
NCBIGene 36.2 55809
Alternative 3-prime, size difference: 36
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_033502
pfam ELM2 57aa 9e-10
in ref transcript
ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in a protein from Arabidopsis thaliana. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.
smart SANT 46aa 0.002
in ref transcript
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.
TRERF1
refseq_TRERF1.F6 refseq_TRERF1.R6 165 414
NCBIGene 36.2 55809
Single exon skipping, size difference: 249
Exclusion in the protein (no frameshift)
Reference transcript: NM_033502
pfam ELM2 57aa 9e-10
in ref transcript
ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in a protein from Arabidopsis thaliana. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.
smart SANT 46aa 0.002
in ref transcript
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.
TREX1
refseq_TREX1.F1 refseq_TREX1.R1 157 242
NCBIGene 36.2 11277
Single exon skipping, size difference: 85
Inclusion in the protein causing a frameshift
Reference transcript: NM_130384
ATRIP
refseq_TREX1.F4 refseq_TREX1.R4 316 397
NCBIGene 36.3 84126
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_130384
RNF216
refseq_TRIAD3.F1 refseq_TRIAD3.R1 193 364
NCBIGene 36.3 54476
Alternative 3-prime, size difference: 171
Exclusion in the protein (no frameshift)
Reference transcript: NM_207111
TRIM17
refseq_TRIM17.F1 refseq_TRIM17.R1 116 152
NCBIGene 36.3 51127
Alternative 3-prime, size difference: 36
Inclusion in 5'UTR
Reference transcript: NM_001024940
cd RING 53aa 6e-06
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
cd BBOX 38aa 2e-05
in ref transcript
B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
smart SPRY 121aa 5e-25
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
pfam zf-B_box 41aa 2e-07
in ref transcript
B-box zinc finger.
smart PRY 53aa 2e-07
in ref transcript
associated with SPRY domains.
smart RING 50aa 2e-06
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
COG PEX10 51aa 1e-04
in ref transcript
RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
TRIM24
refseq_TRIM24.F1 refseq_TRIM24.R1 159 261
NCBIGene 36.3 8805
Alternative 5-prime, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_015905
cd Bromo_tif1_like 108aa 4e-46
in ref transcript
Bromodomain; tif1_like subfamily. Tif1 (transcription intermediary factor 1) is a member of the tripartite motif (TRIM) protein family, which is characterized by a particular domain architecture. It functions by recruiting coactivators and/or corepressors to modulate transcription. Vertebrate Tif1-gamma, also labeled E3 ubiquitin-protein ligase TRIM33, plays a role in the control of hematopoiesis. Its homologue in Xenopus laevis, Ectodermin, has been shown to function in germ-layer specification and control of cell growth during embryogenesis. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd BBOX 39aa 1e-07
in ref transcript
B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
cd BAH_plant_2 62aa 0.006
in ref transcript
BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
smart BBC 127aa 6e-32
in ref transcript
B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains.
smart BROMO 103aa 2e-29
in ref transcript
bromo domain.
pfam zf-B_box 42aa 6e-10
in ref transcript
B-box zinc finger.
pfam PHD 44aa 4e-08
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
pfam zf-B_box 46aa 0.003
in ref transcript
COG COG5076 125aa 3e-14
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
TRIM3
refseq_TRIM3.F1 refseq_TRIM3.R1 203 368
NCBIGene 36.3 10612
Single exon skipping, size difference: 165
Exclusion in 5'UTR
Reference transcript: NM_006458
cd RING 44aa 2e-08
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
cd BBOX 37aa 3e-06
in ref transcript
B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
cd WD40 261aa 9e-06
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
smart BBC 127aa 2e-32
in ref transcript
B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains.
smart IG_FLMN 98aa 2e-22
in ref transcript
Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
smart BBOX 42aa 4e-10
in ref transcript
B-Box-type zinc finger.
smart RING 41aa 7e-10
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
TIGR rad18 83aa 2e-07
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
pfam NHL 28aa 2e-06
in ref transcript
NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in the peptidyl-glycine alpha-amidating monooxygenase (PAM), proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localised to the repeats. The human tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is me diated by the NHL repeats.
pfam NHL 28aa 1e-04
in ref transcript
pfam NHL 28aa 3e-04
in ref transcript
pfam NHL 28aa 0.002
in ref transcript
pfam NHL 28aa 0.003
in ref transcript
COG COG3391 246aa 1e-11
in ref transcript
Uncharacterized conserved protein [Function unknown].
Exon skipping and alternative 3-prime or 5-prime, size difference: 67
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_007028
cd RING 45aa 1e-06
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
cd BBOX 37aa 7e-04
in ref transcript
B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
smart RING 41aa 2e-08
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
pfam zf-B_box 42aa 2e-06
in ref transcript
B-box zinc finger.
COG PEX10 50aa 4e-05
in ref transcript
RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
TRIM33
refseq_TRIM33.F1 refseq_TRIM33.R1 229 280
NCBIGene 36.3 51592
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_015906
Changed!
cd Bromo_tif1_like 126aa 8e-48
in ref transcript
Bromodomain; tif1_like subfamily. Tif1 (transcription intermediary factor 1) is a member of the tripartite motif (TRIM) protein family, which is characterized by a particular domain architecture. It functions by recruiting coactivators and/or corepressors to modulate transcription. Vertebrate Tif1-gamma, also labeled E3 ubiquitin-protein ligase TRIM33, plays a role in the control of hematopoiesis. Its homologue in Xenopus laevis, Ectodermin, has been shown to function in germ-layer specification and control of cell growth during embryogenesis. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd BBOX 39aa 2e-07
in ref transcript
B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
smart BBC 127aa 2e-27
in ref transcript
B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains.
Changed!
smart BROMO 122aa 3e-24
in ref transcript
bromo domain.
pfam zf-B_box 42aa 2e-09
in ref transcript
B-box zinc finger.
pfam PHD 44aa 3e-07
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
smart BBOX 37aa 0.004
in ref transcript
B-Box-type zinc finger.
Changed!
COG COG5076 121aa 3e-09
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
Changed!
cd Bromo_tif1_like 109aa 2e-50
in modified transcript
Changed!
smart BROMO 105aa 1e-27
in modified transcript
Changed!
COG COG5076 104aa 3e-12
in modified transcript
TRIM39
refseq_TRIM39.F1 refseq_TRIM39.R1 124 214
NCBIGene 36.3 56658
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_021253
cd RING 45aa 5e-08
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
cd BBOX 38aa 4e-05
in ref transcript
B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
smart SPRY 109aa 3e-31
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
smart PRY 52aa 3e-17
in ref transcript
associated with SPRY domains.
smart RING 41aa 1e-09
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
Changed!
TIGR alt_F1F0_F0_B 178aa 1e-05
in ref transcript
CC and in principle may run in either direction. This model represents the F0 subunit B of this apparent second ATP synthase.
Changed!
PRK PRK02224 125aa 0.001
in ref transcript
chromosome segregation protein; Provisional.
Changed!
TIGR SMC_prok_A 148aa 8e-05
in modified transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Changed!
PRK PRK02224 202aa 3e-04
in modified transcript
TRIM4
refseq_TRIM4.F1 refseq_TRIM4.R1 224 302
NCBIGene 36.3 89122
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_033017
cd RING 45aa 8e-08
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
cd BBOX 38aa 1e-05
in ref transcript
B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
smart SPRY 117aa 4e-13
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
smart RING 41aa 5e-08
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
pfam zf-C3HC4 25aa 0.010
in ref transcript
Zinc finger, C3HC4 type (RING finger). The C3HC4 type zinc-finger (RING finger) is a cysteine-rich domain of 40 to 60 residues that coordinates two zinc ions, and has the consensus sequence: C-X2-C-X(9-39)-C-X(1-3)-H-X(2-3)-C-X2-C-X(4-48)-C-X2-C where X is any amino acid. Many proteins containing a RING finger play a key role in the ubiquitination pathway.
TRIM55
refseq_TRIM55.F2 refseq_TRIM55.R2 205 293
NCBIGene 36.3 84675
Single exon skipping, size difference: 88
Inclusion in the protein causing a frameshift
Reference transcript: NM_184085
cd RING 60aa 4e-06
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
cd BBOX 40aa 0.007
in ref transcript
B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
pfam zf-B_box 40aa 2e-04
in ref transcript
B-box zinc finger.
Changed!
smart RING 56aa 0.001
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
smart BBC 127aa 0.005
in ref transcript
B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains.
Changed!
smart RING 35aa 0.002
in modified transcript
TRIM7
refseq_TRIM7.F1 refseq_TRIM7.R1 121 398
NCBIGene 36.3 81786
Alternative 5-prime, size difference: 277
Exclusion in 5'UTR
Reference transcript: NM_203296
smart SPRY 107aa 5e-22
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
smart PRY 48aa 5e-11
in ref transcript
associated with SPRY domains.
TRIM7
refseq_TRIM7.F2 refseq_TRIM7.R2 226 294
NCBIGene 36.3 81786
Alternative 5-prime, size difference: 68
Inclusion in the protein causing a new stop codon
Reference transcript: NM_203293
cd RING 56aa 1e-05
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
cd BBOX 37aa 1e-04
in ref transcript
B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
Changed!
smart SPRY 107aa 2e-22
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
Changed!
smart PRY 48aa 2e-10
in ref transcript
associated with SPRY domains.
Changed!
smart RING 53aa 2e-06
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
Changed!
PRK PRK08476 53aa 0.001
in modified transcript
F0F1 ATP synthase subunit B'; Validated.
TRNT1
refseq_TRNT1.F1 refseq_TRNT1.R1 283 343
NCBIGene 36.2 51095
Alternative 5-prime, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_016000
pfam PolyA_pol 124aa 3e-32
in ref transcript
Poly A polymerase head domain. This family includes nucleic acid independent RNA polymerases, such as Poly(A) polymerase, which adds the poly (A) tail to mRNA EC:2.7.7.19. This family also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of the tRNA EC:2.7.7.25.
Changed!
COG PcnB 394aa 2e-51
in ref transcript
tRNA nucleotidyltransferase/poly(A) polymerase [Translation, ribosomal structure and biogenesis].
Changed!
COG PcnB 222aa 3e-49
in modified transcript
TRPC4AP
refseq_TRPC4AP.F1 refseq_TRPC4AP.R1 133 157
NCBIGene 36.3 26133
Alternative 3-prime, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_015638
TRPM3
refseq_TRPM3.F2 refseq_TRPM3.R2 197 233
NCBIGene 36.3 80036
Single exon skipping, size difference: 36
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001007471
pfam Ion_trans 122aa 4e-07
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
TIGR trp 320aa 4e-05
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
TRPM3
refseq_TRPM3.F3 refseq_TRPM3.R3 149 179
NCBIGene 36.3 80036
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_001007471
pfam Ion_trans 122aa 4e-07
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
TIGR trp 320aa 4e-05
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
TRPM3
refseq_TRPM3.F6 refseq_TRPM3.R6 130 205
NCBIGene 36.3 80036
Single exon skipping, size difference: 75
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001007471
pfam Ion_trans 122aa 4e-07
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
TIGR trp 320aa 4e-05
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
TRPT1
refseq_TRPT1.F2 refseq_TRPT1.R2 101 185
NCBIGene 36.3 83707
Single exon skipping, size difference: 84
Exclusion of the protein initiation site
Reference transcript: NM_001033678
Changed!
pfam PTS_2-RNA 179aa 2e-33
in ref transcript
RNA 2'-phosphotransferase, Tpt1 / KptA family. Tpt1 catalyses the last step of tRNA splicing in yeast. It transfers the splice junction 2'-phosphate from ligated tRNA to NAD, to produce ADP-ribose 1"-2"-cyclic phosphate. This is presumed to be followed by a transesterification step to release the RNA. The first step of this reaction is similar to that catalysed by some bacterial toxins. Escherichia coli KptA and mouse Tpt1 are likely to use the same reaction mechanism.
Changed!
PTZ PTZ00315 180aa 7e-39
in ref transcript
2'-phosphotransferase; Provisional.
Changed!
pfam PTS_2-RNA 154aa 2e-27
in modified transcript
Changed!
PTZ PTZ00315 150aa 1e-31
in modified transcript
TRPV4
refseq_TRPV4.F2 refseq_TRPV4.R2 165 345
NCBIGene 36.3 59341
Single exon skipping, size difference: 180
Exclusion in the protein (no frameshift)
Reference transcript: NM_021625
Changed!
cd ANK 155aa 6e-11
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
TIGR trp 651aa 1e-137
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Changed!
cd ANK 148aa 2e-10
in modified transcript
Changed!
TIGR trp 591aa 1e-119
in modified transcript
TSGA10
refseq_TSGA10.F1 refseq_TSGA10.R1 155 199
NCBIGene 36.3 80705
Alternative 3-prime, size difference: 44
Inclusion in 5'UTR
Reference transcript: NM_182911
TIGR SMC_prok_B 333aa 2e-12
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
TIGR SMC_prok_B 298aa 9e-09
in ref transcript
pfam Macoilin 150aa 0.009
in ref transcript
Transmembrane protein. This entry is a highly conserved protein present in eukaryotes.
COG Smc 363aa 5e-10
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG COG4372 204aa 2e-05
in ref transcript
Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown].
TSPAN17
refseq_TSPAN17.F1 refseq_TSPAN17.R1 125 157
NCBIGene 36.3 26262
Alternative 5-prime, size difference: 32
Exclusion in the protein causing a frameshift
Reference transcript: NM_012171
cd TM4SF9_like_LEL 124aa 7e-59
in ref transcript
Tetraspanin, extracellular domain or large extracellular loop (LEL), TM4SF9_like subfamily. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". This subfamily contaions transmembrane 4 superfamily 9 (TM4SF9) or Tetraspanin-5 and related proteins. TM4SF9 is strongly expressed witin the central nervous system, and expression levels appear to correlate with differentiation status of particular neurons, hinting at a role in neuronal maturation.
Changed!
pfam Tetraspannin 252aa 2e-41
in ref transcript
Tetraspanin family.
Changed!
pfam Tetraspannin 245aa 1e-38
in modified transcript
TSPAN3
refseq_TSPAN3.F2 refseq_TSPAN3.R2 187 262
NCBIGene 36.3 10099
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_005724
Changed!
cd TM4SF8_like_LEL 107aa 6e-48
in ref transcript
Tetraspanin, extracellular domain or large extracellular loop (LEL), TM4SF8_like subfamily. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". This subfamily contaions transmembrane 4 superfamily 8 (TM4SF8) or Tspan-3 and related proteins. Tspan-3 has been reported to form a complex with integrin beta1 and OSP/claudin-11, which may be involved in oligodendrocyte proliferation and migration.
Changed!
pfam Tetraspannin 228aa 1e-35
in ref transcript
Tetraspanin family.
Changed!
cd TM4SF8_like_LEL 102aa 8e-44
in modified transcript
Changed!
pfam Tetraspannin 203aa 8e-32
in modified transcript
TSPAN4
refseq_TSPAN4.F1 refseq_TSPAN4.R1 117 217
NCBIGene 36.3 7106
Single exon skipping, size difference: 100
Exclusion in 5'UTR
Reference transcript: NM_003271
cd NET-5_like_LEL 98aa 1e-40
in ref transcript
Tetraspanin, extracellular domain or large extracellular loop (LEL), NET-5_like family. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". This sub-family contains proteins similar to human tetraspan NET-5.
pfam Tetraspannin 222aa 3e-35
in ref transcript
Tetraspanin family.
TSPO
refseq_TSPO.F1 refseq_TSPO.R1 189 400
NCBIGene 36.3 706
Single exon skipping, size difference: 211
Exclusion of the protein initiation site
Reference transcript: NM_000714
Changed!
pfam TspO_MBR 152aa 1e-43
in ref transcript
TspO/MBR family. Tryptophan-rich sensory protein (TspO) is an integral membrane protein that acts as a negative regulator of the expression of specific photosynthesis genes in response to oxygen/light. It is involved in the efflux of porphyrin intermediates from the cell. This reduces the activity of coproporphyrinogen III oxidase, which is thought to lead to the accumulation of a putative repressor molecule that inhibits the expression of specific photosynthesis genes. Several conserved aromatic residues are necessary for TspO function: they are thought to be involved in binding porphyrin intermediates. In, the rat mitochondrial peripheral benzodiazepine receptor (MBR) was shown to not only retain its structure within a bacterial outer membrane, but also to be able to functionally substitute for TspO in TspO- mutants, and to act in a similar manner to TspO in its in situ location: the outer mitochondrial membrane. The biological significance of MBR remains unclear, however. It is thought to be involved in a variety of cellular functions, including cholesterol transport in steroidogenic tissues.
Changed!
COG COG3476 150aa 5e-19
in ref transcript
Tryptophan-rich sensory protein (mitochondrial benzodiazepine receptor homolog) [Signal transduction mechanisms].
TTC8
refseq_TTC8.F2 refseq_TTC8.R2 291 381
NCBIGene 36.3 123016
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_144596
cd TPR 93aa 1e-08
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
cd TPR 90aa 1e-07
in ref transcript
TIGR PEP_TPR_lipo 148aa 5e-06
in ref transcript
This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
TIGR type_IV_pilW 161aa 8e-04
in ref transcript
Members of this family are designated PilF in ref (PMID:8973346) and PilW in ref (PMID:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
COG PilF 202aa 0.001
in ref transcript
Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion].
TTC8
refseq_TTC8.F3 refseq_TTC8.R3 102 132
NCBIGene 36.3 123016
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_144596
cd TPR 93aa 1e-08
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
cd TPR 90aa 1e-07
in ref transcript
TIGR PEP_TPR_lipo 148aa 5e-06
in ref transcript
This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
TIGR type_IV_pilW 161aa 8e-04
in ref transcript
Members of this family are designated PilF in ref (PMID:8973346) and PilW in ref (PMID:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
COG PilF 202aa 0.001
in ref transcript
Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion].
TTLL1
refseq_TTLL1.F1 refseq_TTLL1.R1 199 271
NCBIGene 36.3 25809
Single exon skipping, size difference: 72
Inclusion in the protein causing a new stop codon
Reference transcript: NM_012263
Changed!
pfam TTL 306aa 1e-100
in ref transcript
Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This modification cycle involves a specific carboxypeptidase and the activity of the tubulin-tyrosine ligase (TTL). The true physiological function of TTL has so far not been established. Tubulin-tyrosine ligase (TTL) catalyses the ATP-dependent post-translational addition of a tyrosine to the carboxy terminal end of detyrosinated alpha-tubulin. In normally cycling cells, the tyrosinated form of tubulin predominates. However, in breast cancer cells, the detyrosinated form frequently predominates, with a correlation to tumour aggressiveness. On the other hand, 3-nitrotyrosine has been shown to be incorporated, by TTL, into the carboxy terminal end of detyrosinated alpha-tubulin. This reaction is not reversible by the carboxypeptidase enzyme. Cells cultured in 3-nitrotyrosine rich medium showed evidence of altered microtubule structure and function, including altered cell morphology, epithelial barrier dysfunction, and apoptosis.
TTLL3
refseq_TTLL3.F1 refseq_TTLL3.R1 228 370
NCBIGene 36.3 26140
Single exon skipping, size difference: 142
Exclusion in the protein causing a frameshift
Reference transcript: NM_001025930
pfam TTL 292aa 7e-90
in ref transcript
Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This modification cycle involves a specific carboxypeptidase and the activity of the tubulin-tyrosine ligase (TTL). The true physiological function of TTL has so far not been established. Tubulin-tyrosine ligase (TTL) catalyses the ATP-dependent post-translational addition of a tyrosine to the carboxy terminal end of detyrosinated alpha-tubulin. In normally cycling cells, the tyrosinated form of tubulin predominates. However, in breast cancer cells, the detyrosinated form frequently predominates, with a correlation to tumour aggressiveness. On the other hand, 3-nitrotyrosine has been shown to be incorporated, by TTL, into the carboxy terminal end of detyrosinated alpha-tubulin. This reaction is not reversible by the carboxypeptidase enzyme. Cells cultured in 3-nitrotyrosine rich medium showed evidence of altered microtubule structure and function, including altered cell morphology, epithelial barrier dysfunction, and apoptosis.
TTYH1
refseq_TTYH1.F1 refseq_TTYH1.R1 162 208
NCBIGene 36.3 57348
Single exon skipping, size difference: 46
Inclusion in the protein causing a frameshift
Reference transcript: NM_001005367
Changed!
pfam Tweety 398aa 2e-71
in ref transcript
Tweety. The tweety (tty) gene has not been characterised at the protein level. However, it is thought to form a membrane protein with five potential membrane-spanning regions. A number of potential functions have been suggested.
Changed!
pfam Tweety 397aa 3e-71
in modified transcript
TUSC3
refseq_TUSC3.F1 refseq_TUSC3.R1 159 224
NCBIGene 36.3 7991
Single exon skipping, size difference: 65
Exclusion of the stop codon
Reference transcript: NM_006765
pfam OST3_OST6 308aa 1e-119
in ref transcript
OST3 / OST6 family. The proteins in this family are part of a complex of eight ER proteins that transfers core oligosaccharide from dolichol carrier to Asn-X-Ser/Thr motifs. This family includes both OST3 and OST6, each of which contains four predicted transmembrane helices. Disruption of OST3 and OST6 leads to a defect in the assembly of the complex. Hence, the function of these genes seems to be essential for recruiting a fully active complex necessary for efficient N-glycosylation.
TXNRD1
refseq_TXNRD1.F2 refseq_TXNRD1.R2 279 389
NCBIGene 36.3 7296
Single exon skipping, size difference: 110
Exclusion in the protein causing a frameshift
Reference transcript: NM_003330
Changed!
cd GRX_GRXh_1_2_like 47aa 5e-06
in ref transcript
Glutaredoxin (GRX) family, GRX human class 1 and 2 (h_1_2)-like subfamily; composed of proteins similar to human GRXs, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins such as ribonucleotide reductase. It contains a redox active CXXC motif in a TRX fold and uses a similar dithiol mechanism employed by TRXs for intramolecular disulfide bond reduction of protein substrates. Unlike TRX, GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. The flow of reducing equivalents in the GRX system goes from NADPH -> GSH reductase -> GSH -> GRX -> protein substrates. By altering the redox state of target proteins, GRX is involved in many cellular functions including DNA synthesis, signal transduction and the defense against oxidative stress. Different classes are known including human GRX1 and GRX2, which are members of this subfamily. Also included in this subfamily are the N-terminal GRX domains of proteins similar to human thioredoxin reductase 1 and 3.
Changed!
TIGR TGR 487aa 0.0
in ref transcript
This homodimeric, FAD-containing member of the pyridine nucleotide disulfide oxidoreductase family contains a C-terminal motif Cys-SeCys-Gly, where SeCys is selenocysteine encoded by TGA (in some sequence reports interpreted as a stop codon). In some members of this subfamily, Cys-SeCys-Gly is replaced by Cys-Cys-Gly. The reach of the selenium atom at the C-term arm of the protein is proposed to allow broad substrate specificity.
Changed!
PTZ PTZ00052 487aa 1e-137
in ref transcript
thioredoxin reductase; Provisional.
TXNRD1
refseq_TXNRD1.F3 refseq_TXNRD1.R3 156 393
NCBIGene 36.3 7296
Alternative 5-prime, size difference: 237
Exclusion of the protein initiation site
Reference transcript: NM_003330
Changed!
cd GRX_GRXh_1_2_like 47aa 5e-06
in ref transcript
Glutaredoxin (GRX) family, GRX human class 1 and 2 (h_1_2)-like subfamily; composed of proteins similar to human GRXs, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins such as ribonucleotide reductase. It contains a redox active CXXC motif in a TRX fold and uses a similar dithiol mechanism employed by TRXs for intramolecular disulfide bond reduction of protein substrates. Unlike TRX, GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. The flow of reducing equivalents in the GRX system goes from NADPH -> GSH reductase -> GSH -> GRX -> protein substrates. By altering the redox state of target proteins, GRX is involved in many cellular functions including DNA synthesis, signal transduction and the defense against oxidative stress. Different classes are known including human GRX1 and GRX2, which are members of this subfamily. Also included in this subfamily are the N-terminal GRX domains of proteins similar to human thioredoxin reductase 1 and 3.
TIGR TGR 487aa 0.0
in ref transcript
This homodimeric, FAD-containing member of the pyridine nucleotide disulfide oxidoreductase family contains a C-terminal motif Cys-SeCys-Gly, where SeCys is selenocysteine encoded by TGA (in some sequence reports interpreted as a stop codon). In some members of this subfamily, Cys-SeCys-Gly is replaced by Cys-Cys-Gly. The reach of the selenium atom at the C-term arm of the protein is proposed to allow broad substrate specificity.
PTZ PTZ00052 487aa 1e-137
in ref transcript
thioredoxin reductase; Provisional.
TYSND1
refseq_TYSND1.F1 refseq_TYSND1.R1 103 420
NCBIGene 36.3 219743
Multiple exon skipping, size difference: 317
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_173555
Changed!
TIGR protease_degS 165aa 4e-07
in ref transcript
This family consists of the periplasmic serine protease DegS (HhoB), a shorter paralog of protease DO (HtrA, DegP) and DegQ (HhoA). It is found in E. coli and several other Proteobacteria of the gamma subdivision. It contains a trypsin domain and a single copy of PDZ domain (in contrast to DegP with two copies). A critical role of this DegS is to sense stress in the periplasm and partially degrade an inhibitor of sigma(E).
Changed!
COG DegQ 173aa 2e-05
in ref transcript
Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [Posttranslational modification, protein turnover, chaperones].
U2AF1
refseq_U2AF1.F1 refseq_U2AF1.R1 100 167
NCBIGene 36.3 7307
Single exon skipping, size difference: 67
Inclusion in the protein causing a frameshift
Reference transcript: NM_006758
Changed!
cd RRM 44aa 3e-04
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
smart RRM_1 44aa 6e-09
in ref transcript
RNA recognition motif.
pfam zf-CCCH 27aa 0.010
in ref transcript
Zinc finger C-x8-C-x5-C-x3-H type (and similar).
U2AF1
refseq_U2AF1.F4 refseq_U2AF1.R4 167 234
NCBIGene 36.3 7307
Single exon skipping, size difference: 67
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001025203
Changed!
cd RRM 44aa 3e-04
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
smart RRM_1 44aa 7e-09
in ref transcript
RNA recognition motif.
Changed!
pfam zf-CCCH 27aa 0.006
in modified transcript
Zinc finger C-x8-C-x5-C-x3-H type (and similar).
U2AF1L4
refseq_U2AF1L4.F1 refseq_U2AF1L4.R1 102 160
NCBIGene 36.3 199746
Alternative 5-prime, size difference: 58
Inclusion in the protein causing a frameshift
Reference transcript: NM_144987
cd RRM 63aa 1e-05
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
smart RRM_1 64aa 2e-11
in ref transcript
RNA recognition motif.
Changed!
pfam zf-CCCH 27aa 0.002
in ref transcript
Zinc finger C-x8-C-x5-C-x3-H type (and similar).
Changed!
smart RRM_1 66aa 2e-09
in modified transcript
UBA52
refseq_UBA52.F1 refseq_UBA52.R1 112 134
NCBIGene 36.3 7311
Alternative 5-prime, size difference: 22
Exclusion in 5'UTR
Reference transcript: NM_001033930
cd Ubiquitin 76aa 7e-37
in ref transcript
Ubiquitin (includes Ubq/RPL40e and Ubq/RPS27a fusions as well as homopolymeric multiubiquitin protein chains).
pfam ubiquitin 69aa 2e-25
in ref transcript
Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
pfam Ribosomal_L40e 52aa 2e-18
in ref transcript
Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein.
PTZ PTZ00044 76aa 7e-20
in ref transcript
ubiquitin; Provisional.
COG RPL40A 49aa 3e-07
in ref transcript
Ribosomal protein L40E [Translation, ribosomal structure and biogenesis].
UBASH3A
refseq_UBASH3A.F1 refseq_UBASH3A.R1 192 306
NCBIGene 36.3 53347
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_018961
cd HP_PGM_like 213aa 2e-09
in ref transcript
Histidine phosphatase domain found in phosphoglycerate mutases and related proteins, mostly phosphatases; contains a His residue which is phosphorylated during the reaction. Subgroup of the catalytic domain of a functionally diverse set of proteins, most of which are phosphatases. The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. This subgroup contains cofactor-dependent and cofactor-independent phosphoglycerate mutases (dPGM, and BPGM respectively), fructose-2,6-bisphosphatase (F26BP)ase, Sts-1, SixA, and related proteins. Functions include roles in metabolism, signaling, or regulation, for example, F26BPase affects glycolysis and gluconeogenesis through controlling the concentration of F26BP; BPGM controls the concentration of 2,3-BPG (the main allosteric effector of hemoglobin in human blood cells); human Sts-1 is a T-cell regulator; Escherichia coli Six A participates in the ArcB-dependent His-to-Asp phosphorelay signaling system. Deficiency and mutation in many of the human members result in disease, for example erythrocyte BPGM deficiency is a disease associated with a decrease in the concentration of 2,3-BPG.
cd SH3 57aa 1e-06
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
pfam PGAM 143aa 2e-14
in ref transcript
Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are not included in the Prosite entry. However these sequences are significantly similar and contain identical active site residues.
smart SH3 57aa 8e-07
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
COG GpmB 189aa 4e-12
in ref transcript
Fructose-2,6-bisphosphatase [Carbohydrate transport and metabolism].
COG UBP14 41aa 2e-05
in ref transcript
Isopeptidase T [Posttranslational modification, protein turnover, chaperones].
UBA3
refseq_UBE1C.F1 refseq_UBE1C.R1 134 176
NCBIGene 36.3 9039
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_003968
cd Uba3_RUB 298aa 1e-157
in ref transcript
Ubiquitin activating enzyme (E1) subunit UBA3. UBA3 is part of the heterodimeric activating enzyme (E1), specific for the Rub family of ubiquitin-like proteins (Ubls). E1 enzymes are part of a conjugation cascade to attach Ub or Ubls, covalently to substrate proteins. consisting of activating (E1), conjugating (E2), and/or ligating (E3) enzymes. E1 activates ubiquitin(-like) by C-terminal adenylation, and subsequently forms a highly reactive thioester bond between its catalytic cysteine and Ubls C-terminus. E1 also associates with E2 and promotes ubiquitin transfer to the E2's catalytic cysteine. Post-translational modification by Rub family of ubiquitin-like proteins (Ublps) activates SCF ubiquitin ligases and is involved in cell cycle control, signaling and embryogenesis. UBA3 contains both the nucleotide-binding motif involved in adenylation and the catalytic cysteine involved in the thioester intermediate and Ublp transfer to E2.
pfam ThiF 144aa 1e-38
in ref transcript
ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family.
pfam E2_bind 89aa 4e-25
in ref transcript
E2 binding domain. E1 and E2 enzymes play a central role in ubiquitin and ubiquitin-like protein transfer cascades. This is an E2 binding domain that is found on NEDD8 activating E1 enzyme. The domain resembles ubiquitin, and recruits the catalytic core of the E2 enzyme Ubc12 in a similar manner to that in which ubiquitin interacts with ubiquitin binding domains.
pfam UBACT 66aa 4e-22
in ref transcript
Repeat in ubiquitin-activating (UBA) protein.
COG ThiF 167aa 2e-34
in ref transcript
Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [Coenzyme metabolism].
UBA5
refseq_UBE1DC1.F1 refseq_UBE1DC1.R1 97 447
NCBIGene 36.3 79876
Alternative 5-prime, size difference: 350
Inclusion in the protein causing a new stop codon
Reference transcript: NM_024818
Changed!
cd ThiF_MoeB_HesA_family 246aa 2e-53
in ref transcript
ThiF_MoeB_HesA. Family of E1-like enzymes involved in molybdopterin and thiamine biosynthesis family. The common reaction mechanism catalyzed by MoeB and ThiF, like other E1 enzymes, begins with a nucleophilic attack of the C-terminal carboxylate of MoaD and ThiS, respectively, on the alpha-phosphate of an ATP molecule bound at the active site of the activating enzymes, leading to the formation of a high-energy acyladenylate intermediate and subsequently to the formation of a thiocarboxylate at the C termini of MoaD and ThiS. MoeB, as the MPT synthase (MoaE/MoaD complex) sulfurase, is involved in the biosynthesis of the molybdenum cofactor, a derivative of the tricyclic pterin, molybdopterin (MPT). ThiF catalyzes the adenylation of ThiS, as part of the biosynthesis pathway of thiamin pyrophosphate (vitamin B1).
Changed!
pfam ThiF 134aa 4e-23
in ref transcript
ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family.
Changed!
COG ThiF 249aa 7e-20
in ref transcript
Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [Coenzyme metabolism].
UBE2A
refseq_UBE2A.F1 refseq_UBE2A.R1 102 192
NCBIGene 36.3 7319
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_003336
Changed!
cd UBCc 135aa 6e-50
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Changed!
pfam UQ_con 138aa 3e-58
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 151aa 4e-52
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd UBCc 105aa 1e-30
in modified transcript
Changed!
pfam UQ_con 108aa 6e-37
in modified transcript
Changed!
COG COG5078 121aa 4e-33
in modified transcript
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_007019
Changed!
cd UBCc 138aa 6e-45
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Changed!
pfam UQ_con 137aa 5e-53
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 140aa 3e-43
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd UBCc 120aa 8e-13
in modified transcript
Changed!
pfam UQ_con 119aa 1e-12
in modified transcript
Changed!
COG COG5078 122aa 3e-07
in modified transcript
UBE2D2
refseq_UBE2D2.F1 refseq_UBE2D2.R1 251 415
NCBIGene 36.3 7322
Single exon skipping, size difference: 164
Inclusion in the protein causing a new stop codon
Reference transcript: NM_003339
Changed!
cd UBCc 139aa 1e-50
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Changed!
pfam UQ_con 138aa 3e-57
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 147aa 5e-56
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
UBE2D3
refseq_UBE2D3.F2 refseq_UBE2D3.R2 102 254
NCBIGene 36.3 7323
Single exon skipping, size difference: 152
Inclusion in the protein causing a new stop codon
Reference transcript: NM_181893
Changed!
cd UBCc 137aa 2e-49
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Changed!
pfam UQ_con 137aa 4e-55
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 147aa 2e-53
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
UBE2D3
refseq_UBE2D3.F3 refseq_UBE2D3.R3 184 392
NCBIGene 36.3 7323
Alternative 3-prime, size difference: 208
Inclusion in 5'UTR
Reference transcript: NM_003340
cd UBCc 139aa 4e-50
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
pfam UQ_con 138aa 4e-56
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
COG COG5078 147aa 7e-55
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
UBE2D3
refseq_UBE2D3.F5 refseq_UBE2D3.R5 101 151
NCBIGene 36.3 7323
Single exon skipping, size difference: 50
Inclusion in the protein causing a new stop codon
Reference transcript: NM_181893
Changed!
cd UBCc 137aa 2e-49
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Changed!
pfam UQ_con 137aa 4e-55
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 147aa 2e-53
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd UBCc 137aa 8e-50
in modified transcript
Changed!
pfam UQ_con 137aa 1e-55
in modified transcript
Changed!
COG COG5078 147aa 3e-54
in modified transcript
UBE2D3
refseq_UBE2D3.F7 refseq_UBE2D3.R3 187 395
NCBIGene 36.3 7323
Alternative 3-prime, size difference: 208
Inclusion in 5'UTR
Reference transcript: NM_181891
cd UBCc 139aa 4e-50
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
pfam UQ_con 138aa 4e-56
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
COG COG5078 147aa 7e-55
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
UBE2E1
refseq_UBE2E1.F1 refseq_UBE2E1.R1 196 247
NCBIGene 36.3 7324
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_003341
Changed!
cd UBCc 139aa 2e-47
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Changed!
pfam UQ_con 138aa 4e-54
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 147aa 1e-48
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd UBCc 121aa 3e-41
in modified transcript
Changed!
pfam UQ_con 121aa 4e-48
in modified transcript
Changed!
COG COG5078 125aa 2e-42
in modified transcript
UBE2G1
refseq_UBE2G1.F1 refseq_UBE2G1.R1 118 395
NCBIGene 36.2 7326
Multiple exon skipping, size difference: 277
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_003342
Changed!
cd UBCc 143aa 2e-41
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Changed!
pfam UQ_con 143aa 4e-48
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 146aa 9e-45
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd UBCc 30aa 0.002
in modified transcript
Changed!
smart UBCc 31aa 3e-05
in modified transcript
Ubiquitin-conjugating enzyme E2, catalytic domain homologues. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. This pathway functions in regulating many fundamental processes required for cell viability.TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 30aa 4e-05
in modified transcript
UBE2G2
refseq_UBE2G2.F1 refseq_UBE2G2.R1 106 221
NCBIGene 36.3 7327
Single exon skipping, size difference: 115
Inclusion in the protein causing a new stop codon
Reference transcript: NM_003343
Changed!
cd UBCc 153aa 3e-43
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Changed!
pfam UQ_con 152aa 4e-50
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 163aa 5e-48
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
UBE2H
refseq_UBE2H.F2 refseq_UBE2H.R2 248 341
NCBIGene 36.3 7328
Multiple exon skipping, size difference: 93
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_003344
Changed!
cd UBCc 104aa 4e-30
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Changed!
pfam UQ_con 104aa 2e-34
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 131aa 2e-29
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd UBCc 73aa 6e-14
in modified transcript
Changed!
pfam UQ_con 73aa 8e-17
in modified transcript
Changed!
COG COG5078 100aa 7e-12
in modified transcript
UBE2I
refseq_UBE2I.F1 refseq_UBE2I.R1 104 361
NCBIGene 36.3 7329
Single exon skipping, size difference: 257
Exclusion in 5'UTR
Reference transcript: NM_194259
cd UBCc 146aa 5e-40
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
pfam UQ_con 145aa 1e-53
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
COG COG5078 156aa 4e-45
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
UBE2L3
refseq_UBE2L3.F2 refseq_UBE2L3.R2 283 392
NCBIGene 36.3 7332
Single exon skipping, size difference: 109
Exclusion in 5'UTR
Reference transcript: NM_198157
UBE2V1
refseq_UBE2V1.F1 refseq_UBE2V1.R1 147 296
NCBIGene 36.3 7335
Single exon skipping, size difference: 149
Exclusion in the protein causing a frameshift
Reference transcript: NM_199144
Changed!
cd UBCc 117aa 1e-11
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Changed!
smart UBCc 128aa 3e-25
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic domain homologues. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. This pathway functions in regulating many fundamental processes required for cell viability.TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 131aa 4e-16
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
UBE2W
refseq_UBE2W.F1 refseq_UBE2W.R1 198 301
NCBIGene 36.2 55284
Single exon skipping, size difference: 103
Exclusion in the protein causing a frameshift
Reference transcript: NM_001001481
Changed!
cd UBCc 115aa 3e-25
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Changed!
pfam UQ_con 142aa 3e-30
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 114aa 2e-25
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
UBE2W
refseq_UBE2W.F2 refseq_UBE2W.R2 107 140
NCBIGene 36.3 55284
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_001001481
Changed!
cd UBCc 115aa 3e-25
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
pfam UQ_con 142aa 3e-30
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 114aa 2e-25
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd UBCc 114aa 3e-25
in modified transcript
Changed!
COG COG5078 114aa 2e-25
in modified transcript
UBE3B
refseq_UBE3B.F1 refseq_UBE3B.R1 106 451
NCBIGene 36.3 89910
Alternative 5-prime, size difference: 345
Exclusion in 5'UTR
Reference transcript: NM_130466
cd HECTc 385aa 1e-113
in ref transcript
HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
pfam HECT 331aa 1e-94
in ref transcript
HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
COG HUL4 399aa 2e-79
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
UBL7
refseq_UBL7.F1 refseq_UBL7.R1 167 238
NCBIGene 36.3 84993
Alternative 5-prime, size difference: 71
Exclusion in 5'UTR
Reference transcript: NM_032907
cd BMSC_UbP_N 75aa 1e-29
in ref transcript
BMSC_UbP (bone marrow stromal cell-derived ubiquitin-like protein) has an N-terminal ubiquitin-like (UBQ) domain and a C-terminal ubiquitin-associated (UBA) domain, a domain architecture similar to those of the UBIN, Chap1, and ubiquilin proteins. This CD represents the N-terminal ubiquitin-like domain.
pfam ubiquitin 54aa 1e-07
in ref transcript
Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
PTZ PTZ00044 34aa 0.001
in ref transcript
ubiquitin; Provisional.
UBOX5
refseq_UBOX5.F2 refseq_UBOX5.R2 142 304
NCBIGene 36.3 22888
Single exon skipping, size difference: 162
Exclusion in the protein (no frameshift)
Reference transcript: NM_014948
cd RING 50aa 0.005
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
pfam U-box 81aa 2e-23
in ref transcript
U-box domain. This domain is related to the Ring finger pfam00097 but lacks the zinc binding residues.
COG UFD2 79aa 1e-05
in ref transcript
Ubiquitin fusion degradation protein 2 [Posttranslational modification, protein turnover, chaperones].
UBQLN1
refseq_UBQLN1.F1 refseq_UBQLN1.R1 100 184
NCBIGene 36.3 29979
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_013438
cd hPLIC_N 58aa 4e-24
in ref transcript
hPLIC-1 and hPLIC-2 (human homologs of the yeast ubiquitin-like Dsk2 protein) are type2 UBL's (ubiquitin-like) proteins that are thought to serve as adaptors that link the ubiquitination machinery to the proteasome. The hPLIC's have an N-terminal UBL domain that binds the S5a subunit of the proteasome and a C-terminal UBA (ubiquitin-associated) domain that binds a ubiquitylated protein.
cd UBA 38aa 6e-04
in ref transcript
Ubiquitin Associated domain. The UBA domain is a commonly occurring sequence motif in some members of the ubiquitination pathway, UV excision repair proteins, and certain protein kinases. Although its specific role is so far unknown, it has been suggested that UBA domains are involved in conferring protein target specificity. The domain, a compact three helix bundle, has a conserved GFP-loop and the proline is thought to be critical for binding. The UBA domain is distinct from the conserved three helical domain seen in the N-terminus of EF-TS and eukaryotic NAC proteins.
pfam ubiquitin 59aa 3e-12
in ref transcript
Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
smart UBA 37aa 4e-04
in ref transcript
Ubiquitin associated domain. Present in Rad23, SNF1-like kinases. The newly-found UBA in p62 is known to bind ubiquitin.
UBXD5
refseq_UBXD5.F2 refseq_UBXD5.R2 132 231
NCBIGene 36.3 91544
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_183008
pfam SEP 72aa 6e-19
in ref transcript
SEP domain. The SEP domain is named after Saccharomyces cerevisiae Shp1, Drosophila melanogaster eyes closed gene (eyc), and vertebrate p47. In p47, the SEP domain has been shown to bind to and inhibit the cysteine protease cathepsin L. Most SEP domains are succeeded closely by a UBX domain.
PRK bchH 138aa 0.007
in ref transcript
magnesium chelatase subunit H; Provisional.
UBXD5
refseq_UBXD5.F3 refseq_UBXD5.R3 121 246
NCBIGene 36.3 91544
Single exon skipping, size difference: 125
Exclusion in 5'UTR
Reference transcript: NM_183008
pfam SEP 72aa 6e-19
in ref transcript
SEP domain. The SEP domain is named after Saccharomyces cerevisiae Shp1, Drosophila melanogaster eyes closed gene (eyc), and vertebrate p47. In p47, the SEP domain has been shown to bind to and inhibit the cysteine protease cathepsin L. Most SEP domains are succeeded closely by a UBX domain.
PRK bchH 138aa 0.007
in ref transcript
magnesium chelatase subunit H; Provisional.
UCRC
refseq_UCRC.F1 refseq_UCRC.R1 297 356
NCBIGene 36.3 29796
Alternative 5-prime, size difference: 59
Exclusion in 5'UTR
Reference transcript: NM_001003684
UFD1L
refseq_UFD1L.F1 refseq_UFD1L.R1 175 208
NCBIGene 36.3 7353
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_005659
Changed!
pfam UFD1 181aa 6e-88
in ref transcript
Ubiquitin fusion degradation protein UFD1. Post-translational ubiquitin-protein conjugates are recognised for degradation by the ubiquitin fusion degradation (UFD) pathway. Several proteins involved in this pathway have been identified. This family includes UFD1, a 40kD protein that is essential for vegetative cell viability. The human UFD1 gene is expressed at high levels during embryogenesis, especially in the eyes and in the inner ear primordia and is thought to be important in the determination of ectoderm-derived structures, including neural crest cells. In addition, this gene is deleted in the CATCH-22 (cardiac defects, abnormal facies, thymic hypoplasia, cleft palate and hypocalcaemia with deletions on chromosome 22) syndrome. This clinical syndrome is associated with a variety of developmental defects, all characterised by microdeletions on 22q11.2. Two such developmental defects are the DiGeorge syndrome OMIM:188400, and the velo-cardio- facial syndrome OMIM:145410. Several of the abnormalities associated with these conditions are thought to be due to defective neural crest cell differentiation.
Changed!
COG UFD1 285aa 1e-55
in ref transcript
Ubiquitin fusion-degradation protein [Posttranslational modification, protein turnover, chaperones].
Changed!
pfam UFD1 170aa 9e-79
in modified transcript
Changed!
COG UFD1 274aa 1e-48
in modified transcript
UGCGL1
refseq_UGCGL1.F1 refseq_UGCGL1.R1 175 356
NCBIGene 36.3 56886
Alternative 5-prime, size difference: 181
Inclusion in the protein causing a new stop codon
Reference transcript: NM_020120
Changed!
cd GT8_HUGT1_C_like 248aa 1e-145
in ref transcript
The C-terminal domain of HUGT1-like is highly homologous to the GT 8 family. C-terminal domain of glycoprotein glucosyltransferase (UGT). UGT is a large glycoprotein whose C-terminus contains the catalytic activity. This catalytic C-terminal domain is highly homologous to Glycosyltransferase Family 8 (GT 8) and contains the DXD motif that coordinates donor sugar binding, characteristic for Family 8 glycosyltransferases. GT 8 proteins are retaining enzymes based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed. The non-catalytic N-terminal portion of the human UTG1 (HUGT1) has been shown to monitor the protein folding status and activate its glucosyltransferase activity.
Changed!
pfam UDP-g_GGTase 209aa 3e-79
in ref transcript
UDP-glucose:Glycoprotein Glucosyltransferase. The N-terminal region of this group of proteins is required for correct folding of the ER UDP-Glc: glucosyltransferase.
UCHL5IP
refseq_UIP1.F1 refseq_UIP1.R1 112 289
NCBIGene 36.3 55559
Single exon skipping, size difference: 177
Exclusion in the protein (no frameshift)
Reference transcript: NM_207107
UIP1
refseq_UIP1.F3 refseq_UIP1.R3 135 253
NCBIGene 36.2 55559
Alternative 3-prime, size difference: 118
Exclusion in the protein causing a frameshift
Reference transcript: NM_207107
UNC45A
refseq_UNC45A.F2 refseq_UNC45A.R2 158 203
NCBIGene 36.3 55898
Alternative 5-prime, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_018671
cd TPR 103aa 1e-12
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
cd ARM 120aa 3e-05
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
TIGR 3a0801s09 93aa 5e-10
in ref transcript
COG NrfG 95aa 0.001
in ref transcript
FOG: TPR repeat [General function prediction only].
UNG2
refseq_UNG2.F1 refseq_UNG2.R1 102 478
NCBIGene 36.2 10309
Alternative 5-prime, size difference: 376
Inclusion in the protein causing a new stop codon
Reference transcript: NM_021147
Changed!
cd CYCLIN 90aa 9e-11
in ref transcript
Cyclin box fold. Protein binding domain functioning in cell-cycle and transcription control. Present in cyclins, TFIIB and Retinoblastoma (RB).The cyclins consist of 8 classes of cell cycle regulators that regulate cyclin dependent kinases (CDKs). TFIIB is a transcription factor that binds the TATA box. Cyclins, TFIIB and RB contain 2 copies of the domain.
Changed!
pfam Cyclin_N 125aa 5e-24
in ref transcript
Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-terminal domain.
Changed!
COG COG5024 175aa 3e-20
in ref transcript
Cyclin [Cell division and chromosome partitioning].
UPF3A
refseq_UPF3A.F1 refseq_UPF3A.R1 341 440
NCBIGene 36.3 65110
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_023011
Changed!
pfam Smg4_UPF3 163aa 2e-48
in ref transcript
Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3.
Changed!
pfam Smg4_UPF3 130aa 2e-35
in modified transcript
UPK3B
refseq_UPK3B.F2 refseq_UPK3B.R2 185 265
NCBIGene 36.3 80761
Single exon skipping, size difference: 80
Inclusion in the protein causing a frameshift
Reference transcript: NM_030570
USF1
refseq_USF1.F2 refseq_USF1.R2 165 196
NCBIGene 36.3 7391
Alternative 5-prime, size difference: 31
Exclusion in the protein causing a frameshift
Reference transcript: NM_007122
Changed!
cd HLH 63aa 2e-10
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
Changed!
pfam HLH 56aa 3e-12
in ref transcript
Helix-loop-helix DNA-binding domain.
USF2
refseq_USF2.F1 refseq_USF2.R1 155 356
NCBIGene 36.3 7392
Single exon skipping, size difference: 201
Exclusion in the protein (no frameshift)
Reference transcript: NM_003367
cd HLH 63aa 7e-11
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
pfam HLH 56aa 5e-12
in ref transcript
Helix-loop-helix DNA-binding domain.
TIGR DNA_bind_RsfA 63aa 0.006
in ref transcript
In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
USH1C
refseq_USH1C.F1 refseq_USH1C.R1 114 223
NCBIGene 36.3 10083
Single exon skipping, size difference: 109
Exclusion in the protein causing a frameshift
Reference transcript: NM_153676
cd PDZ_signaling 81aa 7e-17
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 81aa 1e-14
in ref transcript
cd PDZ_signaling 88aa 3e-08
in ref transcript
smart PDZ 83aa 9e-16
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart PDZ 72aa 8e-14
in ref transcript
TIGR degP_htrA_DO 194aa 2e-10
in ref transcript
This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
smart PDZ 87aa 6e-09
in ref transcript
PRK PRK10942 172aa 2e-05
in ref transcript
serine endoprotease; Provisional.
USP1
refseq_USP1.F1 refseq_USP1.R1 170 265
NCBIGene 36.3 7398
Alternative 5-prime, size difference: 95
Exclusion in 5'UTR
Reference transcript: NM_001017415
cd Peptidase_C19O 176aa 2e-84
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
cd Peptidase_C19O 208aa 5e-46
in ref transcript
cd Peptidase_C19O 38aa 0.001
in ref transcript
pfam UCH 196aa 1e-26
in ref transcript
Ubiquitin carboxyl-terminal hydrolase.
pfam UCH 157aa 7e-18
in ref transcript
COG UBP5 177aa 7e-11
in ref transcript
Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
COG UBP5 150aa 1e-07
in ref transcript
USP14
refseq_USP14.F1 refseq_USP14.R1 129 234
NCBIGene 36.3 9097
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_005151
cd Peptidase_C19A 376aa 1e-113
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyse bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Changed!
cd UBP_N 70aa 8e-05
in ref transcript
The UBP (ubiquitin processing protease) domain (also referred to as USP which stands for "ubiquitin-specific protease") is present at in a large family of cysteine proteases that specifically cleave ubiquitin conjugates. This family includes Rpn11, UBP6 (USP14), USP7 (HAUSP). This domain is closely related to the amino-terminal ubiquitin-like domain of BAG1 (Bcl2-associated anthanogene1) protein and is found only in eukaryotes.
pfam UCH 379aa 4e-59
in ref transcript
Ubiquitin carboxyl-terminal hydrolase.
Changed!
smart UBQ 63aa 3e-06
in ref transcript
Ubiquitin homologues. Ubiquitin-mediated proteolysis is involved in the regulated turnover of proteins required for controlling cell cycle progression.
Changed!
COG UBP12 130aa 4e-07
in ref transcript
Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
COG COG5077 117aa 3e-06
in ref transcript
Ubiquitin carboxyl-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd UBP_N 64aa 0.009
in modified transcript
Changed!
smart UBQ 53aa 1e-05
in modified transcript
Changed!
COG UBP12 114aa 4e-07
in modified transcript
USP16
refseq_USP16.F2 refseq_USP16.R2 181 244
NCBIGene 36.3 10600
Single exon skipping, size difference: 63
Exclusion in 5'UTR
Reference transcript: NM_001032410
cd Peptidase_C19K 198aa 7e-64
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
cd Peptidase_C19K 206aa 1e-37
in ref transcript
pfam UCH 196aa 7e-32
in ref transcript
Ubiquitin carboxyl-terminal hydrolase.
pfam UCH 200aa 7e-27
in ref transcript
pfam zf-UBP 82aa 7e-08
in ref transcript
Zn-finger in ubiquitin-hydrolases and other protein.
COG UBP12 204aa 3e-17
in ref transcript
Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
COG UBP12 203aa 1e-16
in ref transcript
USP20
refseq_USP20.F1 refseq_USP20.R1 122 220
NCBIGene 36.3 10868
Alternative 5-prime, size difference: 98
Exclusion in 5'UTR
Reference transcript: NM_001008563
cd Peptidase_C19R 241aa 7e-49
in ref transcript
A subfamily of peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
cd Peptidase_C19D 103aa 3e-12
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
cd Peptidase_C19M 209aa 6e-09
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
pfam UCH 240aa 4e-47
in ref transcript
Ubiquitin carboxyl-terminal hydrolase.
pfam UCH 106aa 2e-25
in ref transcript
smart DUSP 84aa 2e-25
in ref transcript
Domain in ubiquitin-specific proteases.
smart DUSP 82aa 2e-19
in ref transcript
pfam zf-UBP 64aa 8e-07
in ref transcript
Zn-finger in ubiquitin-hydrolases and other protein.
COG UBP12 164aa 4e-29
in ref transcript
Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
COG UBP12 108aa 2e-17
in ref transcript
COG UBP14 238aa 4e-13
in ref transcript
Isopeptidase T [Posttranslational modification, protein turnover, chaperones].
COG UBP12 62aa 8e-05
in ref transcript
USP21
refseq_USP21.F1 refseq_USP21.R1 250 391
NCBIGene 36.3 27005
Single exon skipping, size difference: 141
Exclusion in 5'UTR
Reference transcript: NM_001014443
cd Peptidase_C19R 259aa 1e-60
in ref transcript
A subfamily of peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
cd Peptidase_C19R 22aa 5e-06
in ref transcript
cd Peptidase_C19F 52aa 4e-04
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
pfam UCH 344aa 1e-80
in ref transcript
Ubiquitin carboxyl-terminal hydrolase.
COG UBP5 346aa 2e-50
in ref transcript
Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
USP33
refseq_USP33.F1 refseq_USP33.R1 278 457
NCBIGene 36.3 23032
Single exon skipping, size difference: 179
Exclusion of the protein initiation site
Reference transcript: NM_015017
cd Peptidase_C19R 241aa 7e-50
in ref transcript
A subfamily of peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
cd Peptidase_C19E 100aa 7e-13
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
cd Peptidase_C19M 215aa 5e-11
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
pfam UCH 259aa 4e-49
in ref transcript
Ubiquitin carboxyl-terminal hydrolase.
smart DUSP 83aa 1e-24
in ref transcript
Domain in ubiquitin-specific proteases.
pfam UCH 102aa 2e-23
in ref transcript
smart DUSP 70aa 1e-12
in ref transcript
pfam zf-UBP 73aa 1e-07
in ref transcript
Zn-finger in ubiquitin-hydrolases and other protein.
COG UBP12 155aa 1e-30
in ref transcript
Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
COG UBP12 102aa 1e-14
in ref transcript
COG UBP14 245aa 8e-14
in ref transcript
Isopeptidase T [Posttranslational modification, protein turnover, chaperones].
COG UBP12 63aa 6e-05
in ref transcript
USP33
refseq_USP33.F3 refseq_USP33.R3 136 160
NCBIGene 36.3 23032
Alternative 3-prime, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_015017
Changed!
cd Peptidase_C19R 241aa 7e-50
in ref transcript
A subfamily of peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
cd Peptidase_C19E 100aa 7e-13
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
cd Peptidase_C19M 215aa 5e-11
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Changed!
pfam UCH 259aa 4e-49
in ref transcript
Ubiquitin carboxyl-terminal hydrolase.
smart DUSP 83aa 1e-24
in ref transcript
Domain in ubiquitin-specific proteases.
pfam UCH 102aa 2e-23
in ref transcript
smart DUSP 70aa 1e-12
in ref transcript
pfam zf-UBP 73aa 1e-07
in ref transcript
Zn-finger in ubiquitin-hydrolases and other protein.
COG UBP12 155aa 1e-30
in ref transcript
Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
COG UBP12 102aa 1e-14
in ref transcript
COG UBP14 245aa 8e-14
in ref transcript
Isopeptidase T [Posttranslational modification, protein turnover, chaperones].
COG UBP12 63aa 6e-05
in ref transcript
Changed!
cd Peptidase_C19R 233aa 7e-51
in modified transcript
Changed!
pfam UCH 251aa 5e-50
in modified transcript
Changed!
COG UBP14 180aa 0.002
in modified transcript
USP4
refseq_USP4.F1 refseq_USP4.R1 226 367
NCBIGene 36.3 7375
Single exon skipping, size difference: 141
Exclusion in the protein (no frameshift)
Reference transcript: NM_003363
cd Peptidase_C19R 146aa 1e-47
in ref transcript
A subfamily of peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
cd Peptidase_C19E 170aa 1e-17
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
pfam DUF1055 135aa 2e-57
in ref transcript
Domain of Unknown Function (DUF1055). This region is found in Ubiquitin-specific proteases.
pfam UCH 150aa 4e-49
in ref transcript
Ubiquitin carboxyl-terminal hydrolase.
pfam UCH 181aa 1e-43
in ref transcript
smart DUSP 97aa 1e-25
in ref transcript
Domain in ubiquitin-specific proteases.
Changed!
COG UBP12 274aa 3e-65
in ref transcript
Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
COG UBP12 149aa 1e-47
in ref transcript
Changed!
COG UBP12 150aa 0.009
in ref transcript
Changed!
COG UBP12 502aa 3e-71
in modified transcript
USP45
refseq_USP45.F1 refseq_USP45.R1 209 383
NCBIGene 36.2 85015
Multiple exon skipping, size difference: 174
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: XM_931094
cd Peptidase_C19K 209aa 1e-54
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
Changed!
cd Peptidase_C19K 197aa 1e-29
in ref transcript
pfam UCH 195aa 1e-27
in ref transcript
Ubiquitin carboxyl-terminal hydrolase.
Changed!
pfam UCH 197aa 3e-20
in ref transcript
pfam zf-UBP 56aa 6e-07
in ref transcript
Zn-finger in ubiquitin-hydrolases and other protein.
Changed!
COG UBP14 244aa 1e-15
in ref transcript
Isopeptidase T [Posttranslational modification, protein turnover, chaperones].
COG UBP12 241aa 2e-13
in ref transcript
Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd Peptidase_C19K 139aa 5e-21
in modified transcript
Changed!
pfam UCH 168aa 5e-14
in modified transcript
Changed!
COG UBP14 255aa 4e-16
in modified transcript
USP9X
refseq_USP9X.F1 refseq_USP9X.R1 123 171
NCBIGene 36.3 8239
Alternative 5-prime, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039590
cd peptidase_C19C 404aa 1e-119
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
pfam UCH 399aa 1e-70
in ref transcript
Ubiquitin carboxyl-terminal hydrolase.
COG COG5077 447aa 6e-45
in ref transcript
Ubiquitin carboxyl-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
VAPA
refseq_VAPA.F1 refseq_VAPA.R1 151 286
NCBIGene 36.3 9218
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_003574
pfam Motile_Sperm 104aa 4e-29
in ref transcript
MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins.
COG SCS2 110aa 9e-20
in ref transcript
VAMP-associated protein involved in inositol metabolism [Intracellular trafficking and secretion].
VCL
refseq_VCL.F2 refseq_VCL.R2 196 400
NCBIGene 36.3 7414
Single exon skipping, size difference: 204
Exclusion in the protein (no frameshift)
Reference transcript: NM_014000
pfam Vinculin 358aa 1e-122
in ref transcript
Vinculin family.
Changed!
pfam Vinculin 541aa 2e-95
in ref transcript
Changed!
pfam Vinculin 241aa 4e-76
in ref transcript
Changed!
pfam Vinculin 692aa 1e-168
in modified transcript
VKORC1
refseq_VKORC1.F1 refseq_VKORC1.R1 259 369
NCBIGene 36.3 79001
Single exon skipping, size difference: 110
Exclusion in the protein causing a frameshift
Reference transcript: NM_024006
Changed!
smart VKc 149aa 3e-20
in ref transcript
Family of likely enzymes that includes the catalytic subunit of vitamin K epoxide reductase. Bacterial homologues are fused to members of the thioredoxin family of oxidoreductases.
Changed!
smart VKc 54aa 2e-06
in modified transcript
VLDLR
refseq_VLDLR.F1 refseq_VLDLR.R1 152 236
NCBIGene 36.3 7436
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_003383
cd LDLa 35aa 7e-08
in ref transcript
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
cd LDLa 35aa 9e-08
in ref transcript
cd LDLa 35aa 1e-07
in ref transcript
cd LDLa 32aa 1e-06
in ref transcript
cd LDLa 35aa 3e-06
in ref transcript
cd LDLa 37aa 4e-04
in ref transcript
cd LDLa 31aa 0.001
in ref transcript
cd EGF_CA 31aa 0.008
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
pfam Ldl_recept_b 41aa 6e-12
in ref transcript
Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.
pfam Ldl_recept_a 37aa 5e-10
in ref transcript
Low-density lipoprotein receptor domain class A.
pfam Ldl_recept_a 36aa 1e-09
in ref transcript
pfam Ldl_recept_a 36aa 5e-09
in ref transcript
smart LY 43aa 5e-09
in ref transcript
Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
pfam Ldl_recept_a 37aa 1e-08
in ref transcript
pfam Ldl_recept_a 34aa 6e-08
in ref transcript
smart LY 43aa 5e-07
in ref transcript
pfam Ldl_recept_a 39aa 2e-06
in ref transcript
pfam Ldl_recept_a 31aa 7e-06
in ref transcript
pfam Ldl_recept_b 40aa 6e-05
in ref transcript
pfam Ldl_recept_b 41aa 9e-05
in ref transcript
smart EGF_CA 31aa 2e-04
in ref transcript
Calcium-binding EGF-like domain.
VNN3
refseq_VNN3.F2 refseq_VNN3.R2 195 241
NCBIGene 36.3 55350
Alternative 3-prime, size difference: 46
Inclusion in the protein causing a new stop codon
Reference transcript: NM_018399
Changed!
pfam CN_hydrolase 161aa 7e-17
in ref transcript
Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6. Nitrilase-related proteins generally have a conserved E-K-C catalytic triad, and are multimeric alpha-beta-beta-alpha sandwich proteins.
Changed!
COG COG0388 160aa 3e-12
in ref transcript
Predicted amidohydrolase [General function prediction only].
Changed!
TIGR agmatine_aguB 27aa 0.008
in modified transcript
Members of this family are N-carbamoylputrescine amidase (3.5.1.53). Bacterial genes are designated AguB. The AguAB pathway replaces SpeB for conversion of agmatine to putrescine in two steps rather than one.
VPS13A
refseq_VPS13A.F1 refseq_VPS13A.R1 115 232
NCBIGene 36.3 23230
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_033305
pfam DUF1162 279aa 2e-82
in ref transcript
Protein of unknown function (DUF1162). This family represents a conserved region within several hypothetical eukaryotic proteins. Family members might be vacuolar protein sorting related-proteins.
Changed!
COG MRS6 2101aa 2e-80
in ref transcript
Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion].
COG MRS6 422aa 7e-33
in ref transcript
Changed!
COG MRS6 2062aa 1e-78
in modified transcript
VPS13C
refseq_VPS13C.F1 refseq_VPS13C.R1 271 400
NCBIGene 36.3 54832
Multiple exon skipping, size difference: 129
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_020821
pfam DUF1162 284aa 1e-88
in ref transcript
Protein of unknown function (DUF1162). This family represents a conserved region within several hypothetical eukaryotic proteins. Family members might be vacuolar protein sorting related-proteins.
Changed!
COG MRS6 1049aa 1e-82
in ref transcript
Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion].
COG MRS6 479aa 1e-32
in ref transcript
COG MRS6 888aa 7e-09
in ref transcript
Changed!
COG MRS6 1006aa 1e-84
in modified transcript
VPS13D
refseq_VPS13D.F1 refseq_VPS13D.R1 121 196
NCBIGene 36.3 55187
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_015378
cd UBA 36aa 0.001
in ref transcript
Ubiquitin Associated domain. The UBA domain is a commonly occurring sequence motif in some members of the ubiquitination pathway, UV excision repair proteins, and certain protein kinases. Although its specific role is so far unknown, it has been suggested that UBA domains are involved in conferring protein target specificity. The domain, a compact three helix bundle, has a conserved GFP-loop and the proline is thought to be critical for binding. The UBA domain is distinct from the conserved three helical domain seen in the N-terminus of EF-TS and eukaryotic NAC proteins.
pfam DUF1162 284aa 6e-52
in ref transcript
Protein of unknown function (DUF1162). This family represents a conserved region within several hypothetical eukaryotic proteins. Family members might be vacuolar protein sorting related-proteins.
pfam UBA 37aa 2e-04
in ref transcript
UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to ubiquitin and the ubiquitination pathway. The structure of the UBA domain consists of a compact three helix bundle. This domain is found at the N terminus of EF-TS hence the name TS-N. The structure of EF-TS is known and this domain is implicated in its interaction with EF-TU. The domain has been found in non EF-TS proteins such as alpha-NAC and MJ0280.
COG MRS6 1269aa 1e-53
in ref transcript
Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion].
COG MRS6 463aa 3e-35
in ref transcript
COG MRS6 167aa 0.004
in ref transcript
VPS24
refseq_VPS24.F2 refseq_VPS24.R2 307 368
NCBIGene 36.3 51652
Single exon skipping, size difference: 61
Exclusion in the protein causing a frameshift
Reference transcript: NM_016079
Changed!
pfam Snf7 171aa 9e-29
in ref transcript
Snf7. This family of proteins are involved in protein sorting and transport from the endosome to the vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes play an important role in the degradation of both lipids and cellular proteins. In order to perform this degradative function, vacuoles/lysosomes contain numerous hydrolases which have been transported in the form of inactive precursors via the biosynthetic pathway and are proteolytically activated upon delivery to the vacuole/lysosome. The delivery of transmembrane proteins, such as activated cell surface receptors to the lumen of the vacuole/lysosome, either for degradation/downregulation, or in the case of hydrolases, for proper localisation, requires the formation of multivesicular bodies (MVBs). These late endosomal structures are formed by invaginating and budding of the limiting membrane into the lumen of the compartment. During this process, a subset of the endosomal membrane proteins is sorted into the forming vesicles. Mature MVBs fuse with the vacuole/lysosome, thereby releasing cargo containing vesicles into its hydrolytic lumen for degradation. Endosomal proteins that are not sorted into the intralumenal MVB vesicles are either recycled back to the plasma membrane or Golgi complex, or remain in the limiting membrane of the MVB and are thereby transported to the limiting membrane of the vacuole/lysosome as a consequence of fusion. Therefore, the MVB sorting pathway plays a critical role in the decision between recycling and degradation of membrane proteins. A few archaeal sequences are also present within this family.
Changed!
COG VPS24 124aa 0.001
in ref transcript
Conserved protein implicated in secretion [Cell motility and secretion].
VPS41
refseq_VPS41.F2 refseq_VPS41.R2 231 306
NCBIGene 36.3 27072
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_014396
smart CLH 138aa 4e-25
in ref transcript
Clathrin heavy chain repeat homology.
VPS54
refseq_VPS54.F1 refseq_VPS54.R1 110 146
NCBIGene 36.3 51542
Alternative 3-prime, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_016516
pfam Vps54 135aa 2e-49
in ref transcript
Vps54-like protein. This family contains various proteins that are homologs of the yeast Vps54 protein, such as the rat homolog, the human homolog, and the mouse homolog. In yeast, Vps54 associates with Vps52 and Vps53 proteins to form a trimolecular complex that is involved in protein transport between Golgi, endosomal, and vacuolar compartments. All Vps54 homologs contain a coiled coil region (not found in the region featured in this family) and multiple dileucine motifs.
VRK3
refseq_VRK3.F2 refseq_VRK3.R2 250 400
NCBIGene 36.3 51231
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_016440
cd S_TKc 205aa 8e-12
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 187aa 2e-09
in ref transcript
Protein kinase domain.
COG SPS1 207aa 9e-07
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
VWA1
refseq_VWA1.F2 refseq_VWA1.R2 100 495
NCBIGene 36.3 64856
Alternative 3-prime, size difference: 395
Exclusion in the protein causing a frameshift
Reference transcript: NM_022834
Changed!
cd vWA_collagen 141aa 3e-29
in ref transcript
von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.
Changed!
cd FN3 74aa 4e-06
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Changed!
pfam VWA 141aa 6e-28
in ref transcript
von Willebrand factor type A domain.
Changed!
pfam fn3 69aa 2e-07
in ref transcript
Fibronectin type III domain.
Changed!
pfam fn3 67aa 5e-05
in ref transcript
WAC
refseq_WAC.F1 refseq_WAC.R1 103 412
NCBIGene 36.3 51322
Single exon skipping, size difference: 309
Exclusion in the protein (no frameshift)
Reference transcript: NM_016628
cd WW 30aa 3e-05
in ref transcript
Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
smart WW 28aa 7e-05
in ref transcript
Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
COG PRP40 40aa 0.007
in ref transcript
Splicing factor [RNA processing and modification].
PRK PRK10856 86aa 0.008
in ref transcript
hypothetical protein; Provisional.
WARS
refseq_WARS.F1 refseq_WARS.R1 145 317
NCBIGene 36.3 7453
Single exon skipping, size difference: 172
Exclusion of the protein initiation site
Reference transcript: NM_173701
cd TrpRS_core 284aa 3e-74
in ref transcript
Tryptophanyl-tRNA synthetase (TrpRS) catalytic core domain. TrpRS is a homodimer, which attaches Tyr to the appropriate tRNA. TrpRS is a class I tRNA synthetases, so it aminoacylates the 2'-OH of the nucleotide at the 3' end of the tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains class I characteristic HIGH and KMSKS motifs, which are involved in ATP binding.
Changed!
cd WEPRS_RNA 50aa 1e-14
in ref transcript
WEPRS_RNA binding domain. This short RNA-binding domain is found in several higher eukaryote aminoacyl-tRNA synthetases (aaRSs). It is found in multiple copies in eukaryotic bifunctional glutamyl-prolyl-tRNA synthetases (EPRS) in a region that separates the N-terminal glutamyl-tRNA synthetase (GluRS) from the C-terminal prolyl-tRNA synthetase (ProRS). It is also found at the N-terminus of vertebrate tryptophanyl-tRNA synthetases (TrpRS). This domain consists of a helix-turn-helix structure, which is similar to other RNA-binding proteins. It is involved in both protein-RNA interactions by binding tRNA and protein-protein interactions, which are important for the formation of aaRSs into multienzyme complexes.
TIGR trpS 318aa 1e-105
in ref transcript
This model represents tryptophanyl-tRNA synthetase. Some members of the family have a pfam00458 domain amino-terminal to the region described by this HMM.
Changed!
pfam WHEP-TRS 54aa 3e-17
in ref transcript
WHEP-TRS domain.
PRK PRK12285 386aa 1e-110
in ref transcript
tryptophanyl-tRNA synthetase; Reviewed.
WARS
refseq_WARS.F3 refseq_WARS.R3 235 407
NCBIGene 36.3 7453
Single exon skipping, size difference: 172
Exclusion of the protein initiation site
Reference transcript: NM_004184
cd TrpRS_core 284aa 3e-74
in ref transcript
Tryptophanyl-tRNA synthetase (TrpRS) catalytic core domain. TrpRS is a homodimer, which attaches Tyr to the appropriate tRNA. TrpRS is a class I tRNA synthetases, so it aminoacylates the 2'-OH of the nucleotide at the 3' end of the tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains class I characteristic HIGH and KMSKS motifs, which are involved in ATP binding.
Changed!
cd WEPRS_RNA 50aa 1e-14
in ref transcript
WEPRS_RNA binding domain. This short RNA-binding domain is found in several higher eukaryote aminoacyl-tRNA synthetases (aaRSs). It is found in multiple copies in eukaryotic bifunctional glutamyl-prolyl-tRNA synthetases (EPRS) in a region that separates the N-terminal glutamyl-tRNA synthetase (GluRS) from the C-terminal prolyl-tRNA synthetase (ProRS). It is also found at the N-terminus of vertebrate tryptophanyl-tRNA synthetases (TrpRS). This domain consists of a helix-turn-helix structure, which is similar to other RNA-binding proteins. It is involved in both protein-RNA interactions by binding tRNA and protein-protein interactions, which are important for the formation of aaRSs into multienzyme complexes.
TIGR trpS 318aa 1e-105
in ref transcript
This model represents tryptophanyl-tRNA synthetase. Some members of the family have a pfam00458 domain amino-terminal to the region described by this HMM.
Changed!
pfam WHEP-TRS 54aa 3e-17
in ref transcript
WHEP-TRS domain.
PRK PRK12285 386aa 1e-110
in ref transcript
tryptophanyl-tRNA synthetase; Reviewed.
WARS2
refseq_WARS2.F2 refseq_WARS2.R2 132 161
NCBIGene 36.3 10352
Alternative 3-prime, size difference: 29
Inclusion in the protein causing a new stop codon
Reference transcript: NM_015836
Changed!
cd TrpRS_core 278aa 3e-95
in ref transcript
Tryptophanyl-tRNA synthetase (TrpRS) catalytic core domain. TrpRS is a homodimer, which attaches Tyr to the appropriate tRNA. TrpRS is a class I tRNA synthetases, so it aminoacylates the 2'-OH of the nucleotide at the 3' end of the tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains class I characteristic HIGH and KMSKS motifs, which are involved in ATP binding.
Changed!
TIGR trpS 327aa 7e-76
in ref transcript
This model represents tryptophanyl-tRNA synthetase. Some members of the family have a pfam00458 domain amino-terminal to the region described by this HMM.
Changed!
PRK PRK00927 325aa 1e-123
in ref transcript
tryptophanyl-tRNA synthetase; Reviewed.
Changed!
cd TrpRS_core 178aa 1e-70
in modified transcript
Changed!
TIGR trpS 180aa 3e-60
in modified transcript
Changed!
PRK PRK00927 178aa 6e-89
in modified transcript
WASF1
refseq_WASF1.F2 refseq_WASF1.R2 128 226
NCBIGene 36.3 8936
Single exon skipping, size difference: 98
Exclusion in 5'UTR
Reference transcript: NM_003931
WASF1
refseq_WASF1.F4 refseq_WASF1.R4 129 274
NCBIGene 36.3 8936
Single exon skipping, size difference: 145
Exclusion in 5'UTR
Reference transcript: NM_003931
WBP5
refseq_WBP5.F1 refseq_WBP5.R1 250 329
NCBIGene 36.3 51186
Single exon skipping, size difference: 79
Exclusion in 5'UTR
Reference transcript: NM_001006612
pfam TFA 63aa 6e-18
in ref transcript
Transcription elongation factor A, SII-related family. The function of this family is unclear, but two members from Homo sapiesn are described as transcription elongation factor A, SII-like proteins.
EIF4H
refseq_WBSCR1.F1 refseq_WBSCR1.R1 144 204
NCBIGene 36.3 7458
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_022170
cd RRM 70aa 2e-10
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM 69aa 6e-11
in ref transcript
RNA recognition motif.
Changed!
COG COG0724 87aa 3e-07
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
COG COG0724 81aa 1e-06
in modified transcript
WDHD1
refseq_WDHD1.F1 refseq_WDHD1.R1 143 236
NCBIGene 36.3 11169
Single exon skipping, size difference: 93
Exclusion of the protein initiation site
Reference transcript: NM_007086
Changed!
cd WD40 248aa 5e-27
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd HMG-box 56aa 2e-06
in ref transcript
High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
smart HMG 58aa 8e-08
in ref transcript
high mobility group.
pfam WD40 38aa 3e-06
in ref transcript
WD domain, G-beta repeat.
Changed!
smart WD40 31aa 0.001
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Changed!
COG COG2319 289aa 5e-17
in ref transcript
FOG: WD40 repeat [General function prediction only].
COG NHP6B 96aa 0.003
in ref transcript
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
Changed!
cd WD40 161aa 3e-18
in modified transcript
Changed!
COG COG2319 175aa 4e-08
in modified transcript
WDR16
refseq_WDR16.F1 refseq_WDR16.R1 167 357
NCBIGene 36.2 146845
Single exon skipping, size difference: 190
Exclusion of the protein initiation site
Reference transcript: NM_145054
cd WD40 285aa 6e-34
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 343aa 2e-16
in ref transcript
smart WD40 40aa 0.004
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 42aa 0.007
in ref transcript
COG COG2319 286aa 2e-21
in ref transcript
FOG: WD40 repeat [General function prediction only].
COG COG2319 353aa 6e-15
in ref transcript
WDR17
refseq_WDR17.F1 refseq_WDR17.R1 106 229
NCBIGene 36.3 116966
Single exon skipping, size difference: 123
Exclusion of the protein initiation site
Reference transcript: NM_170710
cd WD40 291aa 4e-48
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 348aa 2e-17
in ref transcript
smart WD40 41aa 4e-06
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 38aa 4e-04
in ref transcript
pfam WD40 40aa 0.002
in ref transcript
WD domain, G-beta repeat.
pfam WD40 32aa 0.003
in ref transcript
pfam WD40 40aa 0.004
in ref transcript
COG COG2319 328aa 7e-32
in ref transcript
FOG: WD40 repeat [General function prediction only].
COG COG2319 401aa 2e-14
in ref transcript
WDR21A
refseq_WDR21A.F2 refseq_WDR21A.R2 197 297
NCBIGene 36.3 26094
Single exon skipping, size difference: 100
Exclusion of the protein initiation site
Reference transcript: NM_015604
cd WD40 187aa 2e-11
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
COG COG2319 181aa 4e-07
in ref transcript
FOG: WD40 repeat [General function prediction only].
WDR21A
refseq_WDR21A.F4 refseq_WDR21A.R4 218 401
NCBIGene 36.3 26094
Multiple exon skipping, size difference: 183
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_015604
cd WD40 187aa 2e-11
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
COG COG2319 181aa 4e-07
in ref transcript
FOG: WD40 repeat [General function prediction only].
WDR21A
refseq_WDR21A.F5 refseq_WDR21A.R5 102 429
NCBIGene 36.3 26094
Multiple exon skipping, size difference: 327
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_015604
Changed!
cd WD40 187aa 2e-11
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Changed!
COG COG2319 181aa 4e-07
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
cd WD40 185aa 4e-11
in modified transcript
Changed!
COG COG2319 202aa 4e-07
in modified transcript
WDR23
refseq_WDR23.F2 refseq_WDR23.R2 258 336
NCBIGene 36.3 80344
Alternative 5-prime and 3-prime, size difference: 78
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_025230
cd WD40 343aa 8e-31
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
pfam WD40 38aa 2e-05
in ref transcript
WD domain, G-beta repeat.
pfam WD40 38aa 0.002
in ref transcript
COG COG2319 351aa 9e-19
in ref transcript
FOG: WD40 repeat [General function prediction only].
WDR35
refseq_WDR35.F1 refseq_WDR35.R1 182 215
NCBIGene 36.3 57539
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_001006657
cd WD40 301aa 2e-11
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
COG COG2319 186aa 7e-11
in ref transcript
FOG: WD40 repeat [General function prediction only].
WDR7
refseq_WDR7.F2 refseq_WDR7.R2 109 208
NCBIGene 36.3 23335
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_015285
cd WD40 188aa 2e-14
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 266aa 1e-08
in ref transcript
cd WD40 70aa 5e-08
in ref transcript
smart WD40 39aa 3e-04
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
COG COG2319 252aa 5e-10
in ref transcript
FOG: WD40 repeat [General function prediction only].
COG COG2319 178aa 8e-06
in ref transcript
COG COG2319 120aa 2e-04
in ref transcript
WFDC10B
refseq_WFDC10B.F1 refseq_WFDC10B.R1 157 312
NCBIGene 36.3 280664
Single exon skipping, size difference: 155
Exclusion of the stop codon
Reference transcript: NM_172006
WFDC2
refseq_WFDC2.F1 refseq_WFDC2.R1 128 272
NCBIGene 36.2 10406
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_006103
Changed!
cd WAP 63aa 4e-04
in ref transcript
whey acidic protein-type four-disulfide core domains. Members of the family include whey acidic protein, elafin (elastase-specific inhibitor), caltrin-like protein (a calcium transport inhibitor) and other extracellular proteinase inhibitors. A group of proteins containing 8 characteristically-spaced cysteine residuesforming disulphide bonds, have been termed '4-disulphide core' proteins. Protease inhibition occurs by insertion of the inhibitory loop into the active site pocket and interference with the catalytic residues of the protease.
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 236aa 2e-05
in ref transcript
Changed!
COG COG2319 164aa 7e-05
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
cd WD40 245aa 8e-09
in modified transcript
Changed!
COG COG2319 242aa 3e-05
in modified transcript
WIPI2
refseq_WIPI2.F2 refseq_WIPI2.R2 169 202
NCBIGene 36.3 26100
Alternative 5-prime, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_015610
cd WD40 194aa 2e-07
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 236aa 2e-05
in ref transcript
COG COG2319 164aa 7e-05
in ref transcript
FOG: WD40 repeat [General function prediction only].
WNK3
refseq_WNK3.F2 refseq_WNK3.R2 100 241
NCBIGene 36.3 65267
Alternative 5-prime, size difference: 141
Exclusion in the protein (no frameshift)
Reference transcript: NM_020922
cd S_TKc 254aa 5e-61
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 249aa 9e-61
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
COG SPS1 335aa 1e-27
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
WNK3
refseq_WNK3.F3 refseq_WNK3.R3 142 172
NCBIGene 36.3 65267
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_020922
cd S_TKc 254aa 5e-61
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 249aa 9e-61
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
COG SPS1 335aa 1e-27
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
WRNIP1
refseq_WRNIP1.F1 refseq_WRNIP1.R1 180 255
NCBIGene 36.3 56897
Alternative 3-prime, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_020135
Changed!
cd AAA 131aa 2e-17
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
Changed!
pfam AAA 137aa 2e-15
in ref transcript
ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes.
smart ZnF_Rad18 24aa 9e-07
in ref transcript
Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Changed!
PRK PRK13342 437aa 1e-165
in ref transcript
Changed!
pfam AAA 76aa 6e-12
in modified transcript
Changed!
TIGR dnaX_nterm 203aa 1e-06
in modified transcript
This model represents the well-conserved first ~ 365 amino acids of the translation of the dnaX gene. The full-length product of the dnaX gene in the model bacterium E. coli is the DNA polymerase III tau subunit. A translational frameshift leads to early termination and a truncated protein subunit gamma, about 1/3 shorter than tau and present in roughly equal amounts. This frameshift mechanism is not necessarily universal for species with DNA polymerase III but appears conserved in the exterme thermophile Thermus thermophilis.
Changed!
PRK PRK13342 412aa 1e-144
in modified transcript
XRCC4
refseq_XRCC4.F2 refseq_XRCC4.R2 167 214
NCBIGene 36.3 7518
Alternative 5-prime, size difference: 47
Inclusion in 5'UTR
Reference transcript: NM_022406
pfam XRCC4 334aa 1e-167
in ref transcript
DNA double-strand break repair and V(D)J recombination protein XRCC4. This family consists of several mammalian specific DNA double-strand break repair and V(D)J recombination protein XRCC4 sequences. In the non-homologous end joining pathway of DNA double-strand break repair, the ligation step is catalysed by a complex of XRCC4 and DNA ligase IV. It is thought that XRCC4 and ligase IV are essential for alignment-based gap filling, as well as for final ligation of the breaks.
YAF2
refseq_YAF2.F2 refseq_YAF2.R2 125 297
NCBIGene 36.2 10138
Multiple exon skipping, size difference: 172
Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein causing a new stop codon
Reference transcript: NM_005748
smart ZnF_RBZ 21aa 7e-04
in ref transcript
Zinc finger domain. Zinc finger domain in Ran-binding proteins (RanBPs), and other proteins. In RanBPs, this domain binds RanGDP.
YME1L1
refseq_YME1L1.F2 refseq_YME1L1.R2 176 347
NCBIGene 36.3 10730
Single exon skipping, size difference: 171
Exclusion in the protein (no frameshift)
Reference transcript: NM_139312
cd AAA 123aa 8e-16
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
TIGR FtsH_fam 434aa 1e-162
in ref transcript
HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
COG HflB 476aa 1e-142
in ref transcript
ATP-dependent Zn proteases [Posttranslational modification, protein turnover, chaperones].
YTHDC1
refseq_YTHDC1.F1 refseq_YTHDC1.R1 303 357
NCBIGene 36.3 91746
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_001031732
pfam YTH 94aa 2e-29
in ref transcript
YT521-B-like family. A protein of the YTH family has been shown to selectively remove transcripts of meiosis-specific genes expressed in mitotic cells. It has been speculated that in higher eukaryotic YTH-family members may be involved in similar mechanaisms to supress gene regulation during gametogenesis or general silencing. The rat protein YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment factor B, and the 68-kDa Src substrate associated during mitosis, Sam68. In vivo splicing assays demonstrated that YT521-B modulates alternative splice site selection in a concentration-dependent manner. The domain is predicted to have four alpha helices and six beta strands.
YWHAB
refseq_YWHAB.F1 refseq_YWHAB.R1 307 402
NCBIGene 36.3 7529
Single exon skipping, size difference: 95
Exclusion in 5'UTR
Reference transcript: NM_003404
pfam 14-3-3 234aa 1e-118
in ref transcript
14-3-3 protein.
COG BMH1 245aa 1e-81
in ref transcript
14-3-3 family protein [Signal transduction mechanisms].
YY1AP1
refseq_YY1AP1.F1 refseq_YY1AP1.R1 340 400
NCBIGene 36.3 55249
Alternative 3-prime, size difference: 60
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_139118
YY1AP1
refseq_YY1AP1.F2 refseq_YY1AP1.R2 181 337
NCBIGene 36.2 55249
Single exon skipping, size difference: 156
Inclusion in the protein causing a new stop codon
Reference transcript: NM_139118
YY1AP1
refseq_YY1AP1.F3 refseq_YY1AP1.R3 104 235
NCBIGene 36.2 55249
Single exon skipping, size difference: 131
Exclusion in the protein causing a frameshift
Reference transcript: NM_139118
YY1AP1
refseq_YY1AP1.F4 refseq_YY1AP1.R1 106 166
NCBIGene 36.3 55249
Alternative 3-prime, size difference: 60
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_139118
ZAN
refseq_ZAN.F1 refseq_ZAN.R1 127 400
NCBIGene 36.3 7455
Exon skipping and alternative 3-prime or 5-prime, size difference: 273
Inclusion in 3'UTR, Exclusion in 3'UTR, Exclusion in 3'UTR
Reference transcript: NM_003386
cd MAM 162aa 8e-32
in ref transcript
Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
cd MAM 162aa 2e-28
in ref transcript
cd MAM 155aa 1e-18
in ref transcript
pfam VWD 153aa 1e-38
in ref transcript
von Willebrand factor type D domain. The Cypridina-type luciferase from Vargula hilgendorfii contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
pfam MAM 162aa 1e-36
in ref transcript
MAM domain. An extracellular domain found in many receptors.
smart VWD 162aa 6e-35
in ref transcript
von Willebrand factor (vWF) type D domain. Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
pfam MAM 164aa 7e-31
in ref transcript
pfam MAM 158aa 8e-21
in ref transcript
smart C8 73aa 2e-17
in ref transcript
C8 domain. This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
smart C8 75aa 1e-15
in ref transcript
pfam TIL 54aa 4e-06
in ref transcript
Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
pfam TIL 50aa 8e-06
in ref transcript
ZC3H14
refseq_ZC3H14.F2 refseq_ZC3H14.R2 101 494
NCBIGene 36.3 79882
Multiple exon skipping, size difference: 393
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_024824
ZCCHC4
refseq_ZCCHC4.F2 refseq_ZCCHC4.R2 303 397
NCBIGene 36.2 29063
Alternative 5-prime, size difference: 94
Exclusion in the protein causing a frameshift
Reference transcript: XM_376310
pfam zf-GRF 41aa 4e-05
in ref transcript
GRF zinc finger. This presumed zinc binding domain is found in a variety of DNA-binding proteins. It seems likely that this domain is involved in nucleic acid binding. It is named GRF after three conserved residues in the centre of the alignment of the domain. This zinc finger may be related to pfam01396.
ZDHHC13
refseq_ZDHHC13.F1 refseq_ZDHHC13.R1 129 275
NCBIGene 36.3 54503
Single exon skipping, size difference: 146
Exclusion in the protein causing a frameshift
Reference transcript: NM_019028
Changed!
cd ANK 114aa 2e-22
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
cd ANK 152aa 4e-22
in ref transcript
Changed!
pfam zf-DHHC 53aa 5e-17
in ref transcript
DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions, and palmitoyltransferase activity.
Changed!
pfam Ank 29aa 1e-05
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
Changed!
pfam Ank 32aa 3e-04
in ref transcript
Changed!
COG COG5273 248aa 4e-23
in ref transcript
Uncharacterized protein containing DHHC-type Zn finger [General function prediction only].
Changed!
COG Arp 146aa 1e-14
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
COG Arp 140aa 6e-14
in ref transcript
ZDHHC14
refseq_ZDHHC14.F1 refseq_ZDHHC14.R1 236 281
NCBIGene 36.3 79683
Alternative 3-prime, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_024630
pfam zf-DHHC 63aa 6e-24
in ref transcript
DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions, and palmitoyltransferase activity.
COG COG5273 249aa 3e-31
in ref transcript
Uncharacterized protein containing DHHC-type Zn finger [General function prediction only].
ZDHHC16
refseq_ZDHHC16.F2 refseq_ZDHHC16.R2 272 320
NCBIGene 36.3 84287
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_032327
pfam zf-DHHC 54aa 2e-21
in ref transcript
DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions, and palmitoyltransferase activity.
Changed!
COG COG5273 88aa 1e-20
in ref transcript
Uncharacterized protein containing DHHC-type Zn finger [General function prediction only].
Changed!
COG COG5273 222aa 8e-22
in modified transcript
ZDHHC16
refseq_ZDHHC16.F3 refseq_ZDHHC16.R3 220 400
NCBIGene 36.3 84287
Single exon skipping, size difference: 180
Exclusion in 5'UTR
Reference transcript: NM_198046
pfam zf-DHHC 54aa 2e-21
in ref transcript
DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions, and palmitoyltransferase activity.
COG COG5273 88aa 1e-20
in ref transcript
Uncharacterized protein containing DHHC-type Zn finger [General function prediction only].
ZDHHC16
refseq_ZDHHC16.F6 refseq_ZDHHC16.R6 123 240
NCBIGene 36.3 84287
Exon skipping and alternative 3-prime or 5-prime, size difference: 117
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_032327
Changed!
pfam zf-DHHC 54aa 2e-21
in ref transcript
DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions, and palmitoyltransferase activity.
Changed!
COG COG5273 88aa 1e-20
in ref transcript
Uncharacterized protein containing DHHC-type Zn finger [General function prediction only].
Changed!
COG COG5273 50aa 0.001
in modified transcript
ZDHHC16
refseq_ZDHHC16.F7 refseq_ZDHHC16.R7 127 322
NCBIGene 36.3 84287
Single exon skipping, size difference: 195
Exclusion in the protein (no frameshift)
Reference transcript: NM_032327
pfam zf-DHHC 54aa 2e-21
in ref transcript
DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions, and palmitoyltransferase activity.
Changed!
COG COG5273 88aa 1e-20
in ref transcript
Uncharacterized protein containing DHHC-type Zn finger [General function prediction only].
Changed!
COG COG5273 63aa 2e-20
in modified transcript
ZFP64
refseq_ZFP64.F1 refseq_ZFP64.R1 173 335
NCBIGene 36.3 55734
Single exon skipping, size difference: 162
Exclusion in the protein (no frameshift)
Reference transcript: NM_018197
ZFYVE9
refseq_ZFYVE9.F2 refseq_ZFYVE9.R2 221 398
NCBIGene 36.3 9372
Single exon skipping, size difference: 177
Exclusion in the protein (no frameshift)
Reference transcript: NM_004799
cd FYVE 54aa 1e-11
in ref transcript
FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
pfam FYVE 66aa 8e-19
in ref transcript
FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. We have included members which do not conserve these histidine residues but are clearly related.
ZGPAT
refseq_ZGPAT.F1 refseq_ZGPAT.R1 185 255
NCBIGene 36.3 84619
Alternative 5-prime, size difference: 70
Exclusion in 5'UTR
Reference transcript: NM_032527
cd TUDOR 31aa 0.005
in ref transcript
Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
pfam G-patch 42aa 2e-08
in ref transcript
G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines.
smart ZnF_C3H1 22aa 6e-04
in ref transcript
zinc finger.
smart TUDOR 36aa 0.006
in ref transcript
Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
ZHX1
refseq_ZHX1.F1 refseq_ZHX1.R1 121 216
NCBIGene 36.3 11244
Alternative 5-prime, size difference: 95
Exclusion in 5'UTR
Reference transcript: NM_001017926
cd homeodomain 56aa 4e-07
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
cd homeodomain 53aa 7e-07
in ref transcript
cd homeodomain 41aa 0.002
in ref transcript
cd homeodomain 48aa 0.002
in ref transcript
cd homeodomain 44aa 0.009
in ref transcript
smart HOX 52aa 5e-08
in ref transcript
Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.
smart HOX 53aa 1e-07
in ref transcript
smart HOX 37aa 2e-04
in ref transcript
smart HOX 41aa 3e-04
in ref transcript
smart HOX 44aa 0.003
in ref transcript
ZMAT1
refseq_ZMAT1.F2 refseq_ZMAT1.R2 152 273
NCBIGene 36.2 84460
Single exon skipping, size difference: 121
Exclusion of the protein initiation site
Reference transcript: NM_001011657
smart ZnF_U1 35aa 0.001
in ref transcript
U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
ZMAT3
refseq_ZMAT3.F1 refseq_ZMAT3.R1 154 260
NCBIGene 36.3 64393
Alternative 5-prime and 3-prime, size difference: 106
Inclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_022470
smart ZnF_U1 35aa 2e-06
in ref transcript
U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
smart ZnF_U1 34aa 2e-05
in ref transcript
smart ZnF_U1 35aa 8e-05
in ref transcript
ZMAT5
refseq_ZMAT5.F1 refseq_ZMAT5.R1 194 309
NCBIGene 36.3 55954
Single exon skipping, size difference: 115
Exclusion in 5'UTR
Reference transcript: NM_019103
pfam zf-CCCH 23aa 6e-04
in ref transcript
Zinc finger C-x8-C-x5-C-x3-H type (and similar).
pfam zf-U1 37aa 0.003
in ref transcript
U1 zinc finger. This family consists of several U1 small nuclear ribonucleoprotein C (U1-C) proteins. The U1 small nuclear ribonucleoprotein (U1 snRNP) binds to the pre-mRNA 5' splice site (ss) at early stages of spliceosome assembly. Recruitment of U1 to a class of weak 5' ss is promoted by binding of the protein TIA-1 to uridine-rich sequences immediately downstream from the 5' ss. Binding of TIA-1 in the vicinity of a 5' ss helps to stabilise U1 snRNP recruitment, at least in part, via a direct interaction with U1-C, thus providing one molecular mechanism for the function of this splicing regulator. This domain is probably a zinc-binding. It is found in multiple copies in some members of the family.
COG COG5136 43aa 0.002
in ref transcript
U1 snRNP-specific protein C [RNA processing and modification].
ZMYM2
refseq_ZMYM2.F2 refseq_ZMYM2.R2 122 169
NCBIGene 36.3 7750
Single exon skipping, size difference: 47
Exclusion in 5'UTR
Reference transcript: NM_003453
pfam zf-FCS 41aa 6e-06
in ref transcript
MYM-type Zinc finger with FCS sequence motif. MYM-type zinc fingers were identified in MYM family proteins. Human zinc finger protein 261 is involved in a chromosomal translocation and may be responsible for X-linked retardation in XQ13.1. Human zinc finger protein 198 is also involved in disease. In myeloproliferative disorders it is fused to FGF receptor 1; in atypical myeloproliferative disorders it is rearranged. Members of the family generally are involved in development. This Zn-finger domain functions as a transcriptional trans-activator of late vaccinia viral genes, and orthologues are also found in all nucleocytoplasmic large DNA viruses, NCLDV. This domain is also found fused to the C termini of recombinases from certain prokaryotic transposons.
pfam zf-FCS 41aa 1e-05
in ref transcript
pfam zf-FCS 44aa 8e-05
in ref transcript
pfam zf-FCS 39aa 0.002
in ref transcript
pfam zf-FCS 40aa 0.002
in ref transcript
ZNF16
refseq_ZNF16.F1 refseq_ZNF16.R1 264 350
NCBIGene 36.3 7564
Single exon skipping, size difference: 86
Exclusion in 5'UTR
Reference transcript: NM_001029976
COG COG5048 349aa 2e-09
in ref transcript
FOG: Zn-finger [General function prediction only].
COG COG5048 210aa 6e-04
in ref transcript
ZNF160
refseq_ZNF160.F1 refseq_ZNF160.R1 105 190
NCBIGene 36.3 90338
Single exon skipping, size difference: 85
Exclusion in 5'UTR
Reference transcript: NM_198893
smart KRAB 61aa 1e-24
in ref transcript
krueppel associated box.
COG COG5048 458aa 8e-12
in ref transcript
FOG: Zn-finger [General function prediction only].
FOG: Zn-finger [General function prediction only].
ZNF189
refseq_ZNF189.F1 refseq_ZNF189.R1 190 284
NCBIGene 36.3 7743
Exon skipping and alternative 3-prime or 5-prime, size difference: 94
Inclusion in the protein causing a new stop codon, Exclusion in the protein (no frameshift)
Reference transcript: NM_003452
Changed!
pfam KRAB 41aa 2e-19
in ref transcript
KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
Changed!
COG COG5048 368aa 5e-09
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF2
refseq_ZNF2.F2 refseq_ZNF2.R2 312 384
NCBIGene 36.3 7549
Single exon skipping, size difference: 72
Exclusion of the protein initiation site
Reference transcript: NM_021088
Changed!
smart KRAB 61aa 2e-23
in ref transcript
krueppel associated box.
COG COG5048 158aa 2e-05
in ref transcript
FOG: Zn-finger [General function prediction only].
Changed!
smart KRAB 32aa 2e-08
in modified transcript
ZNF200
refseq_ZNF200.F2 refseq_ZNF200.R2 107 458
NCBIGene 36.3 7752
Alternative 5-prime, size difference: 351
Exclusion in 5'UTR
Reference transcript: NM_003454
pfam zf-C2H2 23aa 0.008
in ref transcript
Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
ZNF207
refseq_ZNF207.F1 refseq_ZNF207.R1 291 384
NCBIGene 36.3 7756
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098507
ZNF207
refseq_ZNF207.F4 refseq_ZNF207.R4 205 253
NCBIGene 36.3 7756
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098507
ZNF211
refseq_ZNF211.F1 refseq_ZNF211.R1 217 256
NCBIGene 36.3 10520
Single exon skipping, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_006385
pfam KRAB 41aa 4e-16
in ref transcript
KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
FOG: Zn-finger [General function prediction only].
ZNF274
refseq_ZNF274.F1 refseq_ZNF274.R1 278 374
NCBIGene 36.3 10782
Single exon skipping, size difference: 96
Exclusion in 5'UTR
Reference transcript: NM_133502
pfam KRAB 41aa 1e-18
in ref transcript
KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
pfam zf-C2H2 23aa 8e-04
in ref transcript
Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
PATZ1
refseq_ZNF278.F1 refseq_ZNF278.R1 193 258
NCBIGene 36.3 23598
Alternative 3-prime, size difference: 65
Exclusion in the protein causing a frameshift
Reference transcript: NM_014323
pfam BTB 130aa 8e-22
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
Changed!
COG SFP1 58aa 0.004
in modified transcript
KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
COG COG5048 405aa 4e-05
in ref transcript
FOG: Zn-finger [General function prediction only].
COG COG5048 341aa 6e-04
in ref transcript
ZNF484
refseq_ZNF484.F1 refseq_ZNF484.R1 133 178
NCBIGene 36.3 83744
Single exon skipping, size difference: 45
Exclusion of the protein initiation site
Reference transcript: NM_031486
Changed!
smart KRAB 60aa 1e-24
in ref transcript
krueppel associated box.
COG COG5048 402aa 8e-10
in ref transcript
FOG: Zn-finger [General function prediction only].
Changed!
smart KRAB 31aa 4e-07
in modified transcript
ZNF107
refseq_ZNF588.F2 refseq_ZNF588.R2 280 407
NCBIGene 36.3 51427
Single exon skipping, size difference: 127
Exclusion in 5'UTR
Reference transcript: NM_016220
COG COG5048 313aa 1e-07
in ref transcript
FOG: Zn-finger [General function prediction only].
COG COG5048 390aa 3e-07
in ref transcript
ZNF107
refseq_ZNF588.F3 refseq_ZNF588.R3 109 251
NCBIGene 36.3 51427
Single exon skipping, size difference: 142
Exclusion in 5'UTR
Reference transcript: NM_016220
COG COG5048 313aa 1e-07
in ref transcript
FOG: Zn-finger [General function prediction only].
COG COG5048 390aa 3e-07
in ref transcript
ZNF599
refseq_ZNF599.F1 refseq_ZNF599.R1 184 322
NCBIGene 36.2 148103
Single exon skipping, size difference: 138
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001007248
Changed!
smart KRAB 61aa 7e-23
in ref transcript
krueppel associated box.
Changed!
COG COG5048 313aa 3e-13
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF655
refseq_ZNF655.F1 refseq_ZNF655.R1 192 417
NCBIGene 36.2 79027
Multiple exon skipping, size difference: 225
Inclusion in the protein causing a new stop codon, Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_138494
Changed!
COG COG5048 106aa 6e-04
in ref transcript
FOG: Zn-finger [General function prediction only].
Changed!
PRK truB 85aa 0.007
in ref transcript
tRNA pseudouridine synthase B; Provisional.
IKZF3
refseq_ZNFN1A3.F2 refseq_ZNFN1A3.R2 190 307
NCBIGene 36.3 22806
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_012481
IKZF3
refseq_ZNFN1A3.F3 refseq_ZNFN1A3.R3 231 399
NCBIGene 36.3 22806
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_012481
IKZF3
refseq_ZNFN1A3.F5 refseq_ZNFN1A3.R5 146 263
NCBIGene 36.3 22806
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_012481
ZPBP2
refseq_ZPBP2.F1 refseq_ZPBP2.R1 184 250
NCBIGene 36.3 124626
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_199321
pfam Sp38 272aa 1e-142
in ref transcript
Zona-pellucida-binding protein (Sp38). This family contains a number of zona-pellucida-binding proteins that seem to be restricted to mammals. These are sperm proteins that bind to the 90-kDa family of zona pellucida glycoproteins in a calcium-dependent manner. These represent some of the specific molecules that mediate the first steps of gamete interaction, allowing fertilisation to occur.
ZRANB2
refseq_ZRANB2.F1 refseq_ZRANB2.R1 209 284
NCBIGene 36.3 9406
Single exon skipping, size difference: 75
Inclusion in the protein causing a new stop codon
Reference transcript: NM_203350
smart ZnF_RBZ 25aa 6e-04
in ref transcript
Zinc finger domain. Zinc finger domain in Ran-binding proteins (RanBPs), and other proteins. In RanBPs, this domain binds RanGDP.
smart ZnF_RBZ 25aa 0.009
in ref transcript
ZWINT
refseq_ZWINT.F2 refseq_ZWINT.R2 234 375
NCBIGene 36.3 11130
Alternative 5-prime and 3-prime, size difference: 141
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_007057
Changed!
PRK mukB 138aa 0.003
in ref transcript
cell division protein MukB; Provisional.
A26A1
rs.A26A1.F1 rs.A26A1.R1 141 279
NCBIGene 36.3 340441
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_001005365
cd ANK 126aa 2e-26
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
cd ANK 125aa 3e-24
in ref transcript
Changed!
TIGR trp 175aa 6e-07
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
COG Arp 173aa 1e-11
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
TIGR trp 121aa 2e-05
in modified transcript
AADACL3
rs.AADACL3.F1 rs.AADACL3.R1 160 377
NCBIGene 36.3 126767
Single exon skipping, size difference: 217
Exclusion of the protein initiation site
Reference transcript: NM_001103170
Changed!
pfam Abhydrolase_3 264aa 2e-43
in ref transcript
alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes.
Changed!
COG Aes 342aa 2e-33
in ref transcript
Esterase/lipase [Lipid metabolism].
Changed!
pfam Abhydrolase_3 250aa 3e-35
in modified transcript
Changed!
COG Aes 277aa 7e-24
in modified transcript
ABCC1
rs.ABCC1.F1 rs.ABCC1.R1 229 424
NCBIGene 36.3 4363
Single exon skipping, size difference: 195
Exclusion in the protein (no frameshift)
Reference transcript: NM_004996
Changed!
cd ABCC_MRP_domain2 221aa 9e-96
in ref transcript
Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
cd ABCC_MRP_domain1 202aa 6e-89
in ref transcript
Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
Changed!
TIGR MRP_assoc_pro 1524aa 0.0
in ref transcript
This model describes multi drug resistance-associated protein (MRP) in eukaryotes. The multidrug resistance-associated protein is an integral membrane protein that causes multidrug resistance when overexpressed in mammalian cells. It belongs to ABC transporter superfamily. The protein topology and function was experimentally demonstrated by epitope tagging and immunofluorescence. Insertion of tags in the critical regions associated with drug efflux, abrogated its function. The C-terminal domain seem to highly conserved.
Changed!
PTZ PTZ00243 869aa 1e-161
in ref transcript
ABC transporter; Provisional.
COG MdlB 558aa 2e-56
in ref transcript
ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
Changed!
cd ABCC_MRP_domain2 142aa 2e-50
in modified transcript
Changed!
TIGR MRP_assoc_pro 1426aa 0.0
in modified transcript
Changed!
TIGR MRP_assoc_pro 35aa 5e-13
in modified transcript
Changed!
PTZ PTZ00243 772aa 1e-131
in modified transcript
ABCC1
rs.ABCC1.F2 rs.ABCC1.R2 119 296
NCBIGene 36.3 4363
Single exon skipping, size difference: 177
Exclusion in the protein (no frameshift)
Reference transcript: NM_004996
cd ABCC_MRP_domain2 221aa 9e-96
in ref transcript
Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
Changed!
cd ABCC_MRP_domain1 202aa 6e-89
in ref transcript
Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
Changed!
TIGR MRP_assoc_pro 1524aa 0.0
in ref transcript
This model describes multi drug resistance-associated protein (MRP) in eukaryotes. The multidrug resistance-associated protein is an integral membrane protein that causes multidrug resistance when overexpressed in mammalian cells. It belongs to ABC transporter superfamily. The protein topology and function was experimentally demonstrated by epitope tagging and immunofluorescence. Insertion of tags in the critical regions associated with drug efflux, abrogated its function. The C-terminal domain seem to highly conserved.
Changed!
PTZ PTZ00243 869aa 1e-161
in ref transcript
ABC transporter; Provisional.
Changed!
COG MdlB 558aa 2e-56
in ref transcript
ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
Changed!
cd ABCC_MRP_domain1 97aa 3e-36
in modified transcript
Changed!
cd ABCC_MRP_domain1 63aa 1e-15
in modified transcript
Changed!
TIGR MRP_assoc_pro 1465aa 0.0
in modified transcript
Changed!
PTZ PTZ00243 766aa 1e-133
in modified transcript
Changed!
PTZ PTZ00243 326aa 8e-39
in modified transcript
Changed!
PRK PRK11160 68aa 4e-10
in modified transcript
Changed!
COG COG1123 146aa 2e-07
in modified transcript
ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only].
ABCC1
rs.ABCC1.F3 rs.ABCC1.R3 172 340
NCBIGene 36.3 4363
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_004996
cd ABCC_MRP_domain2 221aa 9e-96
in ref transcript
Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
Changed!
cd ABCC_MRP_domain1 202aa 6e-89
in ref transcript
Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
Changed!
TIGR MRP_assoc_pro 1524aa 0.0
in ref transcript
This model describes multi drug resistance-associated protein (MRP) in eukaryotes. The multidrug resistance-associated protein is an integral membrane protein that causes multidrug resistance when overexpressed in mammalian cells. It belongs to ABC transporter superfamily. The protein topology and function was experimentally demonstrated by epitope tagging and immunofluorescence. Insertion of tags in the critical regions associated with drug efflux, abrogated its function. The C-terminal domain seem to highly conserved.
Changed!
PTZ PTZ00243 869aa 1e-161
in ref transcript
ABC transporter; Provisional.
Changed!
COG MdlB 558aa 2e-56
in ref transcript
ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
Changed!
cd ABCC_MRP_domain1 146aa 2e-47
in modified transcript
Changed!
TIGR MRP_assoc_pro 758aa 0.0
in modified transcript
Changed!
TIGR MRP_assoc_pro 711aa 0.0
in modified transcript
Changed!
PTZ PTZ00243 813aa 1e-126
in modified transcript
Changed!
PTZ PTZ00243 326aa 9e-39
in modified transcript
Changed!
COG CydC 305aa 2e-19
in modified transcript
ABC-type transport system involved in cytochrome bd biosynthesis, fused ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones].
ABI1
rs.ABI1.F1 rs.ABI1.R1 100 115
NCBIGene 36.3 10006
Single exon skipping, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_005470
cd SH3 52aa 3e-15
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Changed!
pfam Abi_HHR 79aa 4e-32
in ref transcript
Abl-interactor HHR. The region featured in this family is found towards the N-terminus of a number of adaptor proteins that interact with Abl-family tyrosine kinases. More specifically, it is termed the homeo-domain homologous region (HHR), as it is similar to the DNA-binding region of homeo-domain proteins. Other homeo-domain proteins have been implicated in specifying positional information during embryonic development, and in the regulation of the expression of cell-type specific genes. The Abl-interactor proteins are thought to coordinate the cytoplasmic and nuclear functions of the Abl-family kinases, and seem to be involved in cytoskeletal reorganisation, but their precise role remains unclear.
smart SH3 56aa 2e-17
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Changed!
pfam Abi_HHR 74aa 6e-30
in modified transcript
ABL2
rs.ABL2.F1 rs.ABL2.R1 204 267
NCBIGene 36.3 27
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_001100108
cd PTKc_Abl 263aa 1e-158
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Abelson kinase. Protein Tyrosine Kinase (PTK) family; Abelson (Abl) kinase; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Abl (or c-Abl) is a ubiquitously-expressed cytoplasmic (or nonreceptor) tyr kinase that contains SH3, SH2, and tyr kinase domains in its N-terminal region, as well as nuclear localization motifs, a putative DNA-binding domain, and F- and G-actin binding domains in its C-terminal tail. It also contains a short autoinhibitory cap region in its N-terminus. Abl is normally inactive and requires phosphorylation and myristoylation for activation. Abl function depends on its subcellular localization. In the cytoplasm, Abl plays a role in cell proliferation and survival. In response to DNA damage or oxidative stress, Abl is transported to the nucleus where it induces apoptosis. In chronic myelogenous leukemia (CML) patients, an aberrant translocation results in the replacement of the first exon of Abl with the BCR (breakpoint cluster region) gene. The resulting BCR-Abl fusion protein is constitutively active and associates into tetramers, resulting in a hyperactive kinase sending a continuous signal. This leads to uncontrolled proliferation, morphological transformation and anti-apoptotic effects. BCR-Abl is the target of selective inhibitors, such as imatinib (Gleevec), used in the treatment of CML. Abl2, also known as ARG (Abelson-related gene), is thought to play a cooperative role with Abl in the proper development of the nervous system. The Tel-ARG fusion protein, resulting from reciprocal translocation between chromosomes 1 and 12, is associated with acute myeloid leukemia (AML). The TEL gene is a frequent fusion partner of other tyr kinase oncogenes, including Tel/Abl, Tel/PDGFRbeta, and Tel/Jak2, found in patients with leukemia and myeloproliferative disorders.
cd SH2 90aa 5e-24
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
cd SH3 53aa 2e-09
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
pfam Pkinase_Tyr 252aa 1e-112
in ref transcript
Protein tyrosine kinase.
pfam F_actin_bind 163aa 1e-48
in ref transcript
F-actin binding. The F-actin binding domain forms a compact bundle of four antiparallel alpha-helices, which are arranged in a left-handed topology. Binding of F-actin to the F-actin binding domain may result in cytoplasmic retention and subcellular distribution of the protein, as well as possible inhibition of protein function.
smart SH2 84aa 2e-27
in ref transcript
Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
pfam SH3_1 54aa 1e-12
in ref transcript
SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
COG SPS1 331aa 1e-20
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
ACD
rs.ACD.F1 rs.ACD.R1 173 212
NCBIGene 36.3 65057
Alternative 3-prime, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_001082486
ACOX3
rs.ACOX3.F1 rs.ACOX3.R1 367 435
NCBIGene 36.3 8310
Single exon skipping, size difference: 68
Exclusion in the protein causing a frameshift
Reference transcript: NM_003501
Changed!
cd AXO 654aa 0.0
in ref transcript
Peroxisomal acyl-CoA oxidases (AXO) catalyze the first set in the peroxisomal fatty acid beta-oxidation, the alpha,beta dehydrogenation of the corresponding trans-enoyl-CoA by FAD, which becomes reduced. In a second oxidative half-reaction, the reduced FAD is reoxidized by molecular oxygen. AXO is generally a homodimer, but it has been reported to form a different type of oligomer in yeast. There are several subtypes of AXO's, based on substrate specificity. Palmitoyl-CoA oxidase acts on straight-chain fatty acids and prostanoids; whereas, the closely related Trihydroxycoprostanoly-CoA oxidase has the greatest activity for 2-methyl branched side chains of bile precursors. Pristanoyl-CoA oxidase, acts on 2-methyl branched fatty acids. AXO has an additional domain, C-terminal to the region with similarity to acyl-CoA dehydrogenases, which is included in this alignment.
Changed!
pfam ACOX 190aa 1e-51
in ref transcript
Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA.
pfam Acyl-CoA_dh_M 58aa 2e-09
in ref transcript
Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold.
pfam Acyl-CoA_dh_1 162aa 1e-05
in ref transcript
Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle.
COG CaiA 373aa 6e-24
in ref transcript
Acyl-CoA dehydrogenases [Lipid metabolism].
Changed!
cd AXO 591aa 0.0
in modified transcript
Changed!
pfam ACOX 103aa 8e-24
in modified transcript
ACP1
rs.ACP1.F1 rs.ACP1.R1 233 388
NCBIGene 36.3 52
Alternative 3-prime, size difference: 155
Inclusion in the protein causing a frameshift
Reference transcript: NM_007099
Changed!
cd LMWPc 149aa 4e-43
in ref transcript
Low molecular weight phosphatase family;.
Changed!
pfam LMWPc 150aa 3e-41
in ref transcript
Low molecular weight phosphotyrosine protein phosphatase.
Changed!
cd LMWPc 78aa 4e-23
in modified transcript
Changed!
pfam LMWPc 78aa 2e-21
in modified transcript
Changed!
COG Wzb 80aa 6e-19
in modified transcript
ACPL2
rs.ACPL2.F1 rs.ACPL2.R1 206 380
NCBIGene 36.3 92370
Multiple exon skipping, size difference: 174
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_152282
cd HP_HAP_like 81aa 3e-14
in ref transcript
Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
cd HP_HAP_like 75aa 8e-13
in ref transcript
pfam Acid_phosphat_A 338aa 3e-45
in ref transcript
Histidine acid phosphatase.
ACSM2B
rs.ACSM2B.F1 rs.ACSM2B.R1 102 137
NCBIGene 36.3 348158
Single exon skipping, size difference: 35
Exclusion in 5'UTR
Reference transcript: NM_182617
pfam AMP-binding 411aa 2e-81
in ref transcript
AMP-binding enzyme.
COG Acs 526aa 1e-109
in ref transcript
Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases [Lipid metabolism].
ACTR2
rs.ACTR2.F1 rs.ACTR2.R1 102 117
NCBIGene 36.3 10097
Single exon skipping, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_001005386
Changed!
cd ACTIN 386aa 1e-122
in ref transcript
Actin; An ubiquitous protein involved in the formation of filaments that are a major component of the cytoskeleton. Interaction with myosin provides the basis of muscular contraction and many aspects of cell motility. Each actin protomer binds one molecule of ATP and either calcium or magnesium ions. Actin exists as a monomer in low salt concentrations, but filaments form rapidly as salt concentration rises, with the consequent hydrolysis of ATP. Polymerization is regulated by so-called capping proteins. The ATPase domain of actin shares similarity with ATPase domains of hexokinase and hsp70 proteins.
Changed!
smart ACTIN 388aa 1e-137
in ref transcript
Actin. ACTIN subfamily of ACTIN/mreB/sugarkinase/Hsp70 superfamily.
Changed!
PTZ PTZ00004 393aa 1e-106
in ref transcript
actin; Provisional.
Changed!
cd ACTIN 381aa 1e-122
in modified transcript
Changed!
smart ACTIN 383aa 1e-139
in modified transcript
Changed!
PTZ PTZ00004 388aa 1e-107
in modified transcript
ADAM15
rs.ADAM15.F1 rs.ADAM15.R1 100 172
NCBIGene 36.3 8751
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_207197
cd ZnMc_adamalysin_II_like 200aa 3e-63
in ref transcript
Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
pfam Reprolysin 199aa 2e-62
in ref transcript
Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
smart ACR 142aa 1e-38
in ref transcript
ADAM Cysteine-Rich Domain.
pfam Pep_M12B_propep 143aa 3e-29
in ref transcript
Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not well aligned.
pfam Disintegrin 75aa 4e-29
in ref transcript
Disintegrin.
ADAM9
rs.ADAM9.F1 rs.ADAM9.R1 106 212
NCBIGene 36.3 8754
Single exon skipping, size difference: 106
Exclusion in the protein causing a frameshift
Reference transcript: NM_003816
cd ZnMc_adamalysin_II_like 193aa 3e-68
in ref transcript
Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
pfam Reprolysin 195aa 2e-74
in ref transcript
Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
pfam Pep_M12B_propep 132aa 3e-52
in ref transcript
Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not well aligned.
smart ACR 136aa 1e-50
in ref transcript
ADAM Cysteine-Rich Domain.
pfam Disintegrin 77aa 4e-29
in ref transcript
Disintegrin.
ADAMTS13
rs.ADAMTS13.F1 rs.ADAMTS13.R1 145 238
NCBIGene 36.3 11093
Alternative 3-prime, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_139025
Changed!
cd ZnMc_ADAMTS_like 204aa 8e-68
in ref transcript
Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
Changed!
pfam Reprolysin 206aa 1e-11
in ref transcript
Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
smart TSP1 53aa 3e-11
in ref transcript
Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.
Changed!
cd ZnMc_ADAMTS_like 198aa 4e-65
in modified transcript
Changed!
pfam Reprolysin 200aa 3e-10
in modified transcript
ADORA1
rs.ADORA1.F1 rs.ADORA1.R1 138 293
NCBIGene 36.3 134
Single exon skipping, size difference: 155
Exclusion in 5'UTR
Reference transcript: NM_000674
pfam 7tm_1 257aa 1e-20
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
AFAP1L2
rs.AFAP1L2.F1 rs.AFAP1L2.R1 100 112
NCBIGene 36.3 84632
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_001001936
cd PH 89aa 8e-07
in ref transcript
Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
cd PH 77aa 9e-07
in ref transcript
pfam PH 75aa 1e-08
in ref transcript
PH domain. PH stands for pleckstrin homology.
smart PH 88aa 6e-06
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
AFF3
rs.AFF3.F1 rs.AFF3.R1 113 188
NCBIGene 36.3 3899
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_001025108
pfam AF-4 1204aa 0.0
in ref transcript
AF-4 proto-oncoprotein. This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental retardation syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental retardation. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila.
AGPAT2
rs.AGPAT2.F1 rs.AGPAT2.R1 251 347
NCBIGene 36.3 10555
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_006412
Changed!
pfam Acyltransferase 115aa 3e-28
in ref transcript
Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene.
Changed!
pfam Acyltransferase 88aa 3e-15
in modified transcript
Changed!
COG PlsC 129aa 5e-10
in modified transcript
AGTR1
rs.AGTR1.F1 rs.AGTR1.R1 200 357
NCBIGene 36.3 185
Single exon skipping, size difference: 157
Inclusion in the protein causing a new stop codon
Reference transcript: NM_031850
Changed!
pfam 7tm_1 252aa 2e-42
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
AGTR1
rs.AGTR1.F2 rs.AGTR1.R2 223 281
NCBIGene 36.3 185
Single exon skipping, size difference: 58
Exclusion of the protein initiation site
Reference transcript: NM_031850
pfam 7tm_1 252aa 2e-42
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
AGTR1
rs.AGTR1.F3 rs.AGTR1.R3 291 375
NCBIGene 36.3 185
Single exon skipping, size difference: 84
Exclusion in 5'UTR
Reference transcript: NM_031850
pfam 7tm_1 252aa 2e-42
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_006738
cd RhoGEF 195aa 4e-37
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
cd PH 91aa 2e-05
in ref transcript
Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
smart RhoGEF 192aa 5e-40
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
smart PH 102aa 5e-07
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
Pericentrin-AKAP-450 domain of centrosomal targeting protein. This domain is a coiled-coil region close to the C-terminus of centrosomal proteins that is directly responsible for recruiting AKAP-450 and pericentrin to the centrosome. Hence the suggested name for this region is a PACT domain (pericentrin-AKAP-450 centrosomal targeting). This domain is also present at the C-terminus of coiled-coil proteins from Drosophila and S. pombe, and that from the Drosophila protein is sufficient for targeting to the centrosome in mammalian cells. The function of these proteins is unknown but they seem good candidates for having a centrosomal or spindle pole body location. The final 22 residues of this domain in AKAP-450 appear specifically to be a calmodulin-binding domain indicating that this member at least is likely to contribute to centrosome assembly.
Changed!
TIGR SMC_prok_B 404aa 6e-07
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
pfam SMC_N 259aa 6e-05
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
TIGR SMC_prok_A 252aa 0.001
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
TIGR SMC_prok_B 709aa 0.002
in ref transcript
COG Smc 264aa 3e-04
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
COG Smc 317aa 7e-04
in ref transcript
PRK PRK05771 168aa 0.009
in ref transcript
V-type ATP synthase subunit I; Validated.
Changed!
TIGR SMC_prok_B 386aa 2e-07
in modified transcript
Changed!
COG Smc 308aa 4e-04
in modified transcript
AKR1C2
rs.AKR1C2.F1 rs.AKR1C2.R1 245 345
NCBIGene 36.3 1646
Single exon skipping, size difference: 100
Exclusion in 5'UTR
Reference transcript: NM_001354
cd Aldo_ket_red 289aa 1e-71
in ref transcript
Aldo-keto reductases (AKRs) are a superfamily of soluble NAD(P)(H) oxidoreductases whose chief purpose is to reduce aldehydes and ketones to primary and secondary alcohols. AKRs are present in all phyla and are of importance to both health and industrial applications. Members have very distinct functions and include the prokaryotic 2,5-diketo-D-gluconic acid reductases and beta-keto ester reductases, the eukaryotic aldose reductases, aldehyde reductases, hydroxysteroid dehydrogenases, steroid 5beta-reductases, potassium channel beta-subunits and aflatoxin aldehyde reductases, among others.
pfam Aldo_ket_red 292aa 1e-106
in ref transcript
Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity.
COG ARA1 298aa 9e-81
in ref transcript
Aldo/keto reductases, related to diketogulonate reductase [General function prediction only].
AKT1
rs.AKT1.F1 rs.AKT1.R1 239 323
NCBIGene 36.3 207
Single exon skipping, size difference: 84
Exclusion in 5'UTR
Reference transcript: NM_001014432
cd STKc_PKB_alpha 324aa 0.0
in ref transcript
STKc_PKB_alpha: Serine/Threonine Kinases (STKs), Protein Kinase B (PKB) or Akt subfamily, alpha (or Akt1) isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The PKB subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. There are three PKB isoforms from different genes, PKB-alpha (or Akt1), PKB-beta (or Akt2), and PKB-gamma (or Akt3). PKB contains an N-terminal pleckstrin homology (PH) domain and a C-terminal catalytic domain. PKB-alpha is predominantly expressed in endothelial cells. It is critical for the regulation of angiogenesis and the maintenance of vascular integrity. It also plays a role in adipocyte differentiation. Mice deficient in PKB-alpha exhibit perinatal morbidity, growth retardation, reduction in body weight accompanied by reduced sizes of multiple organs, and enhanced apoptosis in some cell types. PKB-alpha activity has been reported to be frequently elevated in breast and prostate cancers. In some cancer cells, PKB-alpha may act as a suppressor of metastasis.
cd PH_Akt 102aa 3e-52
in ref transcript
Akt pleckstrin homology (PH) domain. Akt (Protein Kinase B (PKB)) is a phosphatidylinositol 3'-kinase (PI3K)-dependent Ser/Thr kinase. The PH domain recruits Akt to the plasma membrane by binding to phosphoinositides (PtdIns-3,4-P2) and is required for activation. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
smart S_TKc 249aa 2e-86
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
smart S_TK_X 68aa 7e-15
in ref transcript
Extension to Ser/Thr-type protein kinases.
pfam PH 99aa 1e-11
in ref transcript
PH domain. PH stands for pleckstrin homology.
PTZ PTZ00263 309aa 4e-80
in ref transcript
protein kinase A catalytic subunit; Provisional.
AKT1S1
rs.AKT1S1.F1 rs.AKT1S1.R1 111 533
NCBIGene 36.3 84335
Alternative 5-prime, size difference: 422
Exclusion of the protein initiation site
Reference transcript: NM_032375
ALG9
rs.ALG9.F1 rs.ALG9.R1 117 138
NCBIGene 36.3 79796
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_024740
Changed!
pfam Glyco_transf_22 318aa 2e-57
in ref transcript
Alg9-like mannosyltransferase family. Members of this family are mannosyltransferase enzymes. At least some members are localised in endoplasmic reticulum and involved in GPI anchor biosynthesis.
Changed!
pfam Glyco_transf_22 311aa 8e-59
in modified transcript
ALPK1
rs.ALPK1.F1 rs.ALPK1.R1 209 261
NCBIGene 36.3 80216
Alternative 3-prime, size difference: 52
Inclusion in 5'UTR
Reference transcript: NM_001102406
pfam Alpha_kinase 205aa 4e-57
in ref transcript
Alpha-kinase family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a bifunctional ion channel. This family is known as the alpha-kinase family. The structure of the kinase domain revealed unexpected similarity to eukaryotic protein kinases in the catalytic core as well as to metabolic enzymes with ATP-grasp domains.
ALS2CR8
rs.ALS2CR8.F1 rs.ALS2CR8.R1 207 326
NCBIGene 36.3 79800
Single exon skipping, size difference: 119
Exclusion in 5'UTR
Reference transcript: NM_024744
AMELX
rs.AMELX.F1 rs.AMELX.R1 245 287
NCBIGene 36.3 265
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_182680
Changed!
pfam Amelogenin 66aa 4e-16
in ref transcript
Amelogenin. Amelogenins play a role in biomineralisation. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organisation and mineralisation of developing enamel. They are found in the extracellular matrix. Mutations in X-chromosomal amelogenin can cause Amelogenesis imperfecta.
Changed!
pfam Amelogenin 52aa 1e-17
in modified transcript
AMPH
rs.AMPH.F1 rs.AMPH.R1 297 423
NCBIGene 36.3 273
Single exon skipping, size difference: 126
Exclusion in the protein (no frameshift)
Reference transcript: NM_001635
cd SH3 68aa 3e-06
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
pfam BAR 220aa 3e-55
in ref transcript
BAR domain. BAR domains are dimerisation, lipid binding and curvature sensing modules found in many different protein families. A BAR domain with an additional N-terminal amphipathic helix (an N-BAR) can drive membrane curvature. These N-BAR domains are found in amphiphysin, endophilin, BRAP and Nadrin. BAR domains are also frequently found alongside domains that determine lipid specificity, like pfam00169 and pfam00787 domains in beta centaurins and sorting nexins respectively.
smart SH3 70aa 7e-07
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Changed!
PRK rne 148aa 0.009
in ref transcript
ribonuclease E; Reviewed.
Changed!
PRK PRK05648 145aa 0.005
in modified transcript
DNA polymerase III subunits gamma and tau; Reviewed.
ANK1
rs.ANK1.F1 rs.ANK1.R1 191 392
NCBIGene 36.3 286
Multiple exon skipping, size difference: 201
Exclusion in the protein (no frameshift), Exclusion of the stop codon
Reference transcript: NM_020478
ANKHD1
rs.ANKHD1.F1 rs.ANKHD1.R1 143 176
NCBIGene 36.3 54882
Alternative 3-prime, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_017747
cd ANK 125aa 5e-28
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 127aa 2e-26
in ref transcript
cd ANK 127aa 3e-26
in ref transcript
cd ANK 119aa 2e-25
in ref transcript
cd ANK 128aa 4e-25
in ref transcript
cd ANK 123aa 5e-23
in ref transcript
cd KH-I 63aa 2e-11
in ref transcript
K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
cd ANK 65aa 5e-08
in ref transcript
pfam KH_1 63aa 2e-11
in ref transcript
KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
TIGR trp 239aa 1e-06
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
TIGR trp 222aa 2e-05
in ref transcript
pfam Ank 33aa 3e-05
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
pfam Ank 29aa 9e-05
in ref transcript
pfam Ank 31aa 4e-04
in ref transcript
pfam Ank 29aa 0.001
in ref transcript
pfam Ank 27aa 0.004
in ref transcript
COG Arp 183aa 7e-16
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
COG Arp 196aa 1e-15
in ref transcript
COG Arp 201aa 9e-15
in ref transcript
COG Arp 228aa 2e-11
in ref transcript
COG Arp 133aa 3e-11
in ref transcript
ANKRD12
rs.ANKRD12.F1 rs.ANKRD12.R1 250 319
NCBIGene 36.3 23253
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_015208
cd ANK 113aa 4e-28
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
pfam Ank 31aa 6e-06
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
pfam Ank 28aa 4e-05
in ref transcript
pfam Ank 33aa 1e-04
in ref transcript
COG Arp 172aa 2e-12
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
ANKRD16
rs.ANKRD16.F1 rs.ANKRD16.R1 102 207
NCBIGene 36.3 54522
Alternative 5-prime and 3-prime, size difference: 105
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_019046
cd ANK 121aa 5e-22
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
cd ANK 156aa 5e-20
in ref transcript
cd ANK 128aa 3e-19
in ref transcript
TIGR trp 142aa 4e-05
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Changed!
TIGR trp 223aa 3e-04
in ref transcript
Changed!
COG Arp 184aa 7e-12
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
COG Arp 122aa 1e-10
in ref transcript
Changed!
cd ANK 121aa 1e-23
in modified transcript
Changed!
TIGR trp 177aa 0.003
in modified transcript
Changed!
COG Arp 169aa 3e-12
in modified transcript
ANKRD7
rs.ANKRD7.F1 rs.ANKRD7.R1 177 237
NCBIGene 36.3 56311
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_001077708
Changed!
cd ANK 123aa 1e-26
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 65aa 7e-10
in ref transcript
pfam Ank 33aa 7e-07
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
TIGR trp 102aa 1e-05
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Changed!
pfam Ank 30aa 4e-05
in ref transcript
Changed!
COG Arp 134aa 1e-10
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
cd ANK 126aa 2e-27
in modified transcript
Changed!
pfam Ank 30aa 5e-05
in modified transcript
Changed!
COG Arp 161aa 2e-11
in modified transcript
ANXA11
rs.ANXA11.F1 rs.ANXA11.R1 220 416
NCBIGene 36.3 311
Single exon skipping, size difference: 196
Exclusion in 5'UTR
Reference transcript: NM_145869
pfam Annexin 66aa 8e-22
in ref transcript
Annexin. This family of annexins also includes giardin that has been shown to function as an annexin.
pfam Annexin 66aa 1e-21
in ref transcript
pfam Annexin 66aa 4e-19
in ref transcript
pfam Annexin 67aa 2e-17
in ref transcript
AP1S1
rs.AP1S1.F1 rs.AP1S1.R1 139 416
NCBIGene 36.3 1174
Alternative 5-prime and 3-prime, size difference: 277
Exclusion in the protein (no frameshift), Exclusion of the stop codon
Reference transcript: NM_001283
Changed!
pfam Clat_adaptor_s 142aa 9e-58
in ref transcript
Clathrin adaptor complex small chain.
Changed!
COG APS2 150aa 7e-47
in ref transcript
Clathrin adaptor complex, small subunit [Intracellular trafficking and secretion].
Changed!
pfam Clat_adaptor_s 127aa 4e-54
in modified transcript
Changed!
COG APS2 129aa 2e-43
in modified transcript
AP3D1
rs.AP3D1.F1 rs.AP3D1.R1 199 349
NCBIGene 36.3 8943
Multiple exon skipping, size difference: 150
Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_003938
Changed!
pfam BLVR 260aa 1e-141
in ref transcript
Bovine leukaemia virus receptor (BLVR). This family consists of several bovine specific leukaemia virus receptors which are thought to function as transmembrane proteins, although their exact function is unknown.
pfam Adaptin_N 550aa 1e-132
in ref transcript
Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pits and vesicles. The N-terminal region of the various adaptor proteins (APs) is constant by comparison to the C-terminal which is variable within members of the AP-2 family; and it has been proposed that this constant region interacts with another uniform component of the coated vesicles.
pfam BLVR 194aa 1e-79
in ref transcript
COG COG5096 407aa 1e-12
in ref transcript
Vesicle coat complex, various subunits [Intracellular trafficking and secretion].
Changed!
pfam BLVR 253aa 1e-139
in modified transcript
AP3D1
rs.AP3D1.F2 rs.AP3D1.R2 101 374
NCBIGene 36.3 8943
Exon skipping and alternative 3-prime or 5-prime, size difference: 273
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_003938
pfam BLVR 260aa 1e-141
in ref transcript
Bovine leukaemia virus receptor (BLVR). This family consists of several bovine specific leukaemia virus receptors which are thought to function as transmembrane proteins, although their exact function is unknown.
Changed!
pfam Adaptin_N 550aa 1e-132
in ref transcript
Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pits and vesicles. The N-terminal region of the various adaptor proteins (APs) is constant by comparison to the C-terminal which is variable within members of the AP-2 family; and it has been proposed that this constant region interacts with another uniform component of the coated vesicles.
pfam BLVR 194aa 1e-79
in ref transcript
Changed!
COG COG5096 407aa 1e-12
in ref transcript
Vesicle coat complex, various subunits [Intracellular trafficking and secretion].
Changed!
pfam Adaptin_N 333aa 3e-61
in modified transcript
Changed!
COG COG5096 212aa 1e-06
in modified transcript
APAF1
rs.APAF1.F1 rs.APAF1.R1 193 322
NCBIGene 36.3 317
Single exon skipping, size difference: 129
Exclusion in the protein (no frameshift)
Reference transcript: NM_181861
Changed!
cd WD40 304aa 4e-60
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 274aa 5e-47
in ref transcript
pfam NB-ARC 286aa 5e-73
in ref transcript
NB-ARC domain.
pfam CARD 85aa 4e-14
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
smart WD40 40aa 8e-08
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 40aa 2e-06
in ref transcript
smart WD40 40aa 8e-05
in ref transcript
smart WD40 42aa 1e-04
in ref transcript
smart WD40 37aa 1e-04
in ref transcript
Changed!
pfam eIF2A 114aa 0.003
in ref transcript
Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
smart WD40 28aa 0.006
in ref transcript
smart WD40 34aa 0.008
in ref transcript
Changed!
COG COG2319 398aa 4e-33
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
COG COG2319 408aa 5e-25
in ref transcript
Changed!
cd WD40 264aa 1e-55
in modified transcript
Changed!
COG COG2319 390aa 1e-33
in modified transcript
APAF1
rs.APAF1.F2 rs.APAF1.R2 185 218
NCBIGene 36.3 317
Alternative 5-prime, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_181861
cd WD40 304aa 4e-60
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 274aa 5e-47
in ref transcript
pfam NB-ARC 286aa 5e-73
in ref transcript
NB-ARC domain.
pfam CARD 85aa 4e-14
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
smart WD40 40aa 8e-08
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 40aa 2e-06
in ref transcript
smart WD40 40aa 8e-05
in ref transcript
smart WD40 42aa 1e-04
in ref transcript
smart WD40 37aa 1e-04
in ref transcript
pfam eIF2A 114aa 0.003
in ref transcript
Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
smart WD40 28aa 0.006
in ref transcript
smart WD40 34aa 0.008
in ref transcript
COG COG2319 398aa 4e-33
in ref transcript
FOG: WD40 repeat [General function prediction only].
COG COG2319 408aa 5e-25
in ref transcript
APP
rs.APP.F1 rs.APP.R1 245 302
NCBIGene 36.3 351
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_000484
cd KU 52aa 9e-18
in ref transcript
BPTI/Kunitz family of serine protease inhibitors; Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure.
pfam A4_EXTRA 165aa 8e-89
in ref transcript
Amyloid A4 extracellular domain.
pfam Kunitz_BPTI 52aa 6e-19
in ref transcript
Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure. Certain family members are similar to the tick anticoagulant peptide (TAP). This is a highly selective inhibitor of factor Xa in the blood coagulation pathways. TAP molecules are highly dipolar, and are arranged to form a twisted two- stranded antiparallel beta-sheet followed by an alpha helix.
pfam APP_amyloid 43aa 4e-18
in ref transcript
beta-amyloid precursor protein C-terminus. This is the amyloid, C-terminal, protein of the beta-Amyloid precursor protein (APP) which is a conserved and ubiquitous transmembrane glycoprotein strongly implicated in the pathogenesis of Alzheimer's disease but whose normal biological function is unknown. The C-terminal 100 residues are released and aggregate into amyloid deposits which are strongly implicated in the pathology of Alzheimer's disease plaque-formation. The domain is associated with family A4_EXTRA, pfam02177, further towards the N-terminus.
pfam Beta-APP 34aa 9e-07
in ref transcript
Beta-amyloid peptide (beta-APP).
pfam OmpH 81aa 0.005
in ref transcript
Outer membrane protein (OmpH-like). This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterised as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.
Changed!
COG SbcC 197aa 0.006
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
Changed!
PRK xseA 173aa 0.001
in modified transcript
exodeoxyribonuclease VII large subunit; Reviewed.
ARFGAP1
rs.ARFGAP1.F1 rs.ARFGAP1.R1 169 193
NCBIGene 36.3 55738
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_175609
pfam ArfGap 118aa 3e-48
in ref transcript
Putative GTPase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
COG COG5347 159aa 4e-31
in ref transcript
GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
ARHGAP9
rs.ARHGAP9.F1 rs.ARHGAP9.R1 295 401
NCBIGene 36.3 64333
Single exon skipping, size difference: 106
Exclusion in the protein causing a frameshift
Reference transcript: NM_032496
Changed!
cd RhoGAP_ARHGAP27_15_12_9 205aa 6e-82
in ref transcript
RhoGAP_ARHGAP27_15_12_9: GTPase-activator protein (GAP) domain for Rho-like GTPases found in ARHGAP27 (also called CAMGAP1), ARHGAP15, 12 and 9-like proteins; This subgroup of ARHGAPs are multidomain proteins that contain RhoGAP, PH, SH3 and WW domains. Most members that are studied show GAP activity towards Rac1, some additionally show activity towards Cdc42. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
cd PH 109aa 2e-09
in ref transcript
Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
cd SH3 54aa 0.002
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Changed!
pfam RhoGAP 171aa 7e-43
in ref transcript
RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.
pfam PH 112aa 3e-08
in ref transcript
PH domain. PH stands for pleckstrin homology.
pfam SH3_1 54aa 3e-05
in ref transcript
SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
Changed!
cd RhoGAP_ARHGAP27_15_12_9 119aa 2e-38
in modified transcript
Changed!
pfam RhoGAP 103aa 6e-16
in modified transcript
ARID4A
rs.ARID4A.F1 rs.ARID4A.R1 146 308
NCBIGene 36.3 5926
Alternative 5-prime, size difference: 162
Exclusion in the protein (no frameshift)
Reference transcript: NM_002892
cd CHROMO 38aa 0.003
in ref transcript
Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
pfam ARID 96aa 1e-31
in ref transcript
ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain.
pfam RBB1NT 93aa 8e-25
in ref transcript
RBB1NT (NUC162) domain. This domain is found N terminal to the ARID/BRIGHT domain in DNA-binding proteins of the Retinoblastoma-binding protein 1 family.
smart TUDOR 56aa 1e-05
in ref transcript
Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
smart CHROMO 42aa 1e-04
in ref transcript
Chromatin organization modifier domain.
ARL13B
rs.ARL13B.F1 rs.ARL13B.R1 136 457
NCBIGene 36.3 200894
Multiple exon skipping, size difference: 321
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_182896
Changed!
cd Arl2l1_Arl13_like 167aa 3e-79
in ref transcript
Arl2l1/Arl13 subfamily. Arl2l1 (Arl2-like protein 1) and Arl13 form a subfamily of the Arf family of small GTPases. Arl2l1 was identified in human cells during a search for the gene(s) responsible for Bardet-Biedl syndrome (BBS). Like Arl6, the identified BBS gene, Arl2l1 is proposed to have cilia-specific functions. Arl13 is found on the X chromosome, but its expression has not been confirmed; it may be a pseudogene.
Changed!
pfam Arf 171aa 4e-36
in ref transcript
ADP-ribosylation factor family. Pfam combines a number of different Prosite families together.
Changed!
pfam DUF1682 43aa 0.007
in ref transcript
Protein of unknown function (DUF1682). The members of this family are all hypothetical eukaryotic proteins of unknown function. One member is described as being an adipocyte-specific protein, but no evidence of this was found.
TIGR major_cap_HK97 94aa 0.009
in ref transcript
This model family represents the major capsid protein component of the heads (capsids) of bacteriophage HK97, phi-105, P27, and related phage. This model represents one of several analogous families lacking detectable sequence similarity. The gene encoding this component is typically located in an operon encoding the small and large terminase subunits, the portal protein and the prohead or maturation protease.
Changed!
PTZ PTZ00133 167aa 2e-23
in ref transcript
ADP-ribosylation factor; Provisional.
PRK rne 46aa 0.002
in ref transcript
ribonuclease E; Reviewed.
Changed!
cd Arl2l1_Arl13_like 62aa 4e-25
in modified transcript
Changed!
pfam Arf 63aa 6e-05
in modified transcript
ARL5A
rs.ARL5A.F1 rs.ARL5A.R1 100 358
NCBIGene 36.3 26225
Alternative 5-prime, size difference: 258
Exclusion of the protein initiation site
Reference transcript: NM_012097
Changed!
cd Arl5_Arl8 173aa 1e-94
in ref transcript
Arl5/Arl8 subfamily. Arl5 (Arf-like 5) and Arl8, like Arl4 and Arl7, are localized to the nucleus and nucleolus. Arl5 is developmentally regulated during embryogenesis in mice. Human Arl5 interacts with the heterochromatin protein 1-alpha (HP1alpha), a nonhistone chromosomal protein that is associated with heterochromatin and telomeres, and prevents telomere fusion. Arl5 may also play a role in embryonic nuclear dynamics and/or signaling cascades. Arl8 was identified from a fetal cartilage cDNA library. It is found in brain, heart, lung, cartilage, and kidney. No function has been assigned for Arl8 to date.
Changed!
pfam Arf 158aa 2e-61
in ref transcript
ADP-ribosylation factor family. Pfam combines a number of different Prosite families together.
Changed!
PTZ PTZ00133 179aa 5e-55
in ref transcript
ADP-ribosylation factor; Provisional.
Changed!
cd Arl5_Arl8 138aa 6e-76
in modified transcript
Changed!
pfam Arf 135aa 3e-50
in modified transcript
Changed!
PTZ PTZ00133 142aa 4e-43
in modified transcript
ARL6IP4
rs.ARL6IP4.F1 rs.ARL6IP4.R1 101 125
NCBIGene 36.3 51329
Alternative 3-prime, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_018694
pfam SR-25 73aa 2e-19
in ref transcript
Nuclear RNA-splicing-associated protein. SR-25, otherwise known as ADP-ribosylation factor-like factor 6-interacting protein 4, is expressed in virtually all tissues. At the N-terminus there is a repeat of serine-arginine (SR repeat), and towards the middle of the protein there are clusters of both serines and of basic amino acids. The presence of many nuclear localisation signals strongly implies that this is a nuclear protein that may contribute to RNA splicing. SR-25 is also implicated, along with heat-shock-protein-27, as a mediator in the Rac1 (GTPase ras-related C3 botulinum toxin substrate 1) signalling pathway.
ARPC4
rs.ARPC4.F1 rs.ARPC4.R1 103 522
NCBIGene 36.3 10093
Alternative 5-prime, size difference: 419
Exclusion of the protein initiation site
Reference transcript: NM_005718
Changed!
pfam ARPC4 168aa 1e-88
in ref transcript
ARP2/3 complex 20 kDa subunit (ARPC4). This family consists of several eukaryotic ARP2/3 complex 20 kDa subunit (P20-ARC) proteins. The Arp2/3 protein complex has been implicated in the control of actin polymerisation in cells. The human complex consists of seven subunits which include the actin related proteins Arp2 and Arp3 it has been suggested that the complex promotes actin assembly in lamellipodia and may participate in lamellipodial protrusion.
Changed!
PTZ PTZ00278 168aa 2e-59
in ref transcript
ARP2/3 complex subunit; Provisional.
ARTN
rs.ARTN.F1 rs.ARTN.R1 150 174
NCBIGene 36.3 9048
Alternative 3-prime, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_057090
pfam TGF_beta 96aa 8e-06
in ref transcript
Transforming growth factor beta like domain.
ASGR2
rs.ASGR2.F1 rs.ASGR2.R1 222 279
NCBIGene 36.3 433
Alternative 5-prime, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_080912
cd CLECT_DC-SIGN_like 126aa 1e-40
in ref transcript
CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
Changed!
pfam Lectin_N 143aa 5e-55
in ref transcript
Hepatic lectin, N-terminal domain.
pfam Lectin_C 109aa 4e-32
in ref transcript
Lectin C-type domain. This family includes both long and short form C-type.
Changed!
pfam Lectin_N 143aa 4e-51
in modified transcript
ASGR2
rs.ASGR2.F2 rs.ASGR2.R2 112 127
NCBIGene 36.3 433
Alternative 5-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_080912
cd CLECT_DC-SIGN_like 126aa 1e-40
in ref transcript
CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
Changed!
pfam Lectin_N 143aa 5e-55
in ref transcript
Hepatic lectin, N-terminal domain.
pfam Lectin_C 109aa 4e-32
in ref transcript
Lectin C-type domain. This family includes both long and short form C-type.
Changed!
pfam Lectin_N 138aa 1e-56
in modified transcript
ASRGL1
rs.ASRGL1.F1 rs.ASRGL1.R1 213 290
NCBIGene 36.3 80150
Alternative 3-prime, size difference: 77
Inclusion in 5'UTR
Reference transcript: NM_025080
cd ASRGL1_like 295aa 1e-100
in ref transcript
ASRGL1_like domains, a subfamily of the L-Asparaginase type 2-like enzymes. The wider family includes Glycosylasparaginase, Taspase 1 and L-Asparaginase type 2 enzymes. The proenzymes undergo autoproteolytic cleavage before a threonine to generate alpha and beta subunits. The threonine becomes the N-terminal residue of the beta subunit and is the catalytic residue. ASRGL1, or asparaginase-like 1, has been cloned from mammalian testis cDNA libraries. It has been identified as a sperm antigen that may induce the production of autoantibodies following obstruction of the male reproductive tract, e.g. vasectomy.
pfam Asparaginase_2 250aa 5e-56
in ref transcript
Asparaginase.
COG COG1446 292aa 1e-63
in ref transcript
Asparaginase [Amino acid transport and metabolism].
ASS1
rs.ASS1.F1 rs.ASS1.R1 202 264
NCBIGene 36.3 445
Single exon skipping, size difference: 62
Exclusion in the protein causing a frameshift
Reference transcript: NM_000050
Changed!
cd Argininosuccinate_Synthase 392aa 1e-161
in ref transcript
Argininosuccinate synthase. The Argininosuccinate synthase is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterized by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity.
Changed!
pfam Arginosuc_synth 398aa 1e-180
in ref transcript
Arginosuccinate synthase. This family contains a PP-loop motif.
Changed!
PRK PRK00509 402aa 1e-170
in ref transcript
argininosuccinate synthase; Provisional.
ASTN2
rs.ASTN2.F1 rs.ASTN2.R1 100 112
NCBIGene 36.3 23245
Alternative 3-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_198187
smart MACPF 184aa 1e-36
in ref transcript
membrane-attack complex / perforin.
ATG4B
rs.ATG4B.F1 rs.ATG4B.R1 101 121
NCBIGene 36.3 23192
Alternative 5-prime, size difference: 20
Inclusion in the protein causing a frameshift
Reference transcript: NM_013325
pfam Peptidase_C54 301aa 1e-101
in ref transcript
Peptidase family C54.
ATG9A
rs.ATG9A.F1 rs.ATG9A.R1 125 177
NCBIGene 36.3 79065
Single exon skipping, size difference: 52
Exclusion in 5'UTR
Reference transcript: NM_001077198
pfam APG9 359aa 1e-143
in ref transcript
Autophagy protein Apg9. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg9 plays a direct role in the formation of the cytoplasm to vacuole targeting and autophagic vesicles, possibly serving as a marker for a specialised compartment essential for these vesicle-mediated alternative targeting pathways.
ATP11A
rs.ATP11A.F1 rs.ATP11A.R1 142 169
NCBIGene 36.3 23250
Exon skipping and alternative 3-prime or 5-prime, size difference: 27
Inclusion in the protein causing a new stop codon, Exclusion in the protein (no frameshift)
Reference transcript: NM_032189
TIGR ATPase-Plipid 1065aa 0.0
in ref transcript
This model describes the P-type ATPase responsible for transporting phospholipids from one leaflet of bilayer membranes to the other. These ATPases are found only in eukaryotes.
COG MgtA 1061aa 1e-106
in ref transcript
Cation transport ATPase [Inorganic ion transport and metabolism].
ATP2A3
rs.ATP2A3.F1 rs.ATP2A3.R1 104 117
NCBIGene 36.3 489
Alternative 5-prime, size difference: 13
Exclusion in the protein causing a frameshift
Reference transcript: NM_174953
cd HAD_like 118aa 0.002
in ref transcript
Haloacid dehalogenase-like hydrolases. The haloacid dehalogenase-like (HAD) superfamily includes L-2-haloacid dehalogenase, epoxide hydrolase, phosphoserine phosphatase, phosphomannomutase, phosphoglycolate phosphatase, P-type ATPase, and many others, all of which use a nucleophilic aspartate in their phosphoryl transfer reaction. All members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. Members of this superfamily are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases.
TIGR ATPase-IIA1_Ca 937aa 0.0
in ref transcript
The calcium P-type ATPases have been characterized as Type IIA based on a phylogenetic analysis which distinguishes this group from the Type IIB PMCA calcium pump modelled by TIGR01517. A separate analysis divides Type IIA into sub-types, SERCA and PMR1 the latter of which is modelled by TIGR01522.
pfam Cation_ATPase_N 57aa 1e-12
in ref transcript
Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport.
COG MgtA 980aa 0.0
in ref transcript
Cation transport ATPase [Inorganic ion transport and metabolism].
ATP2B1
rs.ATP2B1.F1 rs.ATP2B1.R1 232 386
NCBIGene 36.3 490
Single exon skipping, size difference: 154
Inclusion in the protein causing a frameshift
Reference transcript: NM_001682
TIGR ATPase-IIB_Ca 707aa 0.0
in ref transcript
The calcium P-type ATPases have been characterized as Type IIB based on a phylogenetic analysis which distinguishes this group from the Type IIA SERCA calcium pump. A separate analysis divides Type IIA into sub-types (SERCA and PMR1) which are modelled by two corresponding HMMs (TIGR01116 and TIGR01522). This model is well separated from those.
TIGR ATPase-IIB_Ca 281aa 1e-100
in ref transcript
TIGR ATPase-IID_K-Na 297aa 7e-18
in ref transcript
The Leishmania sequence (GP|3192903), which falls between trusted and noise in this model, may very well turn out to be an active potassium pump.
COG MgtA 703aa 1e-127
in ref transcript
Cation transport ATPase [Inorganic ion transport and metabolism].
COG MgtA 237aa 2e-26
in ref transcript
ATP2B2
rs.ATP2B2.F1 rs.ATP2B2.R1 176 311
NCBIGene 36.3 491
Multiple exon skipping, size difference: 135
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001001331
Changed!
TIGR ATPase-IIB_Ca 706aa 0.0
in ref transcript
The calcium P-type ATPases have been characterized as Type IIB based on a phylogenetic analysis which distinguishes this group from the Type IIA SERCA calcium pump. A separate analysis divides Type IIA into sub-types (SERCA and PMR1) which are modelled by two corresponding HMMs (TIGR01116 and TIGR01522). This model is well separated from those.
Changed!
TIGR ATPase-IIB_Ca 278aa 2e-78
in ref transcript
Changed!
COG MgtA 698aa 1e-126
in ref transcript
Cation transport ATPase [Inorganic ion transport and metabolism].
Changed!
COG MgtA 238aa 2e-24
in ref transcript
Changed!
TIGR ATPase-IIB_Ca 1031aa 0.0
in modified transcript
Changed!
COG MgtA 981aa 1e-151
in modified transcript
ATP2C1
rs.ATP2C1.F1 rs.ATP2C1.R1 169 199
NCBIGene 36.3 27032
Alternative 5-prime, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_001001486
cd HAD_like 111aa 2e-04
in ref transcript
Haloacid dehalogenase-like hydrolases. The haloacid dehalogenase-like (HAD) superfamily includes L-2-haloacid dehalogenase, epoxide hydrolase, phosphoserine phosphatase, phosphomannomutase, phosphoglycolate phosphatase, P-type ATPase, and many others, all of which use a nucleophilic aspartate in their phosphoryl transfer reaction. All members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. Members of this superfamily are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases.
TIGR ATPase-IIA2_Ca 881aa 0.0
in ref transcript
The calcium P-type ATPases have been characterized as Type IIA based on a phylogenetic analysis which distinguishes this group from the Type IIB PMCA calcium pump modelled by TIGR01517. A separate analysis divides Type IIA into sub-types, SERCA and PMR1 the former of which is modelled by TIGR01116.
COG MgtA 865aa 0.0
in ref transcript
Cation transport ATPase [Inorganic ion transport and metabolism].
ATP6V0E2
rs.ATP6V0E2.F1 rs.ATP6V0E2.R1 155 268
NCBIGene 36.3 155066
Single exon skipping, size difference: 113
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001100592
Changed!
pfam ATP_synt_H 43aa 8e-14
in ref transcript
ATP synthase subunit H. ATP synthase subunit H is an extremely hydrophobic of approximately 9 kDa. This subunit may be required for assembly of vacuolar ATPase.
Changed!
pfam ATP_synt_H 65aa 2e-22
in modified transcript
ATP6V1C2
rs.ATP6V1C2.F1 rs.ATP6V1C2.R1 139 277
NCBIGene 36.3 245973
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039362
Changed!
pfam V-ATPase_C 414aa 1e-129
in ref transcript
V-ATPase subunit C.
Changed!
COG COG5127 380aa 1e-54
in ref transcript
Vacuolar H+-ATPase V1 sector, subunit C [Energy production and conversion].
Changed!
pfam V-ATPase_C 368aa 1e-135
in modified transcript
Changed!
COG COG5127 334aa 3e-59
in modified transcript
ATP8A1
rs.ATP8A1.F1 rs.ATP8A1.R1 100 145
NCBIGene 36.3 10396
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_006095
Changed!
TIGR ATPase-Plipid 1038aa 0.0
in ref transcript
This model describes the P-type ATPase responsible for transporting phospholipids from one leaflet of bilayer membranes to the other. These ATPases are found only in eukaryotes.
Changed!
COG MgtA 1036aa 1e-121
in ref transcript
Cation transport ATPase [Inorganic ion transport and metabolism].
Changed!
TIGR ATPase-Plipid 1023aa 0.0
in modified transcript
Changed!
COG MgtA 1021aa 1e-122
in modified transcript
ATPAF1
rs.ATPAF1.F1 rs.ATPAF1.R1 136 340
NCBIGene 36.3 64756
Multiple exon skipping, size difference: 204
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_022745
Changed!
pfam ATP11 244aa 1e-102
in ref transcript
ATP11 protein. This family consists of several eukaryotic ATP11 proteins. In Saccharomyces cerevisiae, expression of functional F1-ATPase requires two proteins encoded by the ATP11 and ATP12 genes. Atp11p is a molecular chaperone of the mitochondrial matrix that participates in the biogenesis pathway to form F1, the catalytic unit of the ATP synthase.
Changed!
pfam ATP11 176aa 3e-49
in modified transcript
ATRX
rs.ATRX.F1 rs.ATRX.R1 101 215
NCBIGene 36.3 546
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_000489
cd HELICc 147aa 4e-16
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
cd DEXDc 163aa 4e-11
in ref transcript
DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
pfam SNF2_N 317aa 9e-84
in ref transcript
SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
pfam Helicase_C 77aa 4e-16
in ref transcript
Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
COG HepA 321aa 1e-27
in ref transcript
Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].
COG HepA 194aa 6e-22
in ref transcript
ATXN3
rs.ATXN3.F1 rs.ATXN3.R1 108 273
NCBIGene 36.3 4287
Single exon skipping, size difference: 165
Exclusion in the protein (no frameshift)
Reference transcript: NM_004993
Changed!
pfam Josephin 161aa 2e-69
in ref transcript
Josephin.
Changed!
pfam Josephin 105aa 2e-45
in modified transcript
ATXN7L3
rs.ATXN7L3.F1 rs.ATXN7L3.R1 107 128
NCBIGene 36.3 56970
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_020218
pfam Sgf11 45aa 1e-05
in ref transcript
Sgf11 (transcriptional regulation protein). The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae. The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation.
pfam SCA7 30aa 0.006
in ref transcript
SCA7. This domain is found in the protein Sgf73/Sca7 which is a component of the multihistone acetyltransferase complexes SAGA and SILK. This domain is also found in Ataxin-7, a human protein which in its polyglutamine expanded pathological form, is responsible for the neurodegenerative disease spinocerebellar ataxia 7 (SCA7).
AURKA
rs.AURKA.F1 rs.AURKA.R1 119 217
NCBIGene 36.3 6790
Single exon skipping, size difference: 98
Exclusion in 5'UTR
Reference transcript: NM_198433
cd S_TKc 252aa 1e-80
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 240aa 2e-80
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
PTZ PTZ00263 244aa 4e-53
in ref transcript
protein kinase A catalytic subunit; Provisional.
AXIN1
rs.AXIN1.F1 rs.AXIN1.R1 297 405
NCBIGene 36.3 8312
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_003502
pfam RGS 123aa 2e-31
in ref transcript
Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
smart DAX 83aa 4e-27
in ref transcript
Domain present in Dishevelled and axin. Domain of unknown function.
pfam Axin_b-cat_bind 33aa 2e-08
in ref transcript
Axin beta-catenin binding domain. This domain is found on the scaffolding protein Axin which is a component of the beta-catenin destruction complex. It competes with the tumour suppressor adenomatous polyposis coli protein (APC) for binding to beta-catenin.
B3GALT5
rs.B3GALT5.F1 rs.B3GALT5.R1 237 397
NCBIGene 36.3 10317
Single exon skipping, size difference: 160
Inclusion in the protein causing a new stop codon
Reference transcript: NM_033172
Changed!
pfam Galactosyl_T 191aa 2e-44
in ref transcript
Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- galactosyltrans ferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcNAc terminal chains in the synthesis of the lacto-series oligosaccharides types 1 and 2.
BACE2
rs.BACE2.F1 rs.BACE2.R1 114 264
NCBIGene 36.3 25825
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_012105
Changed!
cd beta_secretase_like 362aa 0.0
in ref transcript
Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which have at least six residues in length with hydrophobic residues in both the P1 and P1' positions. The active site is located at the groove formed by the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors in the active site. Specificity is determined by nearest-neighbor hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. The enzymes are mostly secreted from cells as inactive proenzymes that activate autocatalytically at acidic pH. This family of aspartate proteases is classified by MEROPS as the peptidase family A1 (pepsin A, clan AA).
Changed!
pfam Asp 339aa 3e-34
in ref transcript
Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.
Changed!
PTZ PTZ00147 338aa 6e-20
in ref transcript
histoaspartic protease (plasmepsin I or II); Provisional.
Changed!
cd beta_secretase_like 312aa 1e-165
in modified transcript
Changed!
pfam Asp 289aa 8e-33
in modified transcript
Changed!
PTZ PTZ00147 288aa 1e-17
in modified transcript
BACE2
rs.BACE2.F2 rs.BACE2.R2 136 305
NCBIGene 36.3 25825
Single exon skipping, size difference: 169
Exclusion in the protein causing a frameshift
Reference transcript: NM_012105
Changed!
cd beta_secretase_like 362aa 0.0
in ref transcript
Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which have at least six residues in length with hydrophobic residues in both the P1 and P1' positions. The active site is located at the groove formed by the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors in the active site. Specificity is determined by nearest-neighbor hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. The enzymes are mostly secreted from cells as inactive proenzymes that activate autocatalytically at acidic pH. This family of aspartate proteases is classified by MEROPS as the peptidase family A1 (pepsin A, clan AA).
Changed!
pfam Asp 339aa 3e-34
in ref transcript
Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.
Changed!
PTZ PTZ00147 338aa 6e-20
in ref transcript
histoaspartic protease (plasmepsin I or II); Provisional.
Changed!
cd beta_secretase_like 305aa 1e-163
in modified transcript
Changed!
pfam Asp 232aa 5e-29
in modified transcript
Changed!
PTZ PTZ00147 230aa 4e-16
in modified transcript
BAG5
rs.BAG5.F1 rs.BAG5.R1 191 378
NCBIGene 36.3 9529
Alternative 5-prime, size difference: 187
Exclusion of the protein initiation site
Reference transcript: NM_001015049
smart BAG 78aa 1e-14
in ref transcript
BAG domains, present in regulator of Hsp70 proteins. BAG domains, present in Bcl-2-associated athanogene 1 and silencer of death domains.
smart BAG 79aa 5e-14
in ref transcript
smart BAG 76aa 1e-10
in ref transcript
smart BAG 78aa 6e-09
in ref transcript
BANP
rs.BANP.F1 rs.BANP.R1 114 180
NCBIGene 36.3 54971
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_079837
pfam BEN 74aa 1e-13
in ref transcript
BEN domain. The BEN domain is found in diverse animal proteins such as BANP/SMAR1, NAC1 and the Drosophila mod(mdg4) isoform C, in the chordopoxvirus virosomal protein E5R and in several proteins of polydnaviruses. Computational analysis suggests that the BEN domain mediates protein-DNA and protein-protein interactions during chromatin organisation and transcription.
Changed!
cd Bcl-2_like 114aa 2e-40
in ref transcript
Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
Changed!
TIGR bcl-2 233aa 1e-89
in ref transcript
in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.
Changed!
cd Bcl-2_like 45aa 1e-08
in modified transcript
Changed!
TIGR bcl-2 125aa 1e-35
in modified transcript
Changed!
TIGR bcl-2 45aa 6e-09
in modified transcript
BCLAF1
rs.BCLAF1.F1 rs.BCLAF1.R1 161 308
NCBIGene 36.3 9774
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_014739
BCS1L
rs.BCS1L.F1 rs.BCS1L.R1 116 325
NCBIGene 36.3 617
Single exon skipping, size difference: 209
Exclusion in 5'UTR
Reference transcript: NM_004328
cd AAA 157aa 3e-06
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
pfam BCS1_N 168aa 2e-50
in ref transcript
BCS1 N terminal. This domain is found at the N terminal of the mitochondrial ATPase BSC1. It encodes the import and intramitochondrial sorting for the protein.
pfam AAA 175aa 2e-19
in ref transcript
ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes.
COG HflB 405aa 1e-18
in ref transcript
ATP-dependent Zn proteases [Posttranslational modification, protein turnover, chaperones].
BEST3
rs.BEST3.F1 rs.BEST3.R1 323 476
NCBIGene 36.3 144453
Single exon skipping, size difference: 153
Exclusion in the protein (no frameshift)
Reference transcript: NM_032735
Changed!
pfam Bestrophin 369aa 1e-161
in ref transcript
Bestrophin. Bestrophin is a 68-kDa basolateral plasma membrane protein expressed in retinal pigment epithelial cells (RPE). It is encoded by the VMD2 gene, which is mutated in Best macular dystrophy, a disease characterised by a depressed light peak in the electrooculogram. VMD2 encodes a 585-amino acid protein with an approximate mass of 68 kDa which has been designated bestrophin. Bestrophin shares homology with the Caenorhabditis elegans RFP gene family, named for the presence of a conserved arginine (R), phenylalanine (F), proline (P), amino acid sequence motif. Bestrophin is a plasma membrane protein, localised to the basolateral surface of RPE cells consistent with a role for bestrophin in the generation or regulation of the EOG light peak. Bestrophin and other RFP family members represent a new class of chloride channels, indicating a direct role for bestrophin in generating the light peak. The VMD2 gene underlying Best disease was shown to represent the first human member of the RFP-TM protein family. More than 97% of the disease-causing mutations are located in the N-terminal RFP-TM domain implying important functional properties.
Changed!
pfam Bestrophin 318aa 1e-128
in modified transcript
BID
rs.BID.F1 rs.BID.R1 180 250
NCBIGene 36.3 637
Single exon skipping, size difference: 70
Exclusion of the protein initiation site
Reference transcript: NM_001196
Changed!
pfam BID 192aa 1e-76
in ref transcript
BH3 interacting domain (BID). BID is a member of the BCL-2 superfamily of proteins are key regulators of programmed cell death, hence this family is related to pfam00452. BID is a pro-apoptotic member of the Bcl-2 superfamily and as such posses the ability to target intracellular membranes and contains the BH3 death domain. The activity of BID is regulated by a Caspase 8-mediated cleavage event, exposing the BH3 domain and significantly changing the surface charge and hydrophobicity, which causes a change of cellular localisation.
Changed!
pfam BID 96aa 1e-33
in modified transcript
BIRC5
rs.BIRC5.F1 rs.BIRC5.R1 375 444
NCBIGene 36.3 332
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_001012271
Changed!
cd BIR 95aa 3e-14
in ref transcript
Baculoviral inhibition of apoptosis protein repeat domain; Found in inhibitors of apoptosis proteins (IAPs) and other proteins. In higher eukaryotes, BIR domains inhibit apoptosis by acting as direct inhibitors of the caspase family of protease enzymes. In yeast, BIR domains are involved in regulating cytokinesis. This novel fold is stabilized by zinc tetrahedrally coordinated by one histidine and three cysteine residues and resembles a classical zinc finger.
Changed!
smart BIR 97aa 4e-18
in ref transcript
Baculoviral inhibition of apoptosis protein repeat. Domain found in inhibitor of apoptosis proteins (IAPs) and other proteins. Acts as a direct inhibitor of caspase enzymes.
Changed!
cd BIR 72aa 1e-17
in modified transcript
Changed!
smart BIR 74aa 2e-21
in modified transcript
BIRC7
rs.BIRC7.F1 rs.BIRC7.R1 400 454
NCBIGene 36.3 79444
Alternative 3-prime, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_139317
cd BIR 68aa 5e-23
in ref transcript
Baculoviral inhibition of apoptosis protein repeat domain; Found in inhibitors of apoptosis proteins (IAPs) and other proteins. In higher eukaryotes, BIR domains inhibit apoptosis by acting as direct inhibitors of the caspase family of protease enzymes. In yeast, BIR domains are involved in regulating cytokinesis. This novel fold is stabilized by zinc tetrahedrally coordinated by one histidine and three cysteine residues and resembles a classical zinc finger.
Changed!
cd RING 39aa 0.007
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
smart BIR 71aa 4e-23
in ref transcript
Baculoviral inhibition of apoptosis protein repeat. Domain found in inhibitor of apoptosis proteins (IAPs) and other proteins. Acts as a direct inhibitor of caspase enzymes.
smart RING 34aa 0.003
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
BMP4
rs.BMP4.F1 rs.BMP4.R1 132 341
NCBIGene 36.3 652
Alternative 5-prime, size difference: 209
Exclusion in 5'UTR
Reference transcript: NM_001202
pfam TGFb_propeptide 236aa 9e-62
in ref transcript
TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein.
smart TGFB 101aa 9e-48
in ref transcript
Transforming growth factor-beta (TGF-beta) family. Family members are active as disulphide-linked homo- or heterodimers. TGFB is a multifunctional peptide that controls proliferation, differentiation, and other functions in many cell types.
BPTF
rs.BPTF.F1 rs.BPTF.R1 103 532
NCBIGene 36.3 2186
Alternative 5-prime, size difference: 429
Exclusion in the protein (no frameshift)
Reference transcript: NM_182641
cd Bromo_gcn5_like 101aa 1e-46
in ref transcript
Bromodomain; Gcn5_like subfamily. Gcn5p is a histone acetyltransferase (HAT) which mediates acetylation of histones at lysine residues; such acetylation is generally correlated with the activation of transcription. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd BAH_plant_2 27aa 8e-04
in ref transcript
BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
smart BROMO 102aa 1e-27
in ref transcript
bromo domain.
pfam DDT 61aa 1e-20
in ref transcript
DDT domain. This domain is approximately 60 residues in length, and is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors). It is exclusively associated with nuclear domains, and is thought to be arranged into three alpha helices.
pfam PHD 45aa 6e-10
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
smart PHD 47aa 5e-08
in ref transcript
PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RI NG finger and the FYV E finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BR OMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RI NG fingers misidentified as PHD fingers.
COG COG5076 105aa 8e-14
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
BRCA1
rs.BRCA1.F1 rs.BRCA1.R1 188 311
NCBIGene 36.3 672
Multiple exon skipping, size difference: 123
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_007296
cd RING 46aa 1e-07
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
cd BRCT 80aa 2e-04
in ref transcript
Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
smart BRCT 84aa 6e-09
in ref transcript
breast cancer carboxy-terminal domain.
smart RING 41aa 9e-09
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
COG PEX10 43aa 1e-04
in ref transcript
RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
pfam BACK 99aa 2e-20
in ref transcript
BTB And C-terminal Kelch. This domain is found associated with pfam00651 and pfam01344. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues. This family appears to be closely related to the BTB domain (Finn RD, personal observation).
pfam F5_F8_type_C 115aa 4e-06
in ref transcript
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
pfam F5_F8_type_C 102aa 1e-04
in ref transcript
BTLA
rs.BTLA.F1 rs.BTLA.R1 108 252
NCBIGene 36.3 151888
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_181780
cd IG 75aa 7e-04
in ref transcript
Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
smart IG_like 92aa 7e-06
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
BTN2A2
rs.BTN2A2.F1 rs.BTN2A2.R1 100 448
NCBIGene 36.3 10385
Single exon skipping, size difference: 348
Exclusion in the protein (no frameshift)
Reference transcript: NM_006995
smart SPRY 119aa 3e-27
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
smart PRY 52aa 4e-21
in ref transcript
associated with SPRY domains.
Changed!
pfam V-set 97aa 3e-10
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
pfam C2-set_2 82aa 0.007
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
BTNL8
rs.BTNL8.F1 rs.BTNL8.R1 133 263
NCBIGene 36.3 79908
Alternative 5-prime, size difference: 130
Inclusion in the protein causing a frameshift
Reference transcript: NM_001040462
Changed!
smart SPRY 123aa 3e-16
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
Changed!
smart PRY 50aa 5e-11
in ref transcript
associated with SPRY domains.
pfam V-set 106aa 6e-07
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
C10orf125
rs.C10orf125.F1 rs.C10orf125.R1 397 466
NCBIGene 36.3 282969
Alternative 3-prime, size difference: 69
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001098483
Changed!
pfam RbsD_FucU 139aa 9e-28
in ref transcript
RbsD / FucU transport protein family. The Escherichia coli high-affinity ribose-transport system consists of six proteins encoded by the rbs operon (rbsD, rbsA, rbsC, rbsB, rbsK and rbsR). Of the six components, RbsD is the only one whose function is unknown although it is thought that it somehow plays a critical role in PtsG-mediated ribose transport. This family also includes FucU a protein from the fucose biosynthesis operon that is presumably also involved in fucose transport by similarity to RbsD.
Changed!
COG FucU 145aa 5e-40
in ref transcript
Fucose dissimilation pathway protein FucU [Carbohydrate transport and metabolism].
Changed!
pfam RbsD_FucU 125aa 2e-21
in modified transcript
Changed!
COG FucU 130aa 3e-33
in modified transcript
C10orf30
rs.C10orf30.F1 rs.C10orf30.R1 219 389
NCBIGene 36.3 222389
Single exon skipping, size difference: 170
Exclusion in 5'UTR
Reference transcript: NM_152751
pfam BEN 84aa 2e-09
in ref transcript
BEN domain. The BEN domain is found in diverse animal proteins such as BANP/SMAR1, NAC1 and the Drosophila mod(mdg4) isoform C, in the chordopoxvirus virosomal protein E5R and in several proteins of polydnaviruses. Computational analysis suggests that the BEN domain mediates protein-DNA and protein-protein interactions during chromatin organisation and transcription.
C10orf30
rs.C10orf30.F1 rs.C10orf30.R2 411 542
NCBIGene 36.3 222389
Single exon skipping, size difference: 170
Exclusion in 5'UTR
Reference transcript: NM_152751
pfam BEN 84aa 2e-09
in ref transcript
BEN domain. The BEN domain is found in diverse animal proteins such as BANP/SMAR1, NAC1 and the Drosophila mod(mdg4) isoform C, in the chordopoxvirus virosomal protein E5R and in several proteins of polydnaviruses. Computational analysis suggests that the BEN domain mediates protein-DNA and protein-protein interactions during chromatin organisation and transcription.
C11orf56
rs.C11orf56.F1 rs.C11orf56.R1 151 193
NCBIGene 36.3 84067
Alternative 5-prime, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_032127
pfam RAI16-like 331aa 1e-88
in ref transcript
Retinoic acid induced 16-like protein. This is the conserved N-terminal 450 residues of a family of proteins described as retinoic acid-induced protein 16-like proteins. The exact function is not known. The proteins are found from worms to humans.
C12orf42
rs.C12orf42.F1 rs.C12orf42.R1 380 515
NCBIGene 36.3 374470
Alternative 5-prime, size difference: 135
Exclusion in 5'UTR
Reference transcript: NM_001099336
C12orf44
rs.C12orf44.F1 rs.C12orf44.R1 164 335
NCBIGene 36.3 60673
Alternative 5-prime, size difference: 171
Exclusion in 5'UTR
Reference transcript: NM_021934
pfam DUF1649 167aa 1e-56
in ref transcript
Protein of unknown function (DUF1649). This family is made up of sequences derived from hypothetical eukaryotic proteins of unknown function.
C14orf100
rs.C14orf100.F1 rs.C14orf100.R1 131 149
NCBIGene 36.3 51528
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_016475
pfam DUF766 293aa 1e-113
in ref transcript
Protein of unknown function (DUF766). This family consists of several eukaryotic proteins of unknown function.
C14orf104
rs.C14orf104.F1 rs.C14orf104.R1 239 383
NCBIGene 36.3 55172
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_018139
pfam PIH1 299aa 3e-84
in ref transcript
pre-RNA processing PIH1/Nop17. This domain is involved in pre-rRNA processing. It has has been shown to be required either for nucleolar retention or correct assembly of the box C/D snoRNP in Saccharomyces cerevisiae.
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_017926
C14orf130
rs.C14orf130.F1 rs.C14orf130.R1 313 447
NCBIGene 36.3 55148
Single exon skipping, size difference: 134
Exclusion in the protein causing a frameshift
Reference transcript: NM_175748
Changed!
pfam zf-UBR 68aa 0.001
in ref transcript
Putative zinc finger in N-recognin (UBR box). This region is found in E3 ubiquitin ligases that recognise N-recognins.
C14orf138
rs.C14orf138.F1 rs.C14orf138.R1 210 315
NCBIGene 36.3 79609
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_024558
Changed!
pfam Methyltransf_16 172aa 9e-52
in ref transcript
Putative methyltransferase.
COG COG3897 126aa 8e-04
in ref transcript
Predicted methyltransferase [General function prediction only].
Changed!
pfam Methyltransf_16 170aa 2e-51
in modified transcript
C14orf159
rs.C14orf159.F1 rs.C14orf159.R1 100 115
NCBIGene 36.3 80017
Alternative 5-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_001102368
Changed!
pfam DUF1445 143aa 2e-57
in ref transcript
Protein of unknown function (DUF1445). This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.
Changed!
PRK PRK05463 263aa 7e-46
in ref transcript
hypothetical protein; Provisional.
Changed!
pfam DUF1445 138aa 1e-54
in modified transcript
Changed!
PRK PRK05463 258aa 3e-44
in modified transcript
C14orf159
rs.C14orf159.F2 rs.C14orf159.R2 124 202
NCBIGene 36.3 80017
Single exon skipping, size difference: 78
Exclusion in 5'UTR
Reference transcript: NM_001102366
pfam DUF1445 138aa 1e-54
in ref transcript
Protein of unknown function (DUF1445). This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.
PRK PRK05463 258aa 3e-44
in ref transcript
hypothetical protein; Provisional.
C14orf159
rs.C14orf159.F3 rs.C14orf159.R3 167 287
NCBIGene 36.3 80017
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_001102368
pfam DUF1445 143aa 2e-57
in ref transcript
Protein of unknown function (DUF1445). This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.
PRK PRK05463 263aa 7e-46
in ref transcript
hypothetical protein; Provisional.
C14orf159
rs.C14orf159.F4 rs.C14orf159.R4 211 382
NCBIGene 36.3 80017
Multiple exon skipping, size difference: 171
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_001102366
pfam DUF1445 138aa 1e-54
in ref transcript
Protein of unknown function (DUF1445). This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.
PRK PRK05463 258aa 3e-44
in ref transcript
hypothetical protein; Provisional.
C14orf159
rs.C14orf159.F5 rs.C14orf159.R5 109 145
NCBIGene 36.3 80017
Alternative 5-prime, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_001102368
Changed!
pfam DUF1445 143aa 2e-57
in ref transcript
Protein of unknown function (DUF1445). This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.
Changed!
PRK PRK05463 263aa 7e-46
in ref transcript
hypothetical protein; Provisional.
Changed!
pfam DUF1445 131aa 5e-48
in modified transcript
Changed!
PRK PRK05463 251aa 2e-39
in modified transcript
C14orf159
rs.C14orf159.F6 rs.C14orf159.R6 131 376
NCBIGene 36.3 80017
Alternative 5-prime, size difference: 245
Exclusion in 5'UTR
Reference transcript: NM_001102366
pfam DUF1445 138aa 1e-54
in ref transcript
Protein of unknown function (DUF1445). This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.
Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_052873
C16orf13
rs.C16orf13.F1 rs.C16orf13.R1 102 170
NCBIGene 36.3 84326
Single exon skipping, size difference: 68
Inclusion in the protein causing a frameshift
Reference transcript: NM_001040160
Changed!
pfam DUF938 137aa 6e-48
in ref transcript
Protein of unknown function (DUF938). This family consists of several hypothetical proteins from both prokaryotes and eukaryotes. The function of this family is unknown.
Changed!
pfam DUF938 200aa 6e-81
in modified transcript
C16orf35
rs.C16orf35.F1 rs.C16orf35.R1 169 374
NCBIGene 36.3 8131
Multiple exon skipping, size difference: 205
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_001077350
Changed!
pfam UPF0171 471aa 1e-161
in ref transcript
Uncharacterised protein family (UPF0171).
Changed!
pfam UPF0171 63aa 1e-16
in modified transcript
C16orf46
rs.C16orf46.F1 rs.C16orf46.R1 318 407
NCBIGene 36.3 123775
Single exon skipping, size difference: 89
Exclusion in 5'UTR
Reference transcript: NM_152337
C17orf62
rs.C17orf62.F1 rs.C17orf62.R1 114 164
NCBIGene 36.3 79415
Single exon skipping, size difference: 50
Exclusion in 5'UTR
Reference transcript: NM_001100407
C17orf62
rs.C17orf62.F2 rs.C17orf62.R2 113 155
NCBIGene 36.3 79415
Single exon skipping, size difference: 42
Exclusion in 5'UTR
Reference transcript: NM_001033046
C17orf63
rs.C17orf63.F1 rs.C17orf63.R1 175 209
NCBIGene 36.3 55731
Single exon skipping, size difference: 34
Exclusion in 5'UTR
Reference transcript: NM_018182
C17orf70
rs.C17orf70.F1 rs.C17orf70.R1 109 305
NCBIGene 36.3 80233
Alternative 5-prime, size difference: 196
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001109760
C17orf72
rs.C17orf72.F1 rs.C17orf72.R1 100 121
NCBIGene 36.3 92340
Alternative 3-prime, size difference: 21
Inclusion in 5'UTR
Reference transcript: XM_001726843
C17orf80
rs.C17orf80.F1 rs.C17orf80.R1 275 383
NCBIGene 36.3 55028
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_017941
C18orf19
rs.C18orf19.F1 rs.C18orf19.R1 223 363
NCBIGene 36.3 125228
Single exon skipping, size difference: 140
Exclusion in 5'UTR
Reference transcript: NM_001098801
pfam DUF1279 88aa 2e-32
in ref transcript
Protein of unknown function (DUF1279). This family represents the C-terminus (approx. 120 residues) of a number of eukaryotic proteins of unknown function.
C18orf25
rs.C18orf25.F1 rs.C18orf25.R1 202 385
NCBIGene 36.3 147339
Single exon skipping, size difference: 183
Exclusion in the protein (no frameshift)
Reference transcript: NM_145055
C18orf34
rs.C18orf34.F1 rs.C18orf34.R1 171 285
NCBIGene 36.3 374864
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_001105528
COG Smc 233aa 0.005
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
C18orf37
rs.C18orf37.F1 rs.C18orf37.R1 333 441
NCBIGene 36.3 125476
Multiple exon skipping, size difference: 108
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001098817
pfam YL1_C 30aa 1e-06
in ref transcript
YL1 nuclear protein C-terminal domain. This domain is found in proteins of the YL1 family. These proteins have been shown to be DNA-binding and may be a transcription factor. This domain is found in proteins that are not YL1 proteins.
COG COG5195 88aa 7e-14
in ref transcript
Uncharacterized conserved protein [Function unknown].
C19orf28
rs.C19orf28.F1 rs.C19orf28.R1 193 395
NCBIGene 36.3 126321
Single exon skipping, size difference: 202
Exclusion of the stop codon
Reference transcript: NM_021731
TIGR gph 331aa 5e-08
in ref transcript
GPH:cation symporters catalyze uptake of sugars in symport with a monovalent cation (H+ or Na+). Members of this family includes transporters for melibiose, lactose, raffinose, glucuronides, pentosides and isoprimeverose. Mutants of two groups of these symporters (the melibiose permeases of enteric bacteria, and the lactose permease of Streptococcus thermophilus) have been isolated in which altered cation specificity is observed or in which sugar transport is uncoupled from cation symport (i.e., uniport is catalyzed). The various members of the family can use Na+, H+ or Li, Na+ or Li+, H+ or Li+, or only H+ as the symported cation. All of these proteins possess twelve putative transmembrane a-helical spanners.
COG MelB 411aa 1e-15
in ref transcript
Na+/melibiose symporter and related transporters [Carbohydrate transport and metabolism].
C19orf60
rs.C19orf60.F1 rs.C19orf60.R1 181 247
NCBIGene 36.3 55049
Alternative 3-prime, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_001100418
C19orf62
rs.C19orf62.F1 rs.C19orf62.R1 140 178
NCBIGene 36.3 29086
Alternative 3-prime, size difference: 38
Inclusion in 5'UTR
Reference transcript: NM_014173
TIGR acidobact_VWFA 74aa 0.004
in ref transcript
Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196.
C1D
rs.C1D.F1 rs.C1D.R1 100 113
NCBIGene 36.3 10438
Alternative 5-prime, size difference: 13
Exclusion in 5'UTR
Reference transcript: NM_006333
pfam Sas10_Utp3 80aa 4e-14
in ref transcript
Sas10/Utp3/C1D family. This family contains Utp3 and LCP5 which are components of the U3 ribonucleoprotein complex. It also includes the human C1D protein and Saccharomyces cerevisiae YHR081W (rrp47), an exosome-associated protein required for the 3' processing of stable RNAs, and Sas10 which has been identified as a regulator of chromatin silencing. This family also includes the human protein Neuroguidin an initiation factor 4E (eIF4E) binding protein.
C1orf116
rs.C1orf116.F1 rs.C1orf116.R1 129 315
NCBIGene 36.3 79098
Single exon skipping, size difference: 186
Exclusion of the protein initiation site
Reference transcript: NM_023938
C1orf43
rs.C1orf43.F1 rs.C1orf43.R1 291 393
NCBIGene 36.3 25912
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098616
Changed!
pfam NICE-3 189aa 1e-85
in ref transcript
NICE-3 protein. This family consists of several eukaryotic NICE-3 and related proteins. The gene coding for NICE-3 is part of the epidermal differentiation complex (EDC) which comprises a large number of genes that are of crucial importance for the maturation of the human epidermis. The function of NICE-3 is unknown.
Changed!
pfam NICE-3 155aa 6e-59
in modified transcript
C20orf71
rs.C20orf71.F1 rs.C20orf71.R1 309 417
NCBIGene 36.3 128861
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_178466
Changed!
pfam LBP_BPI_CETP 156aa 1e-15
in ref transcript
LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar.
Changed!
pfam LBP_BPI_CETP 79aa 2e-10
in modified transcript
C21orf33
rs.C21orf33.F1 rs.C21orf33.R1 205 298
NCBIGene 36.3 8209
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_004649
Changed!
cd GATase1_ES1 222aa 6e-89
in ref transcript
Type 1 glutamine amidotransferase (GATase1)-like domain found in zebrafish ES1. This group includes, proteins similar to ES1, Escherichia coli enhancing lycopene biosynthesis protein 2, Azospirillum brasilense iaaC and, human HES1. The catalytic triad typical of GATase1domains is not conserved in this GATase1-like domain. However, in common with GATase1domains a reactive cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Zebrafish ES1 is expressed specifically in adult photoreceptor cells and appears to be a cytoplasmic protein. A. brasilense iaaC is involved in controlling IAA biosynthesis.
Changed!
pfam DJ-1_PfpI 168aa 2e-14
in ref transcript
DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators.
Changed!
PRK PRK11780 224aa 1e-83
in ref transcript
isoprenoid biosynthesis protein with amidotransferase-like domain; Provisional.
Changed!
cd GATase1_ES1 191aa 4e-69
in modified transcript
Changed!
pfam DJ-1_PfpI 137aa 7e-06
in modified transcript
Changed!
PRK PRK11780 193aa 6e-64
in modified transcript
C21orf34
rs.C21orf34.F1 rs.C21orf34.R1 240 283
NCBIGene 36.3 388815
Single exon skipping, size difference: 115
Inclusion in 3'UTR
Reference transcript: NM_001005734
C21orf91
rs.C21orf91.F1 rs.C21orf91.R1 209 272
NCBIGene 36.3 54149
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_001100420
Changed!
pfam EURL 291aa 1e-145
in ref transcript
EURL protein. This family consists of several animal EURL proteins. EURL is preferentially expressed in chick retinal precursor cells as well as in the anterior epithelial cells of the lens at early stages of development. EURL transcripts are found primarily in the peripheral dorsal retina, i.e., the most undifferentiated part of the dorsal retina. EURL transcripts are also detected in the lens at stage 18 and remain abundant in the proliferating epithelial cells of the lens until at least day 11. The distribution pattern of EURL in the developing retina and lens suggest a role before the events leading to cell determination and differentiation.
Changed!
pfam EURL 221aa 1e-108
in modified transcript
C2orf56
rs.C2orf56.F1 rs.C2orf56.R1 349 430
NCBIGene 36.3 55471
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_144736
pfam DUF185 247aa 5e-33
in ref transcript
Uncharacterized ACR, COG1565. This family contains several uncharacterised proteins. One member has been described as an ATP synthase beta subunit transcription termination factor rho protein.
Changed!
COG COG1565 357aa 4e-61
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
COG COG1565 330aa 1e-44
in modified transcript
C5orf5
rs.C5orf5.F1 rs.C5orf5.R1 225 291
NCBIGene 36.3 51306
Single exon skipping, size difference: 66
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_016603
cd RhoGAP_FAM13A1a 185aa 3e-78
in ref transcript
RhoGAP_FAM13A1a: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of FAM13A1, isoform a-like proteins. The function of FAM13A1a is unknown. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by up several orders of magnitude.
pfam RhoGAP 152aa 3e-37
in ref transcript
RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.
C5orf5
rs.C5orf5.F2 rs.C5orf5.R2 302 494
NCBIGene 36.3 51306
Single exon skipping, size difference: 192
Exclusion of the protein initiation site
Reference transcript: NM_016603
Changed!
cd RhoGAP_FAM13A1a 185aa 3e-78
in ref transcript
RhoGAP_FAM13A1a: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of FAM13A1, isoform a-like proteins. The function of FAM13A1a is unknown. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by up several orders of magnitude.
Changed!
pfam RhoGAP 152aa 3e-37
in ref transcript
RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.
Changed!
cd RhoGAP_FAM13A1a 85aa 4e-32
in modified transcript
Changed!
pfam RhoGAP 67aa 2e-12
in modified transcript
C5orf5
rs.C5orf5.F3 rs.C5orf5.R3 293 377
NCBIGene 36.3 51306
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_016603
cd RhoGAP_FAM13A1a 185aa 3e-78
in ref transcript
RhoGAP_FAM13A1a: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of FAM13A1, isoform a-like proteins. The function of FAM13A1a is unknown. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by up several orders of magnitude.
pfam RhoGAP 152aa 3e-37
in ref transcript
RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.
C6orf162
rs.C6orf162.F1 rs.C6orf162.R1 101 122
NCBIGene 36.3 57150
Single exon skipping, size difference: 21
Exclusion in 5'UTR
Reference transcript: NM_001042493
C6orf25
rs.C6orf25.F1 rs.C6orf25.R1 107 126
NCBIGene 36.3 80739
Alternative 3-prime, size difference: 19
Inclusion in the protein causing a frameshift
Reference transcript: NM_138272
C6orf60
rs.C6orf60.F1 rs.C6orf60.R1 174 321
NCBIGene 36.3 79632
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_024581
TIGR SMC_prok_A 303aa 7e-06
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Changed!
TIGR SMC_prok_B 226aa 2e-04
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG Smc 356aa 3e-04
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
C8orf59
rs.C8orf59.F1 rs.C8orf59.R1 136 223
NCBIGene 36.3 401466
Alternative 5-prime, size difference: 87
Exclusion in 5'UTR
Reference transcript: NM_001099670
C8orf79
rs.C8orf79.F1 rs.C8orf79.R1 278 342
NCBIGene 36.3 57604
Alternative 5-prime, size difference: 64
Exclusion in the protein causing a frameshift
Reference transcript: NM_020844
Changed!
cd AdoMet_MTases 92aa 6e-10
in ref transcript
S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
Changed!
pfam Methyltransf_11 90aa 1e-18
in ref transcript
Methyltransferase domain. Members of this family are SAM dependent methyltransferases.
Changed!
COG UbiE 139aa 3e-19
in ref transcript
Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism].
Changed!
pfam Methyltransf_11 38aa 0.006
in modified transcript
Changed!
COG UbiE 95aa 1e-06
in modified transcript
C9orf116
rs.C9orf116.F1 rs.C9orf116.R1 184 230
NCBIGene 36.3 138162
Alternative 3-prime, size difference: 46
Exclusion in the protein causing a frameshift
Reference transcript: NM_001048265
C9orf127
rs.C9orf127.F1 rs.C9orf127.R1 237 309
NCBIGene 36.3 51754
Alternative 5-prime, size difference: 72
Inclusion in 5'UTR
Reference transcript: NM_001042589
C9orf131
rs.C9orf131.F1 rs.C9orf131.R1 249 324
NCBIGene 36.3 138724
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040412
C9orf5
rs.C9orf5.F1 rs.C9orf5.R1 147 171
NCBIGene 36.3 23731
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_032012
COG PerM 163aa 0.009
in ref transcript
Predicted permease [General function prediction only].
CADM1
rs.CADM1.F1 rs.CADM1.R1 222 306
NCBIGene 36.3 23705
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_014333
cd IGcam 71aa 1e-07
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IG 72aa 0.004
in ref transcript
Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
smart IG_like 80aa 3e-10
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam C2-set_2 70aa 3e-08
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
smart IG_like 92aa 9e-08
in ref transcript
CADPS2
rs.CADPS2.F1 rs.CADPS2.R1 406 535
NCBIGene 36.3 93664
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_017954
cd PH_CADPS 110aa 2e-52
in ref transcript
CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam DUF1041 95aa 1e-13
in ref transcript
Domain of Unknown Function (DUF1041). This family consists of several eukaryotic domains of unknown function. Members of this family are often found in tandem repeats and co-occur with pfam00168, pfam00130 and pfam00169 domains.
smart PH 102aa 2e-09
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
CALCA
rs.CALCA.F1 rs.CALCA.R1 104 128
NCBIGene 36.3 796
Alternative 5-prime, size difference: 24
Exclusion in 5'UTR
Reference transcript: NM_001033952
smart CALCITONIN 38aa 8e-14
in ref transcript
calcitonin. This family is formed by calcitonin, the calcitonin gene-related peptide, and amylin. They are short polypeptide hormones.
CAMK2A
rs.CAMK2A.F1 rs.CAMK2A.R1 114 147
NCBIGene 36.3 815
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_015981
cd S_TKc 260aa 1e-81
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 249aa 2e-84
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam CaMKII_AD 128aa 8e-62
in ref transcript
Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
PTZ PTZ00263 261aa 3e-41
in ref transcript
protein kinase A catalytic subunit; Provisional.
CAMK2B
rs.CAMK2B.F1 rs.CAMK2B.R1 117 162
NCBIGene 36.3 816
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_001220
cd S_TKc 260aa 2e-82
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 249aa 1e-85
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam CaMKII_AD 128aa 1e-58
in ref transcript
Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
PTZ PTZ00263 249aa 6e-40
in ref transcript
protein kinase A catalytic subunit; Provisional.
CAMK2G
rs.CAMK2G.F1 rs.CAMK2G.R1 111 153
NCBIGene 36.3 818
Alternative 5-prime, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_172171
cd S_TKc 260aa 5e-84
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 249aa 2e-86
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam CaMKII_AD 128aa 5e-59
in ref transcript
Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
PTZ PTZ00263 249aa 3e-43
in ref transcript
protein kinase A catalytic subunit; Provisional.
CAP1
rs.CAP1.F1 rs.CAP1.R1 104 122
NCBIGene 36.3 10487
Alternative 5-prime, size difference: 18
Exclusion in 5'UTR
Reference transcript: NM_006367
pfam CAP_N 302aa 1e-109
in ref transcript
Adenylate cyclase associated (CAP) N terminal.
pfam CAP_C 156aa 5e-71
in ref transcript
DE Adenylate cyclase associated (CAP) C terminal.
CAPN3
rs.CAPN3.F1 rs.CAPN3.R1 104 122
NCBIGene 36.3 825
Single exon skipping, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_000070
cd CysPc 353aa 1e-106
in ref transcript
Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
cd Calpain_III 158aa 1e-54
in ref transcript
Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
cd EFh 55aa 2e-06
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 68aa 0.004
in ref transcript
cd EFh 56aa 0.008
in ref transcript
pfam Peptidase_C2 343aa 1e-158
in ref transcript
Calpain family cysteine protease.
pfam Calpain_III 155aa 8e-65
in ref transcript
Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions.
COG FRQ1 99aa 6e-06
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
CAPN3
rs.CAPN3.F2 rs.CAPN3.R2 123 401
NCBIGene 36.3 825
Single exon skipping, size difference: 278
Inclusion in the protein causing a new stop codon
Reference transcript: NM_000070
Changed!
cd CysPc 353aa 1e-106
in ref transcript
Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
Changed!
cd Calpain_III 158aa 1e-54
in ref transcript
Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
Changed!
cd EFh 55aa 2e-06
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Changed!
cd EFh 68aa 0.004
in ref transcript
Changed!
cd EFh 56aa 0.008
in ref transcript
Changed!
pfam Peptidase_C2 343aa 1e-158
in ref transcript
Calpain family cysteine protease.
Changed!
pfam Calpain_III 155aa 8e-65
in ref transcript
Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions.
Changed!
COG FRQ1 99aa 6e-06
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
Changed!
cd CysPc 99aa 4e-15
in modified transcript
Changed!
smart CysPc 101aa 3e-27
in modified transcript
Calpain-like thiol protease family. Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).
CAPN3
rs.CAPN3.F3 rs.CAPN3.R3 103 121
NCBIGene 36.3 825
Single exon skipping, size difference: 18
Exclusion in 5'UTR
Reference transcript: NM_173090
cd EFh 55aa 6e-06
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
COG FRQ1 91aa 2e-06
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
CAPN3
rs.CAPN3.F4 rs.CAPN3.R4 188 332
NCBIGene 36.3 825
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_000070
Changed!
cd CysPc 353aa 1e-106
in ref transcript
Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
cd Calpain_III 158aa 1e-54
in ref transcript
Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
cd EFh 55aa 2e-06
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 68aa 0.004
in ref transcript
cd EFh 56aa 0.008
in ref transcript
Changed!
pfam Peptidase_C2 343aa 1e-158
in ref transcript
Calpain family cysteine protease.
pfam Calpain_III 155aa 8e-65
in ref transcript
Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions.
COG FRQ1 99aa 6e-06
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
Changed!
cd CysPc 305aa 1e-111
in modified transcript
Changed!
pfam Peptidase_C2 295aa 1e-165
in modified transcript
CASC1
rs.CASC1.F1 rs.CASC1.R1 286 348
NCBIGene 36.3 55259
Alternative 3-prime, size difference: 62
Exclusion in the protein causing a frameshift
Reference transcript: NM_018272
CASC4
rs.CASC4.F1 rs.CASC4.R1 369 537
NCBIGene 36.3 113201
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_138423
TIGR SMC_prok_B 159aa 3e-06
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG Smc 157aa 9e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG FlaD 98aa 0.003
in ref transcript
Putative archaeal flagellar protein D/E [Cell motility and secretion].
CASP1
rs.CASP1.F1 rs.CASP1.R1 333 477
NCBIGene 36.3 834
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_033292
Changed!
cd CASc 250aa 1e-72
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
Changed!
smart CASc 250aa 5e-94
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
pfam CARD 88aa 5e-20
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
Changed!
cd CASc 202aa 7e-58
in modified transcript
Changed!
smart CASc 202aa 6e-76
in modified transcript
CASP10
rs.CASP10.F1 rs.CASP10.R1 165 294
NCBIGene 36.3 843
Multiple exon skipping, size difference: 129
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_032977
cd CASc 240aa 1e-82
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
cd DED 79aa 2e-12
in ref transcript
Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
cd DED 74aa 1e-11
in ref transcript
smart CASc 239aa 4e-97
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
pfam DED 83aa 7e-17
in ref transcript
Death effector domain.
pfam DED 77aa 1e-15
in ref transcript
CASP2
rs.CASP2.F1 rs.CASP2.R1 296 514
NCBIGene 36.3 835
Exon skipping and alternative 3-prime or 5-prime, size difference: 218
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_032982
Changed!
cd CASc 256aa 3e-74
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
Changed!
smart CASc 256aa 9e-86
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
Changed!
smart CARD 89aa 9e-16
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signalling. Mediates homodimerisation. Structure consists of six antiparallel helices arranged in a topology homologue to the DEATH and the DED domain.
Changed!
smart CARD 75aa 6e-13
in modified transcript
CASP3
rs.CASP3.F1 rs.CASP3.R1 119 286
NCBIGene 36.3 836
Single exon skipping, size difference: 167
Exclusion in 5'UTR
Reference transcript: NM_004346
cd CASc 239aa 3e-86
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
smart CASc 241aa 3e-97
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
CASP4
rs.CASP4.F1 rs.CASP4.R1 110 209
NCBIGene 36.3 837
Single exon skipping, size difference: 99
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001225
Changed!
cd CASc 250aa 3e-69
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
Changed!
smart CASc 250aa 2e-93
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
Changed!
smart CARD 89aa 2e-10
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signalling. Mediates homodimerisation. Structure consists of six antiparallel helices arranged in a topology homologue to the DEATH and the DED domain.
CASP7
rs.CASP7.F1 rs.CASP7.R1 111 144
NCBIGene 36.3 840
Alternative 3-prime, size difference: 33
Exclusion of the protein initiation site
Reference transcript: NM_033338
cd CASc 242aa 4e-84
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
smart CASc 243aa 2e-90
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
CASP8
rs.CASP8.F1 rs.CASP8.R1 149 175
NCBIGene 36.3 841
Alternative 3-prime, size difference: 26
Exclusion in 5'UTR
Reference transcript: NM_001228
cd CASc 253aa 2e-84
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
cd DED 75aa 6e-17
in ref transcript
Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
cd DED 75aa 2e-11
in ref transcript
smart CASc 252aa 4e-89
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
pfam DED 84aa 8e-23
in ref transcript
Death effector domain.
pfam DED 81aa 1e-16
in ref transcript
CASP8
rs.CASP8.F2 rs.CASP8.R2 107 175
NCBIGene 36.3 841
Alternative 5-prime, size difference: 68
Exclusion in 5'UTR
Reference transcript: NM_001228
cd CASc 253aa 2e-84
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
cd DED 75aa 6e-17
in ref transcript
Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
cd DED 75aa 2e-11
in ref transcript
smart CASc 252aa 4e-89
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
pfam DED 84aa 8e-23
in ref transcript
Death effector domain.
pfam DED 81aa 1e-16
in ref transcript
CASP8
rs.CASP8.F3 rs.CASP8.R3 147 212
NCBIGene 36.3 841
Single exon skipping, size difference: 65
Exclusion in the protein causing a frameshift
Reference transcript: NM_001080125
Changed!
cd CASc 253aa 3e-85
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
cd DED 75aa 3e-17
in ref transcript
Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
cd DED 78aa 7e-13
in ref transcript
Changed!
smart CASc 252aa 4e-90
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
pfam DED 84aa 5e-23
in ref transcript
Death effector domain.
pfam DED 82aa 2e-17
in ref transcript
CASP8
rs.CASP8.F4 rs.CASP8.R4 288 333
NCBIGene 36.3 841
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_001080125
cd CASc 253aa 3e-85
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
cd DED 75aa 3e-17
in ref transcript
Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
cd DED 78aa 7e-13
in ref transcript
smart CASc 252aa 4e-90
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
pfam DED 84aa 5e-23
in ref transcript
Death effector domain.
pfam DED 82aa 2e-17
in ref transcript
CASP9
rs.CASP9.F1 rs.CASP9.R1 100 550
NCBIGene 36.3 842
Multiple exon skipping, size difference: 450
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_001229
Changed!
cd CASc 263aa 5e-78
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
Changed!
smart CASc 261aa 6e-81
in ref transcript
Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
smart CARD 91aa 7e-16
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signalling. Mediates homodimerisation. Structure consists of six antiparallel helices arranged in a topology homologue to the DEATH and the DED domain.
Changed!
cd CASc 125aa 2e-26
in modified transcript
Changed!
smart CASc 124aa 3e-26
in modified transcript
CAST
rs.CAST.F1 rs.CAST.R1 135 192
NCBIGene 36.3 831
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_001750
pfam Calpain_inhib 127aa 5e-19
in ref transcript
Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins.
pfam Calpain_inhib 118aa 3e-16
in ref transcript
pfam Calpain_inhib 99aa 2e-13
in ref transcript
pfam Calpain_inhib 126aa 3e-11
in ref transcript
CAST
rs.CAST.F2 rs.CAST.R2 102 141
NCBIGene 36.3 831
Single exon skipping, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_001750
pfam Calpain_inhib 127aa 5e-19
in ref transcript
Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins.
pfam Calpain_inhib 118aa 3e-16
in ref transcript
pfam Calpain_inhib 99aa 2e-13
in ref transcript
Changed!
pfam Calpain_inhib 126aa 3e-11
in ref transcript
Changed!
pfam Calpain_inhib 113aa 2e-09
in modified transcript
CAST
rs.CAST.F3 rs.CAST.R3 101 167
NCBIGene 36.3 831
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_001750
pfam Calpain_inhib 127aa 5e-19
in ref transcript
Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins.
pfam Calpain_inhib 118aa 3e-16
in ref transcript
pfam Calpain_inhib 99aa 2e-13
in ref transcript
pfam Calpain_inhib 126aa 3e-11
in ref transcript
CBFA2T2
rs.CBFA2T2.F1 rs.CBFA2T2.R1 127 415
NCBIGene 36.3 9139
Single exon skipping, size difference: 288
Exclusion of the protein initiation site
Reference transcript: NM_005093
pfam TAFH 96aa 3e-38
in ref transcript
NHR1 homology to TAF. This corresponds to the region NHR1 that is conserved between the product of the nervy gene in Drosophila and the human mtg8b protein, which is hypothesised to be a transcription factor.
pfam NHR2 67aa 2e-32
in ref transcript
NHR2 domain like. The NHR2 (Nervy homology 2) domain is found in the ETO protein where it mediates oligomerisation and protein-protein interactions. It forms an alpha-helical tetramer.
pfam zf-MYND 37aa 5e-11
in ref transcript
MYND finger.
CBFA2T3
rs.CBFA2T3.F1 rs.CBFA2T3.R1 348 423
NCBIGene 36.3 863
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_005187
pfam TAFH 96aa 5e-38
in ref transcript
NHR1 homology to TAF. This corresponds to the region NHR1 that is conserved between the product of the nervy gene in Drosophila and the human mtg8b protein, which is hypothesised to be a transcription factor.
pfam NHR2 67aa 3e-24
in ref transcript
NHR2 domain like. The NHR2 (Nervy homology 2) domain is found in the ETO protein where it mediates oligomerisation and protein-protein interactions. It forms an alpha-helical tetramer.
pfam zf-MYND 37aa 5e-11
in ref transcript
MYND finger.
COG NtpE 60aa 0.003
in ref transcript
Archaeal/vacuolar-type H+-ATPase subunit E [Energy production and conversion].
CCDC32
rs.CCDC32.F1 rs.CCDC32.R1 121 146
NCBIGene 36.3 90416
Alternative 5-prime, size difference: 25
Exclusion of the protein initiation site
Reference transcript: NM_001080791
CCDC33
rs.CCDC33.F1 rs.CCDC33.R1 111 213
NCBIGene 36.3 80125
Single exon skipping, size difference: 102
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_025055
CCDC41
rs.CCDC41.F1 rs.CCDC41.R1 236 289
NCBIGene 36.3 51134
Single exon skipping, size difference: 53
Exclusion in 5'UTR
Reference transcript: NM_016122
TIGR SMC_prok_B 327aa 1e-11
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG Smc 324aa 5e-06
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
CCDC43
rs.CCDC43.F1 rs.CCDC43.R1 270 329
NCBIGene 36.3 124808
Single exon skipping, size difference: 59
Exclusion in the protein causing a frameshift
Reference transcript: NM_144609
CCHCR1
rs.CCHCR1.F1 rs.CCHCR1.R1 148 256
NCBIGene 36.3 54535
Alternative 3-prime, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_001105564
Changed!
pfam HCR 756aa 0.0
in ref transcript
Alpha helical coiled-coil rod protein (HCR). This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
Changed!
COG Smc 377aa 0.003
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
pfam HCR 750aa 0.0
in modified transcript
CCNB1IP1
rs.CCNB1IP1.F1 rs.CCNB1IP1.R1 246 446
NCBIGene 36.3 57820
Single exon skipping, size difference: 200
Exclusion in 5'UTR
Reference transcript: NM_021178
CCNC
rs.CCNC.F1 rs.CCNC.R1 140 362
NCBIGene 36.3 892
Alternative 5-prime, size difference: 222
Exclusion of the protein initiation site
Reference transcript: NM_005190
Changed!
cd CYCLIN 52aa 4e-06
in ref transcript
Cyclin box fold. Protein binding domain functioning in cell-cycle and transcription control. Present in cyclins, TFIIB and Retinoblastoma (RB).The cyclins consist of 8 classes of cell cycle regulators that regulate cyclin dependent kinases (CDKs). TFIIB is a transcription factor that binds the TATA box. Cyclins, TFIIB and RB contain 2 copies of the domain.
cd CYCLIN 84aa 0.008
in ref transcript
Changed!
TIGR ccl1 250aa 7e-21
in ref transcript
University).
Changed!
COG CCL1 243aa 4e-30
in ref transcript
Cdk activating kinase (CAK)/RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH/TFIIK, cyclin H subunit [Cell division and chromosome partitioning / Transcription / DNA replication, recombination, and repair].
Changed!
TIGR ccl1 169aa 4e-12
in modified transcript
Changed!
COG CCL1 157aa 2e-18
in modified transcript
CCR5
rs.CCR5.F1 rs.CCR5.R1 100 335
NCBIGene 36.3 1234
Alternative 3-prime, size difference: 235
Inclusion in 5'UTR
Reference transcript: NM_001100168
pfam 7tm_1 239aa 8e-30
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
CD1E
rs.CD1E.F1 rs.CD1E.R1 129 233
NCBIGene 36.3 913
Alternative 5-prime, size difference: 104
Exclusion in the protein causing a frameshift
Reference transcript: NM_030893
Changed!
cd IGc 93aa 3e-11
in ref transcript
Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
smart IGc1 69aa 7e-16
in ref transcript
Immunoglobulin C-Type.
Changed!
cd IGc 59aa 1e-06
in modified transcript
Changed!
smart IGc1 45aa 8e-10
in modified transcript
CD1E
rs.CD1E.F2 rs.CD1E.R2 193 229
NCBIGene 36.3 913
Alternative 3-prime, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_030893
cd IGc 93aa 3e-11
in ref transcript
Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
smart IGc1 69aa 7e-16
in ref transcript
Immunoglobulin C-Type.
CD33
rs.CD33.F1 rs.CD33.R1 125 506
NCBIGene 36.3 945
Single exon skipping, size difference: 381
Exclusion in the protein (no frameshift)
Reference transcript: NM_001772
Changed!
pfam V-set 114aa 7e-09
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
pfam C2-set_2 74aa 4e-04
in ref transcript
CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
CD3D
rs.CD3D.F1 rs.CD3D.R1 306 438
NCBIGene 36.3 915
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_000732
smart ITAM 21aa 0.001
in ref transcript
Immunoreceptor tyrosine-based activation motif. Motif that may be dually phosphorylated on tyrosine that links antigen receptors to downstream signalling machinery.
CD40
rs.CD40.F1 rs.CD40.R1 142 204
NCBIGene 36.3 958
Single exon skipping, size difference: 62
Exclusion in the protein causing a frameshift
Reference transcript: NM_001250
cd TNFR 98aa 8e-16
in ref transcript
Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Changed!
cd TNFR 63aa 3e-04
in ref transcript
smart TNFR 39aa 0.004
in ref transcript
Tumor necrosis factor receptor / nerve growth factor receptor repeats. Repeats in growth factor receptors that are involved in growth factor binding. TNF/TNFR.
CD44
rs.CD44.F1 rs.CD44.R1 225 354
NCBIGene 36.3 960
Single exon skipping, size difference: 129
Exclusion in the protein (no frameshift)
Reference transcript: NM_000610
cd Link_domain_CD44_like 145aa 3e-50
in ref transcript
This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It also plays an important role in arteriogenesis. The functional HA-binding domain of CD44 is an extended domain comprised of a single link module flanked with N-and C- extensions. These extensions are essential for folding and for functional activity. This group also contains the cell surface retention sequence (CRS) binding protein-1 (CRSBP-1) and lymph vessel endothelial receptor-1 (LYVE-1). CRSBP-1 is a cell surface binding protein for the CRS motif of PDGF-BB (platelet-derived growth factor-BB) and is responsible for the cell surface retention of PDGF-BB in SSV-transformed cells. CRSBP-1 may play a role in autocrine regulation of cell growth mediated by CRS containing growth regulators. LYVE-1 is preferentially expressed on the lymphatic endothelium and is used as a molecular marker for the detection and characterization of lymphatic vessels in tumors.
smart LINK 91aa 1e-32
in ref transcript
Link (Hyaluronan-binding).
CD47
rs.CD47.F1 rs.CD47.R1 168 201
NCBIGene 36.3 961
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_001777
pfam CD47 158aa 2e-40
in ref transcript
CD47 transmembrane region. This family represents the transmembrane region of CD47 leukocyte antigen.
pfam V-set_CD47 135aa 2e-38
in ref transcript
CD47 immunoglobulin-like domain. This family represents the CD47 leukocyte antigen V-set like Ig domain.
CD59
rs.CD59.F1 rs.CD59.R1 105 221
NCBIGene 36.3 966
Single exon skipping, size difference: 116
Exclusion in 5'UTR
Reference transcript: NM_203330
cd LU 72aa 2e-08
in ref transcript
Ly-6 antigen / uPA receptor -like domain; occurs singly in GPI-linked cell-surface glycoproteins (Ly-6 family,CD59, thymocyte B cell antigen, Sgp-2) or as three-fold repeated domain in urokinase-type plasminogen activator receptor. Topology of these domains is similar to that of snake venom neurotoxins.
pfam UPAR_LY6 68aa 7e-12
in ref transcript
u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087.
CD82
rs.CD82.F1 rs.CD82.R1 475 550
NCBIGene 36.3 3732
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_002231
Changed!
cd CD37_CD82_like_LEL 122aa 5e-39
in ref transcript
Tetraspanin, extracellular domain or large extracellular loop (LEL), CD37_CD82_Like family. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". CD37 is a leukocyte-specific protein, and its restricted expression pattern suggests a role in the immune system. A regulatory role in T-cell proliferation has been suggested. CD82 is a metastasis suppressor implicated in biological processes ranging from fusion, adhesion, and migration to apoptosis and alterations of cell morphology.
Changed!
pfam Tetraspannin 250aa 4e-28
in ref transcript
Tetraspanin family.
Changed!
cd CD37_CD82_like_LEL 117aa 1e-36
in modified transcript
Changed!
pfam Tetraspannin 225aa 6e-24
in modified transcript
CDC2
rs.CDC2.F1 rs.CDC2.R1 111 282
NCBIGene 36.3 983
Single exon skipping, size difference: 171
Exclusion in the protein (no frameshift)
Reference transcript: NM_001786
Changed!
cd S_TKc 285aa 6e-76
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
pfam Pkinase 284aa 4e-82
in ref transcript
Protein kinase domain.
Changed!
PTZ PTZ00024 278aa 3e-51
in ref transcript
cyclin-dependent protein kinase; Provisional.
Changed!
cd S_TKc 228aa 7e-41
in modified transcript
Changed!
pfam Pkinase 227aa 1e-48
in modified transcript
Changed!
PTZ PTZ00024 141aa 2e-27
in modified transcript
Changed!
PTZ PTZ00266 82aa 6e-09
in modified transcript
NIMA-related protein kinase; Provisional.
CDC2L1
rs.CDC2L1.F1 rs.CDC2L1.R1 107 154
NCBIGene 36.3 984
Single exon skipping, size difference: 47
Inclusion in the protein causing a frameshift
Reference transcript: NM_033486
Changed!
cd S_TKc 287aa 3e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 276aa 1e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00024 295aa 4e-55
in ref transcript
cyclin-dependent protein kinase; Provisional.
CDC42BPA
rs.CDC42BPA.F1 rs.CDC42BPA.R1 172 415
NCBIGene 36.3 8476
Single exon skipping, size difference: 243
Exclusion in the protein (no frameshift)
Reference transcript: NM_003607
cd STKc_MRCK_alpha 332aa 0.0
in ref transcript
STKc_MRCK_alpha: Serine/Threonine Kinases (STKs), DMPK-like subfamily, DMPK-related cell division control protein 42 (Cdc42) binding kinase (MRCK) alpha isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The DMPK-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. MRCK is activated via interaction with the small GTPase Cdc42. MRCK/Cdc42 signaling mediates myosin-dependent cell motility. MRCKalpha is expressed ubiquitously in many tissues. It plays a role in the regulation of peripheral actin reorganization and neurite outgrowth. It may also play a role in the transferrin iron uptake pathway.
cd PH_MRCK 122aa 2e-61
in ref transcript
MRCK (myotonic dystrophy-related Cdc42-binding kinase) pleckstrin homology (PH) domain. MRCK consists of a serine/threonine kinase domain, a cysteine rich (C1) region, a PH domain and a p21 binding motif. It has been shown to promote cytoskeletal reorganization, which affects many biological processes. The MRCK PH domain is responsible for its targeting to cell to cell junctions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
cd C1 50aa 1e-09
in ref transcript
Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
cd CRIB 36aa 3e-04
in ref transcript
PAK (p21 activated kinase) Binding Domain (PBD), binds Cdc42p- and/or Rho-like small GTPases; also known as the Cdc42/Rac interactive binding (CRIB) motif; has been shown to inhibit transcriptional activation and cell transformation mediated by the Ras-Rac pathway. CRIB-containing effector proteins are functionally diverse and include serine/threonine kinases, tyrosine kinases, actin-binding proteins, and adapter molecules.
smart S_TKc 257aa 2e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam CNH 264aa 6e-64
in ref transcript
CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations.
pfam DMPK_coil 61aa 1e-17
in ref transcript
DMPK coiled coil domain like. This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation.
Changed!
TIGR SMC_prok_B 306aa 4e-14
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart S_TK_X 60aa 1e-11
in ref transcript
Extension to Ser/Thr-type protein kinases.
pfam C1_1 51aa 8e-10
in ref transcript
Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
Changed!
pfam SMC_N 338aa 1e-08
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
smart PBD 36aa 3e-05
in ref transcript
P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB).
smart PH 117aa 2e-04
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
PTZ PTZ00263 335aa 3e-63
in ref transcript
protein kinase A catalytic subunit; Provisional.
Changed!
COG SbcC 545aa 5e-17
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
Changed!
pfam SMC_N 329aa 6e-09
in modified transcript
Changed!
COG SbcC 464aa 3e-11
in modified transcript
CDKN1A
rs.CDKN1A.F1 rs.CDKN1A.R1 243 384
NCBIGene 36.3 1026
Alternative 5-prime, size difference: 141
Exclusion in 5'UTR
Reference transcript: NM_078467
pfam CDI 50aa 8e-13
in ref transcript
Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase.
CDKN2A
rs.CDKN2A.F1 rs.CDKN2A.R1 127 401
NCBIGene 36.3 1029
Alternative 5-prime, size difference: 274
Inclusion in the protein causing a new stop codon
Reference transcript: NM_000077
Changed!
cd ANK 100aa 1e-12
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
PTZ PTZ00322 81aa 1e-05
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 80aa 3e-04
in ref transcript
cd IGcam 73aa 6e-04
in ref transcript
pfam I-set 84aa 4e-06
in ref transcript
Immunoglobulin I-set domain.
smart IGc2 59aa 6e-06
in ref transcript
Immunoglobulin C-2 Type.
smart IG_like 74aa 7e-06
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
CEACAM20
rs.CEACAM20.F2 rs.CEACAM20.R2 124 403
NCBIGene 36.3 125931
Single exon skipping, size difference: 279
Exclusion in the protein (no frameshift)
Reference transcript: NM_001102597
Changed!
cd IGcam 71aa 1e-05
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 80aa 3e-04
in ref transcript
cd IGcam 73aa 6e-04
in ref transcript
pfam I-set 84aa 4e-06
in ref transcript
Immunoglobulin I-set domain.
smart IGc2 59aa 6e-06
in ref transcript
Immunoglobulin C-2 Type.
Changed!
smart IG_like 74aa 7e-06
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
CENPA
rs.CENPA.F1 rs.CENPA.R1 390 468
NCBIGene 36.3 1058
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_001809
Changed!
smart H3 100aa 6e-34
in ref transcript
Histone H3.
Changed!
PTZ PTZ00018 87aa 2e-16
in ref transcript
histone 3; Provisional.
Changed!
smart H3 74aa 3e-25
in modified transcript
Changed!
PTZ PTZ00018 61aa 5e-14
in modified transcript
CEP170
rs.CEP170.F1 rs.CEP170.R1 250 358
NCBIGene 36.3 9859
Alternative 3-prime, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_014812
cd FHA 89aa 4e-10
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
pfam FHA 68aa 3e-11
in ref transcript
FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
COG COG1716 79aa 1e-04
in ref transcript
FOG: FHA domain [Signal transduction mechanisms].
CEP170
rs.CEP170.F2 rs.CEP170.R2 148 178
NCBIGene 36.3 9859
Alternative 5-prime, size difference: 30
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_014812
cd FHA 89aa 4e-10
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
pfam FHA 68aa 3e-11
in ref transcript
FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
COG COG1716 79aa 1e-04
in ref transcript
FOG: FHA domain [Signal transduction mechanisms].
CEP170
rs.CEP170.F3 rs.CEP170.R3 234 528
NCBIGene 36.3 9859
Single exon skipping, size difference: 294
Exclusion in the protein (no frameshift)
Reference transcript: NM_014812
cd FHA 89aa 4e-10
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
pfam FHA 68aa 3e-11
in ref transcript
FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
COG COG1716 79aa 1e-04
in ref transcript
FOG: FHA domain [Signal transduction mechanisms].
CEP250
rs.CEP250.F1 rs.CEP250.R1 185 353
NCBIGene 36.3 11190
Single exon skipping, size difference: 168
Exclusion in the protein (no frameshift)
Reference transcript: NM_007186
TIGR SMC_prok_B 407aa 8e-12
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
TIGR SMC_prok_B 282aa 3e-07
in ref transcript
TIGR SMC_prok_B 209aa 2e-04
in ref transcript
pfam SMC_N 108aa 0.009
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Changed!
COG Smc 379aa 6e-10
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG Smc 261aa 5e-04
in ref transcript
Changed!
COG Smc 372aa 6e-10
in modified transcript
CEP63
rs.CEP63.F1 rs.CEP63.R1 122 260
NCBIGene 36.3 80254
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_025180
Changed!
TIGR SMC_prok_B 289aa 9e-09
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
TIGR SMC_prok_A 224aa 7e-04
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Changed!
COG Smc 345aa 3e-07
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
TIGR SMC_prok_A 346aa 8e-09
in modified transcript
Changed!
TIGR SMC_prok_B 219aa 1e-07
in modified transcript
Changed!
COG Smc 299aa 2e-06
in modified transcript
Changed!
COG Smc 222aa 0.001
in modified transcript
CEP78
rs.CEP78.F1 rs.CEP78.R1 108 156
NCBIGene 36.3 84131
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098802
cd LRR_RI 170aa 1e-09
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
CHCHD4
rs.CHCHD4.F1 rs.CHCHD4.R1 193 363
NCBIGene 36.3 131474
Single exon skipping, size difference: 170
Exclusion of the protein initiation site
Reference transcript: NM_144636
pfam CHCH 44aa 5e-06
in ref transcript
CHCH domain. we have identified a conserved motif in the LOC118487 protein that we have called the CHCH motif. Alignment of this protein with related members showed the presence of three subgroups of proteins, which are called the S (Small), N (N-terminal extended) and C (C-terminal extended) subgroups. All three sub-groups of proteins have in common that they contain a predicted conserved [coiled coil 1]-[helix 1]-[coiled coil 2]-[helix 2] domain (CHCH domain). Within each helix of the CHCH domain, there are two cysteines present in a C-X9-C motif. The N-group contains an additional double helix domain, and each helix contains the C-X9-C motif. This family contains a number of characterised proteins: Cox19 protein - a nuclear gene of Saccharomyces cerevisiae, codes for an 11-kDa protein (Cox19p) required for expression of cytochrome oxidase. Because cox19 mutants are able to synthesise the mitochondrial and nuclear gene products of cytochrome oxidase, Cox19p probably functions post-translationally during assembly of the enzyme. Cox19p is present in the cytoplasm and mitochondria, where it exists as a soluble intermembrane protein. This dual location is similar to what was previously reported for Cox17p, a low molecular weight copper protein thought to be required for maturation of the CuA centre of subunit 2 of cytochrome oxidase. Cox19p have four conserved potential metal ligands, these are three cysteines and one histidine. Mrp10 - belongs to the class of yeast mitochondrial ribosomal proteins that are essential for translation. Eukaryotic NADH-ubiquinone oxidoreductase 19 kDa (NDUFA8) subunit. The CHCH domain was previously called DUF657.
CHEK2
rs.CHEK2.F1 rs.CHEK2.R1 231 360
NCBIGene 36.3 11200
Single exon skipping, size difference: 129
Exclusion in the protein (no frameshift)
Reference transcript: NM_001005735
cd S_TKc 268aa 7e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
cd FHA 89aa 5e-07
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
smart S_TKc 257aa 4e-75
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam FHA 79aa 8e-08
in ref transcript
FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
PTZ PTZ00263 226aa 5e-43
in ref transcript
protein kinase A catalytic subunit; Provisional.
Changed!
cd FHA 109aa 1e-07
in modified transcript
CHEK2
rs.CHEK2.F2 rs.CHEK2.R2 157 244
NCBIGene 36.3 11200
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_001005735
Changed!
cd S_TKc 268aa 7e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
cd FHA 89aa 5e-07
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
Changed!
smart S_TKc 257aa 4e-75
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam FHA 79aa 8e-08
in ref transcript
FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
Changed!
PTZ PTZ00263 226aa 5e-43
in ref transcript
protein kinase A catalytic subunit; Provisional.
Changed!
cd S_TKc 239aa 4e-50
in modified transcript
Changed!
smart S_TKc 228aa 6e-56
in modified transcript
Changed!
PTZ PTZ00263 197aa 9e-29
in modified transcript
CHMP1A
rs.CHMP1A.F1 rs.CHMP1A.R1 100 120
NCBIGene 36.3 5119
Single exon skipping, size difference: 20
Inclusion in the protein causing a frameshift
Reference transcript: NM_001083314
Changed!
pfam Snf7 140aa 3e-11
in modified transcript
Snf7. This family of proteins are involved in protein sorting and transport from the endosome to the vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes play an important role in the degradation of both lipids and cellular proteins. In order to perform this degradative function, vacuoles/lysosomes contain numerous hydrolases which have been transported in the form of inactive precursors via the biosynthetic pathway and are proteolytically activated upon delivery to the vacuole/lysosome. The delivery of transmembrane proteins, such as activated cell surface receptors to the lumen of the vacuole/lysosome, either for degradation/downregulation, or in the case of hydrolases, for proper localisation, requires the formation of multivesicular bodies (MVBs). These late endosomal structures are formed by invaginating and budding of the limiting membrane into the lumen of the compartment. During this process, a subset of the endosomal membrane proteins is sorted into the forming vesicles. Mature MVBs fuse with the vacuole/lysosome, thereby releasing cargo containing vesicles into its hydrolytic lumen for degradation. Endosomal proteins that are not sorted into the intralumenal MVB vesicles are either recycled back to the plasma membrane or Golgi complex, or remain in the limiting membrane of the MVB and are thereby transported to the limiting membrane of the vacuole/lysosome as a consequence of fusion. Therefore, the MVB sorting pathway plays a critical role in the decision between recycling and degradation of membrane proteins. A few archaeal sequences are also present within this family.
Changed!
COG VPS24 148aa 1e-06
in modified transcript
Conserved protein implicated in secretion [Cell motility and secretion].
CHRFAM7A
rs.CHRFAM7A.F1 rs.CHRFAM7A.R1 328 392
NCBIGene 36.3 89832
Single exon skipping, size difference: 64
Exclusion of the protein initiation site
Reference transcript: NM_139320
Changed!
TIGR LIC 373aa 1e-107
in ref transcript
selective while glycine receptors are anion selective).
Changed!
TIGR LIC 307aa 3e-78
in modified transcript
CLCC1
rs.CLCC1.F1 rs.CLCC1.R1 90 100
NCBIGene 36.3 23155
Alternative 3-prime, size difference: 10
Inclusion in 5'UTR
Reference transcript: NM_001048210
pfam MCLC 538aa 0.0
in ref transcript
Mid-1-related chloride channel (MCLC). This family consists of several mid-1-related chloride channels. mid-1-related chloride channel (MCLC) proteins function as a chloride channel when incorporated in the planar lipid bilayer.
CLCC1
rs.CLCC1.F2 rs.CLCC1.R2 101 251
NCBIGene 36.3 23155
Alternative 3-prime, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_001048210
Changed!
pfam MCLC 538aa 0.0
in ref transcript
Mid-1-related chloride channel (MCLC). This family consists of several mid-1-related chloride channels. mid-1-related chloride channel (MCLC) proteins function as a chloride channel when incorporated in the planar lipid bilayer.
Changed!
pfam MCLC 488aa 0.0
in modified transcript
CLCN6
rs.CLCN6.F1 rs.CLCN6.R1 90 100
NCBIGene 36.3 1185
Alternative 5-prime, size difference: 10
Inclusion in the protein causing a frameshift
Reference transcript: NM_001286
Changed!
cd ClC_6_like 322aa 1e-105
in ref transcript
ClC-6-like chloride channel proteins. This CD includes ClC-6, ClC-7 and ClC-B, C, D in plants. Proteins in this family are ubiquitous in eukarotes and their functions are unclear. They are expressed in intracellular organelles membranes. This family belongs to the ClC superfamily of chloride ion channels, which share the unique double-barreled architecture and voltage-dependent gating mechanism. The gating is conferred by the permeating anion itself, acting as the gating charge. ClC chloride ion channel superfamily perform a variety of functions including cellular excitability regulation, cell volume regulation, membrane potential stabilization, acidification of intracellular organelles, signal transduction, and transepithelial transport in animals.
Changed!
cd ClC_6_like 112aa 5e-39
in ref transcript
Changed!
cd CBS_pair_EriC_assoc_euk_bac 54aa 2e-13
in ref transcript
This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes and bacteria. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in CLC chloride channel family members have been associated with classic Bartter syndrome, Osteopetrosis, Dent's disease, idiopathic generalized epilepsy, and myotonia.
Changed!
pfam Voltage_CLC 231aa 4e-28
in ref transcript
Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. In terms of primary structure, they are unrelated to known cation channels or other types of anion channels. Three ClC subfamilies are found in animals. ClC-1 is involved in setting and restoring the resting membrane potential of skeletal muscle, while other channels play important parts in solute concentration mechanisms in the kidney. These proteins contain two pfam00571 domains.
Changed!
pfam Voltage_CLC 129aa 2e-15
in ref transcript
Changed!
pfam CBS 58aa 3e-08
in ref transcript
CBS domain pair. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a pair of CBS domains, that has been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP.
Changed!
smart CBS 48aa 0.009
in ref transcript
Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease.
Changed!
COG EriC 287aa 2e-13
in ref transcript
Chloride channel protein EriC [Inorganic ion transport and metabolism].
Changed!
COG EriC 111aa 3e-09
in ref transcript
Changed!
PRK PRK05567 58aa 3e-05
in ref transcript
Changed!
cd ClC_6_like 188aa 1e-59
in modified transcript
Changed!
pfam Voltage_CLC 78aa 1e-14
in modified transcript
Changed!
COG EriC 136aa 2e-07
in modified transcript
CLEC12A
rs.CLEC12A.F1 rs.CLEC12A.R1 211 310
NCBIGene 36.3 160364
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_138337
cd CLECT_NK_receptors_like 118aa 8e-30
in ref transcript
CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
smart CLECT 117aa 4e-15
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
CLEC1B
rs.CLEC1B.F1 rs.CLEC1B.R1 268 367
NCBIGene 36.3 51266
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_016509
cd CLECT_NK_receptors_like 117aa 8e-29
in ref transcript
CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
smart CLECT 116aa 8e-19
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
CLIP1
rs.CLIP1.F1 rs.CLIP1.R1 280 385
NCBIGene 36.3 6249
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_002956
pfam CAP_GLY 66aa 6e-27
in ref transcript
CAP-Gly domain. Cytoskeleton-associated proteins (CAPs) are involved in the organisation of microtubules and transportation of vesicles and organelles along the cytoskeletal network. A conserved motif, CAP-Gly, has been identified in a number of CAPs, including CLIP-170 and dynactins. The crystal structure of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain was recently solved. The domain contains three beta-strands. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove.
pfam CAP_GLY 66aa 6e-26
in ref transcript
TIGR SMC_prok_A 315aa 9e-14
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Changed!
TIGR SMC_prok_B 178aa 8e-08
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
TIGR SMC_prok_B 349aa 2e-07
in ref transcript
COG NIP100 61aa 2e-11
in ref transcript
Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning].
COG Smc 306aa 4e-11
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG NIP100 62aa 2e-10
in ref transcript
Changed!
PRK PRK03918 578aa 5e-07
in ref transcript
chromosome segregation protein; Provisional.
Changed!
TIGR SMC_prok_B 144aa 9e-06
in modified transcript
Changed!
pfam SMC_N 392aa 1e-05
in modified transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Changed!
PRK PRK03918 543aa 3e-06
in modified transcript
CMTM1
rs.CMTM1.F1 rs.CMTM1.R1 109 460
NCBIGene 36.3 113540
Alternative 5-prime, size difference: 351
Exclusion in the protein (no frameshift)
Reference transcript: NM_052999
Changed!
pfam MARVEL 114aa 0.006
in ref transcript
Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
CNGA3
rs.CNGA3.F1 rs.CNGA3.R1 480 534
NCBIGene 36.3 1261
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_001298
cd CAP_ED 117aa 8e-17
in ref transcript
effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
smart cNMP 120aa 1e-17
in ref transcript
Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
pfam Ion_trans 140aa 3e-09
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
COG Crp 123aa 1e-06
in ref transcript
cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
CNTN5
rs.CNTN5.F1 rs.CNTN5.R1 176 398
NCBIGene 36.3 53942
Single exon skipping, size difference: 222
Exclusion in the protein (no frameshift)
Reference transcript: NM_014361
cd IGcam 77aa 2e-14
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 86aa 3e-13
in ref transcript
cd FN3 92aa 1e-11
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd IGcam 87aa 4e-11
in ref transcript
cd IGcam 92aa 1e-10
in ref transcript
cd FN3 84aa 2e-09
in ref transcript
cd FN3 93aa 4e-05
in ref transcript
cd IGcam 84aa 5e-05
in ref transcript
cd FN3 86aa 2e-04
in ref transcript
cd IGcam 93aa 0.002
in ref transcript
pfam I-set 89aa 1e-16
in ref transcript
Immunoglobulin I-set domain.
pfam I-set 86aa 2e-15
in ref transcript
smart IGc2 62aa 2e-13
in ref transcript
Immunoglobulin C-2 Type.
pfam fn3 85aa 4e-10
in ref transcript
Fibronectin type III domain.
pfam I-set 93aa 1e-09
in ref transcript
pfam fn3 78aa 7e-07
in ref transcript
smart IG_like 74aa 7e-06
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam I-set 91aa 1e-04
in ref transcript
pfam fn3 88aa 6e-04
in ref transcript
smart FN3 78aa 7e-04
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
CNTNAP4
rs.CNTNAP4.F1 rs.CNTNAP4.R1 173 317
NCBIGene 36.3 85445
Single exon skipping, size difference: 144
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_138994
cd FA58C 123aa 2e-24
in ref transcript
Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
cd LamG 145aa 1e-21
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
cd LamG 151aa 1e-17
in ref transcript
Changed!
cd LamG 96aa 3e-10
in ref transcript
cd LamG 140aa 1e-09
in ref transcript
cd FReD 48aa 0.006
in ref transcript
Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.
smart LamG 127aa 6e-25
in ref transcript
Laminin G domain.
pfam F5_F8_type_C 121aa 1e-23
in ref transcript
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
pfam Laminin_G_2 124aa 1e-20
in ref transcript
Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
pfam Laminin_G_2 128aa 5e-13
in ref transcript
Changed!
pfam Laminin_G_2 98aa 9e-12
in ref transcript
smart FBG 45aa 4e-04
in ref transcript
Fibrinogen-related domains (FReDs). Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.
smart COLFI 84aa 0.003
in ref transcript
Fibrillar collagens C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.
pfam EGF 33aa 0.005
in ref transcript
EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
Changed!
cd LamG 146aa 7e-18
in modified transcript
Changed!
pfam Laminin_G_2 127aa 7e-20
in modified transcript
COL11A2
rs.COL11A2.F1 rs.COL11A2.R1 193 271
NCBIGene 36.3 1302
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_080680
cd LamG 152aa 1e-05
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
smart COLFI 196aa 6e-85
in ref transcript
Fibrillar collagens C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.
smart TSPN 183aa 7e-56
in ref transcript
Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin.
COL13A1
rs.COL13A1.F1 rs.COL13A1.R1 123 165
NCBIGene 36.3 1305
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_005203
COL13A1
rs.COL13A1.F2 rs.COL13A1.R2 101 146
NCBIGene 36.3 1305
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_005203
COL13A1
rs.COL13A1.F3 rs.COL13A1.R3 134 170
NCBIGene 36.3 1305
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_005203
COLQ
rs.COLQ.F1 rs.COLQ.R1 137 239
NCBIGene 36.3 8292
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_005677
TIGR myxo_disulf_rpt 25aa 5e-04
in ref transcript
This model represents a sequence region shared between several proteins of Myxococcus xanthus DK 1622 and some eukaryotic proteins that include human pappalysin-1. The region of about 40 amino acids contains several conserved Cys residues presumed to form disulfide bonds. The region appears in up to 13 repeats in Myxococcus.
pfam Collagen 63aa 0.004
in ref transcript
Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.
COMMD5
rs.COMMD5.F1 rs.COMMD5.R1 256 410
NCBIGene 36.3 28991
Alternative 5-prime, size difference: 154
Exclusion in 5'UTR
Reference transcript: NM_014066
cd Commd5_HCaRG 110aa 3e-46
in ref transcript
COMM_Domain containing protein 5, also called HCaRG (hypertension-related, calcium-regulated gene). HCaRG is a nuclear protein that might be involved in cell proliferation; it is negatively regulated by extracellular calcium concentration, and its basal mRNA levels are higher in hypertensive animals. The COMM Domain is found at the C-terminus of a variety of proteins; presumably all COMM_Domain containing proteins are located in the nucleus and the COMM domain plays a role in protein-protein interactions. Several family members have been shown to bind and inhibit NF-kappaB.
cd Commd10 147aa 3e-06
in ref transcript
COMM_Domain containing protein 10. The COMM Domain is found at the C-terminus of a variety of proteins; presumably all COMM_Domain containing proteins are located in the nucleus and the COMM domain plays a role in protein-protein interactions. Several family members have been shown to bind and inhibit NF-kappaB.
pfam HCaRG 176aa 1e-43
in ref transcript
HCaRG protein. This family consists of several mammalian HCaRG(hypertension-related, calcium-regulated gene) proteins. HCaRG is negatively regulated by extracellular calcium concentration, and its basal mRNA levels are higher in hypertensive animals. HCaRG is a nuclear protein potentially involved in the control of cell proliferation.
COMMD9
rs.COMMD9.F1 rs.COMMD9.R1 165 291
NCBIGene 36.3 29099
Single exon skipping, size difference: 126
Exclusion in the protein (no frameshift)
Reference transcript: NM_014186
cd Commd9 108aa 2e-42
in ref transcript
COMM_Domain containing protein 9. The COMM Domain is found at the C-terminus of a variety of proteins; presumably all COMM_Domain containing proteins are located in the nucleus and the COMM domain plays a role in protein-protein interactions. Several family members have been shown to bind and inhibit NF-kappaB.
Changed!
pfam HCaRG 183aa 9e-27
in ref transcript
HCaRG protein. This family consists of several mammalian HCaRG(hypertension-related, calcium-regulated gene) proteins. HCaRG is negatively regulated by extracellular calcium concentration, and its basal mRNA levels are higher in hypertensive animals. HCaRG is a nuclear protein potentially involved in the control of cell proliferation.
Changed!
pfam HCaRG 125aa 4e-24
in modified transcript
COPA
rs.COPA.F1 rs.COPA.R1 119 146
NCBIGene 36.3 1314
Alternative 3-prime, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098398
cd WD40 310aa 9e-60
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
pfam COPI_C 410aa 0.0
in ref transcript
Coatomer (COPI) alpha subunit C-terminus. This family represents the C-terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit. Coatomer (COPI) is a large cytosolic protein complex which forms a coat around vesicles budding from the Golgi apparatus. Such coatomer-coated vesicles have been proposed to play a role in many distinct steps of intracellular transport. Note that many family members also contain the pfam04053 domain.
Changed!
pfam Coatomer_WDAD 452aa 0.0
in ref transcript
Coatomer WD associated region.
smart WD40 40aa 3e-07
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
pfam WD40 34aa 6e-05
in ref transcript
WD domain, G-beta repeat.
pfam WD40 39aa 6e-05
in ref transcript
smart WD40 36aa 1e-04
in ref transcript
smart WD40 39aa 5e-04
in ref transcript
COG COG2319 316aa 4e-32
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
pfam Coatomer_WDAD 443aa 0.0
in modified transcript
COPE
rs.COPE.F1 rs.COPE.R1 101 254
NCBIGene 36.3 11316
Single exon skipping, size difference: 153
Exclusion in the protein (no frameshift)
Reference transcript: NM_007263
cd TPR 83aa 0.007
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
Changed!
pfam Coatomer_E 291aa 1e-109
in ref transcript
Coatomer epsilon subunit. This family represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex.
Changed!
pfam Coatomer_E 240aa 4e-84
in modified transcript
CPEB1
rs.CPEB1.F1 rs.CPEB1.R1 100 115
NCBIGene 36.3 64506
Alternative 5-prime, size difference: 15
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_030594
Changed!
cd RRM 68aa 5e-05
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 60aa 0.005
in ref transcript
Changed!
smart RRM_2 63aa 0.001
in ref transcript
RNA recognition motif.
Changed!
cd RRM 73aa 1e-04
in modified transcript
Changed!
smart RRM_2 68aa 0.005
in modified transcript
CPSF4
rs.CPSF4.F1 rs.CPSF4.R1 112 187
NCBIGene 36.3 10898
Alternative 3-prime, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_006693
smart ZnF_C3H1 26aa 0.002
in ref transcript
zinc finger.
COG YTH1 152aa 1e-21
in ref transcript
Cleavage and polyadenylation specificity factor (CPSF) Clipper subunit and related makorin family Zn-finger proteins [General function prediction only].
CPZ
rs.CPZ.F1 rs.CPZ.R1 132 395
NCBIGene 36.3 8532
Alternative 5-prime and 3-prime, size difference: 263
Inclusion in the protein causing a new stop codon, Inclusion in the protein causing a frameshift
Reference transcript: NM_001014447
Changed!
cd M14_CPZ 395aa 0.0
in ref transcript
Peptidase M14-like domain of carboxypeptidase (CP) Z (CPZ), CPZ belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPZ is a secreted Zn-dependent enzyme whose biological function is largely unknown. Unlike other members of the N/E subfamily, CPZ has a bipartite structure, which consists of an N-terminal cysteine-rich domain (CRD) whose sequence is similar to Wnt-binding proteins, and a C-terminal CP catalytic domain that removes C-terminal Arg residues from substrates. CPZ is enriched in the extracellular matrix and is widely distributed during early embryogenesis. That the CRD of CPZ can bind to Wnt4 suggests that CPZ plays a role in Wnt signaling.
Changed!
pfam Peptidase_M14 302aa 1e-62
in ref transcript
Zinc carboxypeptidase.
Changed!
smart FRI 118aa 2e-36
in ref transcript
Frizzled. Drosophila melanogaster frizzled mediates signalling that polarises a precursor cell along the anteroposterior axis. Homologues of the N-terminal region of frizzled exist either as transmembrane or secreted molecules. Frizzled homologues are reported to be receptors for the Wnt growth factors. (Not yet in MEDLINE: the FRI domain occurs in several receptor tyrosine kinases [Xu, Y.K. and Nusse, Curr. Biol. 8 R405-R406 (1998); Masiakowski, P. and Yanopoulos, G.D., Curr. Biol. 8, R407 (1998)].
Changed!
COG COG2866 185aa 2e-06
in ref transcript
Predicted carboxypeptidase [Amino acid transport and metabolism].
CREBBP
rs.CREBBP.F1 rs.CREBBP.R1 209 323
NCBIGene 36.3 1387
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_004380
cd Bromo_cbp_like 108aa 1e-63
in ref transcript
Bromodomain, cbp_like subfamily. Cbp (CREB binding protein or CREBBP) is an acetyltransferase acting on histone, which gives a specific tag for transcriptional activation and also acetylates non-histone proteins. CREBBP binds specifically to phosphorylated CREB protein and augments the activity of phosphorylated CREB to activate transcription of cAMP-responsive genes. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
cd ZZ_CBP 41aa 2e-19
in ref transcript
Zinc finger, ZZ type. Zinc finger present in CBP/p300 and related proteins. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. CREB-binding protein (CBP) is a large multidomain protein that provides binding sites for transcriptional coactivators, the role of the ZZ domain in CBP/p300 is unclear.
pfam DUF906 234aa 1e-136
in ref transcript
Domain of Unknown Function (DUF906).
pfam KIX 81aa 4e-38
in ref transcript
KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun.
Changed!
pfam zf-TAZ 85aa 1e-35
in ref transcript
TAZ zinc finger. The TAZ2 domain of CBP binds to other transcription factors such as the p53 tumour suppressor protein, E1A oncoprotein, MyoD, and GATA-1. The zinc coordinating motif that is necessary for binding to target DNA sequences consists of HCCC.
pfam DUF902 61aa 6e-31
in ref transcript
Domain of Unknown Function (DUF902).
smart BROMO 109aa 5e-29
in ref transcript
bromo domain.
pfam zf-TAZ 80aa 5e-25
in ref transcript
pfam Creb_binding 73aa 6e-16
in ref transcript
Creb binding. The Creb binding domain assumes a structure comprising of three alpha-helices which pack in a bundle, exposing a hydrophobic groove between alpha-1 and alpha-3 within which complimentary domains found in the protein 'activator for thyroid hormone and retinoid receptors' (ACTR) can dock. Docking of these domains is required for the recruitment of RNA polymerase II and the basal transcription machinery.
smart ZnF_ZZ 43aa 1e-12
in ref transcript
Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
pfam PAT1 136aa 2e-04
in ref transcript
Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
COG COG5076 109aa 1e-12
in ref transcript
Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
PRK PRK08853 109aa 0.001
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
Changed!
pfam zf-TAZ 58aa 5e-20
in modified transcript
CREM
rs.CREM.F1 rs.CREM.R1 210 246
NCBIGene 36.3 1390
Single exon skipping, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_182717
pfam bZIP_1 53aa 4e-05
in ref transcript
bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.
CRKRS
rs.CRKRS.F1 rs.CRKRS.R1 129 156
NCBIGene 36.3 51755
Alternative 3-prime, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_016507
cd S_TKc 295aa 5e-73
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
pfam Pkinase 294aa 1e-74
in ref transcript
Protein kinase domain.
PTZ PTZ00024 288aa 1e-45
in ref transcript
cyclin-dependent protein kinase; Provisional.
Changed!
smart S_TKc 284aa 1e-74
in modified transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
CRTC1
rs.CRTC1.F1 rs.CRTC1.R1 186 234
NCBIGene 36.3 23373
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098482
CSHL1
rs.CSHL1.F1 rs.CSHL1.R1 129 305
NCBIGene 36.3 1444
Exon skipping and alternative 3-prime or 5-prime, size difference: 176
Inclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_022579
Changed!
pfam Hormone_1 190aa 1e-35
in ref transcript
Somatotropin hormone family.
CSHL1
rs.CSHL1.F2 rs.CSHL1.R2 187 256
NCBIGene 36.3 1444
Alternative 3-prime, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_022579
Changed!
pfam Hormone_1 190aa 1e-35
in ref transcript
Somatotropin hormone family.
Changed!
pfam Hormone_1 167aa 5e-31
in modified transcript
CSN1S1
rs.CSN1S1.F1 rs.CSN1S1.R1 98 122
NCBIGene 36.3 1446
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_001890
CSN1S1
rs.CSN1S1.F2 rs.CSN1S1.R2 243 270
NCBIGene 36.3 1446
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_001890
CSPP1
rs.CSPP1.F1 rs.CSPP1.R1 163 268
NCBIGene 36.3 79848
Multiple exon skipping, size difference: 105
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001077204
CTDP1
rs.CTDP1.F1 rs.CTDP1.R1 210 373
NCBIGene 36.3 9150
Single exon skipping, size difference: 163
Exclusion in the protein causing a frameshift
Reference transcript: NM_004715
cd BRCT 81aa 1e-06
in ref transcript
Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
cd biotinyl_domain 91aa 1e-05
in ref transcript
The biotinyl-domain or biotin carboxyl carrier protein (BCCP) domain is present in all biotin-dependent enzymes, such as acetyl-CoA carboxylase, pyruvate carboxylase, propionyl-CoA carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA carboxylase, oxaloacetate decarboxylase, methylmalonyl-CoA decarboxylase, transcarboxylase and urea amidolyase. This domain functions in transferring CO2 from one subsite to another, allowing carboxylation, decarboxylation, or transcarboxylation. During this process, biotin is covalently attached to a specific lysine.
Changed!
pfam FCP1_C 246aa 2e-83
in ref transcript
FCP1, C-terminal. The C-terminal domain of FCP-1 is required for interaction with the carboxy terminal domain of RAP74. Interaction relies extensively on van der Waals contacts between hydrophobic residues situated within alpha-helices in both domains.
pfam NIF 167aa 2e-68
in ref transcript
NLI interacting factor-like phosphatase. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase. This region has been identified as the minimal phosphatase domain.
smart BRCT 87aa 4e-07
in ref transcript
breast cancer carboxy-terminal domain.
COG FCP1 148aa 2e-13
in ref transcript
TFIIF-interacting CTD phosphatases, including NLI-interacting factor [Transcription].
PRK PRK09282 112aa 3e-05
in ref transcript
pyruvate carboxylase subunit B; Validated.
Changed!
pfam FCP1_C 91aa 6e-37
in modified transcript
CTH
rs.CTH.F1 rs.CTH.R1 208 340
NCBIGene 36.3 1491
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_001902
Changed!
cd CGS_like 360aa 1e-155
in ref transcript
CGS_like: Cystathionine gamma-synthase is a PLP dependent enzyme and catalyzes the committed step of methionine biosynthesis. This pathway is unique to microorganisms and plants, rendering the enzyme an attractive target for the development of antimicrobials and herbicides. This subgroup also includes cystathionine gamma-lyases (CGL), O-acetylhomoserine sulfhydrylases and O-acetylhomoserine thiol lyases. CGL's are very similar to CGS's. Members of this group are widely distributed among all three forms of life.
Changed!
pfam Cys_Met_Meta_PP 377aa 1e-165
in ref transcript
Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OAH/OAS sulfhydrylase, O-succinylhomoserine sulfhydrylase All of these members participate is slightly different reactions. All these enzymes use PLP (pyridoxal-5'-phosphate) as a cofactor.
Changed!
COG MetC 386aa 1e-128
in ref transcript
Cystathionine beta-lyases/cystathionine gamma-synthases [Amino acid transport and metabolism].
Changed!
cd CGS_like 316aa 1e-124
in modified transcript
Changed!
pfam Cys_Met_Meta_PP 333aa 1e-134
in modified transcript
Changed!
COG MetC 342aa 1e-102
in modified transcript
CTNND1
rs.CTNND1.F1 rs.CTNND1.R1 302 375
NCBIGene 36.3 1500
Alternative 5-prime, size difference: 73
Exclusion in 5'UTR
Reference transcript: NM_001085458
cd ARM 119aa 1e-19
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
cd ARM 103aa 9e-11
in ref transcript
cd ARM 142aa 5e-05
in ref transcript
smart ARM 42aa 5e-05
in ref transcript
Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
smart ARM 41aa 1e-04
in ref transcript
smart ARM 36aa 6e-04
in ref transcript
CTNND1
rs.CTNND1.F2 rs.CTNND1.R2 120 183
NCBIGene 36.3 1500
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_001085458
cd ARM 119aa 1e-19
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
cd ARM 103aa 9e-11
in ref transcript
cd ARM 142aa 5e-05
in ref transcript
smart ARM 42aa 5e-05
in ref transcript
Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
smart ARM 41aa 1e-04
in ref transcript
smart ARM 36aa 6e-04
in ref transcript
CTNND1
rs.CTNND1.F3 rs.CTNND1.R3 102 120
NCBIGene 36.3 1500
Single exon skipping, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_001085458
cd ARM 119aa 1e-19
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
cd ARM 103aa 9e-11
in ref transcript
cd ARM 142aa 5e-05
in ref transcript
smart ARM 42aa 5e-05
in ref transcript
Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
smart ARM 41aa 1e-04
in ref transcript
smart ARM 36aa 6e-04
in ref transcript
CTNND1
rs.CTNND1.F4 rs.CTNND1.R4 235 322
NCBIGene 36.3 1500
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_001085458
cd ARM 119aa 1e-19
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
cd ARM 103aa 9e-11
in ref transcript
cd ARM 142aa 5e-05
in ref transcript
smart ARM 42aa 5e-05
in ref transcript
Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
smart ARM 41aa 1e-04
in ref transcript
smart ARM 36aa 6e-04
in ref transcript
CTNND1
rs.CTNND1.F5 rs.CTNND1.R5 110 518
NCBIGene 36.3 1500
Multiple exon skipping, size difference: 408
Exclusion in 5'UTR, Exclusion of the protein initiation site
Reference transcript: NM_001085458
cd ARM 119aa 1e-19
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
cd ARM 103aa 9e-11
in ref transcript
cd ARM 142aa 5e-05
in ref transcript
smart ARM 42aa 5e-05
in ref transcript
Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
smart ARM 41aa 1e-04
in ref transcript
smart ARM 36aa 6e-04
in ref transcript
CTNS
rs.CTNS.F1 rs.CTNS.R1 179 302
NCBIGene 36.3 1497
Alternative 5-prime, size difference: 123
Inclusion in 5'UTR
Reference transcript: NM_001031681
TIGR 2A43 233aa 1e-77
in ref transcript
CTSB
rs.CTSB.F1 rs.CTSB.R1 190 264
NCBIGene 36.3 1508
Single exon skipping, size difference: 74
Exclusion in 5'UTR
Reference transcript: NM_147780
cd Peptidase_C1A_CathepsinB 248aa 1e-106
in ref transcript
Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane antibody-mediated interstitial nephritis. It plays a role in renal tubulogenesis and is defective in hereditary tubulointerstitial disorders. TIN-Ag is exclusively expressed in kidney tissues.
pfam Peptidase_C1 250aa 6e-69
in ref transcript
Papain family cysteine protease.
pfam Propeptide_C1 41aa 2e-15
in ref transcript
Peptidase family C1 propeptide. This motif is found at the N terminal of some members of the Peptidase_C1 family (pfam00112) and is involved in activation of this peptidase.
PTZ PTZ00200 232aa 4e-25
in ref transcript
cysteine proteinase precursor; Provisional.
CTSB
rs.CTSB.F2 rs.CTSB.R2 103 191
NCBIGene 36.3 1508
Single exon skipping, size difference: 88
Exclusion in 5'UTR
Reference transcript: NM_147780
cd Peptidase_C1A_CathepsinB 248aa 1e-106
in ref transcript
Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane antibody-mediated interstitial nephritis. It plays a role in renal tubulogenesis and is defective in hereditary tubulointerstitial disorders. TIN-Ag is exclusively expressed in kidney tissues.
pfam Peptidase_C1 250aa 6e-69
in ref transcript
Papain family cysteine protease.
pfam Propeptide_C1 41aa 2e-15
in ref transcript
Peptidase family C1 propeptide. This motif is found at the N terminal of some members of the Peptidase_C1 family (pfam00112) and is involved in activation of this peptidase.
PTZ PTZ00200 232aa 4e-25
in ref transcript
cysteine proteinase precursor; Provisional.
CTSB
rs.CTSB.F3 rs.CTSB.R3 399 444
NCBIGene 36.3 1508
Alternative 5-prime, size difference: 45
Exclusion in 5'UTR
Reference transcript: NM_147781
cd Peptidase_C1A_CathepsinB 248aa 1e-106
in ref transcript
Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane antibody-mediated interstitial nephritis. It plays a role in renal tubulogenesis and is defective in hereditary tubulointerstitial disorders. TIN-Ag is exclusively expressed in kidney tissues.
pfam Peptidase_C1 250aa 6e-69
in ref transcript
Papain family cysteine protease.
pfam Propeptide_C1 41aa 2e-15
in ref transcript
Peptidase family C1 propeptide. This motif is found at the N terminal of some members of the Peptidase_C1 family (pfam00112) and is involved in activation of this peptidase.
PTZ PTZ00200 232aa 4e-25
in ref transcript
cysteine proteinase precursor; Provisional.
CUGBP1
rs.CUGBP1.F1 rs.CUGBP1.R1 104 116
NCBIGene 36.3 10658
Alternative 3-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_001025596
cd RRM 77aa 6e-17
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 75aa 3e-16
in ref transcript
cd RRM 80aa 3e-14
in ref transcript
TIGR SF-CC1 181aa 4e-18
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
smart RRM 71aa 1e-15
in ref transcript
RNA recognition motif.
COG COG0724 93aa 3e-11
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
COG COG0724 163aa 6e-09
in ref transcript
CXXC1
rs.CXXC1.F1 rs.CXXC1.R1 100 112
NCBIGene 36.3 30827
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_001101654
pfam zf-CXXC 37aa 1e-08
in ref transcript
CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
pfam PHD 48aa 5e-08
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
Cytoplasmic Fragile-X interacting family. CYFIP1/2 (Cytoplasmic fragile X mental retardation interacting protein) like proteins for a highly conserved protein family. The function of CYFIPs is unclear, but CYFIP interaction with fragile X mental retardation interacting protein (FMRP) involves the domain of FMRP which also mediating homo- and heteromerization.
CYLD
rs.CYLD.F1 rs.CYLD.R1 243 380
NCBIGene 36.3 1540
Single exon skipping, size difference: 137
Exclusion in 5'UTR
Reference transcript: NM_015247
cd Peptidase_C19N 272aa 3e-56
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
pfam CAP_GLY 69aa 1e-13
in ref transcript
CAP-Gly domain. Cytoskeleton-associated proteins (CAPs) are involved in the organisation of microtubules and transportation of vesicles and organelles along the cytoskeletal network. A conserved motif, CAP-Gly, has been identified in a number of CAPs, including CLIP-170 and dynactins. The crystal structure of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain was recently solved. The domain contains three beta-strands. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove.
pfam CAP_GLY 77aa 9e-12
in ref transcript
pfam CAP_GLY 72aa 7e-10
in ref transcript
COG NIP100 102aa 0.001
in ref transcript
Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning].
DACH1
rs.DACH1.F1 rs.DACH1.R1 105 549
NCBIGene 36.3 1602
Multiple exon skipping, size difference: 444
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_080759
pfam Ski_Sno 115aa 1e-44
in ref transcript
SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and myogenic differentiation. Sno, a Ski proto-oncogene homologue, is expressed in two isoforms and plays a role in the response to proliferation stimuli. Dachshund also contains this domain. It is involved in various aspects of development.
DACT1
rs.DACT1.F1 rs.DACT1.R1 355 466
NCBIGene 36.3 51339
Alternative 3-prime, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_016651
DBI
rs.DBI.F1 rs.DBI.R1 172 412
NCBIGene 36.3 1622
Exon skipping and alternative 3-prime or 5-prime, size difference: 240
Exclusion in 5'UTR, Exclusion of the protein initiation site
Reference transcript: NM_020548
Changed!
cd ACBP 84aa 7e-29
in ref transcript
Acyl CoA binding protein (ACBP) binds thiol esters of long fatty acids and coenzyme A in a one-to-one binding mode with high specificity and affinity. Acyl-CoAs are important intermediates in fatty lipid synthesis and fatty acid degradation and play a role in regulation of intermediary metabolism and gene regulation. The suggested role of ACBP is to act as a intracellular acyl-CoA transporter and pool former. ACBPs are present in a large group of eukaryotic species and several tissue-specific isoforms have been detected.
Changed!
pfam ACBP 85aa 2e-21
in ref transcript
Acyl CoA binding protein.
Changed!
COG ACB 81aa 3e-21
in ref transcript
Acyl-CoA-binding protein [Lipid metabolism].
Changed!
cd ACBP 85aa 1e-27
in modified transcript
Changed!
pfam ACBP 85aa 1e-20
in modified transcript
Changed!
COG ACB 86aa 2e-20
in modified transcript
DBNDD2
rs.DBNDD2.F1 rs.DBNDD2.R1 378 476
NCBIGene 36.3 55861
Alternative 5-prime, size difference: 98
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001048221
Changed!
pfam Dysbindin 86aa 3e-25
in ref transcript
Dysbindin (Dystrobrevin binding protein 1). Dysbindin is an evolutionary conserved 40-kDa coiled-coil-containing protein that binds to alpha- and beta-dystrobrevin in muscle and brain. Dystrophin and alpha-dystrobrevin are co-immunoprecipitated with dysbindin, indicating that dysbindin is DPC-associated in muscle. Dysbindin co-localises with alpha-dystrobrevin at the sarcolemma and is up-regulated in dystrophin-deficient muscle. In the brain, dysbindin is found primarily in axon bundles and especially in certain axon terminals, notably mossy fibre synaptic terminals in the cerebellum and hippocampus. Dysbindin may have implications for the molecular pathology of Duchenne muscular dystrophy and may provide an alternative route for anchoring dystrobrevin and the DPC to the muscle membrane. Genetic variation in the human dysbindin gene is also thought to be associated with Schizophrenia.
Changed!
pfam Dysbindin 73aa 3e-23
in modified transcript
DCLRE1C
rs.DCLRE1C.F1 rs.DCLRE1C.R1 104 426
NCBIGene 36.3 64421
Single exon skipping, size difference: 322
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001033855
Changed!
pfam DRMBL 99aa 2e-16
in ref transcript
DNA repair metallo-beta-lactamase. The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in pfam00753 and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in DNA repair.
Changed!
smart Lactamase_B 115aa 3e-08
in ref transcript
Metallo-beta-lactamase superfamily. Apart from the beta-lactamases a number of other proteins contain this domain PUBMED:7588620. These proteins include thiolesterases, members of the glyoxalase II family, that catalyse the hydrolysis of S-D-lactoyl-glutathione to form glutathione and D-lactic acid and a competence protein that is essential for natural transformation in Neisseria gonorrhoeae and could be a transporter involved in DNA uptake. Except for the competence protein these proteins bind two zinc ions per molecule as cofactor.
Changed!
COG YSH1 175aa 1e-11
in ref transcript
Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis].
Changed!
smart Lactamase_B 71aa 2e-04
in modified transcript
DCX
rs.DCX.F1 rs.DCX.R1 100 115
NCBIGene 36.3 1641
Alternative 5-prime, size difference: 15
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_000555
cd DCX 83aa 9e-25
in ref transcript
DCX The ubiquitin-like DCX domain is present in tandem within the N-terminal half of the doublecortin protein. Doublecortin is expressed in migrating neurons. Mutations in the gene encoding doublecortin cause lissencephaly in males and 'double-cortex syndrome' in females.
cd DCX 80aa 1e-19
in ref transcript
smart DCX 91aa 5e-30
in ref transcript
Domain in the Doublecortin (DCX) gene product. Tandemly-repeated domain in doublin, the Doublecortin gene product. Proposed to bind tubulin. Doublecortin (DCX) is mutated in human X-linked neuronal migration defects.
smart DCX 89aa 2e-25
in ref transcript
DDR1
rs.DDR1.F1 rs.DDR1.R1 100 118
NCBIGene 36.3 780
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_013994
Changed!
cd PTKc_DDR1 310aa 1e-174
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Discoidin Domain Receptor 1. Protein Tyrosine Kinase (PTK) family; mammalian Discoidin domain receptor 1 (DDR1) and homologs; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. DDR1 is a member of the DDR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular discoidin homology domain, a transmembrane segment, an extended juxtamembrane region, and an intracellular catalytic domain. The binding of the ligand, collagen, to DDRs results in a slow but sustained receptor activation. DDR1 binds to all collagens tested to date (types I-IV). It is widely expressed in many tissues. It is abundant in the brain and is also found in keratinocytes, colonic mucosa epithelium, lung epithelium, thyroid follicles, and the islets of Langerhans. During embryonic development, it is found in the developing neuroectoderm. DDR1 is a key regulator of cell morphogenesis, differentiation and proliferation. It is important in the development of the mammary gland, the vasculator and the kidney. DDR1 is also found in human leukocytes, where it facilitates cell adhesion, migration, maturation, and cytokine production.
cd FA58C 117aa 9e-28
in ref transcript
Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
Changed!
smart TyrKc 302aa 8e-97
in ref transcript
Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
Changed!
COG SPS1 297aa 9e-23
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
cd PTKc_DDR1 304aa 1e-176
in modified transcript
Changed!
smart TyrKc 296aa 9e-99
in modified transcript
Changed!
COG SPS1 291aa 2e-23
in modified transcript
DDR1
rs.DDR1.F2 rs.DDR1.R2 230 341
NCBIGene 36.3 780
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_013994
cd PTKc_DDR1 310aa 1e-174
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Discoidin Domain Receptor 1. Protein Tyrosine Kinase (PTK) family; mammalian Discoidin domain receptor 1 (DDR1) and homologs; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. DDR1 is a member of the DDR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular discoidin homology domain, a transmembrane segment, an extended juxtamembrane region, and an intracellular catalytic domain. The binding of the ligand, collagen, to DDRs results in a slow but sustained receptor activation. DDR1 binds to all collagens tested to date (types I-IV). It is widely expressed in many tissues. It is abundant in the brain and is also found in keratinocytes, colonic mucosa epithelium, lung epithelium, thyroid follicles, and the islets of Langerhans. During embryonic development, it is found in the developing neuroectoderm. DDR1 is a key regulator of cell morphogenesis, differentiation and proliferation. It is important in the development of the mammary gland, the vasculator and the kidney. DDR1 is also found in human leukocytes, where it facilitates cell adhesion, migration, maturation, and cytokine production.
cd FA58C 117aa 9e-28
in ref transcript
Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
COG SPS1 297aa 9e-23
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
DGKG
rs.DGKG.F1 rs.DGKG.R1 142 217
NCBIGene 36.3 1608
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_001346
cd C1 50aa 7e-11
in ref transcript
Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
cd EFh 70aa 4e-08
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
pfam DAGK_acc 175aa 8e-87
in ref transcript
Diacylglycerol kinase accessory domain. Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown.
Changed!
smart DAGKc 122aa 2e-47
in ref transcript
Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. DAG can be produced from the hydrolysis of phosphatidylinositol 4,5-bisphosphate (PIP2) by a phosphoinositide-specific phospholipase C and by the degradation of phosphatidylcholine (PC) by a phospholipase C or the concerted actions of phospholipase D and phosphatidate phosphohydrolase. This domain is presumed to be the catalytic domain. Bacterial homologues areknown.
pfam C1_1 50aa 1e-08
in ref transcript
Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
smart C1 41aa 4e-05
in ref transcript
Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains). Some bind phorbol esters and diacylglycerol. Some bind RasGTP. Zinc-binding domains.
Changed!
COG LCB5 327aa 4e-10
in ref transcript
Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only].
COG FRQ1 76aa 7e-04
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
Changed!
smart DAGKc 97aa 1e-32
in modified transcript
Changed!
COG LCB5 277aa 1e-07
in modified transcript
DGKG
rs.DGKG.F2 rs.DGKG.R2 100 217
NCBIGene 36.3 1608
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_001346
cd C1 50aa 7e-11
in ref transcript
Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
cd EFh 70aa 4e-08
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
pfam DAGK_acc 175aa 8e-87
in ref transcript
Diacylglycerol kinase accessory domain. Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown.
smart DAGKc 122aa 2e-47
in ref transcript
Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. DAG can be produced from the hydrolysis of phosphatidylinositol 4,5-bisphosphate (PIP2) by a phosphoinositide-specific phospholipase C and by the degradation of phosphatidylcholine (PC) by a phospholipase C or the concerted actions of phospholipase D and phosphatidate phosphohydrolase. This domain is presumed to be the catalytic domain. Bacterial homologues areknown.
pfam C1_1 50aa 1e-08
in ref transcript
Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
Changed!
smart C1 41aa 4e-05
in ref transcript
Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains). Some bind phorbol esters and diacylglycerol. Some bind RasGTP. Zinc-binding domains.
COG LCB5 327aa 4e-10
in ref transcript
Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only].
COG FRQ1 76aa 7e-04
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
DHRS2
rs.DHRS2.F1 rs.DHRS2.R1 99 110
NCBIGene 36.3 10202
Alternative 5-prime, size difference: 11
Exclusion in the protein causing a frameshift
Reference transcript: NM_182908
cd NAD_bind_H4MPT_DH 53aa 2e-04
in ref transcript
NADP binding domain of methylene tetrahydromethanopterin dehydrogenase. Methylene Tetrahydromethanopterin Dehydrogenase (H4MPT DH) NADP binding domain. NADP-dependent H4MPT DH catalyzes the dehydrogenation of methylene- H4MPT and methylene-tetrahydrofolate (H4F) with NADP+ as cofactor. H4F and H4MPT are both cofactors that carry the one-carbon units between the formyl and methyl oxidation level. H4F and H4MPT are structurally analogous to each other with respect to the pterin moiety, but each has distinct side chain. H4MPT is present only in anaerobic methanogenic archaea and aerobic methylotrophic proteobacteria. H4MPT seems to have evolved independently from H4F and functions as a distinct carrier in C1 metabolism. Amino acid DH-like NAD(P)-binding domains are members of the Rossmann fold superfamily and include glutamate, leucine, and phenylalanine DHs, methylene tetrahydrofolate DH, methylene-tetrahydromethanopterin DH, methylene-tetrahydropholate DH/cyclohydrolase, Shikimate DH-like proteins, malate oxidoreductases, and glutamyl tRNA reductase. Amino acid DHs catalyze the deamination of amino acids to keto acids with NAD(P)+ as a cofactor. The NAD(P)-binding Rossmann fold superfamily includes a wide variety of protein families including NAD(P)- binding domains of alcohol DHs, tyrosine-dependent oxidoreductases, glyceraldehyde-3-phosphate DH, lactate/malate DHs, formate/glycerate DHs, siroheme synthases, 6-phosphogluconate DH, amino acid DHs, repressor rex, NAD-binding potassium channel domain, CoA-binding, and ornithine cyclodeaminase-like domains. These domains have an alpha-beta-alpha configuration. NAD binding involves numerous hydrogen and van der Waals contacts.
Changed!
TIGR 3oxo_ACP_reduc 187aa 2e-30
in ref transcript
This model represents 3-oxoacyl-[ACP] reductase, also called 3-ketoacyl-acyl carrier protein reductase, an enzyme of fatty acid biosynthesis.
Inclusion in the protein causing a new stop codon, Exclusion in the protein causing a frameshift
Reference transcript: NM_019887
Changed!
pfam Smac_DIABLO 234aa 1e-100
in ref transcript
Second Mitochondria-derived Activator of Caspases. Second Mitochondria-derived Activator of Caspases promotes apoptosis by activating caspases in the cytochrome c/Apaf-1/caspase-9 pathway, and by opposing the inhibitory activity of inhibitor of apoptosis proteins (XIAP-BIR3). The protein assumes an elongated three-helix bundle structure, and forms a dimer in solution.
DIAPH1
rs.DIAPH1.F1 rs.DIAPH1.R1 131 158
NCBIGene 36.3 1729
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_005219
smart FH2 438aa 1e-111
in ref transcript
Formin Homology 2 Domain. FH proteins control rearrangements of the actin cytoskeleton, especially in the context of cytokinesis and cell polarisation. Members of this family have been found to interact with Rho-GTPases, profilin and other actin-assoziated proteins. These interactions are mediated by the proline-rich FH1 domain, usually located in front of FH2 (but not listed in SMART). Despite this cytosolic function, vertebrate formins have been assigned functions within the nucleus. A set of Formin-Binding Proteins (FBPs) has been shown to bind FH1 with their WW domain.
pfam Drf_GBD 186aa 3e-52
in ref transcript
Diaphanous GTPase-binding Domain. This domain is bound to by GTP-attached Rho proteins, leading to activation of the Drf protein.
pfam Drf_FH3 193aa 1e-49
in ref transcript
Diaphanous FH3 Domain. This region is found in the Formin-like and and diaphanous proteins.
pfam Drf_FH1 29aa 9e-08
in ref transcript
Formin Homology Region 1. This region is found in some of the Diaphanous related formins (Drfs). It consists of low complexity repeats of around 12 residues.
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_004087
cd GMPK 117aa 2e-24
in ref transcript
Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines.
cd PDZ_signaling 85aa 4e-14
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 87aa 1e-13
in ref transcript
cd PDZ_signaling 81aa 2e-11
in ref transcript
cd SH3 59aa 2e-06
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart GuKc 179aa 2e-51
in ref transcript
Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
pfam MAGUK_N_PEST 117aa 2e-33
in ref transcript
Polyubiquitination (PEST) N-terminal domain of MAGUK. The residues upstream of this domain are the probable palmitoylation sites, particularly two cysteines. The domain has a putative PEST site at the very start that seems to be responsible for poly-ubiquitination. PEST domains are polypeptide sequences enriched in proline (P), glutamic acid (E), serine (S) and threonine (T) that target proteins for rapid destruction. The whole domain, in conjunction with a C-terminal domain of the longer protein, is necessary for dimerisation of the whole protein.
pfam L27_1 64aa 2e-28
in ref transcript
L27_1. The L27 domain is a protein interaction module that exists in a large family of scaffold proteins, functioning as an organisation centre of large protein assemblies required for the establishment and maintenance of cell polarity. L27 domains form specific heterotetrameric complexes, in which each domain contains three alpha-helices.
pfam PDZ_assoc 62aa 3e-24
in ref transcript
PDZ-associated domain of NMDA receptors. This domain is found in higher eukaryotes between the second and third PDZ domains, pfam00595, of glutamate receptor like proteins. Its exact function is not known.
smart PDZ 88aa 8e-17
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart PDZ 90aa 3e-16
in ref transcript
smart PDZ 85aa 5e-14
in ref transcript
smart SH3 59aa 3e-07
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Inclusion in the protein causing a frameshift, Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_021120
cd GMPK 117aa 2e-25
in ref transcript
Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines.
cd PDZ_signaling 81aa 3e-17
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 87aa 3e-14
in ref transcript
cd PDZ_signaling 77aa 2e-11
in ref transcript
cd SH3 59aa 4e-07
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart GuKc 177aa 6e-55
in ref transcript
Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
pfam PDZ_assoc 75aa 1e-22
in ref transcript
PDZ-associated domain of NMDA receptors. This domain is found in higher eukaryotes between the second and third PDZ domains, pfam00595, of glutamate receptor like proteins. Its exact function is not known.
smart PDZ 85aa 2e-20
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart PDZ 90aa 7e-17
in ref transcript
smart PDZ 77aa 6e-13
in ref transcript
smart SH3 59aa 1e-08
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
pfam MAGUK_N_PEST 74aa 0.001
in ref transcript
Polyubiquitination (PEST) N-terminal domain of MAGUK. The residues upstream of this domain are the probable palmitoylation sites, particularly two cysteines. The domain has a putative PEST site at the very start that seems to be responsible for poly-ubiquitination. PEST domains are polypeptide sequences enriched in proline (P), glutamic acid (E), serine (S) and threonine (T) that target proteins for rapid destruction. The whole domain, in conjunction with a C-terminal domain of the longer protein, is necessary for dimerisation of the whole protein.
Alternative 5-prime and 3-prime, size difference: 15
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_147192
cd homeodomain 54aa 8e-15
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
Dentin matrix protein 1 (DMP1). This family consists of several mammalian dentin matrix protein 1 (DMP1) sequences. The dentin matrix acidic phosphoprotein 1 (DMP1) gene has been mapped to human chromosome 4q21. DMP1 is a bone and teeth specific protein initially identified from mineralised dentin. DMP1 is primarily localised in the nuclear compartment of undifferentiated osteoblasts. In the nucleus, DMP1 acts as a transcriptional component for activation of osteoblast-specific genes like osteocalcin. During the early phase of osteoblast maturation, Ca(2+) surges into the nucleus from the cytoplasm, triggering the phosphorylation of DMP1 by a nuclear isoform of casein kinase II. This phosphorylated DMP1 is then exported out into the extracellular matrix, where it regulates nucleation of hydroxyapatite. DMP1 is a unique molecule that initiates osteoblast differentiation by transcription in the nucleus and orchestrates mineralised matrix formation extracellularly, at later stages of osteoblast maturation. The DMP1 gene has been found to be ectopically expressed in lung cancer although the reason for this is unknown.
pfam DMP1 35aa 1e-04
in ref transcript
Changed!
pfam DMP1 359aa 2e-61
in modified transcript
DMPK
rs.DMPK.F1 rs.DMPK.R1 107 122
NCBIGene 36.3 1760
Alternative 5-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_001081563
Changed!
cd STKc_DMPK_like 338aa 1e-170
in ref transcript
STKc_DMPK_like: Serine/Threonine Kinases (STKs), Myotonic Dystrophy protein kinase (DMPK)-like subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The DMPK-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. The DMPK-like subfamily is composed of DMPK and DMPK-related cell division control protein 42 (Cdc42) binding kinase (MRCK). Three isoforms of MRCK are known, named alpha, beta and gamma. The DMPK gene is implicated in myotonic dystrophy 1 (DM1), an inherited multisystemic disorder with symptoms that include muscle hyperexcitability, progressive muscle weakness and wasting, cataract development, testicular atrophy, and cardiac conduction defects. The genetic basis for DM1 is the mutational expansion of a CTG repeat in the 3'-UTR of DMPK. DMPK is expressed in skeletal and cardiac muscles, and in central nervous tissues. The functional role of DMPK is not fully understood. It may play a role in the signal transduction and homeostasis of calcium. MRCK is activated via interaction with the small GTPase Cdc42. MRCK/Cdc42 signaling mediates myosin-dependent cell motility. MRCKgamma is expressed in heart and skeletal muscles, unlike MRCKalpha and MRCKbeta, which are expressed ubiquitously.
smart S_TKc 259aa 8e-69
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam DMPK_coil 61aa 2e-22
in ref transcript
DMPK coiled coil domain like. This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation.
Changed!
smart S_TK_X 67aa 5e-08
in ref transcript
Extension to Ser/Thr-type protein kinases.
PTZ PTZ00263 299aa 1e-56
in ref transcript
protein kinase A catalytic subunit; Provisional.
COG Smc 63aa 0.008
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
cd STKc_DMPK_like 333aa 1e-171
in modified transcript
Changed!
smart S_TK_X 62aa 9e-09
in modified transcript
DNM1L
rs.DNM1L.F1 rs.DNM1L.R1 207 285
NCBIGene 36.3 10059
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_012062
pfam Dynamin_M 292aa 1e-106
in ref transcript
Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169.
smart DYNc 255aa 1e-90
in ref transcript
Dynamin, GTPase. Large GTPases that mediate vesicle trafficking. Dynamin participates in the endocytic uptake of receptors, associated ligands, and plasma membrane following an exocytic event.
pfam GED 92aa 3e-23
in ref transcript
Dynamin GTPase effector domain.
Changed!
COG COG0699 414aa 2e-32
in ref transcript
Predicted GTPases (dynamin-related) [General function prediction only].
Changed!
COG COG0699 616aa 3e-36
in modified transcript
DNM2
rs.DNM2.F1 rs.DNM2.R1 103 115
NCBIGene 36.3 1785
Single exon skipping, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_001005361
cd PH_dynamin 110aa 2e-47
in ref transcript
Dynamin pleckstrin homology (PH) domain. Dynamin is a GTPase that regulates endocytic vesicle formation. It has an N-terminal GTPase domain, followed by a PH domain, a GTPase effector domain and a C-terminal proline arginine rich domain. Dynamin-like proteins, which are found in metazoa, plants and yeast have the same domain architecture as dynamin, but lack the PH domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
smart DYNc 240aa 1e-130
in ref transcript
Dynamin, GTPase. Large GTPases that mediate vesicle trafficking. Dynamin participates in the endocytic uptake of receptors, associated ligands, and plasma membrane following an exocytic event.
pfam Dynamin_M 294aa 1e-104
in ref transcript
Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169.
smart GED 92aa 2e-20
in ref transcript
Dynamin GTPase effector domain.
smart PH 104aa 3e-07
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
Changed!
COG COG0699 424aa 5e-31
in ref transcript
Predicted GTPases (dynamin-related) [General function prediction only].
Changed!
COG COG0699 455aa 3e-31
in modified transcript
DNMT3B
rs.DNMT3B.F1 rs.DNMT3B.R1 188 377
NCBIGene 36.3 1789
Multiple exon skipping, size difference: 189
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_006892
cd Dnmt3b_related 87aa 2e-35
in ref transcript
The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Changed!
cd Cyt_C5_DNA_methylase 271aa 4e-13
in ref transcript
Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology. Cytosine-specific DNA methylases are found both in prokaryotes and eukaryotes. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the mammalian genome. These effects include transcriptional repression via inhibition of transcription factor binding or the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability.
pfam PWWP 73aa 8e-24
in ref transcript
PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown.
pfam DNA_methylase 123aa 8e-08
in ref transcript
C-5 cytosine-specific DNA methylase.
COG Dcm 160aa 5e-13
in ref transcript
Site-specific DNA methylase [DNA replication, recombination, and repair].
Changed!
cd Cyt_C5_DNA_methylase 134aa 9e-09
in modified transcript
DNMT3B
rs.DNMT3B.F2 rs.DNMT3B.R2 135 195
NCBIGene 36.3 1789
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_006892
cd Dnmt3b_related 87aa 2e-35
in ref transcript
The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
cd Cyt_C5_DNA_methylase 271aa 4e-13
in ref transcript
Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology. Cytosine-specific DNA methylases are found both in prokaryotes and eukaryotes. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the mammalian genome. These effects include transcriptional repression via inhibition of transcription factor binding or the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability.
pfam PWWP 73aa 8e-24
in ref transcript
PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown.
pfam DNA_methylase 123aa 8e-08
in ref transcript
C-5 cytosine-specific DNA methylase.
COG Dcm 160aa 5e-13
in ref transcript
Site-specific DNA methylase [DNA replication, recombination, and repair].
DPH3
rs.DPH3.F1 rs.DPH3.R1 260 335
NCBIGene 36.3 285381
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_206831
Changed!
pfam zf-CSL 55aa 3e-18
in ref transcript
CSL zinc finger. This is a zinc binding motif which contains four cysteine residues which chelate zinc. This domain is often found associated with a pfam00226 domain. This domain is named after the conserved motif of the final cysteine.
Changed!
COG COG5216 61aa 1e-16
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
pfam zf-CSL 40aa 1e-07
in modified transcript
Changed!
COG COG5216 42aa 1e-06
in modified transcript
DPH5
rs.DPH5.F1 rs.DPH5.R1 109 127
NCBIGene 36.3 51611
Alternative 5-prime, size difference: 18
Exclusion in 5'UTR
Reference transcript: NM_001077394
TIGR dph5 271aa 1e-80
in ref transcript
This protein participates in the modification of a specific His of elongation factor 2 of eukarotes and Archaea to diphthamide. The protein was characterized in Saccharomyces cerevisiae and designated DPH5.
PTZ PTZ00175 272aa 1e-122
in ref transcript
diphthine synthase; Provisional.
DSC3
rs.DSC3.F1 rs.DSC3.R1 233 276
NCBIGene 36.3 1825
Single exon skipping, size difference: 43
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001941
cd CA 218aa 9e-36
in ref transcript
Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
cd CA 209aa 8e-31
in ref transcript
cd CA 210aa 4e-24
in ref transcript
Changed!
cd CA 193aa 1e-19
in ref transcript
pfam Cadherin_pro 87aa 5e-18
in ref transcript
Cadherin prodomain like. Cadherins are a family of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This domain corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions.
pfam Cadherin 99aa 9e-17
in ref transcript
Cadherin domain.
pfam Cadherin 103aa 2e-11
in ref transcript
smart CA 77aa 4e-11
in ref transcript
Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
pfam Cadherin 95aa 1e-10
in ref transcript
Changed!
pfam Cadherin_C 63aa 1e-04
in ref transcript
Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic binding units. A key determinant to the strength of the binding that it is mediated by cadherins is the juxtamembrane region of the cadherin. This region induces clustering and also binds to the protein p120ctn.
Changed!
pfam Cadherin 74aa 0.009
in ref transcript
Changed!
cd CA 192aa 2e-19
in modified transcript
DTNB
rs.DTNB.F1 rs.DTNB.R1 119 140
NCBIGene 36.3 1838
Single exon skipping, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_021907
cd ZZ_dystrophin 49aa 4e-18
in ref transcript
Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
pfam efhand_1 127aa 3e-47
in ref transcript
EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
pfam efhand_2 89aa 3e-37
in ref transcript
EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
smart ZnF_ZZ 45aa 3e-11
in ref transcript
Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
Changed!
TIGR SMC_prok_B 66aa 0.008
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
PRK PRK05431 65aa 0.003
in ref transcript
seryl-tRNA synthetase; Provisional.
Changed!
TIGR SMC_prok_A 90aa 4e-04
in modified transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Changed!
PRK PRK05431 71aa 7e-05
in modified transcript
DTX2
rs.DTX2.F1 rs.DTX2.R1 153 273
NCBIGene 36.3 113878
Single exon skipping, size difference: 120
Exclusion in 5'UTR
Reference transcript: NM_020892
smart WWE 72aa 7e-20
in ref transcript
Domain in Deltex and TRIP12 homologues. Possibly involved in regulation of ubiquitin-mediated proteolysis.
smart WWE 85aa 7e-20
in ref transcript
DTX2
rs.DTX2.F2 rs.DTX2.R2 402 543
NCBIGene 36.3 113878
Single exon skipping, size difference: 141
Exclusion in the protein (no frameshift)
Reference transcript: NM_001102595
smart WWE 72aa 7e-20
in ref transcript
Domain in Deltex and TRIP12 homologues. Possibly involved in regulation of ubiquitin-mediated proteolysis.
smart WWE 85aa 7e-20
in ref transcript
DUSP13
rs.DUSP13.F1 rs.DUSP13.R1 100 526
NCBIGene 36.3 51207
Multiple exon skipping, size difference: 426
Exclusion in 5'UTR, Exclusion of the protein initiation site
Reference transcript: NM_001007271
cd DSPc 144aa 2e-32
in ref transcript
Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
Changed!
cd DSP_MapKP 131aa 2e-35
in ref transcript
N-terminal regulatory rhodanese domain of dual specificity phosphatases (DSP), such as Mapk Phosphatase. This domain is believed to determine substrate specificity by binding the substrate, such as ERK2, and activating the C-terminal catalytic domain by inducing a conformational change. This domain has homology to the Rhodanese Homology Domain.
Changed!
smart DSPc 140aa 1e-52
in ref transcript
Dual specificity phosphatase, catalytic domain.
Changed!
smart RHOD 114aa 7e-09
in ref transcript
Rhodanese Homology Domain. An alpha beta fold found duplicated in the Rhodanese protein. The the Cysteine containing enzymatically active version of the domain is also found in the CDC25 class of protein phosphatases and a variety of proteins such as sulfide dehydrogenases and stress proteins such as Senesence specific protein 1 in plants, PspE and GlpE in bacteria and cyanide and arsenate resistance proteins. Inactive versions with a loss of the cysteine are also seen in Dual specificity phosphatases, ubiquitin hydrolases from yeast and in sulfuryltransferases. These are likely to play a role in protein interactions.
Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism].
Changed!
cd DSP_MapKP 118aa 2e-28
in modified transcript
Changed!
cd DSPc 102aa 3e-26
in modified transcript
Changed!
smart DSPc 67aa 4e-26
in modified transcript
Changed!
smart RHOD 76aa 3e-08
in modified transcript
Changed!
COG CDC14 77aa 6e-09
in modified transcript
DYNLL1
rs.DYNLL1.F1 rs.DYNLL1.R1 112 126
NCBIGene 36.3 8655
Alternative 3-prime, size difference: 14
Inclusion in 5'UTR
Reference transcript: NM_003746
pfam Dynein_light 89aa 2e-38
in ref transcript
Dynein light chain type 1.
PTZ PTZ00059 89aa 9e-47
in ref transcript
dynein light chain; Provisional.
E2F6
rs.E2F6.F1 rs.E2F6.R1 201 334
NCBIGene 36.3 1876
Single exon skipping, size difference: 133
Inclusion in the protein causing a new stop codon
Reference transcript: NM_198256
Changed!
pfam E2F_TDP 66aa 5e-22
in ref transcript
E2F/DP family winged-helix DNA-binding domain. This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in association with TDP1/2, the heterodimer having increased binding efficiency. The crystal structure of an E2F4-DP2-DNA complex shows that the DNA-binding domains of the E2F and DP proteins both have a fold related to the winged-helix DNA-binding motif. Recognition of the central c/gGCGCg/c sequence of the consensus DNA-binding site is symmetric, and amino acids that contact these bases are conserved among all known E2F and DP proteins.
E2F6
rs.E2F6.F2 rs.E2F6.R2 128 199
NCBIGene 36.3 1876
Single exon skipping, size difference: 71
Inclusion in the protein causing a new stop codon
Reference transcript: NM_198256
Changed!
pfam E2F_TDP 66aa 5e-22
in ref transcript
E2F/DP family winged-helix DNA-binding domain. This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in association with TDP1/2, the heterodimer having increased binding efficiency. The crystal structure of an E2F4-DP2-DNA complex shows that the DNA-binding domains of the E2F and DP proteins both have a fold related to the winged-helix DNA-binding motif. Recognition of the central c/gGCGCg/c sequence of the consensus DNA-binding site is symmetric, and amino acids that contact these bases are conserved among all known E2F and DP proteins.
ECE2
rs.ECE2.F1 rs.ECE2.R1 261 399
NCBIGene 36.3 9718
Alternative 5-prime, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_001100120
pfam Peptidase_M13_N 386aa 1e-124
in ref transcript
Peptidase family M13. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk.
pfam Peptidase_M13 204aa 2e-55
in ref transcript
Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be involved with milk protein cleavage.
COG PepO 654aa 1e-121
in ref transcript
Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones].
ECHDC1
rs.ECHDC1.F1 rs.ECHDC1.R1 150 203
NCBIGene 36.3 55862
Single exon skipping, size difference: 53
Exclusion in the protein causing a frameshift
Reference transcript: NM_001002030
Changed!
cd crotonase-like 192aa 3e-42
in ref transcript
Crotonase/Enoyl-Coenzyme A (CoA) hydratase superfamily. This superfamily contains a diverse set of enzymes including enoyl-CoA hydratase, napthoate synthase, methylmalonyl-CoA decarboxylase, 3-hydoxybutyryl-CoA dehydratase, and dienoyl-CoA isomerase. Many of these play important roles in fatty acid metabolism. In addition to a conserved structural core and the formation of trimers (or dimers of trimers), a common feature in this superfamily is the stabilization of an enolate anion intermediate derived from an acyl-CoA substrate. This is accomplished by two conserved backbone NH groups in active sites that form an oxyanion hole.
Changed!
pfam ECH 167aa 3e-22
in ref transcript
Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase.
Changed!
cd crotonase-like 71aa 8e-14
in modified transcript
Changed!
pfam ECH 52aa 2e-05
in modified transcript
Changed!
COG CaiD 71aa 6e-11
in modified transcript
ECHDC1
rs.ECHDC1.F2 rs.ECHDC1.R2 109 331
NCBIGene 36.3 55862
Single exon skipping, size difference: 222
Exclusion of the protein initiation site
Reference transcript: NM_001002030
Changed!
cd crotonase-like 192aa 3e-42
in ref transcript
Crotonase/Enoyl-Coenzyme A (CoA) hydratase superfamily. This superfamily contains a diverse set of enzymes including enoyl-CoA hydratase, napthoate synthase, methylmalonyl-CoA decarboxylase, 3-hydoxybutyryl-CoA dehydratase, and dienoyl-CoA isomerase. Many of these play important roles in fatty acid metabolism. In addition to a conserved structural core and the formation of trimers (or dimers of trimers), a common feature in this superfamily is the stabilization of an enolate anion intermediate derived from an acyl-CoA substrate. This is accomplished by two conserved backbone NH groups in active sites that form an oxyanion hole.
Changed!
pfam ECH 167aa 3e-22
in ref transcript
Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase.
Changed!
cd crotonase-like 170aa 4e-33
in modified transcript
Changed!
TIGR FadB 156aa 2e-18
in modified transcript
Members represent alpha subunit of multifunctional enzyme complex of the fatty acid degradation cycle. Activities include: enoyl-CoA hydratase (EC 4.2.1.17), dodecenoyl-CoA delta-isomerase activity (EC 5.3.3.8), 3-hydroxyacyl-CoA dehydrogenase (EC 1.1.1.35), 3-hydroxybutyryl-CoA epimerase (EC 5.1.2.3). A representative is E. coli FadB. This model excludes the FadJ family.
Changed!
COG CaiD 219aa 7e-30
in modified transcript
EDA
rs.EDA.F1 rs.EDA.R1 168 219
NCBIGene 36.3 1896
Single exon skipping, size difference: 51
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001399
Changed!
cd TNF 135aa 5e-13
in ref transcript
Tumor Necrosis Factor; TNF superfamily members include the cytokines: TNF (TNF-alpha), LT (lymphotoxin-alpha, TNF-beta), CD40 ligand, Apo2L (TRAIL), Fas ligand, and osteoprotegerin (OPG) ligand. These proteins generally have an intracellular N-terminal domain, a short transmembrane segment, an extracellular stalk, and a globular TNF-like extracellular domain of about 150 residues. They initiate apoptosis by binding to related receptors, some of which have intracellular death domains. They generally form homo- or hetero- trimeric complexes.TNF cytokines bind one elongated receptor molecule along each of three clefts formed by neighboring monomers of the trimer with ligand trimerization a requiste for receptor binding.
Changed!
pfam TNF 114aa 2e-18
in ref transcript
TNF(Tumour Necrosis Factor) family.
EDG2
rs.EDG2.F1 rs.EDG2.R1 313 391
NCBIGene 36.3 1902
Single exon skipping, size difference: 78
Exclusion in 5'UTR
Reference transcript: NM_057159
pfam 7tm_1 240aa 2e-28
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
EDN3
rs.EDN3.F1 rs.EDN3.R1 121 156
NCBIGene 36.3 1908
Alternative 5-prime, size difference: 35
Inclusion in the protein causing a frameshift
Reference transcript: NM_000114
pfam Endothelin 30aa 4e-11
in ref transcript
Endothelin family.
smart END 22aa 1e-04
in ref transcript
Endothelin.
EFCAB6
rs.EFCAB6.F1 rs.EFCAB6.R1 115 473
NCBIGene 36.3 64800
Multiple exon skipping, size difference: 358
Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
Reference transcript: NM_022785
cd EFh 60aa 8e-04
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 61aa 0.001
in ref transcript
cd EFh 59aa 0.007
in ref transcript
pfam DUF1880 118aa 1e-60
in ref transcript
Domain of unknown function (DUF1880). This domain is found predominantly in DJ binding protein. It has no known function.
Changed!
PTZ PTZ00184 162aa 3e-06
in ref transcript
calmodulin; Provisional.
COG FRQ1 61aa 8e-04
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
COG FRQ1 176aa 0.001
in ref transcript
EFTUD1
rs.EFTUD1.F1 rs.EFTUD1.R1 251 404
NCBIGene 36.3 79631
Multiple exon skipping, size difference: 153
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_024580
Changed!
cd EF2 234aa 1e-108
in ref transcript
EF2 (for archaea and eukarya). Translocation requires hydrolysis of a molecule of GTP and is mediated by EF-G in bacteria and by eEF2 in eukaryotes. The eukaryotic elongation factor eEF2 is a GTPase involved in the translocation of the peptidyl-tRNA from the A site to the P site on the ribosome. The 95-kDa protein is highly conserved, with 60% amino acid sequence identity between the human and yeast proteins. Two major mechanisms are known to regulate protein elongation and both involve eEF2. First, eEF2 can be modulated by reversible phosphorylation. Increased levels of phosphorylated eEF2 reduce elongation rates presumably because phosphorylated eEF2 fails to bind the ribosomes. Treatment of mammalian cells with agents that raise the cytoplasmic Ca2+ and cAMP levels reduce elongation rates by activating the kinase responsible for phosphorylating eEF2. In contrast, treatment of cells with insulin increases elongation rates by promoting eEF2 dephosphorylation. Second, the protein can be post-translationally modified by ADP-ribosylation. Various bacterial toxins perform this reaction after modification of a specific histidine residue to diphthamide, but there is evidence for endogenous ADP ribosylase activity. Similar to the bacterial toxins, it is presumed that modification by the endogenous enzyme also inhibits eEF2 activity.
cd aeEF2_snRNP_like_IV 300aa 5e-37
in ref transcript
This family represents domain IV of archaeal and eukaryotic elongation factor 2 (aeEF-2) and of an evolutionarily conserved U5 snRNP-specific protein. U5 snRNP is a GTP-binding factor closely related to the ribosomal translocase EF-2. In complex with GTP, EF-2 promotes the translocation step of translation. During translocation the peptidyl-tRNA is moved from the A site to the P site of the small subunit of ribosome and the mRNA is shifted one codon relative to the ribosome. It has been shown that EF-2_IV domain mimics the shape of anticodon arm of the tRNA in the structurally homologous ternary complex of Phe-tRNA, EF-1 (another transcriptional elongation factor) and GTP analog. The tip portion of this domain is found in a position that overlaps the anticodon arm of the A-site tRNA, implying that EF-2 displaces the A-site tRNA to the P-site by physical interaction with the anticodon arm.
cd eEF2_snRNP_like_C 79aa 7e-31
in ref transcript
eEF2_snRNP_like_C: this family represents a C-terminal domain of eukaryotic elongation factor 2 (eEF-2) and a homologous domain of the spliceosomal human 116kD U5 small nuclear ribonucleoprotein (snRNP) protein (U5-116 kD) and, its yeast counterpart Snu114p. Yeast Snu114p is essential for cell viability and for splicing in vivo. U5-116 kD binds GTP. Experiments suggest that GTP binding and probably GTP hydrolysis is important for the function of the U5-116 kD/Snu114p. In complex with GTP, EF-2 promotes the translocation step of translation. During translocation the peptidyl-tRNA is moved from the A site to the P site, the uncharged tRNA from the P site to the E-site and, the mRNA is shifted one codon relative to the ribosome.
cd eEF2_snRNP_like_II 115aa 5e-20
in ref transcript
EF2_snRNP_like_II: this subfamily represents domain II of elongation factor (EF) EF-2 found eukaryotes and archaea and, the C-terminal portion of the spliceosomal human 116kD U5 small nuclear ribonucleoprotein (snRNP) protein (U5-116 kD) and, its yeast counterpart Snu114p. During the process of peptide synthesis and tRNA site changes, the ribosome is moved along the mRNA a distance equal to one codon with the addition of each amino acid. This translocation step is catalyzed by EF-2_GTP, which is hydrolyzed to provide the required energy. Thus, this action releases the uncharged tRNA from the P site and transfers the newly formed peptidyl-tRNA from the A site to the P site. Yeast Snu114p is essential for cell viability and for splicing in vivo. U5-116 kD binds GTP. Experiments suggest that GTP binding and probably GTP hydrolysis is important for the function of the U5-116 kD/Snu114p.
Changed!
pfam GTP_EFTU 149aa 2e-50
in ref transcript
Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel domains.
TIGR aEF-2 263aa 7e-19
in ref transcript
This model represents archaeal elongation factor 2, a protein more similar to eukaryotic EF-2 than to bacterial EF-G, both in sequence similarity and in sharing with eukaryotes the property of having a diphthamide (modified His) residue at a conserved position. The diphthamide can be ADP-ribosylated by diphtheria toxin in the presence of NAD.
pfam EFG_C 87aa 2e-16
in ref transcript
Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold.
pfam EFG_IV 72aa 0.002
in ref transcript
Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold.
Changed!
PRK PRK07560 277aa 6e-65
in ref transcript
elongation factor EF-2; Reviewed.
COG FusA 206aa 6e-34
in ref transcript
Translation elongation factors (GTPases) [Translation, ribosomal structure and biogenesis].
COG FusA 142aa 3e-23
in ref transcript
Changed!
cd EF2 172aa 3e-68
in modified transcript
Changed!
TIGR aEF-2 111aa 4e-30
in modified transcript
Changed!
PRK PRK07560 226aa 6e-40
in modified transcript
EIF4G2
rs.EIF4G2.F1 rs.EIF4G2.R1 151 265
NCBIGene 36.3 1982
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_001418
smart MA3 101aa 4e-21
in ref transcript
Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press.
pfam MIF4G 102aa 1e-20
in ref transcript
MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.
smart eIF5C 76aa 4e-17
in ref transcript
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5.
ELN
rs.ELN.F1 rs.ELN.R1 125 179
NCBIGene 36.3 2006
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_000501
ELN
rs.ELN.F2 rs.ELN.R2 124 181
NCBIGene 36.3 2006
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_000501
ELN
rs.ELN.F3 rs.ELN.R3 100 115
NCBIGene 36.3 2006
Alternative 3-prime, size difference: 15
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_000501
ELN
rs.ELN.F4 rs.ELN.R4 254 284
NCBIGene 36.3 2006
Single exon skipping, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_000501
ELOVL7
rs.ELOVL7.F1 rs.ELOVL7.R1 123 174
NCBIGene 36.3 79993
Single exon skipping, size difference: 51
Exclusion in 5'UTR
Reference transcript: NM_024930
pfam ELO 240aa 9e-53
in ref transcript
GNS1/SUR4 family. Members of this family are involved in long chain fatty acid elongation systems that produce the 26-carbon precursors for ceramide and sphingolipid synthesis. Predicted to be integral membrane proteins, in eukaryotes they are probably located on the endoplasmic reticulum. Yeast ELO3 affects plasma membrane H+-ATPase activity, and may act on a glucose-signaling pathway that controls the expression of several genes that are transcriptionally regulated by glucose such as PMA1.
PTZ PTZ00251 100aa 1e-04
in ref transcript
fatty acid elongase; Provisional.
ENPP2
rs.ENPP2.F1 rs.ENPP2.R1 148 304
NCBIGene 36.3 5168
Single exon skipping, size difference: 156
Exclusion in the protein (no frameshift)
Reference transcript: NM_006209
cd NUC 259aa 6e-47
in ref transcript
DNA/RNA non-specific endonuclease; prokaryotic and eukaryotic double- and single-stranded DNA and RNA endonucleases also present in phosphodiesterases. They exists as monomers and homodimers.
Changed!
pfam Phosphodiest 365aa 2e-83
in ref transcript
Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cleavage of phosphodiester and phosphosulfate bonds in NAD, deoxynucleotides and nucleotide sugars. Also in this family is ATX an autotaxin, tumour cell motility-stimulating protein which exhibits type I phosphodiesterases activity. The alignment encompasses the active site. Also present with in this family is 60-kDa Ca2+-ATPase form F. odoratum.
smart NUC 231aa 4e-58
in ref transcript
DNA/RNA non-specific endonuclease. prokaryotic and eukaryotic double- and single-stranded DNA and RNA endonucleases also present in phosphodiesterases.
pfam Somatomedin_B 44aa 3e-13
in ref transcript
Somatomedin B domain.
pfam Somatomedin_B 45aa 1e-12
in ref transcript
Changed!
COG COG1524 425aa 2e-28
in ref transcript
Uncharacterized proteins of the AP superfamily [General function prediction only].
COG NUC1 231aa 1e-05
in ref transcript
DNA/RNA endonuclease G, NUC1 [Nucleotide transport and metabolism].
Changed!
pfam Phosphodiest 313aa 8e-91
in modified transcript
Changed!
COG COG1524 373aa 5e-34
in modified transcript
ENTPD8
rs.ENTPD8.F1 rs.ENTPD8.R1 248 359
NCBIGene 36.3 377841
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_001033113
Changed!
pfam GDA1_CD39 421aa 3e-78
in ref transcript
GDA1/CD39 (nucleoside phosphatase) family.
Changed!
COG COG5371 376aa 1e-12
in ref transcript
Golgi nucleoside diphosphatase [Carbohydrate transport and metabolism / Posttranslational modification, protein turnover, chaperones].
Changed!
pfam GDA1_CD39 384aa 2e-66
in modified transcript
Changed!
COG COG5371 304aa 6e-10
in modified transcript
EPB41
rs.EPB41.F1 rs.EPB41.R1 355 475
NCBIGene 36.3 2035
Multiple exon skipping, size difference: 120
Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_203343
cd FERM_C 94aa 1e-32
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
smart B41 157aa 1e-40
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam 4_1_CTD 115aa 3e-26
in ref transcript
4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
pfam FERM_C 85aa 2e-20
in ref transcript
FERM C-terminal PH-like domain.
Changed!
pfam SAB 47aa 5e-14
in ref transcript
SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
pfam FA 45aa 5e-13
in ref transcript
FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
Changed!
pfam SAB 49aa 7e-15
in modified transcript
EPB41
rs.EPB41.F2 rs.EPB41.R2 184 241
NCBIGene 36.3 2035
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_203342
cd FERM_C 94aa 9e-32
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
smart B41 192aa 1e-55
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam 4_1_CTD 115aa 1e-27
in ref transcript
4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
pfam FERM_C 85aa 4e-20
in ref transcript
FERM C-terminal PH-like domain.
pfam SAB 49aa 3e-16
in ref transcript
SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
pfam FA 45aa 3e-13
in ref transcript
FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
EPHA6
rs.EPHA6.F1 rs.EPHA6.R1 137 201
NCBIGene 36.3 285220
Single exon skipping, size difference: 64
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001080448
Changed!
cd PTKc_EphR_A 309aa 1e-160
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinases, Class EphA Ephrin Receptors. Protein Tyrosine Kinase (PTK) family; Ephrin Receptor (EphR) subfamily; most class EphA receptors including EphA3, EphA4, EphA5, and EphA7, but excluding EphA1, EphA2 and EphA10; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. EphRs comprise the largest subfamily of receptor tyr kinases (RTKs). In general, class EphA receptors bind GPI-anchored ephrin-A ligands. There are ten vertebrate EphA receptors (EphA1-10), which display promiscuous interactions with six ephrin-A ligands. One exception is EphA4, which also binds ephrins-B2/B3. EphRs contain an ephrin-binding domain and two fibronectin repeats extracellularly, a transmembrane segment, and a cytoplasmic tyr kinase domain. Binding of the ephrin ligand to EphR requires cell-cell contact since both are anchored to the plasma membrane. The resulting downstream signals occur bidirectionally in both EphR-expressing cells (forward signaling) and ephrin-expressing cells (reverse signaling). Ephrin/EphR interaction mainly results in cell-cell repulsion or adhesion, making it important in neural development and plasticity, cell morphogenesis, cell-fate determination, embryonic development, tissue patterning, and angiogenesis. EphARs and ephrin-A ligands are expressed in multiple areas of the developing brain, especially in the retina and tectum. They are part of a system controlling retinotectal mapping.
cd FN3 92aa 1e-11
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Changed!
cd SAM 59aa 1e-07
in ref transcript
Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
cd FN3 101aa 0.002
in ref transcript
Changed!
pfam Pkinase_Tyr 300aa 1e-111
in ref transcript
Protein tyrosine kinase.
pfam Ephrin_lbd 173aa 7e-98
in ref transcript
Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand.
Changed!
smart SAM 68aa 4e-12
in ref transcript
Sterile alpha motif. Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerisation.
pfam fn3 87aa 2e-10
in ref transcript
Fibronectin type III domain.
pfam fn3 96aa 3e-06
in ref transcript
pfam GCC2_GCC3 37aa 0.002
in ref transcript
GCC2 and GCC3.
Changed!
COG SPS1 317aa 9e-18
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
EPSTI1
rs.EPSTI1.F1 rs.EPSTI1.R1 143 176
NCBIGene 36.3 94240
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_001002264
Changed!
pfam Borrelia_P83 133aa 0.007
in modified transcript
Borrelia P83/100 protein. This family consists of several Borrelia P83/P100 antigen proteins.
Changed!
COG TolA 169aa 0.009
in modified transcript
Membrane protein involved in colicin uptake [Cell envelope biogenesis, outer membrane].
ERBB4
rs.ERBB4.F1 rs.ERBB4.R1 114 162
NCBIGene 36.3 2066
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_005235
cd PTKc_HER4 297aa 0.0
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, HER4. Protein Tyrosine Kinase (PTK) family; HER4 (ErbB4); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. HER4 is a member of the EGFR (HER, ErbB) subfamily of proteins, which are receptor tyr kinases (RTKs) containing an extracellular EGF-related ligand-binding region, a transmembrane helix, and a cytoplasmic region with a tyr kinase domain and a regulatory C-terminal tail. Unlike other tyr kinases, phosphorylation of the activation loop of EGFR proteins is not critical to their activation. Instead, they are activated by ligand-induced dimerization, leading to the phosphorylation of tyr residues in the C-terminal tail, which serve as binding sites for downstream signaling molecules. Ligands that bind HER4 fall into two groups, the neuregulins (or heregulins) and some EGFR (HER1) ligands including betacellulin, HBEGF, and epiregulin. All four neuregulins (NRG1-4) interact with HER4. Upon ligand binding, HER4 forms homo- or heterodimers with other HER proteins. HER4 is essential in embryonic development. It is implicated in mammary gland, cardiac, and neural development. As a postsynaptic receptor of NRG1, HER4 plays an important role in synaptic plasticity and maturation. The impairment of NRG1/HER4 signaling may contribute to schizophrenia.
cd FU 46aa 4e-05
in ref transcript
Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
cd FU 43aa 1e-04
in ref transcript
pfam Pkinase_Tyr 257aa 1e-112
in ref transcript
Protein tyrosine kinase.
pfam Furin-like 149aa 2e-41
in ref transcript
Furin-like cysteine rich region.
pfam Recep_L_domain 113aa 7e-30
in ref transcript
Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain.
pfam Recep_L_domain 121aa 4e-22
in ref transcript
smart FU 48aa 1e-06
in ref transcript
Furin-like repeats.
pfam Furin-like 101aa 2e-06
in ref transcript
COG SPS1 230aa 4e-21
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
ERC1
rs.ERC1.F1 rs.ERC1.R1 108 120
NCBIGene 36.3 23085
Single exon skipping, size difference: 12
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_178040
Changed!
pfam Cast 829aa 1e-148
in ref transcript
RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
pfam RBD-FIP 41aa 1e-07
in ref transcript
FIP domain. The FIP domain is the Rab11-binding domain (RBD) at the C-terminus of a family of Rab11-interacting proteins (FIPs). The Rab proteins constitute the largest family of small GTPases (>60 members in mammals). Among them Rab11 is a well characterised regulator of endocytic and recycling pathways. Rab11 associates with a broad range of post-Golgi organelles, including recycling endosomes.
COG Smc 340aa 3e-09
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
COG Smc 318aa 3e-08
in ref transcript
COG Smc 298aa 5e-06
in ref transcript
Changed!
pfam Cast 833aa 1e-149
in modified transcript
Changed!
COG Smc 264aa 6e-07
in modified transcript
ERCC8
rs.ERCC8.F1 rs.ERCC8.R1 129 347
NCBIGene 36.3 1161
Single exon skipping, size difference: 218
Inclusion in the protein causing a frameshift
Reference transcript: NM_000082
Changed!
cd WD40 322aa 3e-30
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Changed!
smart WD40 41aa 2e-05
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Changed!
smart WD40 32aa 0.007
in ref transcript
Changed!
COG COG2319 358aa 2e-16
in ref transcript
FOG: WD40 repeat [General function prediction only].
ERGIC3
rs.ERGIC3.F1 rs.ERGIC3.R1 100 115
NCBIGene 36.3 51614
Single exon skipping, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_198398
Changed!
pfam DUF1692 188aa 1e-70
in ref transcript
Protein of unknown function (DUF1692). This family contains many hypothetical proteins. It also contains proteins which are involved in transport between the ER and Golgi complex protein transport from ER to Golgi.
Changed!
pfam DUF1692 183aa 1e-72
in modified transcript
EVI1
rs.EVI1.F1 rs.EVI1.R1 114 176
NCBIGene 36.3 2122
Single exon skipping, size difference: 62
Exclusion of the protein initiation site
Reference transcript: NM_001105077
Changed!
smart SET 57aa 1e-04
in ref transcript
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues.
pfam zf-C2H2 23aa 0.007
in ref transcript
Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
EXOC1
rs.EXOC1.F1 rs.EXOC1.R1 254 299
NCBIGene 36.3 55763
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_001024924
Changed!
pfam Sec3 703aa 0.0
in ref transcript
Exocyst complex component Sec3. This entry is the conserved middle and C-terminus of the Sec3 protein. Sec3 binds to the C-terminal cytoplasmic domain of GLYT1 (glycine transporter protein 1). Sec3 is the exocyst component that is closest to the plasma membrane docking site and it serves as a spatial landmark in the plasma membrane for incoming secretory vesicles. Sec3 is recruited to the sites of polarised membrane growth through its interaction with Rho1p, a small GTP-binding protein.
Changed!
pfam Sec3 688aa 0.0
in modified transcript
EYA1
rs.EYA1.F1 rs.EYA1.R1 100 115
NCBIGene 36.3 2138
Alternative 3-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_000503
TIGR EYA-cons_domain 271aa 1e-126
in ref transcript
This domain is common to all eyes absent (EYA) homologs. Metazoan EYA's also contain a variable N-terminal domain consisting largely of low-complexity sequences.
EYA2
rs.EYA2.F1 rs.EYA2.R1 125 197
NCBIGene 36.3 2139
Alternative 5-prime and 3-prime, size difference: 72
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_172113
TIGR EYA-cons_domain 272aa 1e-118
in ref transcript
This domain is common to all eyes absent (EYA) homologs. Metazoan EYA's also contain a variable N-terminal domain consisting largely of low-complexity sequences.
EZH2
rs.EZH2.F1 rs.EZH2.R1 100 115
NCBIGene 36.3 2146
Alternative 5-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_004456
pfam SET 124aa 3e-36
in ref transcript
SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
COG COG2940 208aa 4e-15
in ref transcript
Proteins containing SET domain [General function prediction only].
F7
rs.F7.F1 rs.F7.R1 139 205
NCBIGene 36.3 2155
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_000131
cd Tryp_SPc 237aa 3e-57
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
cd EGF_CA 37aa 9e-06
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
smart Tryp_SPc 235aa 1e-57
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Changed!
smart GLA 63aa 1e-21
in ref transcript
Domain containing Gla (gamma-carboxyglutamate) residues. A hyaluronan-binding domain found in proteins associated with the extracellular matrix, cell adhesion and cell migration.
smart EGF_CA 37aa 2e-04
in ref transcript
Calcium-binding EGF-like domain.
COG COG5640 252aa 1e-14
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
Changed!
smart GLA 62aa 3e-21
in modified transcript
FAM104A
rs.FAM104A.F1 rs.FAM104A.R1 219 282
NCBIGene 36.3 84923
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098832
FAM107A
rs.FAM107A.F1 rs.FAM107A.R1 294 392
NCBIGene 36.3 11170
Single exon skipping, size difference: 98
Exclusion in 5'UTR
Reference transcript: NM_007177
pfam DUF1151 118aa 1e-24
in ref transcript
Protein of unknown function (DUF1151). This family consists of several hypothetical eukaryotic proteins of unknown function.
FAM123C
rs.FAM123C.F1 rs.FAM123C.R1 239 370
NCBIGene 36.3 205147
Alternative 5-prime, size difference: 131
Exclusion in 5'UTR
Reference transcript: NM_001105193
pfam WTX 371aa 1e-103
in ref transcript
WTX protein. The WTX protein is found to be inactivated in one third of Wilms tumours. The WTX protein is functionally uncharacterised.
FAM135A
rs.FAM135A.F1 rs.FAM135A.R1 116 245
NCBIGene 36.3 57579
Exon skipping and alternative 3-prime or 5-prime, size difference: 129
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001105531
pfam DUF676 197aa 2e-59
in ref transcript
Putative serine esterase (DUF676). This family of proteins are probably serine esterase type enzymes with an alpha/beta hydrolase fold.
FAM135A
rs.FAM135A.F2 rs.FAM135A.R2 101 179
NCBIGene 36.3 57579
Single exon skipping, size difference: 78
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001105531
pfam DUF676 197aa 2e-59
in ref transcript
Putative serine esterase (DUF676). This family of proteins are probably serine esterase type enzymes with an alpha/beta hydrolase fold.
FAM135A
rs.FAM135A.F3 rs.FAM135A.R3 171 257
NCBIGene 36.3 57579
Single exon skipping, size difference: 86
Exclusion in 5'UTR
Reference transcript: NM_001105531
pfam DUF676 197aa 2e-59
in ref transcript
Putative serine esterase (DUF676). This family of proteins are probably serine esterase type enzymes with an alpha/beta hydrolase fold.
FAM33A
rs.FAM33A.F1 rs.FAM33A.R1 222 399
NCBIGene 36.3 348235
Single exon skipping, size difference: 177
Exclusion in the protein (no frameshift)
Reference transcript: NM_182620
FAM33A
rs.FAM33A.F2 rs.FAM33A.R2 202 377
NCBIGene 36.3 348235
Alternative 5-prime, size difference: 175
Inclusion in the protein causing a new stop codon
Reference transcript: NM_182620
FAM48A
rs.FAM48A.F1 rs.FAM48A.R1 350 432
NCBIGene 36.3 55578
Alternative 3-prime, size difference: 3
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001014286
FAM54A
rs.FAM54A.F1 rs.FAM54A.R1 197 485
NCBIGene 36.3 113115
Alternative 5-prime, size difference: 288
Exclusion in 5'UTR
Reference transcript: NM_001099286
pfam DUF729 266aa 1e-74
in ref transcript
Protein of unknown function (DUF729). This family consists of several uncharacterised eukaryotic proteins of unknown function.
FAM54B
rs.FAM54B.F1 rs.FAM54B.R1 181 290
NCBIGene 36.3 56181
Alternative 3-prime, size difference: 109
Exclusion in the protein causing a frameshift
Reference transcript: NM_001099625
Changed!
pfam DUF729 248aa 9e-51
in ref transcript
Protein of unknown function (DUF729). This family consists of several uncharacterised eukaryotic proteins of unknown function.
Changed!
pfam DUF729 148aa 6e-39
in modified transcript
FAM70A
rs.FAM70A.F1 rs.FAM70A.R1 217 289
NCBIGene 36.3 55026
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_017938
FAM72A
rs.FAM72A.F1 rs.FAM72A.R1 237 357
NCBIGene 36.3 729533
Alternative 5-prime, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: XM_001133365
FAM89B
rs.FAM89B.F1 rs.FAM89B.R1 90 100
NCBIGene 36.3 23625
Alternative 3-prime, size difference: 10
Inclusion in 5'UTR
Reference transcript: NM_001098785
FAS
rs.FAS.F1 rs.FAS.R1 187 296
NCBIGene 36.3 355
Single exon skipping, size difference: 109
Exclusion in the protein causing a frameshift
Reference transcript: NM_000043
Changed!
cd TNFR 95aa 5e-16
in ref transcript
Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Changed!
smart DEATH 88aa 2e-16
in ref transcript
DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
Changed!
pfam TNFR_c6 37aa 4e-05
in ref transcript
TNFR/NGFR cysteine-rich region.
Changed!
cd TNFR 59aa 2e-06
in modified transcript
FAS
rs.FAS.F2 rs.FAS.R2 104 167
NCBIGene 36.3 355
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_000043
cd TNFR 95aa 5e-16
in ref transcript
Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
smart DEATH 88aa 2e-16
in ref transcript
DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
pfam TNFR_c6 37aa 4e-05
in ref transcript
TNFR/NGFR cysteine-rich region.
FAS
rs.FAS.F3 rs.FAS.R3 100 125
NCBIGene 36.3 355
Single exon skipping, size difference: 25
Exclusion in the protein causing a frameshift
Reference transcript: NM_000043
cd TNFR 95aa 5e-16
in ref transcript
Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Changed!
smart DEATH 88aa 2e-16
in ref transcript
DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
pfam TNFR_c6 37aa 4e-05
in ref transcript
TNFR/NGFR cysteine-rich region.
FASTK
rs.FASTK.F1 rs.FASTK.R1 125 548
NCBIGene 36.3 10922
Single exon skipping, size difference: 423
Exclusion in the protein (no frameshift)
Reference transcript: NM_006712
pfam FAST_1 67aa 1e-13
in ref transcript
FAST kinase-like protein, subdomain 1. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This region is often found immediately N-terminal to the FAST kinase-like protein, subdomain 2.
pfam FAST_2 92aa 8e-13
in ref transcript
FAST kinase-like protein, subdomain 2. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This subdomain is often found associated with the FAST kinase-like protein, subdomain 2.
pfam RAP 58aa 5e-12
in ref transcript
RAP domain. This domain is found in various eukaryotic species, particularly in apicomplexans such as Plasmodium falciparum, where it is found in proteins that are important in various parasite-host cell interactions. It is thought to be an RNA-binding domain.
FBXO43
rs.FBXO43.F1 rs.FBXO43.R1 209 253
NCBIGene 36.3 286151
Alternative 5-prime, size difference: 44
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001029860
Changed!
pfam IBR 43aa 0.002
in ref transcript
IBR domain. The IBR (In Between Ring fingers) domain is often found to occur between pairs of ring fingers (pfam00097). This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain. Proteins that contain two Ring fingers and an IBR domain (these proteins are also termed RBR family proteins) are thought to exist in all eukaryotic organisms. RBR family members play roles in protein quality control and can indirectly regulate transcription. Evidence suggests that RBR proteins are often parts of cullin-containing ubiquitin ligase complexes. The ubiquitin ligase Parkin is an RBR family protein whose mutations are involved in forms of familial Parkinson's disease.
FCGR1B
rs.FCGR1B.F1 rs.FCGR1B.R1 108 384
NCBIGene 36.3 2210
Multiple exon skipping, size difference: 276
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001017986
Changed!
cd IGcam 64aa 8e-04
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
smart IGc2 59aa 4e-04
in ref transcript
Immunoglobulin C-2 Type.
Changed!
smart IG_like 74aa 6e-04
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
FCGR2C
rs.FCGR2C.F1 rs.FCGR2C.R1 103 117
NCBIGene 36.3 9103
Alternative 3-prime, size difference: 14
Inclusion in the protein causing a frameshift
Reference transcript: NM_201563
FERMT3
rs.FERMT3.F1 rs.FERMT3.R1 100 112
NCBIGene 36.3 83706
Alternative 3-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_178443
Changed!
cd Unc112 110aa 3e-45
in ref transcript
Unc-112 pleckstrin homology (PH) domain. Unc-112 and related proteins contain two FERM domains with a PH domain between them. Both the PH and FERM domains have a PH-like fold. The FERM domains are likely responsible for the role of Unc-112 in organizing beta-integrin. The specific role of the Unc-112 PH domain is not known, but it is predicted to be involved in mediating membrane interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
smart B41 90aa 2e-14
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam FERM_M 49aa 2e-10
in ref transcript
FERM central domain. This domain is the central structural domain of the FERM domain.
Changed!
pfam PH 95aa 2e-09
in ref transcript
PH domain. PH stands for pleckstrin homology.
Changed!
cd Unc112 106aa 5e-47
in modified transcript
Changed!
pfam PH 96aa 1e-09
in modified transcript
FEZ2
rs.FEZ2.F1 rs.FEZ2.R1 241 322
NCBIGene 36.3 9637
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_001042548
pfam FEZ 216aa 1e-70
in ref transcript
FEZ-like protein. This is a family of eukaryotic proteins thought to be involved in axonal outgrowth and fasciculation. The N-terminal regions of these sequences are less conserved than the C-terminal regions, and are highly acidic. The Caenorhabditis elegans homolog, UNC-76, may play structural and signalling roles in the control of axonal extension and adhesion (particularly in the presence of adjacent neuronal cells) and these roles have also been postulated for other FEZ family proteins. Certain homologs have been definitively found to interact with the N-terminal variable region (V1) of PKC-zeta, and this interaction causes cytoplasmic translocation of the FEZ family protein in mammalian neuronal cells. The C-terminal region probably participates in the association with the regulatory domain of PKC-zeta. The members of this family are predicted to form coiled-coil structures, which may interact with members of the RhoA family of signalling proteins, but are not thought to contain other characteristic protein motifs. Certain members of this family are expressed almost exclusively in the brain, whereas others (such as FEZ2) are expressed in other tissues, and are thought to perform similar but unknown functions in these tissues.
FGF8
rs.FGF8.F2 rs.FGF8.R2 123 156
NCBIGene 36.3 2253
Alternative 3-prime, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_033163
cd FGF 125aa 2e-28
in ref transcript
Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
pfam FGF 123aa 2e-38
in ref transcript
Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. They are found in a wide range of organisms, from nematodes to humans. Most share an internal core region of high similarity, conserved residues in which are involved in binding with their receptors. On binding, they cause dimerisation of their tyrosine kinase receptors leading to intracellular signalling. There are currently four known tyrosine kinase receptors for fibroblast growth factors. These receptors can each bind several different members of this family. Members of this family have a beta trefoil structure. Most have N-terminal signal peptides and are secreted. A few lack signal sequences but are secreted anyway; still others also lack the signal peptide but are found on the cell surface and within the extracellular matrix. A third group remain intracellular. They have central roles in development, regulating cell proliferation, migration and differentiation. On the other hand, they are important in tissue repair following injury in adult organisms.
FGFR1
rs.FGFR1.F1 rs.FGFR1.R1 107 218
NCBIGene 36.3 2260
Alternative 5-prime and 3-prime, size difference: 111
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_023110
Changed!
cd PTKc_FGFR1 307aa 0.0
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 1. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 1 (FGFR1); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR1 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. Alternative splicing of FGFR1 transcripts produces a variety of isoforms, which are differentially expressed in cells. FGFR1 binds the ligands, FGF1 and FGF2, with high affinity and has also been reported to bind FGF4, FGF6, and FGF9. FGFR1 signaling is critical in the control of cell migration during embryo development. It promotes cell proliferation in fibroblasts. Nuclear FGFR1 plays a role in the regulation of transcription. Mutations, insertions or deletions of FGFR1 have been identified in patients with Kallman's syndrome (KS), an inherited disorder characterized by hypogonadotropic hypogonadism and loss of olfaction. Aberrant FGFR1 expression has been found in some human cancers including 8P11 myeloproliferative syndrome (EMS), breast cancer, and pancreatic adenocarcinoma.
cd IGcam 93aa 6e-15
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 103aa 2e-09
in ref transcript
cd IGcam 55aa 0.004
in ref transcript
Changed!
pfam Pkinase_Tyr 277aa 1e-116
in ref transcript
Protein tyrosine kinase.
pfam I-set 88aa 2e-13
in ref transcript
Immunoglobulin I-set domain.
smart IG_like 98aa 4e-13
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IGc2 48aa 4e-07
in ref transcript
Immunoglobulin C-2 Type.
Changed!
COG SPS1 343aa 2e-21
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
cd PTKc_FGFR1 270aa 1e-158
in modified transcript
Changed!
pfam Pkinase_Tyr 236aa 2e-95
in modified transcript
Changed!
COG SPS1 238aa 4e-21
in modified transcript
FGFR1
rs.FGFR1.F2 FGFR1.R42 228 495
NCBIGene 36.3 2260
Single exon skipping, size difference: 267
Exclusion in the protein (no frameshift)
Reference transcript: NM_023110
cd PTKc_FGFR1 307aa 0.0
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 1. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 1 (FGFR1); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR1 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. Alternative splicing of FGFR1 transcripts produces a variety of isoforms, which are differentially expressed in cells. FGFR1 binds the ligands, FGF1 and FGF2, with high affinity and has also been reported to bind FGF4, FGF6, and FGF9. FGFR1 signaling is critical in the control of cell migration during embryo development. It promotes cell proliferation in fibroblasts. Nuclear FGFR1 plays a role in the regulation of transcription. Mutations, insertions or deletions of FGFR1 have been identified in patients with Kallman's syndrome (KS), an inherited disorder characterized by hypogonadotropic hypogonadism and loss of olfaction. Aberrant FGFR1 expression has been found in some human cancers including 8P11 myeloproliferative syndrome (EMS), breast cancer, and pancreatic adenocarcinoma.
cd IGcam 93aa 6e-15
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 103aa 2e-09
in ref transcript
Changed!
cd IGcam 55aa 0.004
in ref transcript
pfam Pkinase_Tyr 277aa 1e-116
in ref transcript
Protein tyrosine kinase.
pfam I-set 88aa 2e-13
in ref transcript
Immunoglobulin I-set domain.
smart IG_like 98aa 4e-13
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
Changed!
smart IGc2 48aa 4e-07
in ref transcript
Immunoglobulin C-2 Type.
COG SPS1 343aa 2e-21
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
FGFR4
rs.FGFR4.F1 FGFR4.R4 261 381
NCBIGene 36.3 2264
Exon skipping and alternative 3-prime or 5-prime, size difference: 120
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_213647
cd PTKc_FGFR4 311aa 0.0
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 4. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 4 (FGFR4); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR4 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. Unlike other FGFRs, there is only one splice form of FGFR4. It binds FGF1, FGF2, FGF6, FGF19, and FGF23. FGF19 is a selective ligand for FGFR4. Although disruption of FGFR4 in mice causes no obvious phenotype, in vivo inhibition of FGFR4 in cultured skeletal muscle cells resulted in an arrest of muscle progenitor differentiation. FGF6 and FGFR4 are uniquely expressed in myofibers and satellite cells. FGF6/FGFR4 signaling appears to play a key role in the regulation of muscle regeneration. A polymorphism in FGFR4 is found in head and neck squamous cell carcinoma.
cd IGcam 93aa 7e-17
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 101aa 2e-07
in ref transcript
cd IGcam 67aa 0.003
in ref transcript
pfam Pkinase_Tyr 277aa 1e-114
in ref transcript
Protein tyrosine kinase.
smart IGc2 67aa 3e-15
in ref transcript
Immunoglobulin C-2 Type.
smart IG_like 96aa 5e-12
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IGc2 58aa 7e-09
in ref transcript
COG SPS1 251aa 8e-23
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
FGFR4
rs.FGFR4.F2 rs.FGFR4.R2 167 211
NCBIGene 36.3 2264
Alternative 5-prime, size difference: 44
Exclusion in 5'UTR
Reference transcript: NM_213647
cd PTKc_FGFR4 311aa 0.0
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 4. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 4 (FGFR4); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR4 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. Unlike other FGFRs, there is only one splice form of FGFR4. It binds FGF1, FGF2, FGF6, FGF19, and FGF23. FGF19 is a selective ligand for FGFR4. Although disruption of FGFR4 in mice causes no obvious phenotype, in vivo inhibition of FGFR4 in cultured skeletal muscle cells resulted in an arrest of muscle progenitor differentiation. FGF6 and FGFR4 are uniquely expressed in myofibers and satellite cells. FGF6/FGFR4 signaling appears to play a key role in the regulation of muscle regeneration. A polymorphism in FGFR4 is found in head and neck squamous cell carcinoma.
cd IGcam 93aa 7e-17
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 101aa 2e-07
in ref transcript
cd IGcam 67aa 0.003
in ref transcript
pfam Pkinase_Tyr 277aa 1e-114
in ref transcript
Protein tyrosine kinase.
smart IGc2 67aa 3e-15
in ref transcript
Immunoglobulin C-2 Type.
smart IG_like 96aa 5e-12
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IGc2 58aa 7e-09
in ref transcript
COG SPS1 251aa 8e-23
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
FKTN
rs.FKTN.F1 rs.FKTN.R1 173 265
NCBIGene 36.3 2218
Single exon skipping, size difference: 92
Exclusion in 5'UTR
Reference transcript: NM_001079802
pfam LicD 136aa 1e-08
in ref transcript
LICD Protein Family. The LICD family of proteins show high sequence similarity and are involved in phosphorylcholine metabolism. There is evidence to show that LicD2 mutants have a reduced ability to take up choline, have decreased ability to adhere to host cells and are less virulent.
FLJ11151
rs.FLJ11151.F1 rs.FLJ11151.R1 114 540
NCBIGene 36.3 55313
Single exon skipping, size difference: 426
Exclusion in the protein (no frameshift)
Reference transcript: NM_018340
Changed!
pfam Metallophos 172aa 3e-04
in ref transcript
Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacterial SbcD or yeast MRE11. The most conserved regions in this superfamily centre around the metal chelating residues.
Changed!
COG Icc 203aa 1e-09
in ref transcript
Predicted phosphohydrolases [General function prediction only].
FLJ11171
rs.FLJ11171.F1 rs.FLJ11171.R1 181 231
NCBIGene 36.3 55783
Alternative 5-prime, size difference: 50
Exclusion in 5'UTR
Reference transcript: NM_001099642
pfam FtsJ 209aa 2e-18
in ref transcript
FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in complex with its cofactor S-adenosylmethionine revealed that FtsJ has a methyltransferase fold. This family also includes the N terminus of flaviviral NS5 protein. It has been hypothesised that the N-terminal domain of NS5 is a methyltransferase involved in viral RNA capping.
COG FtsJ 117aa 2e-08
in ref transcript
23S rRNA methylase [Translation, ribosomal structure and biogenesis].
FLJ13611
rs.FLJ13611.F1 rs.FLJ13611.R1 110 128
NCBIGene 36.3 80006
Single exon skipping, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_001093755
Changed!
pfam DUF974 238aa 1e-113
in ref transcript
Protein of unknown function (DUF974). Family of uncharacterised eukaryotic proteins.
Changed!
pfam DUF974 232aa 1e-115
in modified transcript
FLJ14154
rs.FLJ14154.F1 rs.FLJ14154.R1 100 122
NCBIGene 36.3 79903
Alternative 5-prime, size difference: 22
Exclusion in 3'UTR
Reference transcript: NM_024845
cd GNAT 103aa 2e-07
in ref transcript
GCN5-related N-acetyltransferases (GNAT) represent a large superfamily of functionally diverse enzymes that catalyze the transfer of an acetyl group from acetyl-Coenzyme A to the primary amine of a wide range of acceptor substrates. Members of this superfamily include aminoglycoside N-acetyltransferases, serotonin N-acetyltransferase, glucosamine-6-phosphate N-acetyltransferase, the histone acetyltransferases, mycothiol synthase, and the Fem family of amino acyl transferases.
pfam Acetyltransf_1 78aa 2e-09
in ref transcript
Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions such as Elp3-related proteins.
COG RimI 71aa 4e-08
in ref transcript
Acetyltransferases [General function prediction only].
FLJ20254
rs.FLJ20254.F1 rs.FLJ20254.R1 175 310
NCBIGene 36.3 54867
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_017727
pfam DUF2359 469aa 0.0
in ref transcript
Uncharacterised conserved protein (DUF2359). This is a 450 amino acid region of a family of proteins conserved from insects to humans. The mouse protein, Q8BM55, is annotated as being a putative Vitamin K-dependent carboxylation gamma-carboxyglutamic (GLA) domain containing protein, but this could not be confirmed. The function is not known.
FLJ22167
rs.FLJ22167.F1 rs.FLJ22167.R1 100 223
NCBIGene 36.3 79583
Alternative 3-prime, size difference: 123
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001077416
Changed!
pfam NAcGluc_Transf 261aa 1e-100
in ref transcript
Carbohydrate (N-acetylglucosamine 6-O) sulfotransferase 5. Members of this family are involved in the biosynthesis of 6-Sulfo Sialyl Lewis X molecules, by catalysing the transfer of sulphate from 3'-phosphoadenosine 5'-phosphosulfate to position 6 of a non-reducing N-acetylglucosamine (GlcNAc) residue.
Changed!
pfam NAcGluc_Transf 71aa 8e-19
in modified transcript
FLJ25404
rs.FLJ25404.F1 rs.FLJ25404.R1 200 280
NCBIGene 36.3 146378
Alternative 3-prime, size difference: 80
Exclusion in the protein causing a frameshift
Reference transcript: NM_001109660
FLJ30428
rs.FLJ30428.F1 rs.FLJ30428.R1 306 529
NCBIGene 36.3 150519
Multiple exon skipping, size difference: 223
Exclusion in 5'UTR, Exclusion of the protein initiation site
Reference transcript: XM_001723916
pfam DUF1886 59aa 0.001
in ref transcript
Domain of unknown function (DUF1886). This domain is predominantly found in the Archaeal protein N-glycosylase/DNA lyase.
FLJ42562
rs.FLJ42562.F1 rs.FLJ42562.R1 101 113
NCBIGene 36.3 400954
Alternative 5-prime, size difference: 12
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: XM_001725001
cd WD40 306aa 8e-30
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 297aa 3e-29
in ref transcript
cd WD40 351aa 5e-19
in ref transcript
Changed!
cd WD40 274aa 1e-17
in ref transcript
cd WD40 308aa 2e-17
in ref transcript
cd WD40 274aa 3e-17
in ref transcript
pfam HELP 47aa 7e-08
in ref transcript
HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localise to membranes. A preliminary study has shown that the N terminus of Sea urchin EMAP containing HELP is sufficient for microtubule binding in vitro (Eichenmuller et al In press).
pfam HELP 29aa 3e-06
in ref transcript
pfam HELP 41aa 2e-05
in ref transcript
pfam WD40 34aa 0.010
in ref transcript
WD domain, G-beta repeat.
Changed!
COG COG2319 403aa 1e-26
in ref transcript
FOG: WD40 repeat [General function prediction only].
COG COG2319 372aa 3e-21
in ref transcript
COG COG2319 399aa 5e-19
in ref transcript
COG COG2319 280aa 2e-14
in ref transcript
COG COG2319 324aa 2e-10
in ref transcript
Changed!
cd WD40 278aa 6e-16
in modified transcript
Changed!
COG COG2319 407aa 8e-26
in modified transcript
FLJ42562
rs.FLJ42562.F2 rs.FLJ42562.R2 249 345
NCBIGene 36.3 400954
Alternative 5-prime, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: XM_001725001
cd WD40 306aa 8e-30
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 297aa 3e-29
in ref transcript
Changed!
cd WD40 351aa 5e-19
in ref transcript
cd WD40 274aa 1e-17
in ref transcript
Changed!
cd WD40 308aa 2e-17
in ref transcript
cd WD40 274aa 3e-17
in ref transcript
pfam HELP 47aa 7e-08
in ref transcript
HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localise to membranes. A preliminary study has shown that the N terminus of Sea urchin EMAP containing HELP is sufficient for microtubule binding in vitro (Eichenmuller et al In press).
pfam HELP 29aa 3e-06
in ref transcript
pfam HELP 41aa 2e-05
in ref transcript
pfam WD40 34aa 0.010
in ref transcript
WD domain, G-beta repeat.
COG COG2319 403aa 1e-26
in ref transcript
FOG: WD40 repeat [General function prediction only].
COG COG2319 372aa 3e-21
in ref transcript
Changed!
COG COG2319 399aa 5e-19
in ref transcript
COG COG2319 280aa 2e-14
in ref transcript
COG COG2319 324aa 2e-10
in ref transcript
Changed!
cd WD40 306aa 8e-18
in modified transcript
Changed!
cd WD40 300aa 4e-17
in modified transcript
Changed!
COG COG2319 361aa 1e-18
in modified transcript
FLJ46266
rs.FLJ46266.F1 rs.FLJ46266.R1 144 225
NCBIGene 36.3 399949
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_207430
FLJ77644
rs.FLJ77644.F1 rs.FLJ77644.R1 258 399
NCBIGene 36.3 728772
Single exon skipping, size difference: 141
Exclusion in the protein (no frameshift)
Reference transcript: XM_001133074
Changed!
pfam DUF1356 241aa 1e-109
in ref transcript
Protein of unknown function (DUF1356). This family consists of several hypothetical mammalian proteins of around 250 residues in length. The function of this family is unknown.
Changed!
pfam DUF1356 194aa 8e-81
in modified transcript
FLJ77644
rs.FLJ77644.F2 rs.FLJ77644.R2 134 302
NCBIGene 36.3 728772
Single exon skipping, size difference: 168
Exclusion in 5'UTR
Reference transcript: XM_001133074
pfam DUF1356 241aa 1e-109
in ref transcript
Protein of unknown function (DUF1356). This family consists of several hypothetical mammalian proteins of around 250 residues in length. The function of this family is unknown.
FN1
rs.FN1.F1 rs.FN1.R1 99 369
NCBIGene 36.3 2335
Single exon skipping, size difference: 270
Exclusion in the protein (no frameshift)
Reference transcript: NM_212482
cd FN2 48aa 2e-16
in ref transcript
Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
cd FN2 48aa 3e-16
in ref transcript
cd FN3 83aa 4e-10
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN1 44aa 4e-09
in ref transcript
Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
cd FN3 81aa 8e-09
in ref transcript
cd FN3 80aa 1e-08
in ref transcript
cd FN1 40aa 4e-08
in ref transcript
cd FN1 42aa 5e-08
in ref transcript
cd FN1 45aa 8e-08
in ref transcript
cd FN3 88aa 1e-07
in ref transcript
cd FN3 88aa 2e-07
in ref transcript
cd FN1 44aa 2e-07
in ref transcript
cd FN3 87aa 3e-07
in ref transcript
cd FN1 38aa 4e-07
in ref transcript
Changed!
cd FN3 81aa 9e-07
in ref transcript
cd FN3 82aa 1e-06
in ref transcript
Changed!
cd FN3 88aa 1e-06
in ref transcript
cd FN1 41aa 1e-06
in ref transcript
cd FN3 93aa 2e-06
in ref transcript
cd FN1 45aa 2e-06
in ref transcript
cd FN1 42aa 2e-06
in ref transcript
cd FN3 81aa 6e-06
in ref transcript
cd FN3 90aa 8e-06
in ref transcript
cd FN3 73aa 9e-06
in ref transcript
cd FN1 44aa 1e-05
in ref transcript
cd FN1 43aa 3e-05
in ref transcript
cd FN1 39aa 0.003
in ref transcript
cd FN3 73aa 0.006
in ref transcript
smart FN2 49aa 9e-20
in ref transcript
Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
smart FN2 49aa 4e-19
in ref transcript
pfam fn3 82aa 8e-17
in ref transcript
Fibronectin type III domain.
smart FN1 45aa 6e-13
in ref transcript
Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
Changed!
pfam fn1 39aa 1e-12
in ref transcript
Fibronectin type I domain.
pfam fn3 81aa 3e-12
in ref transcript
pfam fn3 81aa 3e-12
in ref transcript
smart FN1 45aa 3e-12
in ref transcript
smart FN1 42aa 3e-12
in ref transcript
smart FN1 45aa 5e-12
in ref transcript
smart FN1 39aa 2e-11
in ref transcript
pfam fn3 81aa 4e-11
in ref transcript
pfam fn3 83aa 5e-11
in ref transcript
pfam fn3 67aa 8e-11
in ref transcript
pfam fn1 39aa 8e-11
in ref transcript
pfam fn3 81aa 1e-10
in ref transcript
pfam fn3 80aa 1e-10
in ref transcript
smart FN1 43aa 2e-10
in ref transcript
pfam fn3 80aa 5e-10
in ref transcript
smart FN1 44aa 7e-10
in ref transcript
pfam fn3 81aa 8e-09
in ref transcript
pfam fn3 85aa 9e-09
in ref transcript
pfam fn3 70aa 1e-08
in ref transcript
pfam fn3 65aa 2e-08
in ref transcript
smart FN1 41aa 3e-08
in ref transcript
Changed!
pfam fn3 81aa 3e-07
in ref transcript
smart FN1 41aa 5e-07
in ref transcript
pfam fn3 54aa 5e-05
in ref transcript
pfam fn1 31aa 9e-05
in ref transcript
Changed!
cd FN3 89aa 3e-07
in modified transcript
Changed!
smart FN1 44aa 2e-12
in modified transcript
FN1
rs.FN1.F2 rs.FN1.R2 156 231
NCBIGene 36.3 2335
Alternative 3-prime, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_212482
cd FN2 48aa 2e-16
in ref transcript
Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
cd FN2 48aa 3e-16
in ref transcript
cd FN3 83aa 4e-10
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN1 44aa 4e-09
in ref transcript
Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
cd FN3 81aa 8e-09
in ref transcript
cd FN3 80aa 1e-08
in ref transcript
cd FN1 40aa 4e-08
in ref transcript
cd FN1 42aa 5e-08
in ref transcript
cd FN1 45aa 8e-08
in ref transcript
cd FN3 88aa 1e-07
in ref transcript
cd FN3 88aa 2e-07
in ref transcript
cd FN1 44aa 2e-07
in ref transcript
cd FN3 87aa 3e-07
in ref transcript
cd FN1 38aa 4e-07
in ref transcript
cd FN3 81aa 9e-07
in ref transcript
cd FN3 82aa 1e-06
in ref transcript
cd FN3 88aa 1e-06
in ref transcript
cd FN1 41aa 1e-06
in ref transcript
cd FN3 93aa 2e-06
in ref transcript
cd FN1 45aa 2e-06
in ref transcript
cd FN1 42aa 2e-06
in ref transcript
cd FN3 81aa 6e-06
in ref transcript
cd FN3 90aa 8e-06
in ref transcript
cd FN3 73aa 9e-06
in ref transcript
cd FN1 44aa 1e-05
in ref transcript
cd FN1 43aa 3e-05
in ref transcript
cd FN1 39aa 0.003
in ref transcript
cd FN3 73aa 0.006
in ref transcript
smart FN2 49aa 9e-20
in ref transcript
Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
smart FN2 49aa 4e-19
in ref transcript
pfam fn3 82aa 8e-17
in ref transcript
Fibronectin type III domain.
smart FN1 45aa 6e-13
in ref transcript
Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
pfam fn1 39aa 1e-12
in ref transcript
Fibronectin type I domain.
pfam fn3 81aa 3e-12
in ref transcript
pfam fn3 81aa 3e-12
in ref transcript
smart FN1 45aa 3e-12
in ref transcript
smart FN1 42aa 3e-12
in ref transcript
smart FN1 45aa 5e-12
in ref transcript
smart FN1 39aa 2e-11
in ref transcript
pfam fn3 81aa 4e-11
in ref transcript
pfam fn3 83aa 5e-11
in ref transcript
pfam fn3 67aa 8e-11
in ref transcript
pfam fn1 39aa 8e-11
in ref transcript
pfam fn3 81aa 1e-10
in ref transcript
pfam fn3 80aa 1e-10
in ref transcript
smart FN1 43aa 2e-10
in ref transcript
pfam fn3 80aa 5e-10
in ref transcript
smart FN1 44aa 7e-10
in ref transcript
pfam fn3 81aa 8e-09
in ref transcript
pfam fn3 85aa 9e-09
in ref transcript
pfam fn3 70aa 1e-08
in ref transcript
pfam fn3 65aa 2e-08
in ref transcript
smart FN1 41aa 3e-08
in ref transcript
pfam fn3 81aa 3e-07
in ref transcript
smart FN1 41aa 5e-07
in ref transcript
pfam fn3 54aa 5e-05
in ref transcript
pfam fn1 31aa 9e-05
in ref transcript
FNTA
rs.FNTA.F1 rs.FNTA.R1 297 383
NCBIGene 36.3 2339
Single exon skipping, size difference: 86
Exclusion in the protein causing a frameshift
Reference transcript: NM_002027
Changed!
pfam PPTA 30aa 1e-05
in ref transcript
Protein prenyltransferase alpha subunit repeat. Both farnesyltransferase (FT) and geranylgeranyltransferase 1 (GGT1) recognise a CaaX motif on their substrates where 'a' stands for preferably aliphatic residues, whereas GGT2 recognises a completely different motif. Important substrates for FT include, amongst others, many members of the Ras superfamily. GGT1 substrates include some of the other small GTPases and GGT2 substrates include the Rab family.
Changed!
pfam PPTA 30aa 2e-05
in ref transcript
Changed!
pfam PPTA 30aa 0.002
in ref transcript
Changed!
pfam PPTA 29aa 0.003
in ref transcript
Changed!
COG BET4 211aa 1e-31
in ref transcript
Protein prenyltransferase, alpha subunit [Posttranslational modification, protein turnover, chaperones].
FOXN3
rs.FOXN3.F1 rs.FOXN3.R1 129 195
NCBIGene 36.3 1112
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_001085471
cd FH 79aa 1e-31
in ref transcript
Forkhead (FH), also known as a "winged helix". FH is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development. This family of transcription factor domains, which bind to B-DNA as monomers, are also found in the Hepatocyte nuclear factor (HNF) proteins, which provide tissue-specific gene regulation. The structure contains 2 flexible loops or "wings" in the C-terminal region, hence the term winged helix.
smart FH 88aa 4e-34
in ref transcript
FORKHEAD. FORKHEAD, also known as a "winged helix".
COG COG5025 106aa 5e-15
in ref transcript
Transcription factor of the Forkhead/HNF3 family [Transcription].
FOXP4
rs.FOXP4.F1 rs.FOXP4.R1 175 211
NCBIGene 36.3 116113
Alternative 3-prime, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_001012426
cd FH 73aa 8e-26
in ref transcript
Forkhead (FH), also known as a "winged helix". FH is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development. This family of transcription factor domains, which bind to B-DNA as monomers, are also found in the Hepatocyte nuclear factor (HNF) proteins, which provide tissue-specific gene regulation. The structure contains 2 flexible loops or "wings" in the C-terminal region, hence the term winged helix.
smart FH 73aa 5e-29
in ref transcript
FORKHEAD. FORKHEAD, also known as a "winged helix".
Changed!
COG COG5025 197aa 5e-12
in ref transcript
Transcription factor of the Forkhead/HNF3 family [Transcription].
Changed!
COG COG5025 98aa 2e-10
in modified transcript
Changed!
COG COG5025 234aa 2e-08
in modified transcript
FOXRED2
rs.FOXRED2.F1 rs.FOXRED2.R1 255 352
NCBIGene 36.3 80020
Alternative 5-prime, size difference: 97
Exclusion in 5'UTR
Reference transcript: NM_024955
pfam HI0933_like 154aa 4e-07
in ref transcript
HI0933-like protein.
TIGR TRX_reduct 86aa 0.005
in ref transcript
This model describes thioredoxin-disulfide reductase, a member of the pyridine nucleotide-disulphide oxidoreductases (pfam00070).
COG TrkA 219aa 6e-18
in ref transcript
Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism].
FRS2
rs.FRS2.F1 rs.FRS2.R1 117 185
NCBIGene 36.3 10818
Single exon skipping, size difference: 68
Exclusion in 5'UTR
Reference transcript: NM_006654
cd FRS2 97aa 4e-43
in ref transcript
Fibroblast growth factor receptor substrate 2 (FRS2/SNT1) Phosphotyrosine-binding domain (IRS1-like). FRS2 mediates signaling downstream of the FGF receptor. It has an N-terminal PTBi domain, which has a PH-like fold and is similiar to the PTB domain that is found in insulin receptor substrate molecules. This PTBi domain is shorter than the PTB domain which is found in SHC, Numb and other proteins. The PTBi domain binds to phosphotyrosines which are in NPXpY motifs.
smart PTBI 91aa 1e-24
in ref transcript
Phosphotyrosine-binding domain (IRS1-like).
FRS2
rs.FRS2.F2 rs.FRS2.R2 226 287
NCBIGene 36.3 10818
Single exon skipping, size difference: 61
Exclusion in 5'UTR
Reference transcript: NM_001042555
cd FRS2 97aa 4e-43
in ref transcript
Fibroblast growth factor receptor substrate 2 (FRS2/SNT1) Phosphotyrosine-binding domain (IRS1-like). FRS2 mediates signaling downstream of the FGF receptor. It has an N-terminal PTBi domain, which has a PH-like fold and is similiar to the PTB domain that is found in insulin receptor substrate molecules. This PTBi domain is shorter than the PTB domain which is found in SHC, Numb and other proteins. The PTBi domain binds to phosphotyrosines which are in NPXpY motifs.
smart PTBI 91aa 1e-24
in ref transcript
Phosphotyrosine-binding domain (IRS1-like).
FSCN2
rs.FSCN2.F1 rs.FSCN2.R1 141 213
NCBIGene 36.3 25794
Alternative 3-prime, size difference: 72
Inclusion in 3'UTR
Reference transcript: NM_012418
cd Fascin 121aa 2e-24
in ref transcript
Fascin-like domain; members include actin-bundling/crosslinking proteins facsin, histoactophilin and singed; identified in sea urchin, Drosophila, Xenopus, rodents, and humans; The fascin-like domain adopts a beta-trefoil topology and contains an internal threefold repeat; the fascin subgroup contains four copies of the domain; Structurally similar to fibroblast growth factor (FGF).
cd Fascin 117aa 3e-20
in ref transcript
pfam Fascin 113aa 4e-30
in ref transcript
Fascin domain. This family consists of several eukaryotic fascin or singed proteins. The fascins are a structurally unique and evolutionarily conserved group of actin cross-linking proteins. Fascins function in the organisation of two major forms of actin-based structures: dynamic, cortical cell protrusions and cytoplasmic microfilament bundles. The cortical structures, which include filopodia, spikes, lamellipodial ribs, oocyte microvilli and the dendrites of dendritic cells, have roles in cell-matrix adhesion, cell interactions and cell migration, whereas the cytoplasmic actin bundles appear to participate in cell architecture. Dictyostelium hisactophilin, another actin-binding protein, is a submembranous pH sensor that signals slight changes of the H+ concentration to actin by inducing actin polymerisation and binding to microfilaments only at pH values below seven. Members of this family are histidine rich, typically contain the repeated motif of HHXH.
pfam Fascin 114aa 8e-22
in ref transcript
FSHR
rs.FSHR.F1 rs.FSHR.R1 210 396
NCBIGene 36.3 2492
Single exon skipping, size difference: 186
Exclusion in the protein (no frameshift)
Reference transcript: NM_000145
pfam 7tm_1 242aa 1e-30
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
pfam LRRNT 26aa 0.001
in ref transcript
Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.
Changed!
COG COG4886 86aa 0.009
in modified transcript
Leucine-rich repeat (LRR) protein [Function unknown].
FUT2
rs.FUT2.F1 rs.FUT2.R1 90 100
NCBIGene 36.3 2524
Alternative 5-prime, size difference: 10
Exclusion in 5'UTR
Reference transcript: NM_000511
pfam Glyco_transf_11 314aa 1e-133
in ref transcript
Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes.
FUT3
rs.FUT3.F1 rs.FUT3.R1 112 460
NCBIGene 36.3 2525
Alternative 3-prime, size difference: 348
Inclusion in 5'UTR
Reference transcript: NM_001097639
pfam Glyco_transf_10 291aa 1e-102
in ref transcript
Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10.
FUT6
rs.FUT6.F1 rs.FUT6.R1 187 315
NCBIGene 36.3 2528
Single exon skipping, size difference: 128
Exclusion in 5'UTR
Reference transcript: NM_000150
pfam Glyco_transf_10 354aa 1e-113
in ref transcript
Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10.
FUT8
rs.FUT8.F1 rs.FUT8.R1 120 544
NCBIGene 36.3 2530
Multiple exon skipping, size difference: 424
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_178155
Changed!
cd SH3 53aa 5e-05
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Changed!
smart SH3 54aa 1e-06
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
FUT8
rs.FUT8.F2 rs.FUT8.R2 116 546
NCBIGene 36.3 2530
Single exon skipping, size difference: 430
Exclusion of the protein initiation site
Reference transcript: NM_178155
cd SH3 53aa 5e-05
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart SH3 54aa 1e-06
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
G3BP2
rs.G3BP2.F1 rs.G3BP2.R1 149 248
NCBIGene 36.3 9908
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_203505
cd NTF2 129aa 4e-31
in ref transcript
Nuclear transport factor 2 (NTF2) domain plays an important role in the trafficking of macromolecules, ions and small molecules between the cytoplasm and nucleus. This bi-directional transport of macromolecules across the nuclear envelope requires many soluble factors that includes GDP-binding protein Ran (RanGDP). RanGDP is required for both import and export of proteins and poly(A) RNA. RanGDP also has been implicated in cell cycle control, specifically in mitotic spindle assembly. In interphase cells, RanGDP is predominately nuclear and thought to be GTP bound, but it is also present in the cytoplasm, probably in the GDP-bound state. NTF2 mediates the nuclear import of RanGDP. NTF2 binds to both RanGDP and FxFG repeat-containing nucleoporins.
cd RRM 73aa 3e-10
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
pfam NTF2 123aa 1e-23
in ref transcript
Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins.
pfam RRM_1 66aa 4e-10
in ref transcript
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
COG COG0724 71aa 4e-06
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
G6PC2
rs.G6PC2.F1 rs.G6PC2.R1 276 392
NCBIGene 36.3 57818
Single exon skipping, size difference: 116
Exclusion in the protein causing a frameshift
Reference transcript: NM_021176
Changed!
cd PAP2_glucose_6_phosphatase 241aa 5e-91
in ref transcript
PAP2_like proteins, glucose-6-phosphatase subfamily. Glucose-6-phosphatase converts glucose-6-phosphate into free glucose and is active in the lumen of the endoplasmic reticulum, where it is bound to the membrane. The generation of free glucose is an important control point in metabolism, and stands at the end of gluconeogenesis and the release of glucose from glycogen. Deficiency of glucose-6-phosphatase leads to von Gierke's disease.
Changed!
smart acidPPc 135aa 4e-14
in ref transcript
Acid phosphatase homologues.
Changed!
cd PAP2_glucose_6_phosphatase 103aa 8e-40
in modified transcript
Changed!
smart acidPPc 70aa 2e-08
in modified transcript
GAGE1
rs.GAGE1.F1 rs.GAGE1.R1 255 398
NCBIGene 36.3 2543
Single exon skipping, size difference: 143
Exclusion of the stop codon
Reference transcript: NM_001468
Changed!
pfam GAGE 109aa 4e-09
in ref transcript
GAGE protein. This family consists of several GAGE and XAGE proteins which are found exclusively in humans. The function of this family is unknown although they have been implicated in human cancers.
Changed!
pfam GAGE 116aa 3e-08
in modified transcript
GARNL4
rs.GARNL4.F1 rs.GARNL4.R1 218 263
NCBIGene 36.3 23108
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_015085
pfam Rap_GAP 188aa 3e-80
in ref transcript
Rap/ran-GAP.
GCC2
rs.GCC2.F1 rs.GCC2.R1 198 283
NCBIGene 36.3 9648
Single exon skipping, size difference: 85
Exclusion in the protein causing a frameshift
Reference transcript: NM_181453
Changed!
pfam GRIP 45aa 1e-08
in ref transcript
GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue. At least some of these domains have been shown to bind to GTPase Arl1, see structures.
Changed!
TIGR SMC_prok_B 793aa 2e-07
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
TIGR SMC_prok_B 252aa 0.001
in ref transcript
Changed!
TIGR SMC_prok_B 305aa 0.001
in ref transcript
Changed!
COG Smc 299aa 5e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
COG Smc 609aa 9e-05
in ref transcript
Changed!
COG Smc 295aa 0.006
in ref transcript
GCK
rs.GCK.F1 rs.GCK.R1 169 293
NCBIGene 36.3 2645
Single exon skipping, size difference: 124
Inclusion in the protein causing a new stop codon
Reference transcript: NM_033507
Changed!
pfam Hexokinase_2 238aa 1e-97
in ref transcript
Hexokinase. Hexokinase (EC:2.7.1.1) contains two structurally similar domains represented by this family and pfam00349. Some members of the family have two copies of each of these domains.
Changed!
pfam Hexokinase_1 203aa 5e-75
in ref transcript
Hexokinase. Hexokinase (EC:2.7.1.1) contains two structurally similar domains represented by this family and pfam03727. Some members of the family have two copies of each of these domains.
Changed!
COG COG5026 455aa 1e-75
in ref transcript
Hexokinase [Carbohydrate transport and metabolism].
GCNT1
rs.GCNT1.F1 rs.GCNT1.R1 229 347
NCBIGene 36.3 2650
Single exon skipping, size difference: 118
Exclusion in 5'UTR
Reference transcript: NM_001490
pfam Branch 210aa 1e-69
in ref transcript
Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryonic development. Core-2 branching enzyme forms crucial side-chain branches in O-glycans.
GDAP1
rs.GDAP1.F1 rs.GDAP1.R1 116 246
NCBIGene 36.3 54332
Alternative 5-prime and 3-prime, size difference: 130
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_018972
Changed!
cd GST_C_GDAP1 111aa 9e-48
in ref transcript
GST_C family, Ganglioside-induced differentiation-associated protein 1 (GDAP1) subfamily; GDAP1 was originally identified as a highly expressed gene at the differentiated stage of GD3 synthase-transfected cells. More recently, mutations in GDAP1 have been reported to cause both axonal and demyelinating autosomal-recessive Charcot-Marie-Tooth (CMT) type 4A neuropathy. CMT is characterized by slow and progressive weakness and atrophy of muscles. Sequence analysis of GDAP1 shows similarities and differences with GSTs; it appears to contain both N-terminal thioredoxin-fold and C-terminal alpha helical domains of GSTs, however, it also contains additional C-terminal transmembrane domains unlike GSTs. GDAP1 is mainly expressed in neuronal cells and is localized in the mitochondria through its transmembrane domains. It does not exhibit GST activity using standard substrates.
Changed!
cd GST_N_GDAP1 73aa 1e-36
in ref transcript
GST_N family, Ganglioside-induced differentiation-associated protein 1 (GDAP1) subfamily; GDAP1 was originally identified as a highly expressed gene at the differentiated stage of GD3 synthase-transfected cells. More recently, mutations in GDAP1 have been reported to cause both axonal and demyelinating autosomal-recessive Charcot-Marie-Tooth (CMT) type 4A neuropathy. CMT is characterized by slow and progressive weakness and atrophy of muscles. Sequence analysis of GDAP1 shows similarities and differences with GSTs; it appears to contain both N-terminal TRX-fold and C-terminal alpha helical domains of GSTs, however, it also contains additional C-terminal transmembrane domains unlike GSTs. GDAP1 is mainly expressed in neuronal cells and is localized in the mitochondria through its transmembrane domains. It does not exhibit GST activity using standard substrates.
Changed!
TIGR maiA 118aa 7e-09
in ref transcript
Maleylacetoacetate isomerase is an enzyme of tyrosine and phenylalanine catabolism. It requires glutathione and belongs by homology to the zeta family of glutathione S-transferases. The enzyme (EC 5.2.1.2) is described as active also on maleylpyruvate, and the example from a Ralstonia sp. catabolic plasmid is described as a maleylpyruvate isomerase involved in gentisate catabolism.
Changed!
pfam GST_C 84aa 0.001
in ref transcript
Glutathione S-transferase, C-terminal domain. GST conjugates reduced glutathione to a variety of targets including S-crystallin from squid, the eukaryotic elongation factor 1-gamma, the HSP26 family of stress-related proteins and auxin-regulated proteins in plants. Stringent starvation proteins in Escherichia coli are also included in the alignment but are not known to have GST activity. The glutathione molecule binds in a cleft between N and C-terminal domains. The catalytically important residues are proposed to reside in the N-terminal domain. In plants, GSTs are encoded by a large gene family (48 GST genes in Arabidopsis) and can be divided into the phi, tau, theta, zeta, and lambda classes.
Changed!
COG Gst 267aa 6e-15
in ref transcript
Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones].
GEMIN8
rs.GEMIN8.F1 rs.GEMIN8.R1 116 204
NCBIGene 36.3 54960
Alternative 5-prime, size difference: 88
Exclusion in 5'UTR
Reference transcript: NM_017856
GFRA1
rs.GFRA1.F1 rs.GFRA1.R1 100 115
NCBIGene 36.3 2674
Single exon skipping, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_005264
pfam GDNF 95aa 3e-16
in ref transcript
GDNF/GAS1 domain. This cysteine rich domain is found in multiple copies in GNDF and GAS1 proteins. GDNF and neurturin (NTN) receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuronal survival by signaling through similar multicomponent receptors that consist of a common receptor tyrosine kinase and a member of a GPI-linked family of receptors that determines ligand specificity.
pfam GDNF 83aa 2e-11
in ref transcript
pfam GDNF 80aa 9e-10
in ref transcript
GGNBP1
rs.GGNBP1.F1 rs.GGNBP1.R1 107 170
NCBIGene 36.3 449520
Alternative 3-prime, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: XM_001724455
GGT2
rs.GGT2.F1 rs.GGT2.R1 108 123
NCBIGene 36.3 728441
Alternative 5-prime and 3-prime, size difference: 15
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: XM_001129377
Changed!
pfam G_glu_transpept 513aa 1e-162
in ref transcript
Gamma-glutamyltranspeptidase.
Changed!
COG Ggt 548aa 1e-116
in ref transcript
Gamma-glutamyltransferase [Amino acid transport and metabolism].
Changed!
pfam G_glu_transpept 508aa 1e-162
in modified transcript
Changed!
COG Ggt 543aa 1e-117
in modified transcript
GGTLA1
rs.GGTLA1.F1 rs.GGTLA1.R1 205 301
NCBIGene 36.3 2687
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_001099781
Changed!
pfam G_glu_transpept 523aa 1e-143
in ref transcript
Gamma-glutamyltranspeptidase.
Changed!
COG Ggt 546aa 3e-86
in ref transcript
Gamma-glutamyltransferase [Amino acid transport and metabolism].
Changed!
pfam G_glu_transpept 491aa 1e-130
in modified transcript
Changed!
COG Ggt 514aa 2e-78
in modified transcript
GH1
rs.GH1.F1 rs.GH1.R1 167 212
NCBIGene 36.3 2688
Alternative 3-prime, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_000515
Changed!
pfam Hormone_1 207aa 1e-48
in ref transcript
Somatotropin hormone family.
Changed!
pfam Hormone_1 192aa 2e-44
in modified transcript
GIPC1
rs.GIPC1.F1 rs.GIPC1.R1 165 221
NCBIGene 36.3 10755
Single exon skipping, size difference: 56
Exclusion in 5'UTR
Reference transcript: NM_005716
cd PDZ_signaling 79aa 8e-08
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart PDZ 82aa 5e-09
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
GIT1
rs.GIT1.F1 rs.GIT1.R1 92 119
NCBIGene 36.3 28964
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_001085454
cd ANK 98aa 6e-13
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
smart ArfGap 119aa 3e-36
in ref transcript
Putative GTP-ase activating proteins for the small GTPase, ARF. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
pfam GIT_SHD 31aa 3e-07
in ref transcript
Spa2 homology domain (SHD) of GIT. GIT proteins are signaling integrators with GTPase-activating function which may be involved in the organisation of the cytoskeletal matrix assembled at active zones (CAZ). The function of the CAZ might be to define sites of neurotransmitter release. Mutations in the Spa2 homology domain (SHD) domain of GIT1 described here interfere with the association of GIT1 with Piccolo, beta-PIX, and focal adhesion kinase.
pfam GIT_SHD 22aa 0.001
in ref transcript
COG COG5347 115aa 3e-14
in ref transcript
GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
COG Arp 185aa 1e-06
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
GLRA3
rs.GLRA3.F1 rs.GLRA3.R1 119 164
NCBIGene 36.3 8001
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_006529
Changed!
TIGR LIC 440aa 1e-144
in ref transcript
selective while glycine receptors are anion selective).
Changed!
TIGR LIC 425aa 1e-145
in modified transcript
GLT8D3
rs.GLT8D3.F1 rs.GLT8D3.R1 231 324
NCBIGene 36.3 283464
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_173601
cd GT8_like_2 302aa 1e-178
in ref transcript
GT8_like_2 represents a subfamily of GT8 with unknown function. A subfamily of glycosyltransferase family 8 with unknown function: Glycosyltransferase family 8 comprises enzymes with a number of known activities; lipopolysaccharide galactosyltransferase lipopolysaccharide glucosyltransferase 1, glycogenin glucosyltransferase and inositol 1-alpha-galactosyltransferase. It is classified as a retaining glycosyltransferase, based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed.
pfam Glyco_transf_8 165aa 2e-11
in ref transcript
Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyltransferase, lipopolysaccharide glucosyltransferase 1, and glycogenin glucosyltransferase.
SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation.
GNG4
rs.GNG4.F1 rs.GNG4.R1 100 212
NCBIGene 36.3 2786
Single exon skipping, size difference: 112
Exclusion in 5'UTR
Reference transcript: NM_001098721
GOLSYN
rs.GOLSYN.F1 rs.GOLSYN.R1 138 259
NCBIGene 36.3 55638
Single exon skipping, size difference: 121
Exclusion in 5'UTR
Reference transcript: NM_001099750
GOLSYN
rs.GOLSYN.F2 rs.GOLSYN.R2 264 550
NCBIGene 36.3 55638
Multiple exon skipping, size difference: 286
Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
Reference transcript: NM_001099745
GOSR1
rs.GOSR1.F1 rs.GOSR1.R1 100 122
NCBIGene 36.3 9527
Alternative 5-prime, size difference: 22
Inclusion in the protein causing a frameshift
Reference transcript: NM_004871
Changed!
pfam V-SNARE 156aa 7e-32
in ref transcript
Vesicle transport v-SNARE protein. V-SNARE proteins are required for protein traffic between eukaryotic organelles. The v-SNAREs on transport vesicles interact with t-SNAREs on target membranes in order to facilitate this.
GPR172B
rs.GPR172B.F1 rs.GPR172B.R1 148 182
NCBIGene 36.3 55065
Alternative 3-prime, size difference: 34
Inclusion in 5'UTR
Reference transcript: NM_017986
pfam DUF1011 95aa 1e-29
in ref transcript
Protein of unknown function (DUF1011). Family of uncharacterised eukaryotic proteins.
GPR18
rs.GPR18.F1 rs.GPR18.R1 100 292
NCBIGene 36.3 2841
Single exon skipping, size difference: 192
Exclusion in 5'UTR
Reference transcript: NM_005292
pfam 7tm_1 242aa 7e-25
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
GPR34
rs.GPR34.F1 rs.GPR34.R1 256 332
NCBIGene 36.3 2857
Alternative 3-prime, size difference: 3
Inclusion in 5'UTR
Reference transcript: NM_005300
pfam 7tm_1 251aa 6e-29
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
GPR64
rs.GPR64.F1 rs.GPR64.R1 134 176
NCBIGene 36.3 10149
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_001079858
pfam 7tm_2 262aa 3e-62
in ref transcript
7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
smart GPS 51aa 1e-10
in ref transcript
G-protein-coupled receptor proteolytic site domain. Present in latrophilin/CL-1, sea urchin REJ and polycystin.
GPR89A
rs.GPR89A.F1 rs.GPR89A.R1 104 124
NCBIGene 36.3 653519
Alternative 3-prime, size difference: 20
Inclusion in the protein causing a frameshift
Reference transcript: NM_001097612
GPR98
rs.GPR98.F1 rs.GPR98.R1 132 215
NCBIGene 36.3 84059
Alternative 5-prime, size difference: 83
Exclusion in the protein causing a frameshift
Reference transcript: NM_032119
cd PTX 156aa 5e-06
in ref transcript
Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
Changed!
TIGR caca 203aa 1e-17
in ref transcript
This Hmm is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family.
TIGR caca 219aa 2e-15
in ref transcript
TIGR caca 183aa 3e-13
in ref transcript
TIGR caca 208aa 1e-12
in ref transcript
Changed!
TIGR caca 213aa 2e-12
in ref transcript
TIGR caca 207aa 1e-11
in ref transcript
TIGR caca 184aa 1e-11
in ref transcript
TIGR caca 167aa 2e-11
in ref transcript
Changed!
TIGR caca 161aa 3e-11
in ref transcript
Changed!
TIGR caca 184aa 7e-11
in ref transcript
Changed!
TIGR caca 162aa 1e-10
in ref transcript
Changed!
TIGR caca 142aa 1e-10
in ref transcript
Changed!
pfam 7tm_2 245aa 2e-10
in ref transcript
7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
Changed!
TIGR caca 213aa 5e-10
in ref transcript
TIGR caca 234aa 2e-07
in ref transcript
TIGR caca 191aa 2e-07
in ref transcript
Changed!
TIGR caca 137aa 2e-05
in ref transcript
Changed!
pfam Calx-beta 94aa 5e-05
in ref transcript
Calx-beta domain.
Changed!
TIGR caca 184aa 1e-04
in ref transcript
Changed!
TIGR caca 169aa 2e-04
in ref transcript
Changed!
TIGR caca 95aa 2e-04
in ref transcript
TIGR caca 158aa 2e-04
in ref transcript
smart LamGL 137aa 7e-04
in ref transcript
LamG-like jellyroll fold domain.
Changed!
pfam EPTP 44aa 9e-04
in ref transcript
EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. The architecture and structural features of this repeat make it a likely member 7-bladed beta-propeller fold.
Changed!
pfam GPS 43aa 0.002
in ref transcript
Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
TIGR caca 223aa 0.004
in ref transcript
Changed!
TIGR caca 145aa 3e-09
in modified transcript
GPRASP1
rs.GPRASP1.F1 rs.GPRASP1.R1 162 253
NCBIGene 36.3 9737
Single exon skipping, size difference: 91
Exclusion in 5'UTR
Reference transcript: NM_014710
pfam DUF634 224aa 1e-58
in ref transcript
Protein of unknown function (DUF634). Mammalian protein of unknown function.
GPS1
rs.GPS1.F1 rs.GPS1.R1 100 112
NCBIGene 36.3 2873
Alternative 3-prime, size difference: 12
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_212492
pfam RPN7 183aa 2e-54
in ref transcript
26S proteasome subunit RPN7. RPN7 (known as the non ATPase regulatory subunit 6 in higher eukaryotes) is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae to be required for structural integrity. The 26S proteasome is is involved in the ATP-dependent degradation of ubiquitinated proteins.
pfam PCI 105aa 7e-19
in ref transcript
PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15).
Changed!
cd PBP1_iGluR_AMPA_GluR2 370aa 0.0
in ref transcript
N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the GluR2 subunit of the AMPA (alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid) receptor. The AMPA receptor is a member of the glutamate-receptor ion channels (iGluRs) which are the major mediators of excitatory synaptic transmission in the central nervous system. AMPA receptors are composed of four types of subunits (GluR1, GluR2, GluR3, and GluR4) which combine to form a tetramer and play an important role in mediating the rapid excitatory synaptic current. Furthermore, this N-terminal domain of the iGluRs has homology with LIVBP, a bacterial periplasmic biding protein, as well as with the structurally related glutamate-binding domain of the G-protein-coupled metabotropic receptors (mGluRs).
cd PBPb 112aa 3e-13
in ref transcript
Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD.
cd PBPb 137aa 1e-04
in ref transcript
pfam Lig_chan 275aa 3e-98
in ref transcript
Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors.
pfam ANF_receptor 324aa 2e-39
in ref transcript
Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
pfam Lig_chan-Glu_bd 66aa 6e-25
in ref transcript
Ligated ion channel L-glutamate- and glycine-binding site. This region, sometimes called the S1 domain, is the luminal domain just upstream of the first, M1, transmembrane region of transmembrane ion-channel proteins, and it binds L-glutamate and glycine. It is found in association with Lig_chan, pfam00060.
COG HisJ 96aa 5e-07
in ref transcript
ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [Amino acid transport and metabolism / Signal transduction mechanisms].
ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
Changed!
cd PBP1_iGluR_AMPA_GluR2 350aa 0.0
in modified transcript
GRIA4
rs.GRIA4.F1 rs.GRIA4.R1 357 470
NCBIGene 36.3 2893
Alternative 5-prime, size difference: 113
Inclusion in the protein causing a new stop codon
Reference transcript: NM_000829
cd PBP1_iGluR_AMPA_GluR4 371aa 0.0
in ref transcript
N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the GluR4 subunit of the AMPA (alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid) receptor. The AMPA receptor is a member of the glutamate-receptor ion channels (iGluRs) which are the major mediators of excitatory synaptic transmission in the central nervous system. AMPA receptors are composed of four types of subunits (GluR1, GluR2, GluR3, and GluR4) which combine to form a tetramer and play an important role in mediating the rapid excitatory synaptic current. Furthermore, this N-terminal domain of the iGluRs has homology with LIVBP, a bacterial periplasmic biding protein, as well as with the structurally related glutamate-binding domain of the G-protein-coupled metabotropic receptors (mGluRs).
cd PBPb 118aa 6e-13
in ref transcript
Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD.
pfam Lig_chan 275aa 1e-99
in ref transcript
Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors.
pfam ANF_receptor 338aa 5e-50
in ref transcript
Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
pfam Lig_chan-Glu_bd 66aa 1e-24
in ref transcript
Ligated ion channel L-glutamate- and glycine-binding site. This region, sometimes called the S1 domain, is the luminal domain just upstream of the first, M1, transmembrane region of transmembrane ion-channel proteins, and it binds L-glutamate and glycine. It is found in association with Lig_chan, pfam00060.
COG HisJ 126aa 6e-08
in ref transcript
ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [Amino acid transport and metabolism / Signal transduction mechanisms].
COG LivK 329aa 6e-07
in ref transcript
ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_002092
cd RRM 73aa 6e-09
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 69aa 1e-07
in ref transcript
cd RRM 71aa 2e-05
in ref transcript
smart RRM 68aa 6e-08
in ref transcript
RNA recognition motif.
smart RRM 69aa 6e-06
in ref transcript
smart RRM_2 69aa 7e-06
in ref transcript
RNA recognition motif.
TIGR PABP-1234 120aa 6e-04
in ref transcript
There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
GSDML
rs.GSDML.F1 rs.GSDML.R1 115 142
NCBIGene 36.3 55876
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_001042471
Changed!
pfam Gasdermin 369aa 1e-101
in ref transcript
Gasdermin family. The precise function of this protein is unknown. A deletion/insertion mutation is associated with an autosomal dominant non-syndromic hearing impairment form. In addition, this protein has also been found to contribute to acquired etoposide resistance in melanoma cells. This family also includes the gasdermin protein.
Changed!
pfam Gasdermin 360aa 1e-101
in modified transcript
GSTCD
rs.GSTCD.F1 rs.GSTCD.R1 164 425
NCBIGene 36.3 79807
Alternative 5-prime, size difference: 261
Exclusion in the protein (no frameshift)
Reference transcript: NM_001031720
cd GST_C_family 72aa 4e-05
in ref transcript
Glutathione S-transferase (GST) family, C-terminal alpha helical domain; a large, diverse group of cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. In addition, GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. This family, also referred to as soluble GSTs, is the largest family of GSH transferases and is only distantly related to the mitochondrial GSTs (GSTK). Soluble GSTs bear no structural similarity to microsomal GSTs (MAPEG family) and display additional activities unique to their group, such as catalyzing thiolysis, reduction and isomerization of certain compounds. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Based on sequence similarity, different classes of GSTs have been identified, which display varying tissue distribution, substrate specificities and additional specific activities. In humans, GSTs display polymorphisms which may influence individual susceptibility to diseases such as cancer, arthritis, allergy and sclerosis. Some GST family members with non-GST functions include glutaredoxin 2, the CLIC subfamily of anion channels, prion protein Ure2p, crystallins, metaxins and stringent starvation protein A.
TIGR RF_mod_HemK 69aa 6e-05
in ref transcript
Members of this protein family are HemK, a protein once thought to be involved in heme biosynthesis but now recognized to be a protein-glutamine methyltransferase that modifies the peptide chain release factors. All members of the seed alignment are encoded next to the release factor 1 gene (prfA) and confirmed by phylogenetic analysis. However, the family is diverse enough that even many members of the seed alignment do not score above the seed alignment, which was set high enough to exclude all instances of PrmB.
COG UbiE 65aa 9e-06
in ref transcript
Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism].
GSTM1
rs.GSTM1.F1 rs.GSTM1.R1 243 354
NCBIGene 36.3 2944
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_000561
Changed!
cd GST_C_Mu 121aa 3e-41
in ref transcript
GST_C family, Class Mu subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Mu subfamily is composed of eukaryotic GSTs. In rats, at least six distinct class Mu subunits have been identified, with homologous genes in humans for five of these subunits. Class Mu GSTs can form homodimers and heterodimers, giving a large number of possible isoenzymes that can be formed, all with overlapping activities but different substrate specificities. They are the most abundant GSTs in human liver, skeletal muscle and brain, and are believed to provide protection against diseases including cancer and neurodegenerative disorders. Some isoenzymes have additional specific functions. Human GST M1-1 acts as an endogenous inhibitor of ASK1 (apoptosis signal-regulating kinase 1) thereby suppressing ASK1-mediated cell death. Human GSTM2-2 and 3-3 have been identified as prostaglandin E2 synthases in the brain and may play crucial roles in temperature and sleep-wake regulation.
cd GST_N_Mu 82aa 3e-38
in ref transcript
GST_N family, Class Mu subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. The class Mu subfamily is composed of eukaryotic GSTs. In rats, at least six distinct class Mu subunits have been identified, with homologous genes in humans for five of these subunits. Class Mu GSTs can form homodimers and heterodimers, giving a large number of possible isoenzymes that can be formed, all with overlapping activities but different substrate specificities. They are the most abundant GSTs in human liver, skeletal muscle and brain, and are believed to provide protection against diseases including cancer and neurodegenerative disorders. Some isoenzymes have additional specific functions. Human GST M1-1 acts as an endogenous inhibitor of ASK1 (apoptosis signal-regulating kinase 1), thereby suppressing ASK1-mediated cell death. Human GSTM2-2 and 3-3 have been identified as prostaglandin E2 synthases in the brain and may play crucial roles in temperature and sleep-wake regulation.
pfam GST_N 80aa 1e-17
in ref transcript
Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation factors 1-gamma. Not known to have GST activity; similarity not previously recognised. * HSP26 family of stress-related proteins. including auxin-regulated proteins in plants and stringent starvation proteins in Escherichia coli. Not known to have GST activity. Similarity not previously recognised. The glutathione molecule binds in a cleft between N and C-terminal domains - the catalytically important residues are proposed to reside in the N-terminal domain.
Changed!
pfam GST_C 51aa 5e-10
in ref transcript
Glutathione S-transferase, C-terminal domain. GST conjugates reduced glutathione to a variety of targets including S-crystallin from squid, the eukaryotic elongation factor 1-gamma, the HSP26 family of stress-related proteins and auxin-regulated proteins in plants. Stringent starvation proteins in Escherichia coli are also included in the alignment but are not known to have GST activity. The glutathione molecule binds in a cleft between N and C-terminal domains. The catalytically important residues are proposed to reside in the N-terminal domain. In plants, GSTs are encoded by a large gene family (48 GST genes in Arabidopsis) and can be divided into the phi, tau, theta, zeta, and lambda classes.
Changed!
PTZ PTZ00057 201aa 3e-09
in ref transcript
glutathione s-transferase; Provisional.
Changed!
cd GST_C_Mu 84aa 4e-16
in modified transcript
Changed!
PTZ PTZ00057 109aa 8e-07
in modified transcript
GTF2I
rs.GTF2I.F1 rs.GTF2I.R1 110 170
NCBIGene 36.3 2969
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_032999
pfam GTF2I 73aa 6e-31
in ref transcript
GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain.
pfam GTF2I 73aa 1e-30
in ref transcript
pfam GTF2I 73aa 4e-30
in ref transcript
pfam GTF2I 73aa 1e-29
in ref transcript
pfam GTF2I 73aa 3e-26
in ref transcript
pfam GTF2I 73aa 3e-26
in ref transcript
GTPBP10
rs.GTPBP10.F1 rs.GTPBP10.R1 127 364
NCBIGene 36.3 85865
Multiple exon skipping, size difference: 237
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_033107
Changed!
cd Obg 195aa 2e-58
in ref transcript
Obg subfamily. The Obg nucleotide binding protein subfamily has been implicated in stress response, chromosome partitioning, replication initiation, mycelium development, and sporulation. Obg proteins are among a large group of GTP binding proteins conserved from bacteria to humans. The E. coli homolog, ObgE is believed to function in ribosomal biogenesis. Members of the subfamily contain two equally and highly conserved domains, a C-terminal GTP binding domain and an N-terminal glycine-rich domain.
Changed!
TIGR Obg_CgtA 330aa 5e-73
in ref transcript
This model describes a univeral, mostly one-gene-per-genome GTP-binding protein that associates with ribosomal subunits and appears to play a role in ribosomal RNA maturation. This GTPase, related to the nucleolar protein Obg, is designated CgtA in bacteria. Mutations in this gene are pleiotropic, but it appears that effects on cellular functions such as chromosome partition may be secondary to the effect on ribosome structure. Recent work done in Vibrio cholerae shows an essential role in the stringent response, in which RelA-dependent ability to synthesize the alarmone ppGpp is required for deletion of this GTPase to be lethal.
Changed!
PRK obgE 335aa 1e-72
in ref transcript
GTPase ObgE; Reviewed.
Changed!
cd Obg 188aa 3e-54
in modified transcript
Changed!
TIGR Obg_CgtA 242aa 3e-46
in modified transcript
Changed!
PRK obgE 247aa 1e-47
in modified transcript
GYG2
rs.GYG2.F1 rs.GYG2.R1 307 400
NCBIGene 36.3 8908
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_003918
cd GT8_Glycogenin 252aa 2e-87
in ref transcript
Glycogenin belongs the GT 8 family and initiates the biosynthesis of glycogen. Glycogenin initiates the biosynthesis of glycogen by incorporating glucose residues through a self-glucosylation reaction at a Tyr residue, and then acts as substrate for chain elongation by glycogen synthase and branching enzyme. It contains a conserved DxD motif and an N-terminal beta-alpha-beta Rossmann-like fold that are common to the nucleotide-binding domains of most glycosyltransferases. The DxD motif is essential for coordination of the catalytic divalent cation, most commonly Mn2+. Glycogenin can be classified as a retaining glycosyltransferase, based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed. It is placed in glycosyltransferase family 8 which includes lipopolysaccharide glucose and galactose transferases and galactinol synthases.
pfam Glyco_transf_8 217aa 2e-51
in ref transcript
Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyltransferase, lipopolysaccharide glucosyltransferase 1, and glycogenin glucosyltransferase.
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG Smc 266aa 4e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
HAP1
rs.HAP1.F1 rs.HAP1.R1 102 126
NCBIGene 36.3 9001
Alternative 5-prime, size difference: 24
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_003949
Changed!
pfam HAP1_N 354aa 1e-71
in ref transcript
HAP1 N-terminal conserved region. This family represents an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntington's disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34.
COG Smc 148aa 4e-07
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
pfam HAP1_N 362aa 3e-71
in modified transcript
HEY1
rs.HEY1.F1 rs.HEY1.R1 103 115
NCBIGene 36.3 23462
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040708
Changed!
cd HLH 46aa 9e-06
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
smart ORANGE 47aa 5e-09
in ref transcript
Orange domain. This domain confers specificity among members of the Hairy/E(SPL) family.
Changed!
pfam HLH 44aa 6e-07
in ref transcript
Helix-loop-helix DNA-binding domain.
Changed!
cd HLH 42aa 3e-06
in modified transcript
Changed!
pfam HLH 40aa 2e-07
in modified transcript
HGF
rs.HGF.F1 rs.HGF.R1 110 125
NCBIGene 36.3 3082
Alternative 3-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_000601
cd Tryp_SPc 225aa 2e-51
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
Changed!
cd KR 82aa 6e-28
in ref transcript
Kringle domain; Kringle domains are believed to play a role in binding mediators, such as peptides, other proteins, membranes, or phospholipids. They are autonomous structural domains, found in a varying number of copies, in blood clotting and fibrinolytic proteins, some serine proteases and plasma proteins. Plasminogen-like kringles possess affinity for free lysine and lysine-containing peptides.
cd KR 82aa 9e-28
in ref transcript
cd KR 83aa 9e-27
in ref transcript
cd KR 83aa 5e-25
in ref transcript
cd PAN_APPLE 84aa 8e-20
in ref transcript
PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins, plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins. Common structural features include two disulfide bonds that link the alpha-helix to the central region of the protein. PAN domains have significant functional versatility, fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
smart Tryp_SPc 223aa 3e-56
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
pfam Kringle 78aa 2e-32
in ref transcript
Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta.
pfam Kringle 79aa 6e-31
in ref transcript
Changed!
smart KR 83aa 1e-30
in ref transcript
Kringle domain. Named after a Danish pastry. Found in several serine proteases and in ROR-like receptors. Can occur in up to 38 copies (in apolipoprotein(a)). Plasminogen-like kringles possess affinity for free lysine and lysine- containing peptides.
pfam Kringle 79aa 2e-26
in ref transcript
smart PAN_AP 86aa 4e-07
in ref transcript
divergent subfamily of APPLE domains. Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
COG COG5640 234aa 3e-10
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
Changed!
cd KR 77aa 7e-25
in modified transcript
Changed!
smart KR 78aa 2e-27
in modified transcript
HIF1A
rs.HIF1A.F1 rs.HIF1A.R1 262 389
NCBIGene 36.3 3091
Single exon skipping, size difference: 127
Exclusion in the protein causing a frameshift
Reference transcript: NM_001530
cd PAS 100aa 1e-12
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
cd PAS 55aa 4e-07
in ref transcript
pfam PAS_3 86aa 1e-15
in ref transcript
PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
Changed!
pfam HIF-1a_CTAD 40aa 1e-14
in ref transcript
HIF-1 alpha C terminal transactivation domain. Hypoxia inducible factor-1 alpha (HIF-1 alpha) is the regulatory subunit of the heterodimeric transcription factor HIF-1. It plays a key role in cellular response to low oxygen tension. This region corresponds to the C terminal transactivation domain.
smart PAS 56aa 8e-08
in ref transcript
PAS domain. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels ([1]; Ponting & Aravind, in press).
HIG2
rs.HIG2.F1 rs.HIG2.R1 252 366
NCBIGene 36.3 29923
Alternative 5-prime, size difference: 114
Exclusion in 5'UTR
Reference transcript: NM_013332
HIPK3
rs.HIPK3.F1 rs.HIPK3.R1 180 243
NCBIGene 36.3 10114
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_005734
cd S_TKc 224aa 3e-51
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
cd S_TKc 63aa 0.007
in ref transcript
smart S_TKc 202aa 3e-52
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
smart S_TKc 30aa 0.007
in ref transcript
PTZ PTZ00284 310aa 8e-34
in ref transcript
protein kinase; Provisional.
HLA-F
rs.HLA-F.F1 rs.HLA-F.R1 137 413
NCBIGene 36.3 3134
Single exon skipping, size difference: 276
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098479
Changed!
cd IGc 92aa 2e-19
in ref transcript
Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam MHC_I 178aa 9e-84
in ref transcript
Class I Histocompatibility antigen, domains alpha 1 and 2.
Changed!
pfam C1-set 81aa 3e-22
in ref transcript
Immunoglobulin C1-set domain.
HMGA1
rs.HMGA1.F1 rs.HMGA1.R1 145 178
NCBIGene 36.3 3159
Alternative 5-prime, size difference: 33
Exclusion in 5'UTR
Reference transcript: NM_145899
HMGA1
rs.HMGA1.F2 rs.HMGA1.R2 194 308
NCBIGene 36.3 3159
Single exon skipping, size difference: 114
Exclusion in 5'UTR
Reference transcript: NM_002131
HMGCLL1
rs.HMGCLL1.F1 rs.HMGCLL1.R1 319 409
NCBIGene 36.3 54511
Single exon skipping, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_019036
pfam HMGL-like 271aa 3e-58
in ref transcript
HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase.
PRK PRK05692 286aa 1e-132
in ref transcript
hydroxymethylglutaryl-CoA lyase; Provisional.
HMGCS1
rs.HMGCS1.F1 rs.HMGCS1.R1 328 387
NCBIGene 36.3 3157
Single exon skipping, size difference: 59
Exclusion in 5'UTR
Reference transcript: NM_001098272
cd init_cond_enzymes 371aa 9e-63
in ref transcript
"initiating" condensing enzymes are a subclass of decarboxylating condensing enzymes, including beta-ketoacyl [ACP] synthase, type III and polyketide synthases, type III, which include chalcone synthase and related enzymes. They are characterized by the utlization of CoA substrate primers, as well as the nature of their active site residues.
TIGR HMG-CoA-S_euk 457aa 0.0
in ref transcript
Hydroxymethylglutaryl(HMG)-CoA synthase is the first step of isopentenyl pyrophosphate (IPP) biosynthesis via the mevalonate pathway. This pathway is found mainly in eukaryotes, but also in archaea and some bacteria. This model is specific for eukaryotes.
COG PksG 452aa 2e-75
in ref transcript
3-hydroxy-3-methylglutaryl CoA synthase [Lipid metabolism].
HMMR
rs.HMMR.F1 rs.HMMR.R1 138 183
NCBIGene 36.3 3161
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_012484
pfam SMC_N 239aa 3e-05
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Changed!
TIGR SMC_prok_A 213aa 9e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
COG Smc 236aa 4e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
PRK PRK05771 196aa 0.002
in ref transcript
V-type ATP synthase subunit I; Validated.
Changed!
TIGR SMC_prok_A 223aa 3e-04
in modified transcript
HN1
rs.HN1.F1 rs.HN1.R1 124 143
NCBIGene 36.3 51155
Single exon skipping, size difference: 19
Inclusion in the protein causing a frameshift
Reference transcript: NM_001002032
HNF4A
rs.HNF4A.F1 rs.HNF4A.R1 123 153
NCBIGene 36.3 3172
Alternative 5-prime, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_000457
cd NR_LBD_HNF4_like 223aa 1e-120
in ref transcript
The ligand binding domain of heptocyte nuclear factor 4, which is explosively expanded in nematodes. The ligand binding domain of hepatocyte nuclear factor 4 (HNF4) like proteins: HNF4 is a member of the nuclear receptor superfamily. HNF4 plays a key role in establishing and maintenance of hepatocyte differentiation in the liver. It is also expressed in gut, kidney, and pancreatic beta cells. HNF4 was originally classified as an orphan receptor, but later it is found that HNF4 binds with very high affinity to a variety of fatty acids. However, unlike other nuclear receptors, the ligands do not act as a molecular switch for HNF4. They seem to constantly bind to the receptor, which is constitutively active as a transcription activator. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, HNF4 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD). The LBD domain is also responsible for recruiting co-activator proteins. More than 280 nuclear receptors are found in C. ele gans, most of which are originated from an explosive burst of duplications of HNF4.
pfam Hormone_recep 182aa 2e-41
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
pfam zf-C4 75aa 2e-38
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
HNRNPC
rs.HNRNPC.F1 rs.HNRNPC.R1 168 207
NCBIGene 36.3 3183
Alternative 5-prime, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_031314
cd RRM 68aa 1e-09
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM_2 67aa 5e-10
in ref transcript
RNA recognition motif.
HNRNPC
rs.HNRNPC.F2 rs.HNRNPC.R2 144 170
NCBIGene 36.3 3183
Single exon skipping, size difference: 26
Exclusion in 5'UTR
Reference transcript: NM_031314
cd RRM 68aa 1e-09
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM_2 67aa 5e-10
in ref transcript
RNA recognition motif.
HNRNPR
rs.HNRNPR.F1 rs.HNRNPR.R1 147 313
NCBIGene 36.3 10236
Single exon skipping, size difference: 166
Exclusion of the protein initiation site
Reference transcript: NM_001102398
cd RRM 73aa 1e-11
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 56aa 1e-11
in ref transcript
cd RRM 82aa 2e-08
in ref transcript
TIGR hnRNP-R-Q 313aa 1e-156
in ref transcript
Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
TIGR hnRNP-R-Q 30aa 3e-05
in ref transcript
COG COG0724 65aa 3e-06
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
COG COG0724 138aa 7e-06
in ref transcript
HNRPF
rs.HNRPF.F1 rs.HNRPF.R1 232 273
NCBIGene 36.3 3185
Alternative 5-prime, size difference: 41
Exclusion in 5'UTR
Reference transcript: NM_004966
cd RRM 73aa 3e-09
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 72aa 1e-08
in ref transcript
cd RRM 76aa 3e-06
in ref transcript
pfam zf-RNPHF 36aa 7e-14
in ref transcript
RNPHF zinc finger. This domain is a putative zinc-binding domain (CHHC motif) in RNP H and F. The domain is often associated with pfam00076.
smart RRM_2 70aa 7e-10
in ref transcript
RNA recognition motif.
smart RRM_2 71aa 9e-10
in ref transcript
smart RRM_2 73aa 6e-04
in ref transcript
TIGR SF-CC1 121aa 0.007
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
COG COG0724 159aa 5e-04
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
COG COG0724 57aa 0.002
in ref transcript
HNRPH2
rs.HNRPH2.F1 rs.HNRPH2.R1 101 113
NCBIGene 36.3 3188
Alternative 5-prime, size difference: 12
Exclusion in 5'UTR
Reference transcript: NM_019597
cd RRM 73aa 4e-09
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 72aa 5e-08
in ref transcript
pfam zf-RNPHF 34aa 4e-13
in ref transcript
RNPHF zinc finger. This domain is a putative zinc-binding domain (CHHC motif) in RNP H and F. The domain is often associated with pfam00076.
smart RRM_2 71aa 7e-10
in ref transcript
RNA recognition motif.
smart RRM_2 70aa 3e-09
in ref transcript
HRAS
rs.HRAS.F1 rs.HRAS.R1 176 258
NCBIGene 36.3 3265
Single exon skipping, size difference: 82
Inclusion in the protein causing a new stop codon
Reference transcript: NM_005343
Changed!
cd H_N_K_Ras_like 162aa 8e-91
in ref transcript
H-Ras/N-Ras/K-Ras subfamily. H-Ras, N-Ras, and K-Ras4A/4B are the prototypical members of the Ras family. These isoforms generate distinct signal outputs despite interacting with a common set of activators and effectors, and are strongly associated with oncogenic progression in tumor initiation. Mutated versions of Ras that are insensitive to GAP stimulation (and are therefore constitutively active) are found in a significant fraction of human cancers. Many Ras guanine nucleotide exchange factors (GEFs) have been identified. They are sequestered in the cytosol until activation by growth factors triggers recruitment to the plasma membrane or Golgi, where the GEF colocalizes with Ras. Active (GTP-bound) Ras interacts with several effector proteins that stimulate a variety of diverse cytoplasmic signaling activities. Some are known to positively mediate the oncogenic properties of Ras, including Raf, phosphatidylinositol 3-kinase (PI3K), RalGEFs, and Tiam1. Others are proposed to play negative regulatory roles in oncogenesis, including RASSF and NORE/MST1. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Ras proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
Changed!
smart RAS 163aa 6e-80
in ref transcript
Ras subfamily of RAS small GTPases. Similar in fold and function to the bacterial EF-Tu GTPase. p21Ras couples receptor Tyr kinases and G protein receptors to protein kinase cascades.
Changed!
COG COG1100 165aa 2e-19
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
Changed!
cd H_N_K_Ras_like 148aa 1e-81
in modified transcript
Changed!
smart RAS 147aa 2e-72
in modified transcript
Changed!
COG COG1100 147aa 2e-17
in modified transcript
HS6ST2
rs.HS6ST2.F1 rs.HS6ST2.R1 200 320
NCBIGene 36.3 90161
Multiple exon skipping, size difference: 120
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001077188
Changed!
pfam HS6ST 353aa 1e-177
in ref transcript
Heparan sulfate 6-sulfotransferase (HS6ST). This family consists of several heparan sulfate 6-sulfotransferase (HS6ST) proteins. Heparan sulphate 6- O -sulphotransferase (HS6ST) catalyses the transfer of sulphate from adenosine 3'-phosphate, 5'-phosphosulphate to the 6th position of the N -sulphoglucosamine residue in heparan sulphate.
Changed!
pfam HS6ST 313aa 0.0
in modified transcript
HSF4
rs.HSF4.F1 rs.HSF4.R1 100 114
NCBIGene 36.3 3299
Alternative 3-prime, size difference: 14
Inclusion in the protein causing a frameshift
Reference transcript: NM_001040667
pfam HSF_DNA-bind 194aa 1e-59
in ref transcript
HSF-type DNA-binding.
COG HSF1 182aa 1e-29
in ref transcript
Heat shock transcription factor [Transcription].
HSF4
rs.HSF4.F2 rs.HSF4.R2 111 215
NCBIGene 36.3 3299
Alternative 3-prime, size difference: 104
Exclusion in the protein causing a frameshift
Reference transcript: NM_001040667
pfam HSF_DNA-bind 194aa 1e-59
in ref transcript
HSF-type DNA-binding.
COG HSF1 182aa 1e-29
in ref transcript
Heat shock transcription factor [Transcription].
HSPA8
rs.HSPA8.F1 rs.HSPA8.R1 91 550
NCBIGene 36.3 3312
Exon skipping and alternative 3-prime or 5-prime, size difference: 459
Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_006597
Changed!
pfam HSP70 607aa 0.0
in ref transcript
Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase domain and the carboxyl terminus is the substrate binding region.
Changed!
PTZ PTZ00009 612aa 0.0
in ref transcript
heat shock 70 kDa protein; Provisional.
Changed!
pfam HSP70 457aa 0.0
in modified transcript
Changed!
PTZ PTZ00009 462aa 0.0
in modified transcript
HTATIP2
rs.HTATIP2.F1 rs.HTATIP2.R1 189 303
NCBIGene 36.3 10553
Alternative 5-prime, size difference: 114
Exclusion of the protein initiation site
Reference transcript: NM_001098520
Changed!
TIGR hemA 97aa 0.004
in ref transcript
This enzyme, together with glutamate-1-semialdehyde-2,1-aminomutase (TIGR00713), leads to the production of delta-amino-levulinic acid from Glu-tRNA.
COG COG0702 173aa 2e-08
in ref transcript
Predicted nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism].
Changed!
TIGR hemA 78aa 0.006
in modified transcript
IARS
rs.IARS.F1 rs.IARS.R1 127 300
NCBIGene 36.3 3376
Alternative 5-prime, size difference: 173
Exclusion of the protein initiation site
Reference transcript: NM_013417
cd IleRS_core 235aa 5e-89
in ref transcript
This is the catalytic core domain of isoleucine amino-acyl tRNA synthetases (IleRS) . This class I enzyme is a monomer, which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding. IleRS has an insertion in the core domain, which is subject to both deletions and rearrangements. This editing region hydrolyzes mischarged cognate tRNAs and thus prevents the incorporation of chemically similar amino acids.
cd IleRS_core 143aa 2e-68
in ref transcript
TIGR ileS 860aa 0.0
in ref transcript
The isoleucyl tRNA synthetase (IleS) is a class I amino acyl-tRNA ligase and is particularly closely related to the valyl tRNA synthetase. This model may recognize IleS from every species, including eukaryotic cytosolic and mitochondrial forms.
PRK ileS 1029aa 0.0
in ref transcript
isoleucyl-tRNA synthetase; Reviewed.
ICA1L
rs.ICA1L.F1 rs.ICA1L.R1 246 316
NCBIGene 36.3 130026
Single exon skipping, size difference: 70
Inclusion in 5'UTR
Reference transcript: NM_138468
cd Arfaptin 205aa 3e-54
in ref transcript
Arfaptin domain; arfaptin is a ubiquitously expressed protein implicated in mediating cross-talk between Rac, a member of the Rho family, and Arf small GTPases; Arfaptin binds to GTP-bound Arf1 and Arf6, but binds Rac.GTP and Rac.GDP with similar affinities. Structures of Arfaptin with Rac bound to either GDP or the slowly hydrolysable analogue GMPPNP show that the switch regions adopt similar conformations in both complexes. Arf1 and Arf6 are thought to bind to the same surface as Rac.
pfam Arfaptin 206aa 3e-73
in ref transcript
Arfaptin-like domain. Arfaptin interacts with ARF1, a small GTPase involved in vesicle budding at the Golgi complex and immature secretory granules. The structure of arfaptin shows that upon binding to a small GTPase, arfaptin forms a an elongated, crescent-shaped dimer of three-helix coiled-coils. The N-terminal region of ICA69 is similar to arfaptin.
pfam ICA69 229aa 5e-54
in ref transcript
Islet cell autoantigen ICA69, C-terminal domain. This family includes a 69 kD protein which has been identified as an islet cell autoantigen in type I diabetes mellitus. Its precise function is unknown.
ICAM2
rs.ICAM2.F1 rs.ICAM2.R1 178 260
NCBIGene 36.3 3384
Single exon skipping, size difference: 82
Exclusion in 5'UTR
Reference transcript: NM_001099786
pfam ICAM_N 111aa 2e-45
in ref transcript
Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signalling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, during the viral attachment process. This family is a family that is part of the Ig superfamily and is therefore related to the family ig (pfam00047).
ICAM2
rs.ICAM2.F2 rs.ICAM2.R2 112 229
NCBIGene 36.3 3384
Alternative 5-prime, size difference: 117
Exclusion in 5'UTR
Reference transcript: NM_001099786
pfam ICAM_N 111aa 2e-45
in ref transcript
Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signalling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, during the viral attachment process. This family is a family that is part of the Ig superfamily and is therefore related to the family ig (pfam00047).
IFI6
rs.IFI6.F1 rs.IFI6.R1 100 112
NCBIGene 36.3 2537
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_022873
pfam Ifi-6-16 59aa 2e-11
in ref transcript
Interferon-induced 6-16 family.
IGFBP3
rs.IGFBP3.F1 rs.IGFBP3.R1 100 118
NCBIGene 36.3 3486
Alternative 5-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_001013398
cd TY 70aa 3e-10
in ref transcript
Thyroglobulin type I repeats.; The N-terminal region of human thyroglobulin contains 11 type-1 repeats TY repeats are proposed to be inhibitors of cysteine proteases.
smart IB 78aa 4e-23
in ref transcript
Insulin growth factor-binding protein homologues. High affinity binding partners of insulin-like growth factors.
pfam Thyroglobulin_1 73aa 7e-15
in ref transcript
Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 and 5 with 6.
IHPK1
rs.IHPK1.F1 rs.IHPK1.R1 138 489
NCBIGene 36.3 9807
Single exon skipping, size difference: 351
Exclusion of the protein initiation site
Reference transcript: NM_153273
Changed!
pfam IPK 272aa 2e-68
in ref transcript
Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase.
Changed!
pfam IPK 261aa 1e-67
in modified transcript
IKZF2
rs.IKZF2.F1 rs.IKZF2.R1 158 236
NCBIGene 36.3 22807
Alternative 5-prime, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_016260
Changed!
COG COG5048 83aa 0.007
in ref transcript
FOG: Zn-finger [General function prediction only].
Changed!
COG COG5048 88aa 0.008
in modified transcript
IKZF2
rs.IKZF2.F2 rs.IKZF2.R2 103 140
NCBIGene 36.3 22807
Alternative 5-prime, size difference: 37
Inclusion in the protein causing a new stop codon
Reference transcript: NM_016260
Changed!
COG COG5048 83aa 0.007
in ref transcript
FOG: Zn-finger [General function prediction only].
IL17RE
rs.IL17RE.F1 rs.IL17RE.R1 154 290
NCBIGene 36.3 132014
Multiple exon skipping, size difference: 136
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_153483
Changed!
pfam SEFIR 75aa 1e-06
in ref transcript
SEFIR domain. This family comprises IL17 receptors (IL17Rs) and SEF proteins. The latter are feedback inhibitors of FGF signalling and are also thought to be receptors. Due to its similarity to the TIR domain (pfam01582), the SEFIR region is thought to be involved in homotypic interactions with other SEFIR/TIR-domain-containing proteins. Thus, SEFs and IL17Rs may be involved in TOLL/IL1R-like signalling pathways.
IL32
rs.IL32.F1 rs.IL32.R1 102 125
NCBIGene 36.3 9235
Alternative 5-prime, size difference: 23
Exclusion in 5'UTR
Reference transcript: NM_001012632
ILF3
rs.ILF3.F1 rs.ILF3.R1 100 543
NCBIGene 36.3 3609
Alternative 3-prime, size difference: 443
Exclusion of the stop codon
Reference transcript: NM_004516
cd DSRM 64aa 9e-09
in ref transcript
Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
cd DSRM 46aa 3e-05
in ref transcript
smart DZF 255aa 1e-102
in ref transcript
domain in DSRM or ZnF_C2H2 domain containing proteins.
smart DSRM 65aa 2e-11
in ref transcript
Double-stranded RNA binding motif.
smart DSRM 47aa 2e-07
in ref transcript
PRK rnc 37aa 3e-05
in ref transcript
ribonuclease III; Reviewed.
PRK rnc 32aa 0.006
in ref transcript
ILK
rs.ILK.F1 rs.ILK.R1 145 191
NCBIGene 36.3 3611
Alternative 3-prime, size difference: 46
Inclusion in 5'UTR
Reference transcript: NM_001014794
cd PTKc 243aa 1e-32
in ref transcript
Catalytic Domain of Protein Tyrosine Kinases. Protein Tyrosine Kinase (PTK) family, catalytic domain. This PTKc family is part of a larger superfamily that includes the catalytic domains of protein serine/threonine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. They can be classified into receptor and non-receptor tyr kinases. PTKs play important roles in many cellular processes including, lymphocyte activation, epithelium growth and maintenance, metabolism control, organogenesis regulation, survival, proliferation, differentiation, migration, adhesion, motility, and morphogenesis. Receptor tyr kinases (RTKs) are integral membrane proteins which contain an extracellular ligand-binding region, a transmembrane segment, and an intracellular tyr kinase domain. RTKs are usually activated through ligand binding, which causes dimerization and autophosphorylation of the intracellular tyr kinase catalytic domain, leading to intracellular signaling. Some RTKs are orphan receptors with no known ligands. Non-receptor (or cytoplasmic) tyr kinases are distributed in different intracellular compartments and are usually multi-domain proteins containing a catalytic tyr kinase domain as well as various regulatory domains such as SH3 and SH2. PTKs are usually autoinhibited and require a mechanism for activation. In many PTKs, the phosphorylation of tyr residues in the activation loop is essential for optimal activity. Aberrant expression of PTKs is associated with many development abnormalities and cancers.
cd ANK 113aa 2e-27
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
pfam Pkinase_Tyr 254aa 3e-36
in ref transcript
Protein tyrosine kinase.
pfam Ank 30aa 2e-06
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
pfam Ank 33aa 4e-05
in ref transcript
TIGR trp 115aa 0.001
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
COG Arp 139aa 1e-14
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
PTZ PTZ00283 134aa 9e-11
in ref transcript
serine/threonine protein kinase; Provisional.
IMMT
rs.IMMT.F1 rs.IMMT.R1 142 175
NCBIGene 36.3 10989
Alternative 3-prime, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_006839
Changed!
pfam Mitofilin 753aa 0.0
in ref transcript
Mitochondrial inner membrane protein. Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence (residues 1-43) rich in positive-charged and hydroxylated residues and a membrane anchor domain (residues 47-66). In addition, it has three centrally located coiled coil domains (residues 200-240,280-310 and 400-420).
Changed!
pfam Mitofilin 742aa 0.0
in modified transcript
INCENP
rs.INCENP.F1 rs.INCENP.R1 107 119
NCBIGene 36.3 3619
Single exon skipping, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040694
pfam INCENP_ARK-bind 59aa 2e-19
in ref transcript
Inner centromere protein, ARK binding region. This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregation with cell division in yeast.
INPP4B
rs.INPP4B.F1 rs.INPP4B.R1 97 196
NCBIGene 36.3 8821
Single exon skipping, size difference: 99
Exclusion in 5'UTR
Reference transcript: NM_003866
IQCB1
rs.IQCB1.F1 rs.IQCB1.R1 103 502
NCBIGene 36.3 9657
Multiple exon skipping, size difference: 399
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001023570
Changed!
smart IQ 22aa 0.005
in modified transcript
Short calmodulin-binding motif containing conserved Ile and Gln residues. Calmodulin-binding motif.
IQCE
rs.IQCE.F1 rs.IQCE.R1 255 303
NCBIGene 36.3 23288
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_152558
TIGR SMC_prok_A 282aa 6e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
COG Smc 283aa 2e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
IQCH
rs.IQCH.F1 rs.IQCH.R1 102 375
NCBIGene 36.3 64799
Single exon skipping, size difference: 273
Exclusion in the protein (no frameshift)
Reference transcript: NM_001031715
IRAK1
rs.IRAK1.F1 rs.IRAK1.R1 125 362
NCBIGene 36.3 3654
Single exon skipping, size difference: 237
Exclusion in the protein (no frameshift)
Reference transcript: NM_001569
cd S_TKc 205aa 4e-35
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 205aa 6e-36
in ref transcript
Protein kinase domain.
pfam Death 75aa 3e-07
in ref transcript
Death domain.
COG SPS1 206aa 6e-24
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
IRAK1
rs.IRAK1.F2 rs.IRAK1.R2 242 332
NCBIGene 36.3 3654
Alternative 3-prime, size difference: 90
Exclusion in the protein (no frameshift)
Reference transcript: NM_001569
cd S_TKc 205aa 4e-35
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 205aa 6e-36
in ref transcript
Protein kinase domain.
pfam Death 75aa 3e-07
in ref transcript
Death domain.
COG SPS1 206aa 6e-24
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
IRF2BP2
rs.IRF2BP2.F1 rs.IRF2BP2.R1 221 269
NCBIGene 36.3 359948
Alternative 5-prime, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_182972
ITGA6
rs.ITGA6.F1 rs.ITGA6.R1 102 232
NCBIGene 36.3 3655
Single exon skipping, size difference: 130
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001079818
pfam Integrin_alpha2 476aa 1e-110
in ref transcript
Integrin alpha. This domain is found in integrin alpha and integrin alpha precursors to the C terminus of a number of pfam01839 repeats and to the N-terminus of the pfam00357 cytoplasmic region.
smart Int_alpha 52aa 1e-11
in ref transcript
Integrin alpha (beta-propellor repeats). Integrins are cell adhesion molecules that mediate cell-extracellular matrix and cell-cell interactions. They contain both alpha and beta subunits. Alpha integrins are proposed to contain a domain containing a 7-fold repeat that adopts a beta-propellor fold. Some of these domains contain an inserted von Willebrand factor type-A domain. Some repeats contain putative calcium-binding sites. The 7-fold repeat domain is homologous to a similar domain in phosphatidylinositol-glycan-specific phospholipase D.
smart Int_alpha 55aa 1e-07
in ref transcript
smart Int_alpha 62aa 2e-05
in ref transcript
ITGB1
rs.ITGB1.F1 rs.ITGB1.R1 113 131
NCBIGene 36.3 3688
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_033667
pfam Integrin_beta 431aa 0.0
in ref transcript
Integrin, beta chain. Integrins have been found in animals and their homologues have also been found in cyanobacteria, probably due to horizontal gene transfer. The sequences repeats have been trimmed due to an overlap with EGF.
pfam Integrin_B_tail 89aa 4e-24
in ref transcript
Integrin beta tail domain. This is the beta tail domain of the Integrin protein. Integrins are receptors which are involved in cell-cell and cell-extracellular matrix interactions.
Changed!
pfam Integrin_b_cyt 21aa 5e-06
in ref transcript
Integrin beta cytoplasmic domain. Integrins are a group of transmembrane proteins which function as extracellular matrix receptors and in cell adhesion. Integrins are ubiquitously expressed and are heterodimeric, each composed of an alpha and beta subunit. Several variations of the the alpha and beta subunits exist, and association of different alpha and beta subunits can have different a different binding specificity. This domain corresponds to the cytoplasmic domain of the beta subunit.
Changed!
pfam Integrin_b_cyt 20aa 7e-06
in modified transcript
ITPR1
rs.ITPR1.F1 rs.ITPR1.R1 201 246
NCBIGene 36.3 3708
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_001099952
pfam Ins145_P3_rec 222aa 4e-95
in ref transcript
Inositol 1,4,5-trisphosphate/ryanodine receptor. This domain corresponds to the ligand binding region on inositol 1,4,5-trisphosphate receptor, and the N terminal region of the ryanodine receptor. Both receptors are involved in Ca2+ release. They can couple to the activation of neurotransmitter-gated receptors and voltage-gated Ca2+ channels on the plasma membrane, thus allowing the endoplasmic reticulum discriminate between different types of neuronal activity.
pfam RYDR_ITPR 206aa 9e-67
in ref transcript
RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3.
Changed!
pfam MIR 202aa 3e-66
in ref transcript
MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a domain that may have a ligand transferase function.
pfam RIH_assoc 120aa 4e-55
in ref transcript
RyR and IP3R Homology associated. This eukaryotic domain is found in ryanodine receptors (RyR) and inositol 1,4,5-trisphosphate receptors (IP3R) which together form a superfamily of homotetrameric ligand-gated intracellular Ca2+ channels. There seems to be no known function for this domain. Also see the IP3-binding domain pfam01365 and pfam02815.
pfam RYDR_ITPR 180aa 8e-55
in ref transcript
pfam Ion_trans 224aa 5e-06
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
COG PMT1 142aa 0.004
in ref transcript
Dolichyl-phosphate-mannose--protein O-mannosyl transferase [Posttranslational modification, protein turnover, chaperones].
Changed!
pfam MIR 187aa 5e-68
in modified transcript
ITSN2
rs.ITSN2.F1 rs.ITSN2.R1 426 507
NCBIGene 36.3 50618
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_006277
cd RhoGEF 184aa 2e-35
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
cd C2 89aa 2e-18
in ref transcript
Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
cd EH 67aa 3e-17
in ref transcript
Eps15 homology domain; found in proteins implicated in endocytosis, vesicle transport, and signal transduction. The alignment contains a pair of EF-hand motifs, typically one of them is canonical and binds to Ca2+, while the other may not bind to Ca2+. A hydrophobic binding pocket is formed by residues from both EF-hand motifs. The EH domain binds to proteins containing NPF (class I), [WF]W or SWG (class II), or H[TS]F (class III) sequence motifs.
cd EH 58aa 2e-14
in ref transcript
cd SH3 52aa 4e-14
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd SH3 54aa 2e-11
in ref transcript
cd SH3 51aa 5e-11
in ref transcript
cd SH3 52aa 6e-10
in ref transcript
cd SH3 58aa 9e-08
in ref transcript
cd PH 106aa 7e-04
in ref transcript
Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
smart RhoGEF 181aa 2e-40
in ref transcript
Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
smart EH 93aa 4e-22
in ref transcript
Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
smart EH 80aa 6e-22
in ref transcript
smart C2 97aa 1e-20
in ref transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
smart SH3 55aa 2e-16
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
smart SH3 55aa 4e-13
in ref transcript
smart SH3 56aa 1e-12
in ref transcript
smart SH3 53aa 3e-12
in ref transcript
smart SH3 61aa 5e-09
in ref transcript
Changed!
pfam SMC_N 227aa 3e-05
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
smart PH 109aa 7e-05
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
Changed!
COG Smc 227aa 1e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
TIGR SMC_prok_B 200aa 5e-08
in modified transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
COG Smc 200aa 2e-08
in modified transcript
JARID1A
rs.JARID1A.F1 rs.JARID1A.R1 241 323
NCBIGene 36.3 5927
Single exon skipping, size difference: 82
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001042603
cd BAH_plant_2 30aa 0.003
in ref transcript
BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
pfam PLU-1 333aa 1e-106
in ref transcript
PLU-1-like protein. Sequences in this family bear similarity to the central region of PLU-1. This is a nuclear protein that may have a role in DNA-binding and transcription, and is closely associated with the malignant phenotype of breast cancer. This region is found in various other Jumonji/ARID domain-containing proteins (see pfam02373, pfam01388).
pfam JmjC 117aa 2e-47
in ref transcript
JmjC domain. The JmjC domain belongs to the Cupin superfamily. JmjC-domain proteins may be protein hydroxylases that catalyse a novel histone modification.
pfam ARID 109aa 2e-33
in ref transcript
ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain.
pfam zf-C5HC2 54aa 1e-19
in ref transcript
C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function.
pfam PHD 49aa 1e-11
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
smart JmjN 35aa 2e-09
in ref transcript
Small domain found in the jumonji family of transcription factors. To date, this domain always co-occurs with the JmjC domain (although the reverse is not true).
Changed!
pfam PHD 47aa 3e-06
in ref transcript
smart PHD 52aa 4e-05
in ref transcript
PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RI NG finger and the FYV E finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BR OMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RI NG fingers misidentified as PHD fingers.
JARID1A
rs.JARID1A.F2 rs.JARID1A.R2 100 115
NCBIGene 36.3 5927
Alternative 5-prime and 3-prime, size difference: 15
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001042603
cd BAH_plant_2 30aa 0.003
in ref transcript
BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
pfam PLU-1 333aa 1e-106
in ref transcript
PLU-1-like protein. Sequences in this family bear similarity to the central region of PLU-1. This is a nuclear protein that may have a role in DNA-binding and transcription, and is closely associated with the malignant phenotype of breast cancer. This region is found in various other Jumonji/ARID domain-containing proteins (see pfam02373, pfam01388).
pfam JmjC 117aa 2e-47
in ref transcript
JmjC domain. The JmjC domain belongs to the Cupin superfamily. JmjC-domain proteins may be protein hydroxylases that catalyse a novel histone modification.
pfam ARID 109aa 2e-33
in ref transcript
ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain.
pfam zf-C5HC2 54aa 1e-19
in ref transcript
C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function.
pfam PHD 49aa 1e-11
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
smart JmjN 35aa 2e-09
in ref transcript
Small domain found in the jumonji family of transcription factors. To date, this domain always co-occurs with the JmjC domain (although the reverse is not true).
pfam PHD 47aa 3e-06
in ref transcript
smart PHD 52aa 4e-05
in ref transcript
PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RI NG finger and the FYV E finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BR OMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RI NG fingers misidentified as PHD fingers.
JMJD1C
rs.JMJD1C.F1 rs.JMJD1C.R1 260 314
NCBIGene 36.3 221037
Alternative 3-prime, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_032776
pfam JmjC 93aa 6e-08
in ref transcript
JmjC domain. The JmjC domain belongs to the Cupin superfamily. JmjC-domain proteins may be protein hydroxylases that catalyse a novel histone modification.
KCNG1
rs.KCNG1.F1 rs.KCNG1.R1 103 377
NCBIGene 36.3 3755
Exon skipping and alternative 3-prime or 5-prime, size difference: 274
Inclusion in 5'UTR, Inclusion in 5'UTR
Reference transcript: NM_002237
pfam K_tetra 97aa 6e-27
in ref transcript
K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerisation domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly related to the BTB/POZ domain pfam00651.
pfam Ion_trans 190aa 7e-13
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
PRK PRK10537 91aa 1e-09
in ref transcript
voltage-gated potassium channel; Provisional.
KCNIP2
rs.KCNIP2.F1 rs.KCNIP2.R1 102 147
NCBIGene 36.3 30819
Alternative 3-prime, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_014591
cd EFh 63aa 6e-08
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 72aa 1e-07
in ref transcript
smart EH 87aa 7e-04
in ref transcript
Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
COG FRQ1 167aa 3e-18
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
KCNIP2
rs.KCNIP2.F2 rs.KCNIP2.R2 119 140
NCBIGene 36.3 30819
Alternative 3-prime, size difference: 21
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_014591
cd EFh 63aa 6e-08
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 72aa 1e-07
in ref transcript
smart EH 87aa 7e-04
in ref transcript
Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Changed!
COG FRQ1 167aa 3e-18
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
Changed!
COG FRQ1 174aa 1e-17
in modified transcript
KCNJ16
rs.KCNJ16.F1 rs.KCNJ16.R1 159 240
NCBIGene 36.3 3773
Exon skipping and alternative 3-prime or 5-prime, size difference: 81
Inclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_018658
pfam IRK 315aa 1e-152
in ref transcript
Inward rectifier potassium channel.
KCNK2
rs.KCNK2.F1 rs.KCNK2.R1 149 273
NCBIGene 36.3 3776
Alternative 5-prime, size difference: 124
Exclusion of the protein initiation site
Reference transcript: NM_001017425
pfam Ion_trans_2 52aa 2e-13
in ref transcript
Ion channel. This family includes the two membrane helix type ion channels found in bacteria.
pfam Ion_trans_2 60aa 7e-09
in ref transcript
KCNMA1
rs.KCNMA1.F1 rs.KCNMA1.R1 100 112
NCBIGene 36.3 3778
Single exon skipping, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_001014797
pfam BK_channel_a 99aa 5e-29
in ref transcript
Calcium-activated BK potassium channel alpha subunit.
pfam Ion_trans 172aa 8e-14
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
pfam TrkA_N 114aa 0.004
in ref transcript
TrkA-N domain. This domain is found in a wide variety of proteins. These protein include potassium channels, phosphoesterases, and various other transporters. This domain binds to NAD.
PRK PRK10537 76aa 7e-11
in ref transcript
voltage-gated potassium channel; Provisional.
KDELR2
rs.KDELR2.F1 rs.KDELR2.R1 280 533
NCBIGene 36.3 11014
Single exon skipping, size difference: 253
Exclusion in the protein causing a frameshift
Reference transcript: NM_006854
Changed!
pfam ER_lumen_recept 143aa 3e-50
in ref transcript
ER lumen protein retaining receptor.
Changed!
COG ERD2 212aa 3e-51
in ref transcript
ER lumen protein retaining receptor [Intracellular trafficking and secretion].
Changed!
pfam ER_lumen_recept 90aa 6e-30
in modified transcript
Changed!
COG ERD2 117aa 2e-24
in modified transcript
KIAA0232
rs.KIAA0232.F1 rs.KIAA0232.R1 198 282
NCBIGene 36.3 9778
Single exon skipping, size difference: 84
Exclusion in 5'UTR
Reference transcript: NM_014743
KIAA1191
rs.KIAA1191.F1 rs.KIAA1191.R1 195 282
NCBIGene 36.3 57179
Single exon skipping, size difference: 87
Exclusion of the protein initiation site
Reference transcript: NM_020444
KIAA1191
rs.KIAA1191.F2 rs.KIAA1191.R2 205 313
NCBIGene 36.3 57179
Single exon skipping, size difference: 108
Exclusion in 5'UTR
Reference transcript: NM_020444
KIAA1217
rs.KIAA1217.F1 rs.KIAA1217.R1 297 402
NCBIGene 36.3 56243
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_019590
KIAA1543
rs.KIAA1543.F1 rs.KIAA1543.R1 196 277
NCBIGene 36.3 57662
Multiple exon skipping, size difference: 81
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001080429
pfam DUF1781 127aa 2e-68
in ref transcript
Protein of unknown function (DUF1781). This is a family of uncharacterised proteins. The structure of a murine hypothetical protein from RIKEN cDNA has shown it to adopt a mainly beta barrel structure with an alpha hairpin.
TIGR PDHac_trf_mito 71aa 0.004
in ref transcript
This model represents one of several closely related clades of the dihydrolipoamide acetyltransferase subunit of the pyruvate dehydrogenase complex. It includes sequences from mitochondria and from alpha and beta branches of the proteobacteria, as well as from some other bacteria. Sequences from Gram-positive bacteria are not included. The non-enzymatic homolog protein X, which serves as an E3 component binding protein, falls within the clade phylogenetically but is rejected by its low score.
KIAA1833
rs.KIAA1833.F1 rs.KIAA1833.R1 107 227
NCBIGene 36.3 727957
Single exon skipping, size difference: 120
Exclusion in 5'UTR
Reference transcript: NM_032450
KIF13A
rs.KIF13A.F1 rs.KIF13A.R1 134 173
NCBIGene 36.3 63971
Single exon skipping, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_022113
cd KISc_KIF1A_KIF1B 356aa 1e-152
in ref transcript
Kinesin motor domain, KIF1_like proteins. KIF1A (Unc104) transports synaptic vesicles to the nerve terminal, KIF1B has been implicated in transport of mitochondria. Both proteins are expressed in neurons. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In most kinesins, the motor domain is found at the N-terminus (N-type). N-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. In contrast to the majority of dimeric kinesins, most KIF1A/Unc104 kinesins are monomeric motors. A lysine-rich loop in KIF1A binds to the negatively charged C-terminus of tubulin and compensates for the lack of a second motor domain, allowing KIF1A to move processively.
cd FHA 98aa 8e-08
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
smart KISc 355aa 1e-129
in ref transcript
Kinesin motor, catalytic domain. ATPase. Microtubule-dependent molecular motors that play important roles in intracellular transport of organelles and in cell division.
pfam FHA 65aa 3e-07
in ref transcript
FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
TIGR SMC_prok_B 219aa 0.002
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG KIP1 359aa 1e-69
in ref transcript
Kinesin-like protein [Cytoskeleton].
COG Smc 166aa 0.001
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
KIF13A
rs.KIF13A.F2 rs.KIF13A.R2 232 337
NCBIGene 36.3 63971
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_022113
cd KISc_KIF1A_KIF1B 356aa 1e-152
in ref transcript
Kinesin motor domain, KIF1_like proteins. KIF1A (Unc104) transports synaptic vesicles to the nerve terminal, KIF1B has been implicated in transport of mitochondria. Both proteins are expressed in neurons. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In most kinesins, the motor domain is found at the N-terminus (N-type). N-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. In contrast to the majority of dimeric kinesins, most KIF1A/Unc104 kinesins are monomeric motors. A lysine-rich loop in KIF1A binds to the negatively charged C-terminus of tubulin and compensates for the lack of a second motor domain, allowing KIF1A to move processively.
cd FHA 98aa 8e-08
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
smart KISc 355aa 1e-129
in ref transcript
Kinesin motor, catalytic domain. ATPase. Microtubule-dependent molecular motors that play important roles in intracellular transport of organelles and in cell division.
pfam FHA 65aa 3e-07
in ref transcript
FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
TIGR SMC_prok_B 219aa 0.002
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG KIP1 359aa 1e-69
in ref transcript
Kinesin-like protein [Cytoskeleton].
COG Smc 166aa 0.001
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
KIT
rs.KIT.F1 rs.KIT.R1 100 112
NCBIGene 36.3 3815
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_000222
cd PTKc_Kit 376aa 0.0
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Kit. Protein Tyrosine Kinase (PTK) family; Kit (or c-Kit); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Kit is a member of the Platelet Derived Growth Factor Receptor (PDGFR) subfamily of proteins, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with five immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of Kit to its ligand, the stem-cell factor (SCF), leads to receptor dimerization, trans phosphorylation and activation, and intracellular signaling. Kit is important in the development of melanocytes, germ cells, mast cells, hematopoietic stem cells, the interstitial cells of Cajal, and the pacemaker cells of the GI tract. Kit signaling is involved in major cellular functions including cell survival, proliferation, differentiation, adhesion, and chemotaxis. Mutations in Kit, which result in constitutive ligand-independent activation, are found in human cancers such as gastrointestinal stromal tumor (GIST) and testicular germ cell tumor (TGCT). The aberrant expression of Kit and/or SCF is associated with other tumor types such as systemic mastocytosis and cancers of the breast, neurons, lung, prostate, colon, and rectum. Although the structure of the human Kit catalytic domain is known, it is excluded from this specific alignment model because it contains a deletion in its sequence.
cd IG 75aa 6e-06
in ref transcript
Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam Pkinase_Tyr 154aa 4e-73
in ref transcript
Protein tyrosine kinase.
smart STYKc 304aa 1e-30
in ref transcript
Protein kinase; unclassified specificity. Phosphotransferases. The specificity of this class of kinases can not be predicted. Possible dual-specificity Ser/Thr/Tyr kinase.
smart IG_like 87aa 8e-09
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
COG SPS1 206aa 2e-12
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
COG SPS1 136aa 2e-05
in ref transcript
KITLG
rs.KITLG.F1 rs.KITLG.R1 452 536
NCBIGene 36.3 4254
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_000899
Changed!
pfam SCF 273aa 1e-134
in ref transcript
Stem cell factor. Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. The crystal structure of human SCF has been resolved and a potential receptor-binding site identified.
Changed!
pfam SCF 245aa 1e-121
in modified transcript
KLK15
rs.KLK15.F1 rs.KLK15.R1 186 323
NCBIGene 36.3 55554
Single exon skipping, size difference: 137
Exclusion in the protein causing a frameshift
Reference transcript: NM_017509
Changed!
cd Tryp_SPc 228aa 3e-65
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
Changed!
smart Tryp_SPc 225aa 9e-73
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Changed!
COG COG5640 235aa 4e-17
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
Changed!
cd Tryp_SPc 127aa 5e-36
in modified transcript
Changed!
smart Tryp_SPc 127aa 2e-37
in modified transcript
Changed!
COG COG5640 99aa 0.001
in modified transcript
KLK2
rs.KLK2.F1 rs.KLK2.R1 193 230
NCBIGene 36.3 3817
Alternative 5-prime, size difference: 37
Inclusion in the protein causing a frameshift
Reference transcript: NM_005551
Changed!
cd Tryp_SPc 232aa 1e-59
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
Changed!
smart Tryp_SPc 230aa 2e-67
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Changed!
COG COG5640 235aa 1e-12
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
Changed!
cd Tryp_SPc 185aa 3e-48
in modified transcript
Changed!
smart Tryp_SPc 186aa 1e-54
in modified transcript
Changed!
COG COG5640 51aa 6e-06
in modified transcript
KLK3
rs.KLK3.F1 rs.KLK3.R1 107 549
NCBIGene 36.3 354
Alternative 3-prime, size difference: 442
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001648
Changed!
cd Tryp_SPc 232aa 3e-64
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
Changed!
smart Tryp_SPc 230aa 5e-71
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Changed!
COG COG5640 250aa 1e-21
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
Changed!
cd Tryp_SPc 186aa 8e-45
in modified transcript
Changed!
smart Tryp_SPc 186aa 9e-50
in modified transcript
Changed!
COG COG5640 205aa 2e-08
in modified transcript
KLK6
rs.KLK6.F1 rs.KLK6.R1 150 198
NCBIGene 36.3 5653
Single exon skipping, size difference: 48
Exclusion of the protein initiation site
Reference transcript: NM_002774
Changed!
cd Tryp_SPc 218aa 6e-64
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
Changed!
smart Tryp_SPc 215aa 9e-72
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Changed!
COG COG5640 239aa 1e-22
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
KLK8
rs.KLK8.F1 rs.KLK8.R1 264 399
NCBIGene 36.3 11202
Alternative 3-prime, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_144505
cd Tryp_SPc 223aa 2e-55
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
smart Tryp_SPc 221aa 3e-63
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Changed!
COG COG5640 236aa 4e-13
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
Changed!
COG COG5640 244aa 1e-13
in modified transcript
KRAS
rs.KRAS.F1 rs.KRAS.R1 182 306
NCBIGene 36.3 3845
Single exon skipping, size difference: 124
Exclusion of the stop codon
Reference transcript: NM_033360
Changed!
cd H_N_K_Ras_like 162aa 3e-89
in ref transcript
H-Ras/N-Ras/K-Ras subfamily. H-Ras, N-Ras, and K-Ras4A/4B are the prototypical members of the Ras family. These isoforms generate distinct signal outputs despite interacting with a common set of activators and effectors, and are strongly associated with oncogenic progression in tumor initiation. Mutated versions of Ras that are insensitive to GAP stimulation (and are therefore constitutively active) are found in a significant fraction of human cancers. Many Ras guanine nucleotide exchange factors (GEFs) have been identified. They are sequestered in the cytosol until activation by growth factors triggers recruitment to the plasma membrane or Golgi, where the GEF colocalizes with Ras. Active (GTP-bound) Ras interacts with several effector proteins that stimulate a variety of diverse cytoplasmic signaling activities. Some are known to positively mediate the oncogenic properties of Ras, including Raf, phosphatidylinositol 3-kinase (PI3K), RalGEFs, and Tiam1. Others are proposed to play negative regulatory roles in oncogenesis, including RASSF and NORE/MST1. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Ras proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
Changed!
smart RAS 163aa 2e-78
in ref transcript
Ras subfamily of RAS small GTPases. Similar in fold and function to the bacterial EF-Tu GTPase. p21Ras couples receptor Tyr kinases and G protein receptors to protein kinase cascades.
Changed!
COG COG1100 185aa 3e-18
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
Changed!
cd H_N_K_Ras_like 162aa 9e-90
in modified transcript
Changed!
smart RAS 161aa 6e-78
in modified transcript
Changed!
COG COG1100 163aa 5e-18
in modified transcript
KRT80
rs.KRT80.F1 rs.KRT80.R1 107 142
NCBIGene 36.3 144501
Alternative 3-prime, size difference: 35
Inclusion in the protein causing a new stop codon
Reference transcript: NM_182507
pfam Filament 312aa 1e-69
in ref transcript
Intermediate filament protein.
COG Smc 253aa 2e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
KTN1
rs.KTN1.F1 rs.KTN1.R1 121 208
NCBIGene 36.3 3895
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_001079521
TIGR SMC_prok_B 226aa 1e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
TIGR SMC_prok_B 278aa 2e-05
in ref transcript
Changed!
COG Smc 222aa 0.002
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
TIGR SMC_prok_B 272aa 3e-05
in modified transcript
KTN1
rs.KTN1.F2 rs.KTN1.R2 163 247
NCBIGene 36.3 3895
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_001079521
TIGR SMC_prok_B 226aa 1e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
TIGR SMC_prok_B 278aa 2e-05
in ref transcript
COG Smc 222aa 0.002
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
KTN1
rs.KTN1.F3 rs.KTN1.R3 148 204
NCBIGene 36.3 3895
Single exon skipping, size difference: 56
Exclusion of the stop codon
Reference transcript: NM_182926
TIGR SMC_prok_B 226aa 1e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
TIGR SMC_prok_B 278aa 2e-05
in ref transcript
COG Smc 222aa 0.002
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
KTN1
rs.KTN1.F4 FOX.KTN1.R1 117 186
NCBIGene 36.3 3895
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_001079521
TIGR SMC_prok_B 226aa 1e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
TIGR SMC_prok_B 278aa 2e-05
in ref transcript
Changed!
COG Smc 222aa 0.002
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
TIGR SMC_prok_B 255aa 1e-04
in modified transcript
KTN1
rs.KTN1.F5 rs.KTN1.R5 277 401
NCBIGene 36.3 3895
Single exon skipping, size difference: 124
Exclusion in 5'UTR
Reference transcript: NM_182926
TIGR SMC_prok_B 226aa 1e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
TIGR SMC_prok_B 278aa 2e-05
in ref transcript
COG Smc 222aa 0.002
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
L1CAM
rs.L1CAM.F1 rs.L1CAM.R1 100 112
NCBIGene 36.3 3897
Single exon skipping, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_000425
cd IGcam 85aa 4e-14
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 88aa 9e-14
in ref transcript
cd FN3 97aa 8e-12
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd IGcam 92aa 6e-11
in ref transcript
cd FN3 86aa 7e-10
in ref transcript
cd IGcam 87aa 1e-09
in ref transcript
cd IGcam 86aa 6e-08
in ref transcript
cd FN3 88aa 1e-06
in ref transcript
cd FN3 82aa 7e-06
in ref transcript
smart IGc2 64aa 1e-14
in ref transcript
Immunoglobulin C-2 Type.
smart IG_like 82aa 1e-10
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam fn3 96aa 1e-10
in ref transcript
Fibronectin type III domain.
smart IG_like 83aa 2e-10
in ref transcript
pfam I-set 90aa 2e-10
in ref transcript
Immunoglobulin I-set domain.
pfam fn3 90aa 9e-10
in ref transcript
smart FN3 85aa 1e-09
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
pfam fn3 82aa 5e-08
in ref transcript
smart IG_like 85aa 2e-07
in ref transcript
LAMA2
rs.LAMA2.F1 rs.LAMA2.R1 100 112
NCBIGene 36.3 3908
Single exon skipping, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_000426
cd LamG 150aa 1e-29
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
cd LamG 153aa 2e-28
in ref transcript
Changed!
cd LamG 162aa 2e-25
in ref transcript
cd LamG 163aa 4e-22
in ref transcript
cd LamG 160aa 1e-19
in ref transcript
cd EGF_Lam 47aa 1e-08
in ref transcript
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies.
cd EGF_Lam 45aa 1e-07
in ref transcript
cd EGF_Lam 49aa 5e-07
in ref transcript
cd EGF_Lam 53aa 7e-07
in ref transcript
cd EGF_Lam 43aa 2e-06
in ref transcript
cd EGF_Lam 50aa 3e-06
in ref transcript
cd EGF_Lam 49aa 7e-06
in ref transcript
cd EGF_Lam 58aa 1e-05
in ref transcript
cd EGF_Lam 46aa 2e-05
in ref transcript
cd EGF_Lam 53aa 1e-04
in ref transcript
cd EGF_Lam 48aa 0.001
in ref transcript
cd EGF_Lam 42aa 0.002
in ref transcript
cd EGF_Lam 58aa 0.007
in ref transcript
pfam Laminin_N 247aa 1e-88
in ref transcript
Laminin N-terminal (Domain VI).
pfam Laminin_I 266aa 1e-65
in ref transcript
Laminin Domain I. coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
pfam Laminin_G_1 129aa 8e-40
in ref transcript
Laminin G domain.
smart LamB 133aa 5e-38
in ref transcript
Laminin B domain.
smart LamB 136aa 1e-37
in ref transcript
pfam Laminin_II 137aa 1e-36
in ref transcript
Laminin Domain II. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
Changed!
pfam Laminin_G_1 140aa 3e-34
in ref transcript
pfam Laminin_G_1 141aa 4e-30
in ref transcript
smart LamG 130aa 8e-29
in ref transcript
Laminin G domain.
pfam Laminin_G_1 142aa 1e-26
in ref transcript
TIGR SMC_prok_B 521aa 1e-14
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart EGF_Lam 45aa 8e-10
in ref transcript
Laminin-type epidermal growth factor-like domai.
smart EGF_Lam 47aa 1e-09
in ref transcript
smart EGF_Lam 44aa 9e-08
in ref transcript
pfam Laminin_EGF 47aa 1e-07
in ref transcript
Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6.
smart EGF_Lam 58aa 3e-07
in ref transcript
smart EGF_Lam 44aa 1e-06
in ref transcript
pfam Laminin_EGF 49aa 1e-06
in ref transcript
smart EGF_Lam 44aa 2e-06
in ref transcript
pfam Laminin_EGF 47aa 5e-06
in ref transcript
pfam Laminin_EGF 41aa 8e-06
in ref transcript
smart EGF_Lam 53aa 3e-05
in ref transcript
pfam Laminin_EGF 56aa 3e-04
in ref transcript
pfam Laminin_EGF 56aa 9e-04
in ref transcript
COG SbcC 608aa 2e-20
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
Changed!
cd LamG 158aa 2e-26
in modified transcript
Changed!
pfam Laminin_G_1 136aa 3e-32
in modified transcript
LAMA4
rs.LAMA4.F1 rs.LAMA4.R1 99 116
NCBIGene 36.3 3910
Alternative 5-prime, size difference: 17
Inclusion in 5'UTR
Reference transcript: NM_001105206
cd LamG 136aa 8e-30
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
cd LamG 151aa 7e-25
in ref transcript
cd LamG 159aa 6e-17
in ref transcript
cd LamG 140aa 7e-15
in ref transcript
cd LamG 178aa 8e-13
in ref transcript
cd EGF_Lam 52aa 3e-11
in ref transcript
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies.
cd EGF_Lam 44aa 8e-08
in ref transcript
cd EGF_Lam 46aa 5e-07
in ref transcript
cd SPEC 197aa 3e-04
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
pfam Laminin_I 248aa 1e-72
in ref transcript
Laminin Domain I. coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
smart LamG 134aa 5e-31
in ref transcript
Laminin G domain.
pfam Laminin_II 128aa 3e-30
in ref transcript
Laminin Domain II. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
smart LamG 133aa 1e-26
in ref transcript
smart LamG 138aa 6e-19
in ref transcript
smart LamG 155aa 6e-16
in ref transcript
pfam Laminin_G_2 114aa 4e-13
in ref transcript
Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
pfam Laminin_EGF 47aa 6e-11
in ref transcript
Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6.
TIGR SMC_prok_B 556aa 2e-09
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart EGF_Lam 37aa 3e-07
in ref transcript
Laminin-type epidermal growth factor-like domai.
pfam Laminin_EGF 41aa 8e-07
in ref transcript
pfam VSP 201aa 8e-04
in ref transcript
Giardia variant-specific surface protein.
COG Smc 304aa 1e-11
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
PRK PRK02224 345aa 2e-06
in ref transcript
chromosome segregation protein; Provisional.
LAMA4
rs.LAMA4.F2 rs.LAMA4.R2 127 148
NCBIGene 36.3 3910
Alternative 5-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_001105206
cd LamG 136aa 8e-30
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
cd LamG 151aa 7e-25
in ref transcript
cd LamG 159aa 6e-17
in ref transcript
cd LamG 140aa 7e-15
in ref transcript
cd LamG 178aa 8e-13
in ref transcript
cd EGF_Lam 52aa 3e-11
in ref transcript
Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies.
cd EGF_Lam 44aa 8e-08
in ref transcript
cd EGF_Lam 46aa 5e-07
in ref transcript
cd SPEC 197aa 3e-04
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
pfam Laminin_I 248aa 1e-72
in ref transcript
Laminin Domain I. coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
smart LamG 134aa 5e-31
in ref transcript
Laminin G domain.
pfam Laminin_II 128aa 3e-30
in ref transcript
Laminin Domain II. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
smart LamG 133aa 1e-26
in ref transcript
smart LamG 138aa 6e-19
in ref transcript
smart LamG 155aa 6e-16
in ref transcript
pfam Laminin_G_2 114aa 4e-13
in ref transcript
Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
pfam Laminin_EGF 47aa 6e-11
in ref transcript
Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6.
TIGR SMC_prok_B 556aa 2e-09
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart EGF_Lam 37aa 3e-07
in ref transcript
Laminin-type epidermal growth factor-like domai.
pfam Laminin_EGF 41aa 8e-07
in ref transcript
pfam VSP 201aa 8e-04
in ref transcript
Giardia variant-specific surface protein.
COG Smc 304aa 1e-11
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
PRK PRK02224 345aa 2e-06
in ref transcript
chromosome segregation protein; Provisional.
LARP4
rs.LARP4.F1 rs.LARP4.R1 99 117
NCBIGene 36.3 113251
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_199188
cd RRM 67aa 0.004
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart LA 79aa 5e-29
in ref transcript
Domain in the RNA-binding Lupus La protein; unknown function.
LDB3
rs.LDB3.F1 rs.LDB3.R1 100 289
NCBIGene 36.3 11155
Single exon skipping, size difference: 189
Exclusion in the protein (no frameshift)
Reference transcript: NM_007078
cd PDZ_signaling 77aa 3e-16
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart PDZ 81aa 5e-17
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
pfam LIM 56aa 8e-12
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
pfam LIM 53aa 9e-10
in ref transcript
pfam LIM 56aa 2e-08
in ref transcript
smart ZM 26aa 3e-05
in ref transcript
ZASP-like motif. Short motif (26 amino acids) present in an alpha-actinin-binding protein, ZASP, and similar molecules.
Inclusion in the protein causing a frameshift, Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_007078
cd PDZ_signaling 77aa 3e-16
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart PDZ 81aa 5e-17
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
pfam LIM 56aa 8e-12
in ref transcript
LIM domain. This family represents two copies of the LIM structural domain.
pfam LIM 53aa 9e-10
in ref transcript
pfam LIM 56aa 2e-08
in ref transcript
Changed!
smart ZM 26aa 3e-05
in ref transcript
ZASP-like motif. Short motif (26 amino acids) present in an alpha-actinin-binding protein, ZASP, and similar molecules.
Changed!
smart ZM 26aa 3e-06
in modified transcript
LGALS9
rs.LGALS9.F1 rs.LGALS9.R1 111 207
NCBIGene 36.3 3965
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_009587
cd GLECT 131aa 2e-36
in ref transcript
Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.
cd GLECT 128aa 5e-33
in ref transcript
smart GLECT 131aa 4e-40
in ref transcript
Galectin. Galectin - galactose-binding lectin.
pfam Gal-bind_lectin 128aa 3e-35
in ref transcript
Galactoside-binding lectin. This family contains galactoside binding lectins. The family also includes enzymes such as human eosinophil lysophospholipase (EC:3.1.1.5).
LILRB5
rs.LILRB5.F1 rs.LILRB5.R1 125 425
NCBIGene 36.3 10990
Single exon skipping, size difference: 300
Exclusion in the protein (no frameshift)
Reference transcript: NM_001081442
smart IG_like 67aa 0.010
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
LINS1
rs.LINS1.F1 rs.LINS1.R1 199 308
NCBIGene 36.3 55180
Single exon skipping, size difference: 109
Exclusion in 5'UTR
Reference transcript: NM_018148
LOC100129583
rs.LOC100129583.F1 rs.LOC100129583.R1 404 449
NCBIGene 36.3 100129583
Alternative 5-prime, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: XM_001721012
LOC100129720
rs.LOC100129720.F1 rs.LOC100129720.R1 286 391
NCBIGene 36.3 100129720
Alternative 3-prime, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: XM_001725772
LOC100130008
rs.LOC100130008.F1 rs.LOC100130008.R1 199 316
NCBIGene 36.3 100130008
Alternative 5-prime, size difference: 117
Exclusion in 5'UTR
Reference transcript: XM_001725170
LOC100130771
rs.LOC100130771.F1 rs.LOC100130771.R1 131 299
NCBIGene 36.3 100130771
Alternative 3-prime, size difference: 168
Exclusion of the stop codon
Reference transcript: XM_001725809
LOC100130890
rs.LOC100130890.F1 rs.LOC100130890.R1 180 266
NCBIGene 36.3 100130890
Single exon skipping, size difference: 86
Exclusion in 5'UTR
Reference transcript: XM_001721588
cd RHOD_HSP67B2 67aa 9e-15
in ref transcript
Member of the Rhodanese Homology Domain superfamily. This CD includes the heat shock protein 67B2 of Drosophila melanogaster and other similar proteins, many of which are uncharacterized.
pfam Rhodanese 63aa 9e-06
in ref transcript
Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
PRK PRK07878 58aa 4e-05
in ref transcript
molybdopterin biosynthesis-like protein MoeZ; Validated.
LOC100131626
rs.LOC100131626.F1 rs.LOC100131626.R1 287 340
NCBIGene 36.3 100131626
Alternative 3-prime, size difference: 53
Exclusion in the protein causing a frameshift
Reference transcript: XM_001726222
LOC100132134
rs.LOC100132134.F1 rs.LOC100132134.R1 112 173
NCBIGene 36.3 100132134
Alternative 5-prime, size difference: 61
Exclusion in the protein causing a frameshift
Reference transcript: XM_001719469
Changed!
pfam Glyco_hydro_2 61aa 8e-04
in ref transcript
Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities.
LOC100132215
rs.LOC100132215.F1 rs.LOC100132215.R1 144 259
NCBIGene 36.3 100132215
Alternative 5-prime and 3-prime, size difference: 115
Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: XM_001725462
LOC100132352
rs.LOC100132352.F1 rs.LOC100132352.R1 138 177
NCBIGene 36.3 100132352
Alternative 3-prime, size difference: 39
Exclusion of the stop codon
Reference transcript: XM_001720717
LOC100132565
rs.LOC100132565.F1 rs.LOC100132565.R1 149 242
NCBIGene 36.3 100132565
Exon skipping and alternative 3-prime or 5-prime, size difference: 93
Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: XM_001724162
TIGR SMC_prok_A 151aa 0.003
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
LOC100132771
rs.LOC100132771.F1 rs.LOC100132771.R1 408 487
NCBIGene 36.3 100132771
Alternative 3-prime, size difference: 79
Inclusion in 3'UTR
Reference transcript: XM_001719907
LOC116412
rs.LOC116412.F1 rs.LOC116412.R1 158 260
NCBIGene 36.3 116412
Alternative 5-prime, size difference: 102
Exclusion in 5'UTR
Reference transcript: XM_375665
pfam zf-C2H2 23aa 0.006
in ref transcript
Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
COG COG5048 156aa 5e-06
in ref transcript
FOG: Zn-finger [General function prediction only].
COG COG5048 217aa 3e-05
in ref transcript
LOC26010
rs.LOC26010.F1 rs.LOC26010.R1 175 382
NCBIGene 36.3 26010
Single exon skipping, size difference: 207
Exclusion in the protein (no frameshift)
Reference transcript: NM_015535
Changed!
pfam DUF1387 310aa 1e-124
in ref transcript
Protein of unknown function (DUF1387). This family represents a conserved region approximately 300 residues long within a number of hypothetical proteins of unknown function that seem to be restricted to mammals.
Changed!
pfam DUF1387 163aa 1e-71
in modified transcript
Changed!
pfam DUF1387 90aa 2e-19
in modified transcript
LOC284701
rs.LOC284701.F1 rs.LOC284701.R1 128 350
NCBIGene 36.3 284701
Alternative 5-prime, size difference: 222
Exclusion in 5'UTR
Reference transcript: XM_001723603
LOC391322
rs.LOC391322.F1 rs.LOC391322.R1 227 266
NCBIGene 36.3 391322
Alternative 5-prime, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: XM_001726974
Changed!
pfam MIF 108aa 5e-34
in ref transcript
Macrophage migration inhibitory factor (MIF).
Changed!
pfam MIF 95aa 8e-37
in modified transcript
LOC439992
rs.LOC439992.F1 rs.LOC439992.R1 262 412
NCBIGene 36.3 439992
Alternative 5-prime and 3-prime, size difference: 150
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: XM_926528
Changed!
pfam Ribosomal_S3Ae 199aa 4e-89
in ref transcript
Ribosomal S3Ae family.
Changed!
COG RPS1A 217aa 2e-48
in ref transcript
Ribosomal protein S3AE [Translation, ribosomal structure and biogenesis].
Changed!
pfam Ribosomal_S3Ae 149aa 2e-55
in modified transcript
Changed!
COG RPS1A 167aa 6e-30
in modified transcript
LOC440434
rs.LOC440434.F1 rs.LOC440434.R1 111 191
NCBIGene 36.3 440434
Single exon skipping, size difference: 80
Inclusion in the protein causing a frameshift
Reference transcript: XM_001725426
Changed!
pfam Peptidase_M1 285aa 3e-88
in ref transcript
Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase activity.
Changed!
COG PepN 273aa 4e-44
in ref transcript
Aminopeptidase N [Amino acid transport and metabolism].
Changed!
pfam Peptidase_M1 93aa 5e-10
in modified transcript
LOC51149
rs.LOC51149.F1 rs.LOC51149.R1 146 311
NCBIGene 36.3 51149
Multiple exon skipping, size difference: 165
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_016175
LOC642393
rs.LOC642393.F1 rs.LOC642393.R1 212 305
NCBIGene 36.3 642393
Alternative 3-prime, size difference: 93
Inclusion in the protein causing a new stop codon
Reference transcript: XM_001724428
Changed!
TIGR rplT_bact 73aa 4e-11
in ref transcript
This protein binds directly to 23s ribosomal RNA and is necessary for the in vitro assembly process of the 50s ribosomal subunit. It is not involved in the protein synthesizing functions of that subunit. GO process changed accordingly (SS 5/09/03).
Changed!
PRK rplT 69aa 1e-11
in ref transcript
50S ribosomal protein L20; Provisional.
Changed!
TIGR rplT_bact 75aa 3e-12
in modified transcript
Changed!
PRK rplT 71aa 1e-12
in modified transcript
LOC642393
rs.LOC642393.F2 rs.LOC642393.R2 144 489
NCBIGene 36.3 642393
Alternative 3-prime, size difference: 345
Inclusion in the protein causing a new stop codon
Reference transcript: XM_001724426
Changed!
TIGR rplT_bact 98aa 2e-19
in ref transcript
This protein binds directly to 23s ribosomal RNA and is necessary for the in vitro assembly process of the 50s ribosomal subunit. It is not involved in the protein synthesizing functions of that subunit. GO process changed accordingly (SS 5/09/03).
Changed!
PRK rplT 97aa 2e-22
in ref transcript
50S ribosomal protein L20; Provisional.
Changed!
TIGR rplT_bact 74aa 5e-12
in modified transcript
Changed!
PRK rplT 70aa 2e-12
in modified transcript
LOC643680
rs.LOC643680.F1 rs.LOC643680.R1 227 427
NCBIGene 36.3 643680
Alternative 3-prime, size difference: 200
Exclusion of the stop codon
Reference transcript: XM_931901
pfam CD20 40aa 2e-04
in ref transcript
CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
LOC653117
rs.LOC653117.F1 rs.LOC653117.R1 101 449
NCBIGene 36.3 653117
Single exon skipping, size difference: 348
Exclusion in the protein (no frameshift)
Reference transcript: XM_001722839
smart SPRY 123aa 3e-16
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
smart PRY 50aa 2e-10
in ref transcript
associated with SPRY domains.
Changed!
pfam V-set 106aa 6e-07
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
LOC653188
rs.LOC653188.F1 rs.LOC653188.R1 250 311
NCBIGene 36.3 653188
Alternative 5-prime, size difference: 61
Exclusion in the protein causing a frameshift
Reference transcript: XM_001721330
Changed!
pfam Glyco_hydro_2 61aa 9e-05
in ref transcript
Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities.
Changed!
PRK PRK10150 62aa 0.006
in ref transcript
beta-D-glucuronidase; Provisional.
LOC653550
rs.LOC653550.F1 rs.LOC653550.R1 191 315
NCBIGene 36.3 653550
Alternative 5-prime, size difference: 124
Exclusion in the protein causing a frameshift
Reference transcript: XM_001725815
LOC727737
rs.LOC727737.F1 rs.LOC727737.R1 99 115
NCBIGene 36.3 727737
Alternative 5-prime, size difference: 16
Exclusion in the protein causing a frameshift
Reference transcript: XM_001126088
pfam Peptidase_C54 301aa 1e-102
in ref transcript
Peptidase family C54.
LOC727761
rs.LOC727761.F1 rs.LOC727761.R1 114 243
NCBIGene 36.3 727761
Exon skipping and alternative 3-prime or 5-prime, size difference: 129
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: XM_001126211
Changed!
cd TMPK 185aa 3e-23
in ref transcript
Thymidine monophosphate kinase (TMPK), also known as thymidylate kinase, catalyzes the phosphorylation of thymidine monophosphate (TMP) to thymidine diphosphate (TDP) utilizing ATP as its preferred phophoryl donor. TMPK represents the rate-limiting step in either de novo or salvage biosynthesis of thymidine triphosphate (TTP).
Changed!
pfam Thymidylate_kin 181aa 3e-54
in ref transcript
Thymidylate kinase.
Changed!
COG Tmk 188aa 2e-32
in ref transcript
Thymidylate kinase [Nucleotide transport and metabolism].
Changed!
cd TMPK 142aa 2e-11
in modified transcript
Changed!
pfam Thymidylate_kin 138aa 2e-31
in modified transcript
Changed!
COG Tmk 145aa 4e-16
in modified transcript
LOC727761
rs.LOC727761.F2 rs.LOC727761.R2 111 183
NCBIGene 36.3 727761
Alternative 3-prime, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: XM_001126211
Changed!
cd TMPK 185aa 3e-23
in ref transcript
Thymidine monophosphate kinase (TMPK), also known as thymidylate kinase, catalyzes the phosphorylation of thymidine monophosphate (TMP) to thymidine diphosphate (TDP) utilizing ATP as its preferred phophoryl donor. TMPK represents the rate-limiting step in either de novo or salvage biosynthesis of thymidine triphosphate (TTP).
Changed!
pfam Thymidylate_kin 181aa 3e-54
in ref transcript
Thymidylate kinase.
Changed!
COG Tmk 188aa 2e-32
in ref transcript
Thymidylate kinase [Nucleotide transport and metabolism].
Changed!
cd TMPK 161aa 4e-17
in modified transcript
Changed!
pfam Thymidylate_kin 157aa 7e-41
in modified transcript
Changed!
COG Tmk 164aa 2e-23
in modified transcript
LOC729870
rs.LOC729870.F1 rs.LOC729870.R1 253 392
NCBIGene 36.3 729870
Alternative 3-prime, size difference: 139
Inclusion in 5'UTR
Reference transcript: XM_001725930
LOC92482
rs.LOC92482.F1 rs.LOC92482.R1 147 304
NCBIGene 36.3 92482
Single exon skipping, size difference: 157
Exclusion in the protein causing a frameshift
Reference transcript: XM_001726024
LPHN1
rs.LPHN1.F1 rs.LPHN1.R1 100 115
NCBIGene 36.3 22859
Single exon skipping, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_001008701
pfam OLF 257aa 1e-105
in ref transcript
Olfactomedin-like domain.
pfam Latrophilin 362aa 1e-100
in ref transcript
Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein coupled receptors that are involved in secretion. Latrophilin mRNA is present only in neuronal tissue. Lactrophillin interacts with G-alpha O.
pfam 7tm_2 247aa 5e-74
in ref transcript
7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
pfam Gal_Lectin 81aa 2e-25
in ref transcript
Galactose binding lectin domain.
smart GPS 53aa 3e-15
in ref transcript
G-protein-coupled receptor proteolytic site domain. Present in latrophilin/CL-1, sea urchin REJ and polycystin.
smart HormR 64aa 7e-14
in ref transcript
Domain present in hormone receptors.
LRRCC1
rs.LRRCC1.F1 rs.LRRCC1.R1 122 328
NCBIGene 36.3 85444
Single exon skipping, size difference: 206
Exclusion in the protein causing a frameshift
Reference transcript: NM_033402
Changed!
pfam SMC_N 259aa 3e-05
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Changed!
TIGR SMC_prok_B 251aa 0.002
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
COG Smc 351aa 0.002
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
LRRFIP2
rs.LRRFIP2.F1 rs.LRRFIP2.R1 142 337
NCBIGene 36.3 9209
Multiple exon skipping, size difference: 195
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_006309
Changed!
pfam DUF2051 354aa 1e-54
in ref transcript
Double stranded RNA binding protein (DUF2051). This is a novel protein identified as interacting with the leucine-rich repeat domain of human flightless-I, FliI protein.
Changed!
COG SbcC 333aa 3e-07
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
Changed!
pfam DUF2051 289aa 1e-80
in modified transcript
Changed!
COG Smc 255aa 2e-06
in modified transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
LRRN3
rs.LRRN3.F1 rs.LRRN3.R1 267 374
NCBIGene 36.3 54674
Single exon skipping, size difference: 107
Exclusion in 5'UTR
Reference transcript: NM_001099660
cd IGcam 89aa 2e-10
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd FN3 68aa 0.006
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
pfam I-set 88aa 9e-11
in ref transcript
Immunoglobulin I-set domain.
pfam fn3 68aa 2e-05
in ref transcript
Fibronectin type III domain.
smart LRRCT 52aa 2e-04
in ref transcript
Leucine rich repeat C-terminal domain.
smart LRRNT 45aa 0.009
in ref transcript
Leucine rich repeat N-terminal domain.
COG COG4886 201aa 4e-05
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
LSP1
rs.LSP1.F1 rs.LSP1.R1 111 550
NCBIGene 36.3 4046
Alternative 5-prime, size difference: 439
Exclusion in 5'UTR
Reference transcript: NM_001013254
pfam Caldesmon 92aa 4e-04
in ref transcript
Caldesmon.
LTBP4
rs.LTBP4.F1 rs.LTBP4.R1 100 119
NCBIGene 36.3 8425
Alternative 3-prime, size difference: 19
Inclusion in the protein causing a frameshift
Reference transcript: NM_001042544
Changed!
cd EGF_CA 34aa 0.003
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Changed!
cd EGF_CA 33aa 0.007
in ref transcript
Changed!
smart EGF_CA 37aa 7e-05
in ref transcript
Calcium-binding EGF-like domain.
Changed!
smart EGF_CA 34aa 1e-04
in ref transcript
Changed!
smart EGF_CA 36aa 4e-04
in ref transcript
Changed!
pfam TB 41aa 6e-04
in ref transcript
TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin.
Changed!
smart EGF_CA 31aa 0.001
in ref transcript
Changed!
smart EGF_CA 38aa 0.004
in ref transcript
Changed!
pfam EGF_CA 41aa 0.004
in ref transcript
Calcium binding EGF domain.
Changed!
smart EGF_CA 37aa 0.008
in ref transcript
MAGEA6
rs.MAGEA6.F1 rs.MAGEA6.R1 123 187
NCBIGene 36.3 4105
Alternative 3-prime, size difference: 64
Inclusion in 5'UTR
Reference transcript: NM_005363
pfam MAGE 171aa 5e-69
in ref transcript
MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
MAGEC3
rs.MAGEC3.F1 rs.MAGEC3.R1 143 536
NCBIGene 36.3 139081
Exon skipping and alternative 3-prime or 5-prime, size difference: 393
Inclusion in the protein causing a new stop codon, Inclusion in the protein causing a frameshift, Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_138702
Changed!
pfam MAGE 114aa 2e-35
in ref transcript
MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
pfam MAGE 113aa 4e-35
in ref transcript
MAGED4B
rs.MAGED4B.F1 rs.MAGED4B.R1 102 114
NCBIGene 36.3 81557
Alternative 5-prime, size difference: 12
Inclusion in the protein causing a new stop codon
Reference transcript: NM_177535
pfam MAGE 170aa 8e-75
in ref transcript
MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
PRK PRK07003 168aa 0.002
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
MAGIX
rs.MAGIX.F1 rs.MAGIX.R1 160 388
NCBIGene 36.3 79917
Single exon skipping, size difference: 228
Exclusion in the protein (no frameshift)
Reference transcript: NM_024859
Changed!
cd PDZ_signaling 59aa 7e-10
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
Changed!
smart PDZ 56aa 2e-11
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
Changed!
cd PDZ_signaling 38aa 3e-06
in modified transcript
Changed!
smart PDZ 40aa 7e-08
in modified transcript
MAGIX
rs.MAGIX.F2 rs.MAGIX.R2 129 174
NCBIGene 36.3 79917
Alternative 3-prime, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_024859
cd PDZ_signaling 59aa 7e-10
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart PDZ 56aa 2e-11
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart PDZ 56aa 2e-11
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
Changed!
cd PKc_MKK5 279aa 1e-172
in ref transcript
Protein kinases (PKs), MAP kinase kinase 5 (MKK5) subfamily, catalytic (c) domain. PKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine or tyrosine residues on protein substrates. The MKK5 subfamily is part of a larger superfamily that includes the catalytic domains of other protein serine/threonine kinases, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. The mitogen-activated protein (MAP) kinase signaling pathways are important mediators of cellular responses to extracellular signals. The pathways involve a triple kinase core cascade comprising of the MAP kinase (MAPK), which is phosphorylated and activated by a MAPK kinase (MAPKK or MKK), which itself is phosphorylated and activated by a MAPK kinase kinase (MAPKKK or MKKK). MKK5, also referred to as MEK5, is a dual-specificity PK that phosphorylates its downstream target, extracellular signal-regulated kinase 5 (ERK5), on specific threonine and tyrosine residues. MKK5 is activated by MEKK2 and MEKK3 in response to mitogenic and stress stimuli. The ERK5 cascade promotes cell proliferation, differentiation, neuronal survival, and neuroprotection. This cascade plays an essential role in heart development. Mice deficient in either ERK5 or MKK5 die around embryonic day 10 due to cardiovascular defects including underdevelopment of the myocardium. In addition, MKK5 is associated with metastasis and unfavorable prognosis in prostate cancer.
cd PB1_Map2k5 91aa 3e-47
in ref transcript
PB1 domain is essential part of the mitogen-activated protein kinase kinase 5 (Map2k5, alias MEK5) one of the key member of the signaling kinases cascade which involved in angiogenesis and early cardiovascular development. The PB1 domain of Map2k5 interacts with the PB1 domain of another members of kinase cascade MEKK2 (or MEKK3). A canonical PB1-PB1 interaction, involving heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The Map2k5 protein contains a type I PB1 domain.
Changed!
smart S_TKc 242aa 1e-62
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam PB1 67aa 7e-10
in ref transcript
PB1 domain.
Changed!
COG SPS1 255aa 6e-32
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
cd PKc_MKK5 269aa 1e-163
in modified transcript
Changed!
smart S_TKc 232aa 1e-58
in modified transcript
Changed!
COG SPS1 245aa 3e-29
in modified transcript
MAP4K1
rs.MAP4K1.F1 rs.MAP4K1.R1 154 252
NCBIGene 36.3 11184
Single exon skipping, size difference: 98
Exclusion in the protein causing a frameshift
Reference transcript: NM_007181
cd STKc_MAP4K3_like 260aa 1e-149
in ref transcript
Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 3 (MAPKKKK3 or MAP4K3)-like subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K3-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. This subfamily includes MAP4K3, MAP4K1, MAP4K2, MAP4K5, and related proteins. Vertebrate members contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain, similar to MAP4K4/6. MAP4Ks are involved in some MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K1, also called haematopoietic progenitor kinase 1 (HPK1), is a hematopoietic-specific STK involved in many cellular signaling cascades including MAPK, antigen receptor, apoptosis, growth factor, and cytokine signaling. It participates in the regulation of T cell receptor signaling and T cell-mediated immune responses. MAP4K2 was referred to as germinal center (GC) kinase because of its preferred location in GC B cells. MAP4K3 plays a role in the nutrient-responsive pathway of mTOR (mammalian target of rapamycin) signaling. It is required in the activation of S6 kinase by amino acids and for the phosphorylation of the mTOR-regulated inhibitor of eukaryotic initiation factor 4E. MAP4K5, also called germinal center kinase-related enzyme (GCKR), has been shown to activate the MAPK c-Jun N-terminal kinase (JNK).
Changed!
smart CNH 308aa 9e-84
in ref transcript
Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
smart S_TKc 248aa 6e-69
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
COG SPS1 354aa 8e-37
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
smart CNH 308aa 5e-85
in modified transcript
MAPK10
rs.MAPK10.F1 rs.MAPK10.R1 148 175
NCBIGene 36.3 5602
Alternative 5-prime, size difference: 27
Inclusion in the protein causing a new stop codon
Reference transcript: NM_138982
Changed!
cd S_TKc 297aa 1e-66
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 284aa 1e-67
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00024 292aa 1e-34
in ref transcript
cyclin-dependent protein kinase; Provisional.
MAPK14
rs.MAPK14.F1 rs.MAPK14.R1 159 238
NCBIGene 36.3 1432
Single exon skipping, size difference: 79
Exclusion in the protein causing a frameshift
Reference transcript: NM_139012
Changed!
cd S_TKc 276aa 7e-66
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 265aa 4e-64
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00024 269aa 5e-38
in ref transcript
cyclin-dependent protein kinase; Provisional.
Changed!
cd S_TKc 220aa 1e-62
in modified transcript
Changed!
smart S_TKc 209aa 2e-60
in modified transcript
Changed!
PTZ PTZ00024 213aa 3e-37
in modified transcript
MAPK14
rs.MAPK14.F2 rs.MAPK14.R2 101 180
NCBIGene 36.3 1432
Single exon skipping, size difference: 79
Exclusion in the protein causing a frameshift
Reference transcript: NM_139012
Changed!
cd S_TKc 276aa 7e-66
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 265aa 4e-64
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00024 269aa 5e-38
in ref transcript
cyclin-dependent protein kinase; Provisional.
Changed!
cd S_TKc 220aa 1e-62
in modified transcript
Changed!
smart S_TKc 209aa 2e-60
in modified transcript
Changed!
PTZ PTZ00024 213aa 3e-37
in modified transcript
MAPK3
rs.MAPK3.F1 rs.MAPK3.R1 155 287
NCBIGene 36.3 5595
Single exon skipping, size difference: 132
Exclusion in the protein (no frameshift)
Reference transcript: NM_002746
Changed!
cd S_TKc 290aa 4e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
Changed!
smart S_TKc 279aa 1e-69
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
PTZ PTZ00024 300aa 5e-44
in ref transcript
cyclin-dependent protein kinase; Provisional.
Changed!
cd S_TKc 246aa 9e-68
in modified transcript
Changed!
smart S_TKc 235aa 1e-69
in modified transcript
Changed!
PTZ PTZ00024 256aa 3e-37
in modified transcript
MATK
rs.MATK.F1 rs.MATK.R1 316 539
NCBIGene 36.3 4145
Single exon skipping, size difference: 223
Exclusion of the protein initiation site
Reference transcript: NM_139355
cd PTKc_Chk 254aa 1e-146
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Csk homologous kinase. Protein Tyrosine Kinase (PTK) family; Csk homologous kinase (Chk); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Csk subfamily kinases are cytoplasmic (or nonreceptor) tyr kinases containing the Src homology domains, SH3 and SH2, N-terminal to the catalytic tyr kinase domain. They negatively regulate the activity of Src kinases that are anchored to the plasma membrane. Chk is also referred to as megakaryocyte-associated tyrosine kinase (Matk). To inhibit Src kinases, Chk is translocated to the membrane via binding to specific transmembrane proteins, G-proteins, or adaptor proteins near the membrane. Chk inhibit Src kinases using a noncatalytic mechanism by simply binding to them. As a negative regulator of Src kinases, Chk may play important roles in cell proliferation, survival, and differentiation, and consequently, in cancer development and progression. Chk is expressed in brain and hematopoietic cells. Studies in mice reveal that Chk is not functionally redundant with Csk and that it plays an important role as a regulator of immune responses. Chk also plays a role in neural differentiation in a manner independent of Src by enhancing Mapk activation via Ras-mediated signaling.
cd SH2 90aa 1e-19
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
cd SH3 43aa 0.009
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
smart SH3 45aa 1e-04
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
COG SPS1 218aa 4e-22
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
pfam Myelin_MBP 154aa 1e-59
in ref transcript
Myelin basic protein.
Changed!
pfam Myelin_MBP 143aa 8e-52
in modified transcript
MCL1
rs.MCL1.F1 rs.MCL1.R1 133 381
NCBIGene 36.3 4170
Single exon skipping, size difference: 248
Exclusion in the protein causing a frameshift
Reference transcript: NM_021960
Changed!
cd Bcl-2_like 144aa 7e-37
in ref transcript
Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
Changed!
pfam Bcl-2 100aa 5e-32
in ref transcript
Apoptosis regulator proteins, Bcl-2 family.
Changed!
cd Bcl-2_like 56aa 2e-04
in modified transcript
MED24
rs.MED24.F1 rs.MED24.R1 124 146
NCBIGene 36.3 9862
Alternative 3-prime, size difference: 22
Inclusion in 5'UTR
Reference transcript: NM_014815
MED24
rs.MED24.F2 rs.MED24.R2 108 147
NCBIGene 36.3 9862
Single exon skipping, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_014815
MEN1
rs.MEN1.F1 rs.MEN1.R1 103 118
NCBIGene 36.3 4221
Alternative 5-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_130803
Changed!
pfam Menin 615aa 0.0
in ref transcript
Menin. MEN1, the gene responsible for multiple endocrine neoplasia type 1, is a tumour suppressor gene that encodes a protein called Menin which may be an atypical GTPase stimulated by nm23.
Changed!
pfam Menin 610aa 0.0
in modified transcript
MGC13057
rs.MGC13057.F1 rs.MGC13057.R1 139 158
NCBIGene 36.3 84281
Alternative 5-prime, size difference: 19
Exclusion in 5'UTR
Reference transcript: NM_032321
MGC45438
rs.MGC45438.F1 rs.MGC45438.R1 102 460
NCBIGene 36.3 146556
Alternative 3-prime, size difference: 358
Exclusion of the stop codon
Reference transcript: NM_152459
MID1IP1
rs.MID1IP1.F1 rs.MID1IP1.R1 143 247
NCBIGene 36.3 58526
Single exon skipping, size difference: 104
Exclusion in 5'UTR
Reference transcript: NM_001098790
pfam Spot_14 182aa 9e-66
in ref transcript
Thyroid hormone-inducible hepatic protein Spot 14. This family consists of several thyroid hormone-inducible hepatic protein (Spot 14 or S14) sequences. Mainly expressed in tissues that synthesise triglycerides, the mRNA coding for Spot 14 has been shown to be increased in rat liver by insulin, dietary carbohydrates, glucose in hepatocyte culture medium, as well as thyroid hormone. In contrast, dietary fats and polyunsaturated fatty acids, have been shown to decrease the amount of Spot 14 mRNA, while an elevated level of cAMP acts as a dominant negative factor. In addition, liver-specific factors or chromatin organisation of the gene have been shown to contribute to the regulation of its expression. Spot 14 protein is thought to be required for induction of hepatic lipogenesis.
MIER1
rs.MIER1.F1 rs.MIER1.R1 115 189
NCBIGene 36.3 57708
Single exon skipping, size difference: 74
Inclusion in the protein causing a frameshift
Reference transcript: NM_001077700
Changed!
cd SANT 44aa 7e-04
in ref transcript
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Changed!
pfam ELM2 54aa 2e-09
in ref transcript
ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in a protein from Arabidopsis thaliana. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.
Changed!
smart SANT 46aa 6e-05
in ref transcript
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.
MITF
rs.MITF.F1 rs.MITF.R1 97 115
NCBIGene 36.3 4286
Alternative 3-prime, size difference: 18
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_198159
cd HLH 61aa 4e-11
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
pfam HLH 54aa 3e-11
in ref transcript
Helix-loop-helix DNA-binding domain.
MIZF
rs.MIZF.F1 rs.MIZF.R1 336 453
NCBIGene 36.3 25988
Single exon skipping, size difference: 117
Exclusion in 5'UTR
Reference transcript: NM_015517
COG COG5236 133aa 0.002
in ref transcript
Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].
MLLT4
rs.MLLT4.F1 rs.MLLT4.R1 187 226
NCBIGene 36.3 4301
Exon skipping and alternative 3-prime or 5-prime, size difference: 39
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001040001
cd AF6_RA_repeat1 112aa 3e-53
in ref transcript
The AF-6 protein (also known as afadin and canoe) is a multidomain cell junction protein that contains two N-terminal Ras-associating (RA) domains in addition to FHA (forkhead-associated), DIL (class V myosin homology region), and PDZ domains and a proline-rich region. AF6 acts downstream of the Egfr (Epidermal Growth Factor-receptor)/Ras signalling pathway and provides a link from Egfr to cytoskeletal elements.
cd AF6_RA_repeat2 100aa 2e-42
in ref transcript
The AF-6 protein (also known as afadin and canoe) is a multidomain cell junction protein that contains two N-terminal Ras-associating (RA) domains in addition to FHA (forkhead-associated), DIL (class V myosin homology region), and PDZ domains and a proline-rich region. AF6 acts downstream of the Egfr (Epidermal Growth Factor-receptor)/Ras signalling pathway and provides a link from Egfr to cytoskeletal elements.
cd PDZ_signaling 84aa 3e-12
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd FHA 90aa 2e-05
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
pfam DIL 107aa 5e-26
in ref transcript
DIL domain. The DIL domain has no known function.
smart RA 94aa 7e-17
in ref transcript
Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.).
smart PDZ 88aa 4e-14
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart RA 100aa 2e-13
in ref transcript
smart FHA 52aa 4e-08
in ref transcript
Forkhead associated domain. Found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain.
Myelin-associated oligodendrocytic basic protein (MOBP). MOBP is abundantly expressed in central nervous system myelin, and shares several characteristics with myelin basic protein (MBP), in terms of regional distribution and function. MOBP has been shown to be essential for normal arrangement of the radial component in central nervous system myelin.
MMD2
rs.MMD2.F1 rs.MMD2.R1 109 181
NCBIGene 36.3 221938
Alternative 3-prime, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_001100600
Changed!
TIGR hlyIII 225aa 1e-47
in ref transcript
This family includes proteins from pathogenic and non-pathogenic bacteria, Homo sapiens and Drosophila. In Bacillus cereus, a pathogen, it has been show to function as a channel-forming cytolysin. The human protein is expressed preferentially in mature macrophages, consistent with a role cytolytic role.
Changed!
COG COG1272 243aa 8e-22
in ref transcript
Predicted membrane protein, hemolysin III homolog [General function prediction only].
Changed!
TIGR hlyIII 201aa 2e-50
in modified transcript
Changed!
COG COG1272 219aa 6e-24
in modified transcript
MOBKL3
rs.MOBKL3.F1 rs.MOBKL3.R1 123 186
NCBIGene 36.3 25843
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_015387
pfam Mob1_phocein 167aa 3e-61
in ref transcript
Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known structural motifs; however MOB1 is a member of a conserved gene family and shares sequence similarity with a nonessential yeast gene, MOB2. Mob1 is a phosphoprotein in vivo and a substrate for the Mps1p kinase in vitro. Conditional alleles of MOB1 cause a late nuclear division arrest at restrictive temperature. This family also includes phocein, a rat protein that by yeast two hybrid interacts with striatin.
MOCS1
rs.MOCS1.F1 rs.MOCS1.R1 100 115
NCBIGene 36.3 4337
Alternative 5-prime, size difference: 15
Inclusion in the protein causing a new stop codon
Reference transcript: NM_005942
Changed!
cd MoaC_PE 140aa 2e-40
in ref transcript
MoaC family, prokaryotic and eukaryotic. Members of this family are involved in molybdenum cofactor (Moco) biosynthesis, an essential cofactor of a diverse group of redox enzymes. MoaC, a small hexameric protein, converts, together with MoaA, a guanosine derivative to the precursor Z by inserting the carbon-8 of the purine between the 2' and 3' ribose carbon atoms, which is the first of three phases of Moco biosynthesis.
cd Radical_SAM 198aa 2e-19
in ref transcript
Radical SAM superfamily. Enzymes of this family generate radicals by combining a 4Fe-4S cluster and S-adenosylmethionine (SAM) in close proximity. They are characterized by a conserved CxxxCxxC motif, which coordinates the conserved iron-sulfur cluster. Mechanistically, they share the transfer of a single electron from the iron-sulfur cluster to SAM, which leads to its reductive cleavage to methionine and a 5'-deoxyadenosyl radical, which, in turn, abstracts a hydrogen from the appropriately positioned carbon atom. Depending on the enzyme, SAM is consumed during this process or it is restored and reused. Radical SAM enzymes catalyze steps in metabolism, DNA repair, the biosynthesis of vitamins and coenzymes, and the biosynthesis of many antibiotics. Examples are biotin synthase (BioB), lipoyl synthase (LipA), pyruvate formate-lyase (PFL), coproporphyrinogen oxidase (HemN), lysine 2,3-aminomutase (LAM), anaerobic ribonucleotide reductase (ARR), and MoaA, an enzyme of the biosynthesis of molybdopterin.
Changed!
TIGR moaA 306aa 1e-114
in ref transcript
The model for this family describes molybdenum cofactor biosynthesis protein A, or MoaA, as found in bacteria. It does not include the family of probable functional equivalent proteins from the archaea. MoaA works together with MoaC to synthesize precursor Z from guanine.
Changed!
pfam MoaC 136aa 2e-43
in ref transcript
MoaC family. Members of this family are involved in molybdenum cofactor biosynthesis. However their molecular function is not known.
Changed!
PRK moaA 327aa 1e-116
in ref transcript
molybdenum cofactor biosynthesis protein A; Reviewed.
Changed!
PRK moaC 157aa 5e-48
in ref transcript
molybdenum cofactor biosynthesis protein C; Provisional.
Changed!
TIGR moaA 325aa 1e-114
in modified transcript
Changed!
PRK moaA 332aa 1e-117
in modified transcript
MORC4
rs.MORC4.F1 rs.MORC4.R1 200 238
NCBIGene 36.3 79710
Alternative 5-prime, size difference: 38
Exclusion in the protein causing a frameshift
Reference transcript: NM_024657
cd HATPase_c 99aa 3e-04
in ref transcript
Histidine kinase-like ATPases; This family includes several ATP-binding proteins for example: histidine kinase, DNA gyrase B, topoisomerases, heat shock protein HSP90, phytochrome-like ATPases and DNA mismatch repair proteins.
pfam zf-CW 46aa 3e-17
in ref transcript
CW-type Zinc Finger. This domain appears to be a zinc finger. The alignment shows four conserved cysteine residues and a conserved tryptophan. It was first identified by, and is predicted to be a "highly specialised mononuclear four-cysteine zinc finger...that plays a role in DNA binding and/or promoting protein-protein interactions in complicated eukaryotic processes including...chromatin methylation status and early embryonic development." Weak homology to pfam00628 further evidences these predictions (personal obs: C Yeats). Twelve different CW-domain-containing protein subfamilies are described, with different subfamilies being characteristic of vertebrates, higher plants and other animals in which these domain is found.
pfam HATPase_c 101aa 3e-06
in ref transcript
Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90.
COG MutL 63aa 0.001
in ref transcript
DNA mismatch repair enzyme (predicted ATPase) [DNA replication, recombination, and repair].
MPP3
rs.MPP3.F1 rs.MPP3.R1 99 120
NCBIGene 36.3 4356
Single exon skipping, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_001932
Changed!
cd GMPK 119aa 1e-24
in ref transcript
Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines.
cd PDZ_signaling 81aa 1e-11
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd SH3 59aa 2e-05
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Changed!
smart GuKc 188aa 4e-51
in ref transcript
Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
smart PDZ 82aa 2e-12
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart L27 54aa 1e-09
in ref transcript
domain in receptor targeting proteins Lin-2 and Lin-7.
pfam L27 51aa 1e-06
in ref transcript
L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7.
smart SH3 62aa 1e-05
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Changed!
COG Gmk 187aa 9e-22
in ref transcript
Guanylate kinase [Nucleotide transport and metabolism].
All proteins in this family for which functions are known are subunits of a nuclease complex made up of multiple proteins including MRE11 and RAD50 homologs. The functions of this nuclease complex include recombinational repair and non-homolgous end joining. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). The proteins in this family are distantly related to proteins in the SbcCD complex of bacteria.
COG SbcD 309aa 2e-20
in ref transcript
DNA repair exonuclease [DNA replication, recombination, and repair].
MRGPRF
rs.MRGPRF.F1 rs.MRGPRF.R1 99 110
NCBIGene 36.3 219928
Alternative 5-prime and 3-prime, size difference: 11
Inclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_145015
pfam 7tm_1 225aa 2e-11
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
MRPL21
rs.MRPL21.F1 rs.MRPL21.R1 109 122
NCBIGene 36.3 219927
Alternative 5-prime, size difference: 13
Inclusion in the protein causing a new stop codon
Reference transcript: NM_181514
Changed!
pfam Ribosomal_L21p 73aa 1e-15
in ref transcript
Ribosomal prokaryotic L21 protein.
Changed!
COG RplU 68aa 2e-06
in ref transcript
Ribosomal protein L21 [Translation, ribosomal structure and biogenesis].
MRPL39
rs.MRPL39.F1 rs.MRPL39.R1 101 190
NCBIGene 36.3 54148
Single exon skipping, size difference: 89
Exclusion in the protein causing a frameshift
Reference transcript: NM_080794
cd TGS_ThrRS_N 55aa 1e-13
in ref transcript
TGS _ThrRS_N: ThrRS (threonyl-tRNA Synthetase) is a class II tRNA synthetase that couples threonine to its cognate tRNA. In addition to its catalytic and anticodon-binding domains, ThrRS has an N-terminal TGS domain, named after the ThrRS, GTPase, and SpoT proteins where it occurs. The TGS domain is thought to interact with the tRNA acceptor arm along with an adjacent N-terminal domain. The specific function of TGS is not well understood.
TIGR thrS 133aa 2e-05
in ref transcript
This model represents the threonyl-tRNA synthetase found in most organisms. This protein is a class II tRNA synthetase, and is recognized by the pfam HMM tRNA-synt_2b. Note that B. subtilis has closely related isozymes thrS and thrZ. The N-terminal regions are quite dissimilar between archaeal and eubacterial forms, while some eukaryotic forms are missing sequence there altogether.
PRK thrS 197aa 2e-12
in ref transcript
threonyl-tRNA synthetase; Reviewed.
MRPL42
rs.MRPL42.F1 rs.MRPL42.R1 115 135
NCBIGene 36.3 28977
Single exon skipping, size difference: 20
Exclusion in the protein causing a frameshift
Reference transcript: NM_172178
Changed!
pfam MRP-S32 99aa 3e-42
in ref transcript
Mitochondrial 28S ribosomal protein S32. This entry is of a family of short, approximately 100 amino acid residues, proteins which are mitochondrial 28S ribosomal proteins named as MRP-S32. Their exact function could not be confirmed.
MRPL55
rs.MRPL55.F1 rs.MRPL55.R1 104 216
NCBIGene 36.3 128308
Alternative 5-prime, size difference: 112
Exclusion in 5'UTR
Reference transcript: NM_181462
pfam Mitoc_L55 118aa 1e-32
in ref transcript
Mitochondrial ribosomal protein L55. Members of this family are involved in mitochondrial biogenesis and G2/M phase cell cycle progression. They form a component of the mitochondrial ribosome large subunit (39S) which comprises a 16S rRNA and about 50 distinct proteins.
MRPS11
rs.MRPS11.F1 rs.MRPS11.R1 265 364
NCBIGene 36.3 64963
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_022839
Changed!
TIGR bact_S11 106aa 3e-18
in ref transcript
This model describes the bacterial 30S ribosomal protein S11. Cutoffs are set such that the model excludes archaeal and eukaryotic ribosomal proteins, but many chloroplast and mitochondrial equivalents of S11 are detected.
Changed!
PRK PRK05309 109aa 2e-19
in ref transcript
30S ribosomal protein S11; Validated.
Changed!
TIGR bact_S11 84aa 3e-13
in modified transcript
Changed!
PRK PRK05309 87aa 5e-14
in modified transcript
MRVI1 protein. This family consists of mammalian MRVI1 proteins which are related to the lymphoid-restricted membrane protein (JAW1) and the IP3 receptor associated cGMP kinase substrates A and B (IRAGA and IRAGB). The function of MRVI1 is unknown although mutations in the Mrvi1 gene induces myeloid leukaemia by altering the expression of a gene important for myeloid cell growth and/or differentiation so it has been speculated that Mrvi1 is a tumour suppressor gene. IRAG is very similar in sequence to MRVI1 and is an essential NO/cGKI-dependent regulator of IP3-induced calcium release. Activation of cGKI decreases IP3-stimulated elevations in intracellular calcium, induces smooth muscle relaxation and contributes to the antiproliferative and pro-apoptotic effects of NO/cGMP. Jaw1 is a member of a class of proteins with COOH-terminal hydrophobic membrane anchors and is structurally similar to proteins involved in vesicle targeting and fusion. This suggests that the function and/or the structure of the ER in lymphocytes may be modified by lymphoid-restricted resident ER proteins.
MRVI1
rs.MRVI1.F2 rs.MRVI1.R2 257 362
NCBIGene 36.3 10335
Single exon skipping, size difference: 105
Exclusion in 5'UTR
Reference transcript: NM_001100163
pfam MRVI1 593aa 1e-141
in ref transcript
MRVI1 protein. This family consists of mammalian MRVI1 proteins which are related to the lymphoid-restricted membrane protein (JAW1) and the IP3 receptor associated cGMP kinase substrates A and B (IRAGA and IRAGB). The function of MRVI1 is unknown although mutations in the Mrvi1 gene induces myeloid leukaemia by altering the expression of a gene important for myeloid cell growth and/or differentiation so it has been speculated that Mrvi1 is a tumour suppressor gene. IRAG is very similar in sequence to MRVI1 and is an essential NO/cGKI-dependent regulator of IP3-induced calcium release. Activation of cGKI decreases IP3-stimulated elevations in intracellular calcium, induces smooth muscle relaxation and contributes to the antiproliferative and pro-apoptotic effects of NO/cGMP. Jaw1 is a member of a class of proteins with COOH-terminal hydrophobic membrane anchors and is structurally similar to proteins involved in vesicle targeting and fusion. This suggests that the function and/or the structure of the ER in lymphocytes may be modified by lymphoid-restricted resident ER proteins.
MRVI1
rs.MRVI1.F3 rs.MRVI1.R3 414 518
NCBIGene 36.3 10335
Single exon skipping, size difference: 104
Exclusion in the protein causing a frameshift
Reference transcript: NM_001098579
Changed!
pfam MRVI1 593aa 1e-144
in ref transcript
MRVI1 protein. This family consists of mammalian MRVI1 proteins which are related to the lymphoid-restricted membrane protein (JAW1) and the IP3 receptor associated cGMP kinase substrates A and B (IRAGA and IRAGB). The function of MRVI1 is unknown although mutations in the Mrvi1 gene induces myeloid leukaemia by altering the expression of a gene important for myeloid cell growth and/or differentiation so it has been speculated that Mrvi1 is a tumour suppressor gene. IRAG is very similar in sequence to MRVI1 and is an essential NO/cGKI-dependent regulator of IP3-induced calcium release. Activation of cGKI decreases IP3-stimulated elevations in intracellular calcium, induces smooth muscle relaxation and contributes to the antiproliferative and pro-apoptotic effects of NO/cGMP. Jaw1 is a member of a class of proteins with COOH-terminal hydrophobic membrane anchors and is structurally similar to proteins involved in vesicle targeting and fusion. This suggests that the function and/or the structure of the ER in lymphocytes may be modified by lymphoid-restricted resident ER proteins.
MS4A13
rs.MS4A13.F1 rs.MS4A13.R1 108 228
NCBIGene 36.3 503497
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_001012417
Changed!
pfam CD20 139aa 7e-14
in ref transcript
CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
Changed!
pfam CD20 99aa 2e-09
in modified transcript
MS4A15
rs.MS4A15.F1 rs.MS4A15.R1 149 436
NCBIGene 36.3 219995
Exon skipping and alternative 3-prime or 5-prime, size difference: 287
Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
Reference transcript: NM_001098835
Changed!
pfam CD20 118aa 3e-30
in ref transcript
CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
MSLN
rs.MSLN.F1 rs.MSLN.R1 118 142
NCBIGene 36.3 10232
Alternative 3-prime, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_013404
Changed!
pfam Mesothelin 606aa 0.0
in ref transcript
Pre-pro-megakaryocyte potentiating factor precursor (Mesothelin). This family consists of several mammalian pre-pro-megakaryocyte potentiating factor precursor (MPF) or mesothelin proteins. Mesothelin is a glycosylphosphatidylinositol-linked glycoprotein highly expressed in mesothelial cells, mesotheliomas, and ovarian cancer, but the biological function of the protein is not known.
Changed!
pfam Mesothelin 598aa 0.0
in modified transcript
MSR1
rs.MSR1.F1 rs.MSR1.R1 142 331
NCBIGene 36.3 4481
Single exon skipping, size difference: 189
Exclusion in the protein (no frameshift)
Reference transcript: NM_138715
Changed!
smart SR 101aa 1e-39
in ref transcript
Scavenger receptor Cys-rich. The sea ucrhin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.
pfam Macscav_rec 49aa 3e-18
in ref transcript
Macrophage scavenger receptor.
pfam Collagen 63aa 3e-13
in ref transcript
Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.
Changed!
pfam SMC_N 83aa 0.007
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Changed!
pfam SMC_N 173aa 0.009
in ref transcript
Changed!
smart SR 42aa 3e-12
in modified transcript
MST4
rs.MST4.F1 rs.MST4.R1 102 333
NCBIGene 36.3 51765
Single exon skipping, size difference: 231
Exclusion in the protein (no frameshift)
Reference transcript: NM_016542
Changed!
cd STKc_MST4 277aa 1e-146
in ref transcript
Serine/threonine kinases (STKs), mammalian Ste20-like protein kinase 4 (MST4) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MST4 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. MST4 is sometimes referred to as MASK (MST3 and SOK1-related kinase). It plays a role in mitogen-activated protein kinase (MAPK) signaling during cytoskeletal rearrangement, morphogenesis, and apoptosis. It influences cell growth and transformation by modulating the extracellular signal-regulated kinase (ERK) pathway. MST4 may also play a role in tumor formation and progression. It localizes in the Golgi apparatus by interacting with the Golgi matrix protein GM130 and may play a role in cell migration.
Changed!
smart S_TKc 241aa 4e-70
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
COG SPS1 266aa 3e-32
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
cd STKc_MST3_like 204aa 1e-119
in modified transcript
Serine/threonine kinases (STKs), mammalian Ste20-like protein kinase 3 (MST3)-like subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MST3-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. This subfamily is composed of MST3, MST4, STK25, Schizosaccharomyces pombe Nak1 and Sid1, Saccharomyces cerevisiae sporulation-specific protein 1 (SPS1), and related proteins. Nak1 is required by fission yeast for polarizing the tips of actin cytoskeleton and is involved in cell growth, cell separation, cell morphology and cell-cycle progression. Sid1 is a component in the septation initiation network (SIN) signaling pathway, and plays a role in cytokinesis. SPS1 plays a role in regulating proteins required for spore wall formation. MST4 plays a role in mitogen-activated protein kinase (MAPK) signaling during cytoskeletal rearrangement, morphogenesis, and apoptosis. MST3 phosphorylates the STK NDR and may play a role in cell cycle progression and cell morphology. STK25 may play a role in the regulation of cell migration and polarization.
Changed!
smart S_TKc 182aa 1e-60
in modified transcript
Changed!
COG SPS1 201aa 2e-27
in modified transcript
MST4
rs.MST4.F2 rs.MST4.R2 325 511
NCBIGene 36.3 51765
Single exon skipping, size difference: 186
Exclusion in the protein (no frameshift)
Reference transcript: NM_016542
Changed!
cd STKc_MST4 277aa 1e-146
in ref transcript
Serine/threonine kinases (STKs), mammalian Ste20-like protein kinase 4 (MST4) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MST4 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. MST4 is sometimes referred to as MASK (MST3 and SOK1-related kinase). It plays a role in mitogen-activated protein kinase (MAPK) signaling during cytoskeletal rearrangement, morphogenesis, and apoptosis. It influences cell growth and transformation by modulating the extracellular signal-regulated kinase (ERK) pathway. MST4 may also play a role in tumor formation and progression. It localizes in the Golgi apparatus by interacting with the Golgi matrix protein GM130 and may play a role in cell migration.
Changed!
smart S_TKc 241aa 4e-70
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
COG SPS1 266aa 3e-32
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
cd STKc_MST3_like 212aa 6e-90
in modified transcript
Serine/threonine kinases (STKs), mammalian Ste20-like protein kinase 3 (MST3)-like subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MST3-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. This subfamily is composed of MST3, MST4, STK25, Schizosaccharomyces pombe Nak1 and Sid1, Saccharomyces cerevisiae sporulation-specific protein 1 (SPS1), and related proteins. Nak1 is required by fission yeast for polarizing the tips of actin cytoskeleton and is involved in cell growth, cell separation, cell morphology and cell-cycle progression. Sid1 is a component in the septation initiation network (SIN) signaling pathway, and plays a role in cytokinesis. SPS1 plays a role in regulating proteins required for spore wall formation. MST4 plays a role in mitogen-activated protein kinase (MAPK) signaling during cytoskeletal rearrangement, morphogenesis, and apoptosis. MST3 phosphorylates the STK NDR and may play a role in cell cycle progression and cell morphology. STK25 may play a role in the regulation of cell migration and polarization.
Changed!
smart S_TKc 166aa 5e-47
in modified transcript
Changed!
COG SPS1 208aa 2e-21
in modified transcript
MTERFD3
rs.MTERFD3.F1 rs.MTERFD3.R1 90 100
NCBIGene 36.3 80298
Alternative 3-prime, size difference: 10
Inclusion in 5'UTR
Reference transcript: NM_025198
pfam mTERF 282aa 3e-10
in ref transcript
mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein possessing three putative leucine zippers one of which is bipartite. The protein binds DNA as a monomer. The leucine zippers are not implicated in a dimerisation role as in other leucine zippers.
MTMR14
rs.MTMR14.F1 rs.MTMR14.R1 267 423
NCBIGene 36.3 64419
Single exon skipping, size difference: 156
Exclusion in the protein (no frameshift)
Reference transcript: NM_001077525
MTUS1
rs.MTUS1.F1 rs.MTUS1.R1 287 449
NCBIGene 36.3 57509
Single exon skipping, size difference: 162
Exclusion in the protein (no frameshift)
Reference transcript: NM_001001924
TIGR SMC_prok_B 297aa 3e-14
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG Smc 320aa 2e-11
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
MUC1
rs.MUC1.F1 rs.MUC1.R1 101 128
NCBIGene 36.3 4582
Alternative 3-prime, size difference: 27
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_002456
smart SEA 116aa 2e-31
in ref transcript
Domain found in sea urchin sperm protein, enterokinase, agrin. Proposed function of regulating or binding carbohydrate sidechains.
MUC1
rs.MUC1.F3 rs.MUC1.R3 137 191
NCBIGene 36.3 4582
Alternative 3-prime, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_002456
Changed!
smart SEA 116aa 2e-31
in ref transcript
Domain found in sea urchin sperm protein, enterokinase, agrin. Proposed function of regulating or binding carbohydrate sidechains.
Changed!
smart SEA 117aa 1e-26
in modified transcript
MVP
rs.MVP.F1 rs.MVP.R1 130 171
NCBIGene 36.3 9961
Alternative 3-prime, size difference: 41
Inclusion in 5'UTR
Reference transcript: NM_005115
pfam Vault 53aa 8e-12
in ref transcript
Major Vault Protein repeat. The vault is a ubiquitous and highly conserved ribonucleoprotein particle of approximately 13 mDa of unknown function. This family corresponds to a repeat found in the amino terminal half of the major vault protein.
pfam Vault 53aa 2e-09
in ref transcript
pfam Vault 58aa 9e-09
in ref transcript
pfam Vault 49aa 9e-08
in ref transcript
pfam Vault 62aa 3e-06
in ref transcript
pfam Vault 48aa 6e-05
in ref transcript
MYH14
rs.MYH14.F1 rs.MYH14.R1 124 148
NCBIGene 36.3 79784
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_001077186
Changed!
cd MYSc_type_II 707aa 0.0
in ref transcript
Myosin motor domain, type II myosins. Myosin II mediates cortical contraction in cell motility, and is the motor in smooth and skeletal muscle. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
Changed!
smart MYSc 709aa 0.0
in ref transcript
Myosin. Large ATPases. ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.
pfam Myosin_tail_1 499aa 1e-30
in ref transcript
Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
pfam Myosin_tail_1 238aa 4e-14
in ref transcript
Changed!
COG COG5022 780aa 0.0
in ref transcript
Myosin heavy chain [Cytoskeleton].
Changed!
cd MYSc_type_II 699aa 0.0
in modified transcript
Changed!
smart MYSc 701aa 0.0
in modified transcript
Changed!
COG COG5022 772aa 0.0
in modified transcript
MYO3B
rs.MYO3B.F1 rs.MYO3B.R1 134 215
NCBIGene 36.3 140469
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_138995
cd MYSc_type_III 699aa 0.0
in ref transcript
Myosin motor domain, type III myosins. Myosin III has been shown to play a role in the vision process in insects and in hearing in mammals. Myosin III, an unconventional myosin, does not form dimers. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
cd STKc_myosinIIIB 291aa 0.0
in ref transcript
Serine/threonine kinases (STKs), class IIIB myosin subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The class III myosin subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Class III myosins are motor proteins containing an N-terminal kinase catalytic domain and a C-terminal actin-binding domain. Class III myosins may play an important role in maintaining the structural integrity of photoreceptor cell microvilli. They may also function as cargo carriers during light-dependent translocation, in photoreceptor cells, of proteins such as transducin and arrestin. Class IIIB myosin is expressed highly in retina. It is also present in the brain and testis. The human class IIIB myosin gene maps to a region that overlaps the locus for Bardet-Biedl syndrome, which is characterized by dysmorphic extremities, retinal dystrophy, obesity, male hypogenitalism, and renal abnormalities.
smart MYSc 696aa 0.0
in ref transcript
Myosin. Large ATPases. ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.
smart S_TKc 257aa 2e-67
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
Changed!
COG COG5022 755aa 1e-168
in ref transcript
Myosin heavy chain [Cytoskeleton].
COG SPS1 268aa 3e-33
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
Changed!
COG COG5022 688aa 1e-168
in modified transcript
MYOM1
rs.MYOM1.F1 rs.MYOM1.R1 102 390
NCBIGene 36.3 8736
Single exon skipping, size difference: 288
Exclusion in the protein (no frameshift)
Reference transcript: NM_003803
cd FN3 94aa 2e-15
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 90aa 8e-14
in ref transcript
cd FN3 94aa 9e-14
in ref transcript
cd IGcam 92aa 9e-14
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd FN3 91aa 1e-12
in ref transcript
cd FN3 95aa 3e-12
in ref transcript
cd IGcam 89aa 2e-11
in ref transcript
cd IGcam 80aa 1e-05
in ref transcript
cd IGcam 78aa 0.005
in ref transcript
pfam I-set 85aa 4e-21
in ref transcript
Immunoglobulin I-set domain.
pfam fn3 89aa 1e-12
in ref transcript
Fibronectin type III domain.
smart FN3 84aa 4e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Changed!
smart IG_like 88aa 9e-12
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam fn3 87aa 2e-11
in ref transcript
pfam fn3 90aa 2e-10
in ref transcript
pfam I-set 69aa 5e-10
in ref transcript
pfam fn3 85aa 2e-09
in ref transcript
pfam I-set 70aa 2e-05
in ref transcript
pfam I-set 70aa 3e-04
in ref transcript
Changed!
pfam I-set 93aa 9e-12
in modified transcript
MYST3
rs.MYST3.F1 rs.MYST3.R1 255 299
NCBIGene 36.3 7994
Alternative 5-prime, size difference: 44
Exclusion in 5'UTR
Reference transcript: NM_001099412
cd BAH_plant_2 34aa 8e-05
in ref transcript
BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
pfam MOZ_SAS 188aa 1e-101
in ref transcript
MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases.
pfam PHD 52aa 3e-10
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
pfam PHD 58aa 7e-06
in ref transcript
smart H15 78aa 8e-06
in ref transcript
Domain in histone families 1 and 5.
COG SAS2 331aa 3e-90
in ref transcript
Histone acetyltransferase (MYST family) [Chromatin structure and dynamics].
NAE1
rs.NAE1.F1 rs.NAE1.R1 190 288
NCBIGene 36.3 8883
Single exon skipping, size difference: 98
Inclusion in the protein causing a frameshift
Reference transcript: NM_003905
Changed!
cd APPBP1_RUB 389aa 1e-178
in ref transcript
Ubiquitin activating enzyme (E1) subunit APPBP1. APPBP1 is part of the heterodimeric activating enzyme (E1), specific for the Rub family of ubiquitin-like proteins (Ubls). E1 enzymes are part of a conjugation cascade to attach Ub or Ubls, covalently to substrate proteins consisting of activating (E1), conjugating (E2), and/or ligating (E3) enzymes. E1 activates ubiquitin(-like) by C-terminal adenylation, and subsequently forms a highly reactive thioester bond between its catalytic cysteine and Ubls C-terminus. E1 also associates with E2 and promotes ubiquitin transfer to the E2's catalytic cysteine. Post-translational modification by Rub family of ubiquitin-like proteins (Ublps) activates SCF ubiquitin ligases and is involved in cell cycle control, signaling and embryogenesis. ABPP1 contains part of the adenylation domain.
Changed!
cd APPBP1_RUB 84aa 3e-17
in ref transcript
Changed!
TIGR Ube1 141aa 4e-16
in ref transcript
This model represents the full length, over a thousand amino acids, of a multicopy family of eukaryotic proteins, many of which are designated ubiquitin-activating enzyme E1. Members have two copies of the ThiF family domain (pfam00899), a repeat found in ubiquitin-activating proteins (pfam02134), and other regions.
Changed!
TIGR Ube1 104aa 7e-07
in ref transcript
Changed!
COG ThiF 185aa 4e-22
in ref transcript
Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [Coenzyme metabolism].
NARF
rs.NARF.F1 rs.NARF.R1 109 247
NCBIGene 36.3 26502
Single exon skipping, size difference: 138
Inclusion in the protein causing a new stop codon
Reference transcript: NM_012336
Changed!
pfam Fe_hyd_lg_C 307aa 1e-107
in ref transcript
Iron only hydrogenase large subunit, C-terminal domain.
Changed!
pfam Fe_hyd_SSU 54aa 2e-06
in ref transcript
Iron hydrogenase small subunit. This family represents the small subunit of the Fe-only hydrogenases EC:1.18.99.1. The subunit is comprised of alternating random coil and alpha helical structures that encompasses the large subunit in a novel protein fold.
Changed!
COG COG4624 412aa 6e-50
in ref transcript
Iron only hydrogenase large subunit, C-terminal domain [General function prediction only].
Changed!
pfam Fe_hyd_lg_C 182aa 2e-64
in modified transcript
Changed!
COG COG4624 224aa 7e-36
in modified transcript
NARF
rs.NARF.F2 rs.NARF.R2 102 246
NCBIGene 36.3 26502
Single exon skipping, size difference: 144
Exclusion in the protein (no frameshift)
Reference transcript: NM_012336
Changed!
pfam Fe_hyd_lg_C 307aa 1e-107
in ref transcript
Iron only hydrogenase large subunit, C-terminal domain.
pfam Fe_hyd_SSU 54aa 2e-06
in ref transcript
Iron hydrogenase small subunit. This family represents the small subunit of the Fe-only hydrogenases EC:1.18.99.1. The subunit is comprised of alternating random coil and alpha helical structures that encompasses the large subunit in a novel protein fold.
Changed!
COG COG4624 412aa 6e-50
in ref transcript
Iron only hydrogenase large subunit, C-terminal domain [General function prediction only].
Changed!
pfam Fe_hyd_lg_C 302aa 1e-106
in modified transcript
Changed!
COG COG4624 387aa 1e-43
in modified transcript
NARG1L
rs.NARG1L.F1 rs.NARG1L.R1 108 215
NCBIGene 36.3 79612
Single exon skipping, size difference: 107
Exclusion in the protein causing a frameshift
Reference transcript: NM_024561
cd TPR 100aa 6e-06
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
TIGR PEP_TPR_lipo 219aa 6e-06
in ref transcript
This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
COG COG4783 165aa 3e-06
in ref transcript
Putative Zn-dependent protease, contains TPR repeats [General function prediction only].
NARG2
rs.NARG2.F1 rs.NARG2.R1 317 477
NCBIGene 36.3 79664
Alternative 3-prime, size difference: 160
Exclusion in the protein causing a frameshift
Reference transcript: NM_024611
Changed!
pfam NARG2_C 214aa 3e-74
in ref transcript
NMDA receptor-regulated gene protein 2. The transition of neuronal cells from pre-cursor to mature state is regulated by the N-methyl-d-aspartate (NMDA) receptor, a glutamate-gated ion channel that is permeable to Ca2+. NMDA receptors probably mediate this activity by permitting expression of NARG2. NARG2 is transiently expressed, being a regulatory protein that is present in the nucleus of dividing cells and then down-regulated as progenitors exit the cell cycle and begin to differentiate. NARG2 contains repeats of (S/T)PXX, (11 in mouse, six in human), a putative DNA-binding motif that is found in many gene-regulatory proteins including Kruppel, Hunchback and Antennapedi.
NBEAL1
rs.NBEAL1.F1 rs.NBEAL1.R1 129 336
NCBIGene 36.3 65065
Multiple exon skipping, size difference: 207
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001099273
Changed!
cd Beach 281aa 1e-128
in ref transcript
BEACH (Beige and Chediak-Higashi) domains, implicated in membrane trafficking, are present in a family of proteins conserved throughout eukaryotes. This group contains human lysosomal trafficking regulator (LYST), LPS-responsive and beige-like anchor (LRBA) and neurobeachin. Disruption of LYST leads to Chediak-Higashi syndrome, characterized by severe immunodeficiency, albinism, poor blood coagulation and neurologic problems. Neurobeachin is a candidate gene linked to autism. LBRA seems to be upregulated in several cancer types. It has been shown that the BEACH domain itself is important for the function of these proteins.
cd WD40 274aa 2e-15
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd Neurobeachin 84aa 9e-11
in ref transcript
Neurobeachin Pleckstrin homology-like domain. This domain is found in the large multi-domain eukaryotic protein Nerubeachin, N-terminal to the BEACH domain. This PH-like domain interacts with the BEACH domain in the same manner used by other PH-like domains to bind peptides.
Changed!
pfam Beach 281aa 1e-132
in ref transcript
Beige/BEACH domain.
COG COG2319 285aa 4e-08
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
cd Beach 275aa 1e-124
in modified transcript
Changed!
pfam Beach 275aa 1e-128
in modified transcript
NBN
rs.NBN.F1 rs.NBN.R1 345 395
NCBIGene 36.3 4683
Single exon skipping, size difference: 50
Inclusion in the protein causing a new stop codon
Reference transcript: NM_002485
Changed!
cd FHA 109aa 2e-14
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
Changed!
cd BRCT 69aa 4e-04
in ref transcript
Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
Changed!
pfam Nbs1_C 65aa 3e-30
in ref transcript
DNA damage repair protein Nbs1. This C terminal region of the DNA damage repair protein Nbs1 has been identified to be necessary for the binding of Mre11 and Tel1.
Changed!
pfam FHA 77aa 2e-08
in ref transcript
FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
Changed!
pfam BRCT 67aa 0.008
in ref transcript
BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not been included in this family. The BRCT domain of XRCC1 forms a homodimer in the crystal structure. This suggests that pairs of BRCT domains associate as homo- or heterodimers.
Changed!
cd FHA 49aa 3e-05
in modified transcript
NCALD
rs.NCALD.F1 rs.NCALD.R1 114 201
NCBIGene 36.3 83988
Single exon skipping, size difference: 87
Exclusion in 5'UTR
Reference transcript: NM_001040625
cd EFh 62aa 1e-10
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
cd EFh 75aa 4e-10
in ref transcript
smart EFh 29aa 0.001
in ref transcript
EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
COG FRQ1 166aa 2e-26
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
NCAM1
rs.NCAM1.F1 rs.NCAM1.R1 170 275
NCBIGene 36.3 4684
Multiple exon skipping, size difference: 105
Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_181351
cd IGcam 91aa 5e-12
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 87aa 8e-10
in ref transcript
cd IGcam 104aa 1e-09
in ref transcript
cd IGcam 73aa 2e-09
in ref transcript
cd FN3 98aa 5e-07
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 88aa 2e-04
in ref transcript
pfam I-set 83aa 1e-15
in ref transcript
Immunoglobulin I-set domain.
smart IG_like 91aa 2e-11
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IGc2 69aa 8e-10
in ref transcript
Immunoglobulin C-2 Type.
smart IG_like 85aa 2e-08
in ref transcript
pfam fn3 89aa 7e-08
in ref transcript
Fibronectin type III domain.
pfam fn3 83aa 1e-06
in ref transcript
NCBP2
rs.NCBP2.F1 rs.NCBP2.R1 227 386
NCBIGene 36.3 22916
Alternative 5-prime and 3-prime, size difference: 159
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_007362
Changed!
cd RRM 75aa 5e-18
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
smart RRM_2 74aa 7e-18
in ref transcript
RNA recognition motif.
Changed!
COG COG0724 83aa 1e-11
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
NCOA3
rs.NCOA3.F1 rs.NCOA3.R1 108 120
NCBIGene 36.3 8202
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_181659
cd PAS 66aa 2e-08
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
cd HLH 54aa 0.002
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
pfam Nuc_rec_co-act 48aa 2e-16
in ref transcript
Nuclear receptor coactivator. This region is found on eukaryotic nuclear receptor coactivators and forms an alpha helical structure.
pfam SRC-1 87aa 6e-15
in ref transcript
Steroid receptor coactivator. This domain is found in steroid/nuclear receptor coactivators and contains two LXXLL motifs that are involved in receptor binding. The family includes SRC-1/NcoA-1, NcoA-2/TIF2, pCIP/ACTR/GRIP-1/AIB1.
smart PAS 58aa 8e-10
in ref transcript
PAS domain. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels ([1]; Ponting & Aravind, in press).
pfam DUF1518 58aa 4e-08
in ref transcript
Domain of unknown function (DUF1518). This domain, which is usually found tandemly repeated, is found various receptor co-activating proteins.
smart HLH 55aa 3e-06
in ref transcript
helix loop helix domain.
NCOR2
rs.NCOR2.F1 rs.NCOR2.R1 102 153
NCBIGene 36.3 9612
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_006312
cd SANT 44aa 2e-07
in ref transcript
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
cd SANT 43aa 0.003
in ref transcript
pfam Myb_DNA-binding 44aa 1e-09
in ref transcript
Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
pfam Myb_DNA-binding 43aa 6e-04
in ref transcript
NCOR2
rs.NCOR2.F2 rs.NCOR2.R2 173 311
NCBIGene 36.3 9612
Alternative 5-prime, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_006312
cd SANT 44aa 2e-07
in ref transcript
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
cd SANT 43aa 0.003
in ref transcript
pfam Myb_DNA-binding 44aa 1e-09
in ref transcript
Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
pfam Myb_DNA-binding 43aa 6e-04
in ref transcript
NF1
rs.NF1.F1 rs.NF1.R1 212 275
NCBIGene 36.3 4763
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_001042492
Changed!
cd RasGAP_Neurofibromin 347aa 0.0
in ref transcript
Neurofibromin is the product of the neurofibromatosis type 1 gene (NF1) and shares a region of similarity with catalytic domain of the mammalian p120RasGAP protein and an extended similarity with the Saccharomyces cerevisiae RasGAP proteins Ira1 and Ira2. Neurofibromin has been shown to function as a GAP (GTPase-activating protein) which inhibits low molecular weight G proteins such as Ras by stimulating their intrinsic GTPase activity. NF1 is a common genetic disorder characterized by various symptoms ranging from predisposition for the development of tumors to learning disability or mental retardation. Loss of neurofibromin activity can be correlated to the increase in Ras-GTP concentration in neurofibromas of NF1 of patients, supporting the notion that unregulated Ras signaling may contribute to their development.
cd SEC14 147aa 1e-09
in ref transcript
Sec14p-like lipid-binding domain. Found in secretory proteins, such as S. cerevisiae phosphatidylinositol transfer protein (Sec14p), and in lipid regulated proteins such as RhoGAPs, RhoGEFs and neurofibromin (NF1). SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
Changed!
smart RasGAP 371aa 2e-95
in ref transcript
GTPase-activator protein for Ras-like GTPases. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position. Improved domain limits from structure.
smart SEC14 141aa 1e-17
in ref transcript
Domain in homologues of a S. cerevisiae phosphatidylinositol transfer protein (Sec14p). Domain in homologues of a S. cerevisiae phosphatidylinositol transfer protein (Sec14p) and in RhoGAPs, RhoGEFs and the RasGAP, neurofibromin (NF1). Lipid-binding domain. The SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
Changed!
COG IQG1 219aa 2e-07
in ref transcript
Protein involved in regulation of cellular morphogenesis/cytokinesis [Cell division and chromosome partitioning / Signal transduction mechanisms].
Changed!
cd RasGAP_Neurofibromin 326aa 0.0
in modified transcript
Changed!
smart RasGAP 350aa 1e-98
in modified transcript
Changed!
COG IQG1 198aa 5e-08
in modified transcript
NF2
rs.NF2.F1 rs.NF2.R1 125 140
NCBIGene 36.3 4771
Alternative 5-prime, size difference: 15
Exclusion in 3'UTR
Reference transcript: NM_181832
cd FERM_C 93aa 1e-22
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
smart B41 204aa 3e-53
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam FERM_C 87aa 9e-33
in ref transcript
FERM C-terminal PH-like domain.
pfam ERM 218aa 2e-31
in ref transcript
Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins.
NFKB2
rs.NFKB2.F1 rs.NFKB2.R1 102 276
NCBIGene 36.3 4791
Alternative 5-prime, size difference: 174
Exclusion in 5'UTR
Reference transcript: NM_001077494
cd IPT_NFkappaB 100aa 9e-45
in ref transcript
IPT domain of the transcription factor NFkappaB and related transcription factors. NFkappaB is considered a central regulator of stress responses, activated by different stressful conditions, including physical stress, oxidative stress, and exposure to certain chemicals. NFkappaB blocking cell apoptosis in several cell types, gives it an important role in cell proliferation and differentiation.
cd ANK 138aa 1e-21
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
cd ANK 132aa 1e-18
in ref transcript
pfam RHD 181aa 4e-66
in ref transcript
Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See pfam01833) that binds to DNA.
smart IPT 100aa 4e-13
in ref transcript
ig-like, plexins, transcription factors.
TIGR trp 177aa 3e-06
in ref transcript
after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
COG Arp 108aa 7e-11
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
COG Arp 167aa 8e-07
in ref transcript
NLRP1
rs.NLRP1.F1 rs.NLRP1.R1 101 113
NCBIGene 36.3 22861
Alternative 5-prime, size difference: 12
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_033004
cd LRR_RI 179aa 8e-30
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
cd AAA 95aa 0.009
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
pfam NACHT 170aa 5e-45
in ref transcript
NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
pfam CARD 82aa 2e-16
in ref transcript
Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
pfam PAAD_DAPIN 58aa 2e-08
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
smart LRR_RI 28aa 0.002
in ref transcript
Leucine rich repeat, ribonuclease inhibitor type.
COG RNA1 142aa 0.004
in ref transcript
Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].
NOVA1
rs.NOVA1.F1 rs.NOVA1.R1 361 433
NCBIGene 36.3 4857
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_002515
Changed!
cd PCBP_like_KH 66aa 1e-12
in ref transcript
K homology RNA-binding domain, PCBP_like. Members of this group possess KH domains in a tandem arrangement. Most members, similar to the poly(C) binding proteins (PCBPs) and Nova, containing three KH domains, with the first and second domains, which are represented here, in tandem arrangement, followed by a large spacer region, with the third domain near the C-terminal end of the protein. The poly(C) binding proteins (PCBPs) can be divided into two groups, hnRNPs K/J and the alphaCPs, which share a triple KH domain configuration and poly(C) binding specificity. They play roles in mRNA stabilization, translational activation, and translational silencing. Nova-1 and Nova-2 are nuclear RNA-binding proteins that regulate splicing. This group also contains plant proteins that seem to have two tandem repeat arrrangements, like Hen4, a protein that plays a role in AGAMOUS (AG) pre-mRNA processing and important step in plant development. In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
cd PCBP_like_KH 67aa 3e-12
in ref transcript
cd KH-I 66aa 3e-09
in ref transcript
K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
pfam KH_1 64aa 2e-10
in ref transcript
KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
pfam KH_1 66aa 1e-09
in ref transcript
smart KH 70aa 3e-08
in ref transcript
K homology RNA-binding domain.
Changed!
COG Pnp 73aa 0.007
in ref transcript
Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) [Translation, ribosomal structure and biogenesis].
Changed!
cd PCBP_like_KH 66aa 1e-12
in modified transcript
Changed!
TIGR arCOG04150 160aa 0.005
in modified transcript
This family of proteins is universal among the 41 archaeal genomes analyzed in and is not observed outside of the archaea. The proteins contain a single KH domain (pfam00013) which is likely to confer the ability to bind RNA.
Changed!
COG COG1094 176aa 1e-04
in modified transcript
Predicted RNA-binding protein (contains KH domains) [General function prediction only].
NOXO1
rs.NOXO1.F1 rs.NOXO1.R1 100 115
NCBIGene 36.3 124056
Alternative 5-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_172168
Changed!
cd PX_NoxO1 125aa 3e-45
in ref transcript
The phosphoinositide binding Phox Homology domain of Nox Organizing protein 1. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions such as cell signaling, vesicular trafficking, protein sorting, and lipid modification, among others. Nox Organizing protein 1 (NoxO1) is a critical regulator of enzyme kinetics of the nonphagocytic NADPH oxidase Nox1, which catalyzes the transfer of electrons from NADPH to molecular oxygen to form superoxide. Nox1 is expressed in colon, stomach, uterus, prostate, and vascular smooth muscle cells. NoxO1, a homolog of the p47phox subunit of phagocytic NADPH oxidase, is involved in targeting activator subunits (such as NoxA1) to Nox1. It is co-localized with Nox1 in the membranes of resting cells and directs the subcellular localization of Nox1. The PX domain is involved in targeting of proteins to PI-enriched membranes, and may also be involved in protein-protein interaction. The PX domain of NoxO1 preferentially binds phosphatidylinositol-3,5-bisphosphate [PI(3,5)P2], PI5P, and PI4P.
cd SH3 50aa 4e-06
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd SH3 54aa 2e-04
in ref transcript
smart SH3 45aa 2e-06
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Changed!
smart PX 101aa 2e-04
in ref transcript
PhoX homologous domain, present in p47phox and p40phox. Eukaryotic domain of unknown function present in phox proteins, PLD isoforms, a PI3K isoform.
smart SH3 33aa 0.001
in ref transcript
Changed!
cd PX_NoxO1 120aa 4e-47
in modified transcript
Changed!
smart PX 96aa 9e-06
in modified transcript
NPC1L1
rs.NPC1L1.F1 rs.NPC1L1.R1 298 379
NCBIGene 36.3 29881
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_013389
Changed!
TIGR 2A060601 1261aa 0.0
in ref transcript
The model describes Niemann-Pick C type protein in eukaryotes. The defective protein has been associated with Niemann-Pick disease which is described in humans as autosomal recessive lipidosis. It is characterized by the lysosomal accumulation of unestrified cholesterol. It is an integral membrane protein, which indicates that this protein is most likely involved in cholesterol transport or acts as some component of cholesterol homeostasis.
COG COG1033 119aa 5e-05
in ref transcript
Predicted exporters of the RND superfamily [General function prediction only].
Changed!
TIGR 2A060601 1234aa 0.0
in modified transcript
NQO1
rs.NQO1.F1 rs.NQO1.R1 104 206
NCBIGene 36.3 1728
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_000903
Changed!
pfam Flavodoxin_2 213aa 5e-56
in ref transcript
Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones and protect cells against damage by free radicals and reactive oxygen species. This enzyme uses a FAD co-factor. The equation for this reaction is:- NAD(P)H + acceptor <=> NAD(P)(+) + reduced acceptor. This enzyme is also involved in the bioactivation of prodrugs used in chemotherapy. The family also includes acyl carrier protein phosphodiesterase EC:3.1.4.14. This enzyme converts holo-ACP to apo-ACP by hydrolytic cleavage of the phosphopantetheine residue from ACP. This family is related to pfam03358 and pfam00258.
Changed!
COG MdaB 212aa 7e-32
in ref transcript
Putative NADPH-quinone reductase (modulator of drug activity B) [General function prediction only].
Changed!
pfam Flavodoxin_2 179aa 2e-38
in modified transcript
Changed!
COG MdaB 178aa 2e-21
in modified transcript
NQO1
rs.NQO1.F2 rs.NQO1.R2 110 224
NCBIGene 36.3 1728
Single exon skipping, size difference: 114
Exclusion in the protein (no frameshift)
Reference transcript: NM_000903
Changed!
pfam Flavodoxin_2 213aa 5e-56
in ref transcript
Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones and protect cells against damage by free radicals and reactive oxygen species. This enzyme uses a FAD co-factor. The equation for this reaction is:- NAD(P)H + acceptor <=> NAD(P)(+) + reduced acceptor. This enzyme is also involved in the bioactivation of prodrugs used in chemotherapy. The family also includes acyl carrier protein phosphodiesterase EC:3.1.4.14. This enzyme converts holo-ACP to apo-ACP by hydrolytic cleavage of the phosphopantetheine residue from ACP. This family is related to pfam03358 and pfam00258.
Changed!
COG MdaB 212aa 7e-32
in ref transcript
Putative NADPH-quinone reductase (modulator of drug activity B) [General function prediction only].
Changed!
pfam Flavodoxin_2 175aa 2e-34
in modified transcript
Changed!
COG MdaB 174aa 2e-13
in modified transcript
NR1I3
rs.NR1I3.F1 rs.NR1I3.R1 195 335
NCBIGene 36.3 9970
Single exon skipping, size difference: 140
Exclusion of the protein initiation site
Reference transcript: NM_001077482
cd NR_LBD_PXR_like 250aa 9e-91
in ref transcript
The ligand binding domain of xenobiotic receptors:pregnane X receptor and constitutive androstane receptor. The ligand binding domain of xenobiotic receptors: This xenobiotic receptor family includes pregnane X receptor (PXR), constitutive androstane receptor (CAR) and other related nuclear receptors. They function as sensors of toxic byproducts of cell metabolism and of exogenous chemicals, to facilitate their elimination. The nuclear receptor pregnane X receptor (PXR) is a ligand-regulated transcription factor that responds to a diverse array of chemically distinct ligands, including many endogenous compounds and clinical drugs. The ligand binding domain of PXR shows remarkable flexibility to accommodate both large and small molecules. PXR functions as a heterodimer with retinoic X receptor-alpha (RXRa) and binds to a variety of response elements in the promoter regions of a diverse set of target genes involved in the metabolism, transport, and elimination of these molecules from the cell. Constitutive androstane receptor (CAR) is a closest mammalian relative of PXR, which has also been proposed to function as a xenosensor. CAR is activated by some of the same ligands as PXR and regulates a subset of common genes. The sequence homology and functional similarity suggests that the CAR gene arose from a duplication of an ancestral PXR gene. Like other nuclear receptors, xenobiotic receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
Changed!
pfam zf-C4 71aa 2e-29
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
pfam Hormone_recep 190aa 5e-23
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
Changed!
pfam zf-C4 46aa 9e-13
in modified transcript
NR1I3
rs.NR1I3.F2 rs.NR1I3.R2 105 117
NCBIGene 36.3 9970
Alternative 3-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_001077482
Changed!
cd NR_LBD_PXR_like 250aa 9e-91
in ref transcript
The ligand binding domain of xenobiotic receptors:pregnane X receptor and constitutive androstane receptor. The ligand binding domain of xenobiotic receptors: This xenobiotic receptor family includes pregnane X receptor (PXR), constitutive androstane receptor (CAR) and other related nuclear receptors. They function as sensors of toxic byproducts of cell metabolism and of exogenous chemicals, to facilitate their elimination. The nuclear receptor pregnane X receptor (PXR) is a ligand-regulated transcription factor that responds to a diverse array of chemically distinct ligands, including many endogenous compounds and clinical drugs. The ligand binding domain of PXR shows remarkable flexibility to accommodate both large and small molecules. PXR functions as a heterodimer with retinoic X receptor-alpha (RXRa) and binds to a variety of response elements in the promoter regions of a diverse set of target genes involved in the metabolism, transport, and elimination of these molecules from the cell. Constitutive androstane receptor (CAR) is a closest mammalian relative of PXR, which has also been proposed to function as a xenosensor. CAR is activated by some of the same ligands as PXR and regulates a subset of common genes. The sequence homology and functional similarity suggests that the CAR gene arose from a duplication of an ancestral PXR gene. Like other nuclear receptors, xenobiotic receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
pfam zf-C4 71aa 2e-29
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
Changed!
pfam Hormone_recep 190aa 5e-23
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
Changed!
cd NR_LBD_PXR_like 246aa 2e-92
in modified transcript
Changed!
pfam Hormone_recep 186aa 1e-22
in modified transcript
NR1I3
rs.NR1I3.F3 rs.NR1I3.R3 103 257
NCBIGene 36.3 9970
Alternative 3-prime, size difference: 154
Exclusion of the stop codon
Reference transcript: NM_001077482
Changed!
cd NR_LBD_PXR_like 250aa 9e-91
in ref transcript
The ligand binding domain of xenobiotic receptors:pregnane X receptor and constitutive androstane receptor. The ligand binding domain of xenobiotic receptors: This xenobiotic receptor family includes pregnane X receptor (PXR), constitutive androstane receptor (CAR) and other related nuclear receptors. They function as sensors of toxic byproducts of cell metabolism and of exogenous chemicals, to facilitate their elimination. The nuclear receptor pregnane X receptor (PXR) is a ligand-regulated transcription factor that responds to a diverse array of chemically distinct ligands, including many endogenous compounds and clinical drugs. The ligand binding domain of PXR shows remarkable flexibility to accommodate both large and small molecules. PXR functions as a heterodimer with retinoic X receptor-alpha (RXRa) and binds to a variety of response elements in the promoter regions of a diverse set of target genes involved in the metabolism, transport, and elimination of these molecules from the cell. Constitutive androstane receptor (CAR) is a closest mammalian relative of PXR, which has also been proposed to function as a xenosensor. CAR is activated by some of the same ligands as PXR and regulates a subset of common genes. The sequence homology and functional similarity suggests that the CAR gene arose from a duplication of an ancestral PXR gene. Like other nuclear receptors, xenobiotic receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
pfam zf-C4 71aa 2e-29
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
Changed!
pfam Hormone_recep 190aa 5e-23
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
Changed!
cd NR_LBD_PXR_like 210aa 1e-81
in modified transcript
Changed!
pfam Hormone_recep 149aa 1e-23
in modified transcript
NR1I3
rs.NR1I3.F4 rs.NR1I3.R4 100 115
NCBIGene 36.3 9970
Alternative 3-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_001077482
Changed!
cd NR_LBD_PXR_like 250aa 9e-91
in ref transcript
The ligand binding domain of xenobiotic receptors:pregnane X receptor and constitutive androstane receptor. The ligand binding domain of xenobiotic receptors: This xenobiotic receptor family includes pregnane X receptor (PXR), constitutive androstane receptor (CAR) and other related nuclear receptors. They function as sensors of toxic byproducts of cell metabolism and of exogenous chemicals, to facilitate their elimination. The nuclear receptor pregnane X receptor (PXR) is a ligand-regulated transcription factor that responds to a diverse array of chemically distinct ligands, including many endogenous compounds and clinical drugs. The ligand binding domain of PXR shows remarkable flexibility to accommodate both large and small molecules. PXR functions as a heterodimer with retinoic X receptor-alpha (RXRa) and binds to a variety of response elements in the promoter regions of a diverse set of target genes involved in the metabolism, transport, and elimination of these molecules from the cell. Constitutive androstane receptor (CAR) is a closest mammalian relative of PXR, which has also been proposed to function as a xenosensor. CAR is activated by some of the same ligands as PXR and regulates a subset of common genes. The sequence homology and functional similarity suggests that the CAR gene arose from a duplication of an ancestral PXR gene. Like other nuclear receptors, xenobiotic receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
pfam zf-C4 71aa 2e-29
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
Changed!
pfam Hormone_recep 190aa 5e-23
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
Changed!
cd NR_LBD_PXR_like 245aa 1e-92
in modified transcript
Changed!
pfam Hormone_recep 185aa 9e-25
in modified transcript
NR6A1
rs.NR6A1.F1 rs.NR6A1.R1 106 118
NCBIGene 36.3 2649
Alternative 3-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_033334
cd NR_LBD_DHR4_like 212aa 1e-103
in ref transcript
The ligand binding domain of orphan nuclear receptor Ecdysone-induced receptor DHR4. The ligand binding domain of Ecdysone-induced receptor DHR4: Ecdysone-induced orphan receptor DHR4 is a member of the nuclear receptor family. DHR4 is expressed during the early Drosophila larval development and is induced by ecdysone. DHR4 coordinates growth and maturation in Drosophila by mediating endocrine response to the attainment of proper body size during larval development. Mutations in DHR4 result in shorter larval development which translates into smaller and lighter flies. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, DHR4 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
pfam zf-C4 76aa 3e-32
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
pfam Hormone_recep 165aa 1e-26
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
NRD1
rs.NRD1.F1 rs.NRD1.R1 339 543
NCBIGene 36.3 4898
Multiple exon skipping, size difference: 204
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_002525
Changed!
pfam Peptidase_M16 127aa 4e-37
in ref transcript
Insulinase (Peptidase family M16).
pfam Peptidase_M16_C 176aa 3e-20
in ref transcript
Peptidase M16 inactive domain. Peptidase M16 consists of two structurally related domains. One is the active peptidase, whereas the other is inactive. The two domains hold the substrate like a clamp.
pfam Peptidase_M16_C 183aa 1e-08
in ref transcript
COG Ptr 879aa 1e-148
in ref transcript
Secreted/periplasmic Zn-dependent peptidases, insulinase-like [Posttranslational modification, protein turnover, chaperones].
Changed!
pfam Peptidase_M16 139aa 3e-38
in modified transcript
NRG1
rs.NRG1.F1 rs.NRG1.R1 191 293
NCBIGene 36.3 3084
Multiple exon skipping, size difference: 102
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_013956
cd IGcam 89aa 2e-05
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam Neuregulin 396aa 1e-109
in ref transcript
Neuregulin family.
smart IG_like 87aa 1e-07
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam EGF 32aa 0.006
in ref transcript
EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
NRG1
rs.NRG1.F2 rs.NRG1.R2 251 393
NCBIGene 36.3 3084
Single exon skipping, size difference: 142
Inclusion in the protein causing a new stop codon
Reference transcript: NM_013956
cd IGcam 89aa 2e-05
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
pfam Neuregulin 396aa 1e-109
in ref transcript
Neuregulin family.
smart IG_like 87aa 1e-07
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
Changed!
pfam EGF 32aa 0.006
in ref transcript
EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
Changed!
pfam Neuregulin 190aa 2e-57
in modified transcript
NRG1
rs.NRG1.F3 rs.NRG1.R3 119 143
NCBIGene 36.3 3084
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_013956
cd IGcam 89aa 2e-05
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam Neuregulin 396aa 1e-109
in ref transcript
Neuregulin family.
smart IG_like 87aa 1e-07
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam EGF 32aa 0.006
in ref transcript
EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
NRG2
rs.NRG2.F1 rs.NRG2.R1 127 151
NCBIGene 36.3 9542
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_013982
cd IGcam 88aa 2e-11
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam Neuregulin 321aa 1e-39
in ref transcript
Neuregulin family.
pfam I-set 90aa 3e-15
in ref transcript
Immunoglobulin I-set domain.
NRG2
rs.NRG2.F2 rs.NRG2.R2 110 134
NCBIGene 36.3 9542
Single exon skipping, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_013983
cd IGcam 88aa 2e-11
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam Neuregulin 321aa 8e-40
in ref transcript
Neuregulin family.
pfam I-set 90aa 3e-15
in ref transcript
Immunoglobulin I-set domain.
NRP1
rs.NRP1.F1 rs.NRP1.R1 112 217
NCBIGene 36.3 8829
Single exon skipping, size difference: 105
Exclusion in the protein (no frameshift)
Reference transcript: NM_003873
cd MAM 154aa 2e-45
in ref transcript
Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
cd FA58C 147aa 1e-36
in ref transcript
Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
cd CUB 114aa 5e-29
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd CUB 118aa 1e-26
in ref transcript
cd FA58C 148aa 2e-23
in ref transcript
smart MAM 160aa 2e-46
in ref transcript
Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
pfam CUB 116aa 5e-33
in ref transcript
CUB domain.
smart FA58C 151aa 9e-33
in ref transcript
Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
pfam CUB 112aa 2e-31
in ref transcript
pfam F5_F8_type_C 135aa 1e-18
in ref transcript
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
NRP2
rs.NRP2.F1 rs.NRP2.R1 100 115
NCBIGene 36.3 8828
Alternative 5-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_018534
cd FA58C 147aa 4e-39
in ref transcript
Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
cd MAM 148aa 3e-36
in ref transcript
Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
cd CUB 114aa 4e-31
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd CUB 117aa 3e-27
in ref transcript
cd FA58C 154aa 2e-24
in ref transcript
smart MAM 154aa 9e-40
in ref transcript
Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
smart FA58C 151aa 4e-34
in ref transcript
Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
pfam CUB 112aa 6e-31
in ref transcript
CUB domain.
pfam CUB 116aa 3e-29
in ref transcript
pfam F5_F8_type_C 141aa 3e-21
in ref transcript
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
NRP2
rs.NRP2.F2 rs.NRP2.R2 100 115
NCBIGene 36.3 8828
Alternative 5-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_201266
cd FA58C 147aa 2e-39
in ref transcript
Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
cd MAM 148aa 4e-36
in ref transcript
Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
cd CUB 114aa 5e-31
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd CUB 117aa 4e-27
in ref transcript
cd FA58C 154aa 8e-25
in ref transcript
smart MAM 154aa 8e-40
in ref transcript
Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
smart FA58C 151aa 2e-34
in ref transcript
Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
pfam CUB 112aa 7e-31
in ref transcript
CUB domain.
pfam CUB 116aa 4e-29
in ref transcript
pfam F5_F8_type_C 141aa 2e-21
in ref transcript
F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
NRXN3
rs.NRXN3.F1 rs.NRXN3.R1 143 233
NCBIGene 36.3 9369
Single exon skipping, size difference: 90
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_004796
cd LamG 154aa 2e-23
in ref transcript
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
cd LamG 170aa 2e-22
in ref transcript
Changed!
cd LamG 150aa 3e-18
in ref transcript
cd LamG 156aa 2e-12
in ref transcript
cd EGF_CA 34aa 3e-04
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
smart LamG 150aa 5e-28
in ref transcript
Laminin G domain.
smart LamG 137aa 2e-26
in ref transcript
Changed!
smart LamG 127aa 2e-20
in ref transcript
smart LamG 136aa 4e-19
in ref transcript
smart LamG 40aa 0.001
in ref transcript
pfam EGF 33aa 0.008
in ref transcript
EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
Changed!
cd LamG 120aa 3e-15
in modified transcript
Changed!
smart LamG 157aa 7e-17
in modified transcript
NSL1
rs.NSL1.F1 rs.NSL1.R1 327 412
NCBIGene 36.3 25936
Single exon skipping, size difference: 85
Inclusion in the protein causing a frameshift
Reference transcript: NM_015471
Changed!
pfam Mis14 145aa 5e-30
in ref transcript
Kinetochore protein Mis14 like. Mis14 is a kinetochore protein which is known to be recruited to kinetochores independently of CENP-A.
Changed!
pfam Mis14 100aa 1e-19
in modified transcript
NTRK1
rs.NTRK1.F1 NTRK1.R1 100 118
NCBIGene 36.3 4914
Single exon skipping, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_002529
cd PTKc_TrkA 280aa 1e-164
in ref transcript
Catalytic Domain of the Protein Tyrosine Kinase, Tropomyosin Related Kinase A. Protein Tyrosine Kinase (PTK) family; Tropomyosin related kinase A (TrkA); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. TrkA is a member of the Trk subfamily of proteins, which are receptor tyr kinases (RTKs) containing an extracellular region with arrays of leucine-rich motifs flanked by two cysteine-rich clusters followed by two immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. Binding of TrkA to its ligand, nerve growth factor (NGF), results in receptor oligomerization and activation of the catalytic domain. TrkA is expressed mainly in neural-crest-derived sensory and sympathetic neurons of the peripheral nervous system, and in basal forebrain cholinergic neurons of the central nervous system. It is critical for neuronal growth, differentiation and survival. Alternative TrkA splicing has been implicated as a pivotal regulator of neuroblastoma (NB) behavior. Normal TrkA expression is associated with better NB prognosis, while the hypoxia-regulated TrkAIII splice variant promotes NB pathogenesis and progression. Aberrant TrkA expression has also been demonstrated in non-neural tumors including prostate, breast, lung, and pancreatic cancers.
cd IGcam 71aa 0.002
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
pfam Pkinase_Tyr 272aa 1e-104
in ref transcript
Protein tyrosine kinase.
TIGR PCC 67aa 3e-05
in ref transcript
Note: this model is restricted to the amino half because a full-length model is incompatible with the HMM software package.
pfam I-set 85aa 4e-04
in ref transcript
Immunoglobulin I-set domain.
pfam I-set 70aa 0.002
in ref transcript
COG SPS1 265aa 1e-23
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
NUMB
rs.NUMB.F1 rs.NUMB.R1 104 137
NCBIGene 36.3 8650
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_001005743
Changed!
cd Numb 149aa 2e-72
in ref transcript
Numb Phosphotyrosine-binding (PTB) domain. Numb is a membrane associated adaptor protein, which is a determinant of asymmetric cell division. Numb has an N-terminal PTB domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
Changed!
pfam PID 134aa 7e-34
in ref transcript
Phosphotyrosine interaction domain (PTB/PID).
pfam NumbF 82aa 8e-32
in ref transcript
NUMB domain. This presumed domain is found in the Numb family of proteins adjacent to the PTB domain.
Changed!
cd Numb 138aa 5e-75
in modified transcript
Changed!
pfam PID 123aa 3e-33
in modified transcript
NUP50
rs.NUP50.F1 rs.NUP50.R1 123 313
NCBIGene 36.3 10762
Single exon skipping, size difference: 190
Inclusion in the protein causing a new stop codon
Reference transcript: NM_007172
Changed!
cd RanBD 112aa 2e-16
in ref transcript
Ran-binding domain; This domain of approximately 150 residues shares structural similarity to the PH domain, but lacks detectable sequence similarity. Ran is a Ras-like nuclear small GTPase, which regulates receptor-mediated transport between the nucleus and the cytoplasm. RanGTP hydrolysis is stimulated by RanGAP together with the Ran-binding domain containing acessory proteins RanBP1 and RanBP2. These accessory proteins stabilize the active GTP-bound form of Ran . The Ran-binding domain is found in multiple copies in Nuclear pore complex proteins.
Changed!
pfam NUP50 303aa 5e-71
in ref transcript
NUP50 (Nucleoporin 50 kDa). Nucleoporin 50 kDa (NUP50) acts as a cofactor for the importin-alpha:importin-beta heterodimer, which in turn allows for transportation of many nuclear-targeted proteins through nuclear pore complexes. The C terminus of NUP50 binds importin-beta through RAN-GTP, the N terminus binds the C terminus of importin-alpha, while a central domain binds importin-beta. NUP50:importin-alpha:importin-beta then binds cargo and can stimulate nuclear import. The N-terminal domain of NUP50 is also able to actively displace nuclear localisation signals from importin-alpha.
Changed!
pfam Ran_BP1 110aa 3e-05
in ref transcript
RanBP1 domain.
Changed!
pfam NUP50 23aa 1e-07
in modified transcript
NUP98
rs.NUP98.F1 rs.NUP98.R1 143 365
NCBIGene 36.3 4928
Single exon skipping, size difference: 222
Exclusion in the protein (no frameshift)
Reference transcript: NM_016320
pfam Nucleoporin2 156aa 2e-63
in ref transcript
Nucleoporin autopeptidase.
NUP98
rs.NUP98.F2 rs.NUP98.R2 256 307
NCBIGene 36.3 4928
Alternative 5-prime, size difference: 51
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_016320
pfam Nucleoporin2 156aa 2e-63
in ref transcript
Nucleoporin autopeptidase.
NUPR1
rs.NUPR1.F1 rs.NUPR1.R1 231 285
NCBIGene 36.3 26471
Alternative 3-prime, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_001042483
pfam Phospho_p8 29aa 8e-10
in ref transcript
DNA-binding nuclear phosphoprotein p8. P8 is a short 80-82 amino acid protein that is conserved from nematodes to humans. It carries at least one protein kinase C domain suggesting a possible role in signal transduction and it is thought to be a phosphoprotein, but the sites of phosphorylation and the kinases involved remain to be determined.
NYD-SP21
rs.NYD-SP21.F1 rs.NYD-SP21.R1 248 299
NCBIGene 36.3 84689
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_032597
Changed!
pfam CD20 127aa 1e-33
in ref transcript
CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
Changed!
pfam CD20 110aa 1e-24
in modified transcript
OCIAD1
rs.OCIAD1.F1 rs.OCIAD1.R1 153 400
NCBIGene 36.3 54940
Alternative 5-prime, size difference: 247
Exclusion in 5'UTR
Reference transcript: NM_001079839
pfam OCIA 108aa 4e-48
in ref transcript
Ovarian carcinoma immunoreactive antigen (OCIA). This family consists of several ovarian carcinoma immunoreactive antigen (OCIA) and related eukaryotic sequences. The function of this family is unknown.
OGG1
rs.OGG1.F1 rs.OGG1.R1 98 145
NCBIGene 36.3 4968
Alternative 3-prime, size difference: 47
Exclusion in the protein causing a frameshift
Reference transcript: NM_016828
Changed!
cd ENDO3c 179aa 3e-18
in ref transcript
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases.
TIGR ogg 306aa 1e-137
in ref transcript
All proteins in this family for which functions are known are 8-oxo-guanaine DNA glycosylases that function in base excision repair. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). This family is distantly realted to the Nth-MutY superfamily.
COG AlkA 229aa 3e-25
in ref transcript
3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair].
Changed!
cd ENDO3c 179aa 2e-17
in modified transcript
OGN
rs.OGN.F1 rs.OGN.R1 122 404
NCBIGene 36.3 4969
Alternative 3-prime, size difference: 282
Inclusion in 5'UTR
Reference transcript: NM_014057
OPA1
rs.OPA1.F1 rs.OPA1.R1 100 208
NCBIGene 36.3 4976
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_130837
pfam Dynamin_N 179aa 4e-38
in ref transcript
Dynamin family.
OPA1
rs.OPA1.F2 rs.OPA1.R2 105 216
NCBIGene 36.3 4976
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_130837
pfam Dynamin_N 179aa 4e-38
in ref transcript
Dynamin family.
OPTN
rs.OPTN.F1 rs.OPTN.R1 101 116
NCBIGene 36.3 10133
Alternative 3-prime, size difference: 15
Inclusion in 5'UTR
Reference transcript: NM_001008213
pfam SMC_N 178aa 0.001
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
PRK PRK05771 248aa 0.010
in ref transcript
V-type ATP synthase subunit I; Validated.
OSBPL2
rs.OSBPL2.F1 rs.OSBPL2.R1 121 157
NCBIGene 36.3 9885
Alternative 3-prime, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_144498
pfam Oxysterol_BP 351aa 4e-64
in ref transcript
Oxysterol-binding protein.
OSBPL3
rs.OSBPL3.F1 rs.OSBPL3.R1 148 256
NCBIGene 36.3 26031
Single exon skipping, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_015550
cd PH_oxysterol_bp 91aa 2e-18
in ref transcript
Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam Oxysterol_BP 300aa 2e-56
in ref transcript
Oxysterol-binding protein.
smart PH 91aa 3e-09
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
OSBPL9
rs.OSBPL9.F1 rs.OSBPL9.R1 102 171
NCBIGene 36.3 114883
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_148909
cd PH_oxysterol_bp 93aa 1e-28
in ref transcript
Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
pfam Oxysterol_BP 342aa 2e-49
in ref transcript
Oxysterol-binding protein.
smart PH 95aa 6e-13
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
OSCAR
rs.OSCAR.F1 rs.OSCAR.R1 105 117
NCBIGene 36.3 126014
Single exon skipping, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_206818
PACRG
rs.PACRG.F1 rs.PACRG.R1 276 393
NCBIGene 36.3 135138
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_152410
Changed!
pfam ParcG 223aa 2e-83
in ref transcript
Parkin co-regulated protein. This family of proteins is transcribed anti-sense along the DNA to the Parkin gene product and the two appear to be transcribed under the same promoter. The protein has predicted alpha-helical and beta-sheet domains which suggest its function is in the ubiquitin/proteasome system. Mutations in parkin are the genetic cause of early-onset and autosomal recessive juvenile parkinsonism.
Changed!
pfam ParcG 184aa 9e-89
in modified transcript
PACS2
rs.PACS2.F1 rs.PACS2.R1 102 114
NCBIGene 36.3 23241
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_001100913
pfam Pacs-1 213aa 3e-93
in ref transcript
PACS-1 cytosolic sorting protein. PACS-1 is a cytosolic sorting protein that directs the localisation of membrane proteins in the trans-Golgi network (TGN)/endosomal system. PACS-1 connects the clathrin adaptor AP-1 to acidic cluster sorting motifs contained in the cytoplasmic domain of cargo proteins such as furin, the cation-independent mannose-6-phosphate receptor and in viral proteins such as human immunodeficiency virus type 1 Nef.
pfam Pacs-1 149aa 3e-61
in ref transcript
PAICS
rs.PAICS.F1 rs.PAICS.R1 110 131
NCBIGene 36.3 10606
Single exon skipping, size difference: 21
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_006452
Changed!
cd SAICAR_synt_Ade5 252aa 1e-142
in ref transcript
Ade5_like 5-aminoimidazole-4-(N-succinylcarboxamide) ribonucleotide (SAICAR) synthase. Eukaryotic group of SAICAR synthetases represented by the Drosophila melanogaster, N-terminal, SAICAR synthetase domain of bifunctional Ade5. The Ade5 gene product (CAIR-SAICARs) catalyzes the sixth and seventh steps of the de novo biosynthesis of purine nucleotides (also reported as seventh and eighth steps). SAICAR synthetase converts 5-aminoimidazole-4-carboxyribonucleotide (CAIR), ATP, and L-aspartate into 5-aminoimidazole-4-(N-succinylcarboxamide) ribonucleotide (SAICAR), ADP, and phosphate.
Changed!
pfam SAICAR_synt 245aa 8e-92
in ref transcript
SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase.
TIGR purE 154aa 1e-57
in ref transcript
Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. This model represents PurK, an N5-CAIR mutase.
Changed!
PRK PRK09362 252aa 5e-54
in ref transcript
Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase [Nucleotide transport and metabolism].
Changed!
cd SAICAR_synt_Ade5 259aa 1e-139
in modified transcript
Changed!
pfam SAICAR_synt 252aa 5e-89
in modified transcript
Changed!
PRK PRK09362 259aa 1e-50
in modified transcript
PAK4
rs.PAK4.F1 rs.PAK4.R1 91 550
NCBIGene 36.3 10298
Single exon skipping, size difference: 459
Exclusion in the protein (no frameshift)
Reference transcript: NM_005884
cd STKc_PAK_II 285aa 1e-173
in ref transcript
Serine/threonine kinases (STKs), p21-activated kinase (PAK) subfamily, Group II, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The PAK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. PAKs are Rho family GTPase-regulated kinases that serve as important mediators in the function of Cdc42 (cell division cycle 42) and Rac. PAKs from higher eukaryotes are classified into two groups (I and II), according to their biochemical and structural features. Group II PAKs, also called non-conventional PAKs, include PAK4, PAK5, and PAK6. Group II PAKs contain PBD (p21-binding domain) and catalytic domains, but lack other motifs found in group I PAKs, such as an AID (autoinhibitory domain) and SH3 binding sites. Since group II PAKs do not contain an obvious AID, they may be regulated differently from group I PAKs. While group I PAKs interact with the SH3 containing proteins Nck, Grb2 and PIX, no such binding has been demonstrated for group II PAKs. Some known substrates of group II PAKs are also substrates of group I PAKs such as Raf, BAD, LIMK and GEFH1. Unique group II substrates include MARK/Par-1 and PDZ-RhoGEF. Group II PAKs play important roles in filopodia formation, neuron extension, cytoskeletal organization, and cell survival.
cd CRIB_PAK_like 39aa 4e-11
in ref transcript
PAK (p21 activated kinase) Binding Domain (PBD), binds Cdc42p- and/or Rho-like small GTPases; also known as the Cdc42/Rac interactive binding (CRIB) motif; has been shown to inhibit transcriptional activation and cell transformation mediated by the Ras-Rac pathway. This subgroup of CRIB/PBD-domains is found N-terminal of Serine/Threonine kinase domains in PAK and PAK-like proteins.
smart S_TKc 242aa 5e-76
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam PBD 55aa 1e-11
in ref transcript
P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB).
COG SPS1 248aa 2e-35
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
PAP2D
rs.PAP2D.F1 rs.PAP2D.R1 113 128
NCBIGene 36.3 163404
Alternative 5-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_001037317
cd PAP2_wunen 150aa 4e-52
in ref transcript
PAP2, wunen subfamily. Most likely a family of membrane associated phosphatidic acid phosphatases. Wunen is a drosophila protein expressed in the central nervous system, which provides repellent activity towards primordial germ cells (PGCs), controls the survival of PGCs and is essential in the migration process of these cells towards the somatic gonadal precursors.
smart acidPPc 91aa 8e-11
in ref transcript
Acid phosphatase homologues.
PAPSS2
rs.PAPSS2.F1 rs.PAPSS2.R1 100 115
NCBIGene 36.3 9060
Single exon skipping, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_001015880
Changed!
cd ATPS 369aa 1e-140
in ref transcript
ATP-sulfurylase (ATPS), also known as sulfate adenylate transferase, catalyzes the transfer of an adenylyl group from ATP to sulfate, forming adenosine 5'-phosphosulfate (APS). This reaction is generally accompanied by a further reaction, catalyzed by APS kinase, in which APS is phosphorylated to yield 3'-phospho-APS (PAPS). In some organisms the APS kinase is a separate protein, while in others it is incorporated with ATP sulfurylase in a bifunctional enzyme that catalyzes both reactions. In bifunctional proteins, the domain that performs the kinase activity can be attached at the N-terminal end of the sulfurylase unit or at the C-terminal end, depending on the organism. While the reaction is ubiquitous among organisms, the physiological role of the reaction varies. In some organisms it is used to generate APS from sulfate and ATP, while in others it proceeds in the opposite direction to generate ATP from APS and pyrophosphate. ATP sulfurylase can be a monomer, a homodimer, or a homo-oligomer, depending on the organism. ATPS belongs to a large superfamily of nucleotidyltransferases that includes pantothenate synthetase (PanC), phosphopantetheine adenylyltransferase (PPAT), and the amino-acyl tRNA synthetases. The enzymes of this family are structurally similar and share a dinucleotide-binding domain.
cd APSK 151aa 9e-73
in ref transcript
Adenosine 5'-phosphosulfate kinase (APSK) catalyzes the phosphorylation of adenosine 5'-phosphosulfate to form 3'-phosphoadenosine 5'-phosphosulfate (PAPS). The end-product PAPS is a biologically "activated" sulfate form important for the assimilation of inorganic sulfate.
Changed!
pfam ATP-sulfurylase 324aa 1e-120
in ref transcript
ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for PAPS (phosphoadenosine-phosphosulfate) synthesis from inorganic sulphate. ATP sulfurylase catalyses the synthesis of adenosine-phosphosulfate APS from ATP and inorganic sulphate.
pfam APS_kinase 159aa 4e-78
in ref transcript
Adenylylsulphate kinase. Enzyme that catalyses the phosphorylation of adenylylsulphate to 3'-phosphoadenylylsulfate. This domain contains an ATP binding P-loop motif.
Changed!
cd ATPS 364aa 1e-140
in modified transcript
Changed!
pfam ATP-sulfurylase 322aa 1e-119
in modified transcript
Changed!
PRK sat 384aa 1e-54
in modified transcript
PARK2
rs.PARK2.F1 rs.PARK2.R1 183 267
NCBIGene 36.3 5071
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_004562
cd parkin_N 70aa 9e-31
in ref transcript
parkin_N parkin protein is a RING-type E3 ubiquitin ligase with an amino-terminal ubiquitin-like (Ubl) domain and an RBR signature consisting of two RING finger domains separated by an IBR/DRIL domain. Naturally occurring mutations in parkin are thought to cause the disease AR_JP (autosomal-recessive juvenile parkinsonism). Parkin binds the Rpn10 subunit of 26S proteasomes through its Ubl domain.
pfam ubiquitin 61aa 2e-15
in ref transcript
Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
pfam IBR 63aa 2e-06
in ref transcript
IBR domain. The IBR (In Between Ring fingers) domain is often found to occur between pairs of ring fingers (pfam00097). This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain. Proteins that contain two Ring fingers and an IBR domain (these proteins are also termed RBR family proteins) are thought to exist in all eukaryotic organisms. RBR family members play roles in protein quality control and can indirectly regulate transcription. Evidence suggests that RBR proteins are often parts of cullin-containing ubiquitin ligase complexes. The ubiquitin ligase Parkin is an RBR family protein whose mutations are involved in forms of familial Parkinson's disease.
PTZ PTZ00044 54aa 7e-06
in ref transcript
ubiquitin; Provisional.
PARP2
rs.PARP2.F1 rs.PARP2.R1 258 297
NCBIGene 36.3 10038
Alternative 5-prime, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_005484
cd parp_like 344aa 1e-129
in ref transcript
Poly(ADP-ribose) polymerase (parp) catalytic domain catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. Experiments have shown that a carboxyl 40 kDa fragment is still catalytically active. Poly(ADP-ribose)-like polymerases (PARPS 1-3, VPARP, tankyrase) catalyze the addition of up to 100 ADP_ribose units from NAD+. PARPs 1 and 2 are localized in the nucleaus, bind DNA, and are activated by DNA damage. VPARP is part of the vault ribonucleoprotein complex. Tankyrases regulates telomere length through interactions with telomere repeat binding factor 1.
pfam PARP 213aa 1e-80
in ref transcript
Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. Experiments have shown that a carboxyl 40 kDa fragment is still catalytically active.
pfam PARP_reg 130aa 5e-30
in ref transcript
Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. Experiments have shown that a carboxyl 40 kDa fragment is still catalytically active.
smart WGR 86aa 3e-25
in ref transcript
Proposed nucleic acid binding domain. This domain is named after its most conserved central motif. It is found in a variety of polyA polymerases as well as in molybdate metabolism regulators (e.g. in E.coli) and other proteins of unknown function. The domain is found in isolation in some proteins and is between 70 and 80 residues in length. It is proposed that it may be a nucleic acid binding domain.
PAX3
rs.PAX3.F1 rs.PAX3.R1 110 133
NCBIGene 36.3 5077
Alternative 5-prime and 3-prime, size difference: 23
Inclusion in the protein causing a new stop codon, Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_181459
cd PAX 128aa 2e-61
in ref transcript
Paired Box domain.
cd homeodomain 58aa 6e-18
in ref transcript
Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
The biotinyl-domain or biotin carboxyl carrier protein (BCCP) domain is present in all biotin-dependent enzymes, such as acetyl-CoA carboxylase, pyruvate carboxylase, propionyl-CoA carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA carboxylase, oxaloacetate decarboxylase, methylmalonyl-CoA decarboxylase, transcarboxylase and urea amidolyase. This domain functions in transferring CO2 from one subsite to another, allowing carboxylation, decarboxylation, or transcarboxylation. During this process, biotin is covalently attached to a specific lysine.
TIGR pyruv_carbox 1140aa 0.0
in ref transcript
This enzyme plays a role in gluconeogensis but not glycolysis.
PRK PRK12999 1146aa 0.0
in ref transcript
pyruvate carboxylase; Reviewed.
PCBP2
rs.PCBP2.F1 rs.PCBP2.R1 113 206
NCBIGene 36.3 5094
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_005016
cd PCBP_like_KH 65aa 5e-16
in ref transcript
K homology RNA-binding domain, PCBP_like. Members of this group possess KH domains in a tandem arrangement. Most members, similar to the poly(C) binding proteins (PCBPs) and Nova, containing three KH domains, with the first and second domains, which are represented here, in tandem arrangement, followed by a large spacer region, with the third domain near the C-terminal end of the protein. The poly(C) binding proteins (PCBPs) can be divided into two groups, hnRNPs K/J and the alphaCPs, which share a triple KH domain configuration and poly(C) binding specificity. They play roles in mRNA stabilization, translational activation, and translational silencing. Nova-1 and Nova-2 are nuclear RNA-binding proteins that regulate splicing. This group also contains plant proteins that seem to have two tandem repeat arrrangements, like Hen4, a protein that plays a role in AGAMOUS (AG) pre-mRNA processing and important step in plant development. In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
cd PCBP_like_KH 62aa 4e-14
in ref transcript
cd KH-I 63aa 7e-12
in ref transcript
K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
pfam KH_1 61aa 2e-11
in ref transcript
KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
pfam KH_1 63aa 3e-11
in ref transcript
pfam KH_1 64aa 6e-10
in ref transcript
COG NusA 122aa 6e-04
in ref transcript
Transcription elongation factor [Transcription].
PCBP2
rs.PCBP2.F2 rs.PCBP2.R2 97 109
NCBIGene 36.3 5094
Alternative 5-prime and 3-prime, size difference: 12
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_005016
cd PCBP_like_KH 65aa 5e-16
in ref transcript
K homology RNA-binding domain, PCBP_like. Members of this group possess KH domains in a tandem arrangement. Most members, similar to the poly(C) binding proteins (PCBPs) and Nova, containing three KH domains, with the first and second domains, which are represented here, in tandem arrangement, followed by a large spacer region, with the third domain near the C-terminal end of the protein. The poly(C) binding proteins (PCBPs) can be divided into two groups, hnRNPs K/J and the alphaCPs, which share a triple KH domain configuration and poly(C) binding specificity. They play roles in mRNA stabilization, translational activation, and translational silencing. Nova-1 and Nova-2 are nuclear RNA-binding proteins that regulate splicing. This group also contains plant proteins that seem to have two tandem repeat arrrangements, like Hen4, a protein that plays a role in AGAMOUS (AG) pre-mRNA processing and important step in plant development. In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
cd PCBP_like_KH 62aa 4e-14
in ref transcript
cd KH-I 63aa 7e-12
in ref transcript
K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
pfam KH_1 61aa 2e-11
in ref transcript
KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
pfam KH_1 63aa 3e-11
in ref transcript
pfam KH_1 64aa 6e-10
in ref transcript
COG NusA 122aa 6e-04
in ref transcript
Transcription elongation factor [Transcription].
PCMTD2
rs.PCMTD2.F1 rs.PCMTD2.R1 263 344
NCBIGene 36.3 55251
Alternative 3-prime, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_018257
cd AdoMet_MTases 110aa 6e-04
in ref transcript
S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
Protein-L-isoaspartate carboxylmethyltransferase [Posttranslational modification, protein turnover, chaperones].
Changed!
pfam PCMT 183aa 4e-20
in modified transcript
Changed!
COG Pcm 172aa 1e-13
in modified transcript
PCSK6
rs.PCSK6.F1 rs.PCSK6.R1 171 210
NCBIGene 36.3 5046
Single exon skipping, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_138320
cd FU 51aa 6e-07
in ref transcript
Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
cd FU 43aa 2e-06
in ref transcript
cd FU 47aa 3e-06
in ref transcript
cd FU 53aa 1e-05
in ref transcript
pfam Peptidase_S8 301aa 2e-94
in ref transcript
Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold containing a 7-stranded parallel beta sheet, order 2314567.
pfam P_proprotein 91aa 4e-34
in ref transcript
Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic domain.
smart FU 48aa 2e-08
in ref transcript
Furin-like repeats.
smart FU 44aa 2e-08
in ref transcript
smart FU 44aa 1e-06
in ref transcript
pfam Furin-like 112aa 4e-06
in ref transcript
Furin-like cysteine rich region.
pfam VSP 115aa 5e-04
in ref transcript
Giardia variant-specific surface protein.
COG AprE 324aa 1e-20
in ref transcript
Subtilisin-like serine proteases [Posttranslational modification, protein turnover, chaperones].
COG COG4935 91aa 1e-15
in ref transcript
Regulatory P domain of the subtilisin-like proprotein convertases and other proteases [Posttranslational modification, protein turnover, chaperones].
COG NapH 83aa 0.009
in ref transcript
Polyferredoxin [Energy production and conversion].
PCSK6
rs.PCSK6.F2 rs.PCSK6.R2 227 273
NCBIGene 36.3 5046
Single exon skipping, size difference: 46
Exclusion in the protein causing a frameshift
Reference transcript: NM_138324
pfam Peptidase_S8 301aa 3e-91
in ref transcript
Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold containing a 7-stranded parallel beta sheet, order 2314567.
Changed!
pfam P_proprotein 84aa 2e-28
in ref transcript
Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic domain.
COG AprE 324aa 4e-16
in ref transcript
Subtilisin-like serine proteases [Posttranslational modification, protein turnover, chaperones].
COG COG4935 69aa 4e-12
in ref transcript
Regulatory P domain of the subtilisin-like proprotein convertases and other proteases [Posttranslational modification, protein turnover, chaperones].
Changed!
pfam P_proprotein 85aa 7e-29
in modified transcript
PCTK3
rs.PCTK3.F1 rs.PCTK3.R1 282 364
NCBIGene 36.3 5129
Alternative 5-prime, size difference: 82
Inclusion in 5'UTR
Reference transcript: NM_212503
cd S_TKc 282aa 3e-64
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 272aa 1e-68
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
PTZ PTZ00024 284aa 1e-44
in ref transcript
cyclin-dependent protein kinase; Provisional.
PCTP
rs.PCTP.F1 rs.PCTP.R1 99 332
NCBIGene 36.3 58488
Alternative 5-prime, size difference: 233
Inclusion in the protein causing a new stop codon
Reference transcript: NM_021213
Changed!
cd START 204aa 4e-30
in ref transcript
START(STeroidogenic Acute Regulatory (STAR) related lipid Transfer) Domain. These domains are 200-210 amino acid in length and occur in proteins involved in lipid transport (phosphatidylcholine) and metabolism, signal transduction, and transcriptional regulation. The most striking feature of the START domain structure is a predominantly hydrophobic tunnel extending nearly the entire protein and used to binding a single molecule of large lipophilic compounds, like cholesterol.
Changed!
smart START 204aa 5e-40
in ref transcript
in StAR and phosphatidylcholine transfer protein. putative lipid-binding domain in StAR and phosphatidylcholine transfer protein.
PDE8A
rs.PDE8A.F1 rs.PDE8A.R1 120 258
NCBIGene 36.3 5151
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_002605
Changed!
cd PAS 99aa 3e-06
in ref transcript
PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
cd HDc 195aa 4e-05
in ref transcript
Metal dependent phosphohydrolases with conserved 'HD' motif.
pfam PDEase_I 255aa 2e-55
in ref transcript
3'5'-cyclic nucleotide phosphodiesterase.
Changed!
pfam PAS 106aa 2e-07
in ref transcript
PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
pfam Response_reg 117aa 5e-07
in ref transcript
Response regulator receiver domain. This domain receives the signal from the sensor partner in bacterial two-component systems. It is usually found N-terminal to a DNA binding effector domain.
Metal dependent phosphohydrolases with conserved 'HD' motif.
Changed!
pfam PDEase_I 223aa 8e-56
in ref transcript
3'5'-cyclic nucleotide phosphodiesterase.
PDS5A
rs.PDS5A.F1 rs.PDS5A.R1 100 471
NCBIGene 36.3 23244
Alternative 5-prime and 3-prime, size difference: 371
Exclusion in 5'UTR, Exclusion of the protein initiation site
Reference transcript: NM_001100399
PELI3
rs.PELI3.F1 rs.PELI3.R1 338 410
NCBIGene 36.3 246330
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_145065
Changed!
pfam Pellino 394aa 1e-175
in ref transcript
Pellino. Pellino is involved in Toll-like signalling pathways, and associates with the kinase domain of the Pelle Ser/Thr kinase.
Changed!
pfam Pellino 416aa 0.0
in modified transcript
PFAAP5
rs.PFAAP5.F1 rs.PFAAP5.R1 102 152
NCBIGene 36.3 10443
Alternative 5-prime, size difference: 50
Inclusion in the protein causing a frameshift
Reference transcript: NM_033111
PH-4
rs.PH-4.F1 rs.PH-4.R1 142 325
NCBIGene 36.3 54681
Alternative 5-prime and 3-prime, size difference: 183
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_177938
cd EFh 61aa 3e-05
in ref transcript
EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
Changed!
smart P4Hc 98aa 3e-12
in ref transcript
Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.
Changed!
smart P4Hc 85aa 4e-07
in ref transcript
COG FRQ1 127aa 8e-05
in ref transcript
Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
Changed!
smart P4Hc 213aa 1e-23
in modified transcript
PHACTR2
rs.PHACTR2.F1 rs.PHACTR2.R1 310 550
NCBIGene 36.3 9749
Single exon skipping, size difference: 240
Exclusion in the protein (no frameshift)
Reference transcript: NM_001100164
pfam RPEL 26aa 0.003
in ref transcript
RPEL repeat. The RPEL repeat is named after four conserved amino acids it contains. The function of the RPEL repeat is unknown however it might be a DNA binding repeat based on the observation that a member from Drosophila melanogaster contains a pfam02037 domain that is also implicated in DNA binding.
pfam RPEL 26aa 0.006
in ref transcript
PHF20L1
rs.PHF20L1.F1 rs.PHF20L1.R1 108 337
NCBIGene 36.3 51105
Alternative 3-prime, size difference: 229
Inclusion in the protein causing a new stop codon
Reference transcript: NM_016018
cd TUDOR 46aa 2e-06
in ref transcript
Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
smart TUDOR 54aa 3e-06
in ref transcript
Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
smart MBT 66aa 3e-05
in ref transcript
Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
Changed!
pfam PHD 44aa 1e-04
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001101802
cd BAH_plant_2 39aa 3e-04
in ref transcript
BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
pfam PHD 44aa 1e-12
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
Phytanoyl-CoA dioxygenase (PhyH). This family is made up of several eukaryotic phytanoyl-CoA dioxygenase (PhyH) proteins, ectoine hydroxylases and a number of bacterial deoxygenases. PhyH is a peroxisomal enzyme catalysing the first step of phytanic acid alpha-oxidation. PhyH deficiency causes Refsum's disease (RD) which is an inherited neurological syndrome biochemically characterised by the accumulation of phytanic acid in plasma and tissues.
Changed!
pfam PhyH 250aa 9e-47
in modified transcript
Changed!
COG COG5285 258aa 1e-11
in modified transcript
Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin [Secondary metabolites biosynthesis, transport, and catabolism].
PHYHD1
rs.PHYHD1.F2 rs.PHYHD1.R2 124 187
NCBIGene 36.3 254295
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_001100876
Changed!
pfam PhyH 250aa 9e-47
in ref transcript
Phytanoyl-CoA dioxygenase (PhyH). This family is made up of several eukaryotic phytanoyl-CoA dioxygenase (PhyH) proteins, ectoine hydroxylases and a number of bacterial deoxygenases. PhyH is a peroxisomal enzyme catalysing the first step of phytanic acid alpha-oxidation. PhyH deficiency causes Refsum's disease (RD) which is an inherited neurological syndrome biochemically characterised by the accumulation of phytanic acid in plasma and tissues.
Changed!
COG COG5285 258aa 1e-11
in ref transcript
Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin [Secondary metabolites biosynthesis, transport, and catabolism].
Changed!
pfam PhyH 229aa 5e-44
in modified transcript
Changed!
COG COG5285 237aa 1e-12
in modified transcript
PHYHIP
rs.PHYHIP.F1 rs.PHYHIP.R1 280 402
NCBIGene 36.3 9796
Single exon skipping, size difference: 122
Exclusion in 5'UTR
Reference transcript: NM_001099335
cd FN3 108aa 7e-04
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
PICALM
rs.PICALM.F1 rs.PICALM.R1 116 266
NCBIGene 36.3 8301
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_007166
cd ANTH_AP180_CALM 117aa 1e-37
in ref transcript
ANTH domain family; composed of adaptor protein 180 (AP180), clathrin assembly lymphoid myeloid leukemia protein (CALM) and similar proteins. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. AP180 and CALM play important roles in clathrin-mediated endocytosis. AP180 is a brain-specific clathrin-binding protein which stimulates clathrin assembly during the recycling of synaptic vesicles. The ANTH domain is structurally similar to the VHS domain and is composed of a superhelix of eight alpha helices. ANTH domains bind both inositol phospholipids and proteins, and contribute to the nucleation and formation of clathrin coats on membranes. ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the trans-Golgi network, which suggests that the ANTH domain is a universal component of the machinery for clathrin-mediated membrane budding.
pfam ANTH 265aa 7e-80
in ref transcript
ANTH domain. AP180 is an endocytotic accessory proteins that has been implicated in the formation of clathrin-coated pits. The domain is involved in phosphatidylinositol 4,5-bisphosphate binding and is a universal adaptor for nucleation of clathrin coats.
PID1
rs.PID1.F1 rs.PID1.R1 123 334
NCBIGene 36.3 55022
Single exon skipping, size difference: 211
Exclusion of the protein initiation site
Reference transcript: NM_017933
cd CG8312 134aa 5e-10
in ref transcript
CG8312 Phosphotyrosine-binding (PTB) domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
PILRB
rs.PILRB.F1 rs.PILRB.R1 146 355
NCBIGene 36.3 29990
Alternative 3-prime, size difference: 209
Exclusion in the protein causing a frameshift
Reference transcript: NM_178238
Changed!
pfam V-set 92aa 1e-05
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
PIP5K3
rs.PIP5K3.F1 rs.PIP5K3.R1 146 437
NCBIGene 36.3 200576
Multiple exon skipping, size difference: 291
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_015040
cd Fab1_TCP 245aa 1e-104
in ref transcript
TCP-1 like domain of the eukaryotic phosphatidylinositol 3-phosphate (PtdIns3P) 5-kinase Fab1. Fab1p is important for vacuole size regulation, presumably by modulating PtdIns(3,5)P2 effector activity. In the human homolog p235/PIKfyve deletion of this domain leads to loss of catalytic activity. However no exact function this domain has been defined. In general, chaperonins are involved in productive folding of proteins.
cd PIPKc 314aa 5e-98
in ref transcript
Phosphatidylinositol phosphate kinases (PIPK) catalyze the phosphorylation of phosphatidylinositol phosphate on the fourth or fifth hydroxyl of the inositol ring, to form phosphatidylinositol bisphosphate. CD alignment includes type II phosphatidylinositol phosphate kinases (PIPKII-beta), type I andII PIPK (-alpha, -beta, and -gamma) kinases and related yeast Fab1p and Mss4p kinases. Signaling by phosphorylated species of phosphatidylinositol regulates secretion, vesicular trafficking, membrane translocation, cell adhesion, chemotaxis, DNA synthesis, and cell cycling. The catalytic core domains of PIPKs are structurally similar to PI3K, PI4K, and cAMP-dependent protein kinases (PKA), the dimerization region is a unique feature of the PIPKs.
cd DEP_PIKfyve 82aa 1e-38
in ref transcript
DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in fungal RhoGEF (GDP/GTP exchange factor) PIKfyve-like proteins. PIKfyve contains N-terminal Fyve finger and DEP domains, a central chaperonin-like domain and a C-terminal PIPK (phosphatidylinositol phosphate kinase) domain. PIKfyve-like proteins are important phosphatidylinositol (3)-monophosphate (PtdIns(3)P)-5-kinases, producing PtdIns(3,5)P2, which plays a major role in multivesicular body (MVB) sorting and control of retrograde traffic from the vacuole back to the endosome and/or Golgi. PIKfyve itself has been shown to be play a role in regulating early-endosome-to-trans-Golgi network (TGN) retrograde trafficking.
Changed!
cd FYVE 53aa 2e-15
in ref transcript
FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
smart PIPKc 238aa 5e-72
in ref transcript
Phosphatidylinositol phosphate kinases.
TIGR chap_CCT_gamma 224aa 8e-32
in ref transcript
Members of this family, all eukaryotic, are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT gamma chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.
Changed!
pfam FYVE 60aa 9e-20
in ref transcript
FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. We have included members which do not conserve these histidine residues but are clearly related.
smart DEP 73aa 3e-17
in ref transcript
Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signalling proteins that contain PH, rasGEF, rhoGEF, rhoGAP, RGS, PDZ domains. DEP domain in Drosophila dishevelled is essential to rescue planar polarity defects and induce JNK signalling (Cell 94, 109-118).
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
cd KR 86aa 4e-20
in ref transcript
Kringle domain; Kringle domains are believed to play a role in binding mediators, such as peptides, other proteins, membranes, or phospholipids. They are autonomous structural domains, found in a varying number of copies, in blood clotting and fibrinolytic proteins, some serine proteases and plasma proteins. Plasminogen-like kringles possess affinity for free lysine and lysine-containing peptides.
cd KR 85aa 5e-17
in ref transcript
Changed!
cd FN1 43aa 3e-04
in ref transcript
Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
smart Tryp_SPc 247aa 1e-71
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
pfam Kringle 82aa 3e-26
in ref transcript
Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta.
pfam Kringle 82aa 4e-21
in ref transcript
Changed!
smart FN1 43aa 4e-06
in ref transcript
Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
COG COG5640 251aa 2e-23
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
PLAUR
rs.PLAUR.F1 rs.PLAUR.R1 206 341
NCBIGene 36.3 5329
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_002659
cd LU 82aa 8e-10
in ref transcript
Ly-6 antigen / uPA receptor -like domain; occurs singly in GPI-linked cell-surface glycoproteins (Ly-6 family,CD59, thymocyte B cell antigen, Sgp-2) or as three-fold repeated domain in urokinase-type plasminogen activator receptor. Topology of these domains is similar to that of snake venom neurotoxins.
Changed!
cd LU 87aa 1e-06
in ref transcript
smart LU 81aa 8e-12
in ref transcript
Ly-6 antigen / uPA receptor -like domain. Three-fold repeated domain in urokinase-type plasminogen activator receptor; occurs singly in other GPI-linked cell-surface glycoproteins (Ly-6 family, CD59, thymocyte B cell antigen, Sgp-2). Topology of these domains is similar to that of snake venom neurotoxins.
Changed!
smart LU 85aa 4e-11
in ref transcript
smart LU 77aa 3e-06
in ref transcript
PLCB4
rs.PLCB4.F1 rs.PLCB4.R1 122 159
NCBIGene 36.3 5332
Single exon skipping, size difference: 37
Exclusion in the protein causing a frameshift
Reference transcript: NM_000933
cd PLCc 142aa 3e-41
in ref transcript
Phospholipase C, catalytic domain; Phosphoinositide-specific phospholipases C catalyze hydrolysis of phosphatidylinositol-4,5-bisphosphate (PIP2) to D-myo-inositol-1,4,5-trisphosphate (1,4,5-IP3) and sn-1,2-diacylglycerol (DAG). Both products function as second messengers in eukaryotic signal transduction cascades. 1,4,5-IP3 triggers inflow of calcium from intracellular stores; the membrane resident product DAG controls cellular protein phosphorylation states by activating various protein kinase C isozymes. The enzyme comprises 2 regions (X and Y) connected via a linker which may contain inserted domains, X and Y together form a TIM barrel-like structure containing the active site residues.
cd PH_PLC 121aa 3e-26
in ref transcript
Phospholipase C (PLC) pleckstrin homology (PH) domain. There are several isozymes of PLC (beta, gamma, delta, epsilon. zeta). While, PLC beta, gamma and delta all have N-terminal PH domains, lipid binding specificity is not conserved between them. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
cd PLCc 99aa 2e-17
in ref transcript
cd C2_2 117aa 7e-14
in ref transcript
Protein kinase C conserved region 2, subgroup 2; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing phospholipases C and D( PLC-1, PLD) and specific protein kinases C (PKC) subtypes, the C-terminal beta strand occupies the position of what is the N-terminal strand in subgroup 1.
smart PLCYc 115aa 7e-55
in ref transcript
Phospholipase C, catalytic domain (part); domain Y. Phosphoinositide-specific phospholipases C. These enzymes contain 2 regions (X and Y) which together form a TIM barrel-like structure containing the active site residues. Phospholipase C enzymes (PI-PLC) act as signal transducers that generate two second messengers, inositol-1,4,5-trisphosphate and diacylglycerol. The bacterial enzyme [6] appears to be a homologue of the mammalian PLCs.
pfam PI-PLC-X 139aa 4e-51
in ref transcript
Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit.
pfam efhand_like 92aa 4e-23
in ref transcript
Phosphoinositide-specific phospholipase C, efhand-like. Members of this family are predominantly found in phosphoinositide-specific phospholipase C. They adopt a structure consisting of a core of four alpha helices, in an EF like fold, and are required for functioning of the enzyme.
pfam DUF1154 47aa 6e-13
in ref transcript
Protein of unknown function (DUF1154). This family represents a small conserved region of unknown function within eukaryotic phospholipase C (EC:3.1.4.3). All members also contain pfam00387 and pfam00388.
smart C2 98aa 1e-11
in ref transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
Changed!
pfam SMC_N 256aa 7e-06
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Changed!
COG Smc 258aa 5e-05
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
pfam SMC_N 243aa 4e-07
in modified transcript
Changed!
COG Smc 249aa 1e-04
in modified transcript
PNMA5
rs.PNMA5.F1 rs.PNMA5.R1 100 114
NCBIGene 36.3 114824
Alternative 5-prime, size difference: 14
Exclusion in 5'UTR
Reference transcript: NM_001103151
PNMAL1
rs.PNMAL1.F1 rs.PNMAL1.R1 106 311
NCBIGene 36.3 55228
Alternative 5-prime, size difference: 205
Exclusion in the protein causing a frameshift
Reference transcript: NM_018215
PNPLA7
rs.PNPLA7.F1 rs.PNPLA7.R1 332 407
NCBIGene 36.3 375775
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098537
cd Pat_PNPLA6_PNPLA7 306aa 1e-176
in ref transcript
Patatin-like phospholipase domain containing protein 6 and protein 7. Patatin-like phospholipase domain containing protein 6 (PNPLA6) and protein 7 (PNPLA7) are 60% identical to each other. PNPLA6 is commonly known as Neuropathy Target Esterase (NTE). NTE has at least two functional domains: the N-terminal domain putatively regulatory domain and the C-terminal catalytic domain which shows esterase activity. NTE shows phospholipase activity for lysophosphatidylcholine (LPC) and phosphatidylcholine (PC). Exposure of NTE to organophosphates leads to organophosphate-induced delayed neurotoxicity (OPIDN). OPIDN is a progressive neurological condition that is characterized by weakness, paralysis, pain, and paresthesia. PNPLA7 is an insulin-regulated phospholipase that is homologous to Neuropathy Target Esterase (NTE or PNPLA6) and is also known as NTE-related esterase (NRE). Human NRE is predominantly expressed in prostate, white adipose, and pancreatic tissue. NRE hydrolyzes sn-1 esters in lysophosphatidylcholine and lysophosphatidic acid, but shows no lipase activity with substrates like triacylglycerols (TG), cholesteryl esters, retinyl esters (RE), phosphatidylcholine (PC), or monoacylglycerol (MG). This family includes PNPLA6 and PNPLA7 from Homo sapiens, YMF9 from Yeast, and Swiss Cheese protein (sws) from Drosophila melanogaster.
cd CAP_ED 109aa 1e-15
in ref transcript
effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
cd CAP_ED 109aa 1e-12
in ref transcript
cd CAP_ED 122aa 6e-11
in ref transcript
pfam Patatin 151aa 3e-28
in ref transcript
Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of lipid acyl hydrolase, catalysing the cleavage of fatty acids from membrane lipids. Members of this family have been found also in vertebrates.
pfam cNMP_binding 91aa 4e-14
in ref transcript
Cyclic nucleotide-binding domain.
pfam cNMP_binding 91aa 1e-13
in ref transcript
pfam cNMP_binding 98aa 4e-09
in ref transcript
COG RssA 274aa 1e-35
in ref transcript
Predicted esterase of the alpha-beta hydrolase superfamily [General function prediction only].
COG Crp 116aa 1e-11
in ref transcript
cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
COG Crp 130aa 3e-11
in ref transcript
COG Crp 143aa 2e-10
in ref transcript
PODXL
rs.PODXL.F1 rs.PODXL.R1 305 401
NCBIGene 36.3 5420
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_001018111
pfam CD34_antigen 201aa 2e-62
in ref transcript
CD34/Podocalyxin family. This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion.
POLR3H
rs.POLR3H.F1 rs.POLR3H.R1 123 210
NCBIGene 36.3 171568
Single exon skipping, size difference: 87
Exclusion in 5'UTR
Reference transcript: NM_001018050
POMT1
rs.POMT1.F1 rs.POMT1.R1 214 366
NCBIGene 36.3 10585
Single exon skipping, size difference: 152
Exclusion of the protein initiation site
Reference transcript: NM_007171
Changed!
pfam PMT 215aa 1e-58
in ref transcript
Dolichyl-phosphate-mannose-protein mannosyltransferase. This is a family of Dolichyl-phosphate-mannose-protein mannosyltransferase proteins EC:2.4.1.109. These proteins are responsible for O-linked glycosylation of proteins, they catalyse the reaction:- Dolichyl phosphate D-mannose + protein <=> dolichyl phosphate + O-D-mannosyl-protein. Also in this family is Drosophila rotated abdomen protein which is a putative mannosyltransferase. This family appears to be distantly related to pfam02516 (A Bateman pers. obs.).
pfam MIR 173aa 2e-10
in ref transcript
MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a domain that may have a ligand transferase function.
Changed!
COG PMT1 729aa 2e-97
in ref transcript
Dolichyl-phosphate-mannose--protein O-mannosyl transferase [Posttranslational modification, protein turnover, chaperones].
Changed!
pfam PMT 180aa 7e-49
in modified transcript
Changed!
COG PMT1 683aa 1e-88
in modified transcript
POMT1
rs.POMT1.F2 rs.POMT1.R2 157 223
NCBIGene 36.3 10585
Alternative 5-prime, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_007171
Changed!
pfam PMT 215aa 1e-58
in ref transcript
Dolichyl-phosphate-mannose-protein mannosyltransferase. This is a family of Dolichyl-phosphate-mannose-protein mannosyltransferase proteins EC:2.4.1.109. These proteins are responsible for O-linked glycosylation of proteins, they catalyse the reaction:- Dolichyl phosphate D-mannose + protein <=> dolichyl phosphate + O-D-mannosyl-protein. Also in this family is Drosophila rotated abdomen protein which is a putative mannosyltransferase. This family appears to be distantly related to pfam02516 (A Bateman pers. obs.).
pfam MIR 173aa 2e-10
in ref transcript
MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a domain that may have a ligand transferase function.
Changed!
COG PMT1 729aa 2e-97
in ref transcript
Dolichyl-phosphate-mannose--protein O-mannosyl transferase [Posttranslational modification, protein turnover, chaperones].
Changed!
pfam PMT 215aa 1e-57
in modified transcript
Changed!
COG PMT1 707aa 2e-98
in modified transcript
PON2
rs.PON2.F1 rs.PON2.R1 152 188
NCBIGene 36.3 5445
Alternative 3-prime, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_000305
pfam Arylesterase 86aa 6e-38
in ref transcript
Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. Human arylesterase (PON1) is associated with HDL and may protect against LDL oxidation.
Changed!
COG COG3386 192aa 4e-09
in ref transcript
Gluconolactonase [Carbohydrate transport and metabolism].
Changed!
COG COG3386 138aa 1e-08
in modified transcript
POT1
rs.POT1.F1 rs.POT1.R1 256 353
NCBIGene 36.3 25913
Single exon skipping, size difference: 97
Inclusion in the protein causing a new stop codon
Reference transcript: NM_015450
cd hPOT1_OB2 118aa 2e-39
in ref transcript
hPOT1_OB2: A subfamily of OB folds similar to the second OB fold (OB2) of human protection of telomeres 1 protein (hPOT1). POT1 proteins bind to the single-stranded (ss) 3-prime ends of the telomere. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. hPOT1 is implicated in telomere length regulation.
cd hPOT1_OB1_like 133aa 2e-33
in ref transcript
hPOT1_OB1_like: A subfamily of OB folds similar to the first OB fold (OB1) of human protection of telomeres 1 protein (hPOT1), the single OB fold of the N-terminal domain of Schizosaccharomyces pombe POT1 (SpPOT1), and the first OB fold of the N-terminal domain of the alpha subunit (OB1Nalpha) of Oxytricha nova telomere end binding protein (OnTEBP). POT1 proteins recognize single-stranded (ss) 3-prime ends of the telomere. A 3-prime ss overhang is conserved in ciliated protozoa, yeast, and mammals. SpPOT1 is essential for telomere maintenance. It binds specifically to the ss G-rich telomeric sequence (GGTTAC) of S. pombe. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. Deletion of the S. pombe pot1+ gene results in a rapid loss of telomere sequences, chromosome mis-segregation and chromosome circularization. hPOT1 is implicated in telomere length regulation. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. A second OB fold has not been predicted in S. pombe POT1. OnTEBP binds the extreme 3-prime end of telomeric DNA. It is heterodimeric and contains four OB folds - three in the alpha subunit (two in the N-terminal domain and one in the C-terminal domain) and one in the beta subunit. OB1Nalpha, together with the second OB fold of the N-terminal domain of OnTEBP alpha subunit and the beta subunit OB fold, forms a deep cleft that binds ssDNA.
pfam Telo_bind 103aa 4e-14
in ref transcript
Telomere-binding protein alpha subunit, central domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. Alpha subunit consists of 3 structural domains, all with the same beta-barrel OB fold.
POT1
rs.POT1.F2 rs.POT1.R2 107 238
NCBIGene 36.3 25913
Single exon skipping, size difference: 131
Exclusion in the protein causing a frameshift
Reference transcript: NM_015450
Changed!
cd hPOT1_OB2 118aa 2e-39
in ref transcript
hPOT1_OB2: A subfamily of OB folds similar to the second OB fold (OB2) of human protection of telomeres 1 protein (hPOT1). POT1 proteins bind to the single-stranded (ss) 3-prime ends of the telomere. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. hPOT1 is implicated in telomere length regulation.
Changed!
cd hPOT1_OB1_like 133aa 2e-33
in ref transcript
hPOT1_OB1_like: A subfamily of OB folds similar to the first OB fold (OB1) of human protection of telomeres 1 protein (hPOT1), the single OB fold of the N-terminal domain of Schizosaccharomyces pombe POT1 (SpPOT1), and the first OB fold of the N-terminal domain of the alpha subunit (OB1Nalpha) of Oxytricha nova telomere end binding protein (OnTEBP). POT1 proteins recognize single-stranded (ss) 3-prime ends of the telomere. A 3-prime ss overhang is conserved in ciliated protozoa, yeast, and mammals. SpPOT1 is essential for telomere maintenance. It binds specifically to the ss G-rich telomeric sequence (GGTTAC) of S. pombe. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. Deletion of the S. pombe pot1+ gene results in a rapid loss of telomere sequences, chromosome mis-segregation and chromosome circularization. hPOT1 is implicated in telomere length regulation. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. A second OB fold has not been predicted in S. pombe POT1. OnTEBP binds the extreme 3-prime end of telomeric DNA. It is heterodimeric and contains four OB folds - three in the alpha subunit (two in the N-terminal domain and one in the C-terminal domain) and one in the beta subunit. OB1Nalpha, together with the second OB fold of the N-terminal domain of OnTEBP alpha subunit and the beta subunit OB fold, forms a deep cleft that binds ssDNA.
Changed!
pfam Telo_bind 103aa 4e-14
in ref transcript
Telomere-binding protein alpha subunit, central domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. Alpha subunit consists of 3 structural domains, all with the same beta-barrel OB fold.
Changed!
cd hPOT1_OB1_like 37aa 4e-06
in modified transcript
POT1
rs.POT1.F3 rs.POT1.R3 102 191
NCBIGene 36.3 25913
Single exon skipping, size difference: 89
Exclusion in the protein causing a frameshift
Reference transcript: NM_015450
cd hPOT1_OB2 118aa 2e-39
in ref transcript
hPOT1_OB2: A subfamily of OB folds similar to the second OB fold (OB2) of human protection of telomeres 1 protein (hPOT1). POT1 proteins bind to the single-stranded (ss) 3-prime ends of the telomere. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. hPOT1 is implicated in telomere length regulation.
cd hPOT1_OB1_like 133aa 2e-33
in ref transcript
hPOT1_OB1_like: A subfamily of OB folds similar to the first OB fold (OB1) of human protection of telomeres 1 protein (hPOT1), the single OB fold of the N-terminal domain of Schizosaccharomyces pombe POT1 (SpPOT1), and the first OB fold of the N-terminal domain of the alpha subunit (OB1Nalpha) of Oxytricha nova telomere end binding protein (OnTEBP). POT1 proteins recognize single-stranded (ss) 3-prime ends of the telomere. A 3-prime ss overhang is conserved in ciliated protozoa, yeast, and mammals. SpPOT1 is essential for telomere maintenance. It binds specifically to the ss G-rich telomeric sequence (GGTTAC) of S. pombe. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. Deletion of the S. pombe pot1+ gene results in a rapid loss of telomere sequences, chromosome mis-segregation and chromosome circularization. hPOT1 is implicated in telomere length regulation. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. A second OB fold has not been predicted in S. pombe POT1. OnTEBP binds the extreme 3-prime end of telomeric DNA. It is heterodimeric and contains four OB folds - three in the alpha subunit (two in the N-terminal domain and one in the C-terminal domain) and one in the beta subunit. OB1Nalpha, together with the second OB fold of the N-terminal domain of OnTEBP alpha subunit and the beta subunit OB fold, forms a deep cleft that binds ssDNA.
pfam Telo_bind 103aa 4e-14
in ref transcript
Telomere-binding protein alpha subunit, central domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. Alpha subunit consists of 3 structural domains, all with the same beta-barrel OB fold.
POT1
rs.POT1.F4 rs.POT1.R4 106 226
NCBIGene 36.3 25913
Single exon skipping, size difference: 120
Inclusion in the protein causing a new stop codon
Reference transcript: NM_015450
cd hPOT1_OB2 118aa 2e-39
in ref transcript
hPOT1_OB2: A subfamily of OB folds similar to the second OB fold (OB2) of human protection of telomeres 1 protein (hPOT1). POT1 proteins bind to the single-stranded (ss) 3-prime ends of the telomere. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. hPOT1 is implicated in telomere length regulation.
cd hPOT1_OB1_like 133aa 2e-33
in ref transcript
hPOT1_OB1_like: A subfamily of OB folds similar to the first OB fold (OB1) of human protection of telomeres 1 protein (hPOT1), the single OB fold of the N-terminal domain of Schizosaccharomyces pombe POT1 (SpPOT1), and the first OB fold of the N-terminal domain of the alpha subunit (OB1Nalpha) of Oxytricha nova telomere end binding protein (OnTEBP). POT1 proteins recognize single-stranded (ss) 3-prime ends of the telomere. A 3-prime ss overhang is conserved in ciliated protozoa, yeast, and mammals. SpPOT1 is essential for telomere maintenance. It binds specifically to the ss G-rich telomeric sequence (GGTTAC) of S. pombe. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. Deletion of the S. pombe pot1+ gene results in a rapid loss of telomere sequences, chromosome mis-segregation and chromosome circularization. hPOT1 is implicated in telomere length regulation. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. A second OB fold has not been predicted in S. pombe POT1. OnTEBP binds the extreme 3-prime end of telomeric DNA. It is heterodimeric and contains four OB folds - three in the alpha subunit (two in the N-terminal domain and one in the C-terminal domain) and one in the beta subunit. OB1Nalpha, together with the second OB fold of the N-terminal domain of OnTEBP alpha subunit and the beta subunit OB fold, forms a deep cleft that binds ssDNA.
pfam Telo_bind 103aa 4e-14
in ref transcript
Telomere-binding protein alpha subunit, central domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. Alpha subunit consists of 3 structural domains, all with the same beta-barrel OB fold.
PPARA
rs.PPARA.F1 rs.PPARA.R1 127 211
NCBIGene 36.3 5465
Single exon skipping, size difference: 84
Exclusion in 5'UTR
Reference transcript: NM_005036
cd NR_LBD_PPAR 267aa 1e-108
in ref transcript
The ligand binding domain of peroxisome proliferator-activated receptors. The ligand binding domain (LBD) of peroxisome proliferator-activated receptors (PPAR): Peroxisome proliferator-activated receptors (PPARs) are members of the nuclear receptor superfamily of ligand-activated transcription factors. PPARs play important roles in regulating cellular differentiation, development and lipid metabolism. Activated PPAR forms a heterodimer with the retinoid X receptor (RXR) that binds to the hormone response element located upstream of the peroxisome proliferator responsive genes and interacts with co-activators. There are three subtypes of peroxisome proliferator activated receptors, alpha, beta (or delta), and gamma, each with a distinct tissue distribution. Several essential fatty acids, oxidized lipids and prostaglandin J derivatives can bind and activate PPAR. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, PPAR has a central well conserved DNA binding domain (DBD), a variable N-terminal regulatory domain, a flexible hinge a nd a C-terminal ligand binding domain (LBD).
pfam zf-C4 75aa 1e-33
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
pfam Hormone_recep 182aa 5e-29
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
PPARG
rs.PPARG.F1 rs.PPARG.R1 143 217
NCBIGene 36.3 5468
Single exon skipping, size difference: 74
Exclusion in 5'UTR
Reference transcript: NM_138712
cd NR_LBD_PPAR 268aa 1e-111
in ref transcript
The ligand binding domain of peroxisome proliferator-activated receptors. The ligand binding domain (LBD) of peroxisome proliferator-activated receptors (PPAR): Peroxisome proliferator-activated receptors (PPARs) are members of the nuclear receptor superfamily of ligand-activated transcription factors. PPARs play important roles in regulating cellular differentiation, development and lipid metabolism. Activated PPAR forms a heterodimer with the retinoid X receptor (RXR) that binds to the hormone response element located upstream of the peroxisome proliferator responsive genes and interacts with co-activators. There are three subtypes of peroxisome proliferator activated receptors, alpha, beta (or delta), and gamma, each with a distinct tissue distribution. Several essential fatty acids, oxidized lipids and prostaglandin J derivatives can bind and activate PPAR. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, PPAR has a central well conserved DNA binding domain (DBD), a variable N-terminal regulatory domain, a flexible hinge a nd a C-terminal ligand binding domain (LBD).
pfam zf-C4 75aa 1e-35
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
pfam Hormone_recep 182aa 4e-27
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
PPCS
rs.PPCS.F1 rs.PPCS.R1 100 545
NCBIGene 36.3 79717
Alternative 5-prime, size difference: 445
Exclusion in the protein causing a frameshift
Reference transcript: NM_024664
Changed!
pfam DFP 244aa 2e-54
in ref transcript
DNA / pantothenate metabolism flavoprotein. The DNA/pantothenate metabolism flavoprotein (EC:4.1.1.36) affects synthesis of DNA, and pantothenate metabolism.
Serine/threonine phosphatases, family 2C, catalytic domain; The protein architecture and deduced catalytic mechanism of PP2C phosphatases are similar to the PP1, PP2A, PP2B family of protein Ser/Thr phosphatases, with which PP2C shares no sequence similarity.
smart PP2Cc 277aa 6e-76
in ref transcript
Serine/threonine phosphatases, family 2C, catalytic domain. The protein architecture and deduced catalytic mechanism of PP2C phosphatases are similar to the PP1, PP2A, PP2B family of protein Ser/Thr phosphatases, with which PP2C shares no sequence similarity.
pfam PP2C_C 81aa 3e-26
in ref transcript
Protein serine/threonine phosphatase 2C, C-terminal domain. Protein phosphatase 2C (PP2C) is involved in regulating cellular responses to stress in various eukaryotes. It consists of two domains: an N-terminal catalytic domain and a C-terminal domain characteristic of mammalian PP2Cs. This domain consists of three antiparallel alpha helices, one of which packs against two corresponding alpha-helices of the N-terminal domain. The C-terminal domain does not seem to play a role in catalysis, but it may provide protein substrate specificity due to the cleft that is created between it and the catalytic domain.
PTZ PTZ00224 297aa 2e-46
in ref transcript
protein phosphatase 2C; Provisional.
PPP2R5D
rs.PPP2R5D.F1 rs.PPP2R5D.R1 195 291
NCBIGene 36.3 5528
Alternative 5-prime and 3-prime, size difference: 96
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_006245
Changed!
pfam B56 408aa 0.0
in ref transcript
Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzyme to act on a diverse array of substrates is largely controlled by the nature of its regulatory B subunit. There are multiple families of B subunits (See also pfam01240), this family is called the B56 family.
Changed!
pfam B56 400aa 0.0
in modified transcript
PPP4R1
rs.PPP4R1.F1 rs.PPP4R1.R1 254 305
NCBIGene 36.3 9989
Alternative 3-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_001042388
PQBP1
rs.PQBP1.F1 rs.PQBP1.R1 99 110
NCBIGene 36.3 10084
Alternative 5-prime, size difference: 11
Exclusion in 5'UTR
Reference transcript: NM_001032383
pfam WW 31aa 0.004
in ref transcript
WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
PRDM10
rs.PRDM10.F1 rs.PRDM10.R1 103 115
NCBIGene 36.3 56980
Single exon skipping, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_020228
PRDM7
rs.PRDM7.F1 rs.PRDM7.R1 100 383
NCBIGene 36.3 11105
Single exon skipping, size difference: 283
Exclusion in the protein causing a frameshift
Reference transcript: NM_001098173
smart KRAB 59aa 4e-08
in ref transcript
krueppel associated box.
pfam SSXRD 33aa 1e-05
in ref transcript
SSXRD motif. SSX1 can repress transcription, and this has been attributed to a putative Kruppel associated box (KRAB) repression domain at the N-terminus. However, from the analysis of these deletion constructs further repression activity was found at the C-terminus of SSX1. Which has been called the SSXRD (SSX Repression Domain). The potent repression exerted by full-length SSX1 appears to localise to this region.
Changed!
smart SET 111aa 1e-04
in ref transcript
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues.
PRELP
rs.PRELP.F1 rs.PRELP.R1 90 100
NCBIGene 36.3 5549
Alternative 5-prime, size difference: 10
Exclusion in 5'UTR
Reference transcript: NM_002725
cd LRR_RI 220aa 4e-04
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
smart LRRNT 36aa 2e-04
in ref transcript
Leucine rich repeat N-terminal domain.
COG COG4886 219aa 9e-07
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
PREPL
rs.PREPL.F1 rs.PREPL.R1 133 331
NCBIGene 36.3 9581
Single exon skipping, size difference: 198
Exclusion in the protein (no frameshift)
Reference transcript: NM_006036
pfam Peptidase_S9 162aa 1e-20
in ref transcript
Prolyl oligopeptidase family.
Changed!
pfam Peptidase_S9_N 320aa 2e-18
in ref transcript
Prolyl oligopeptidase, N-terminal beta-propeller domain. This unusual 7-stranded beta-propeller domain protects the catalytic triad of prolyl oligopeptidase (see pfam00326), excluding larger peptides and proteins from proteolysis in the cytosol.
Changed!
COG PtrB 551aa 7e-83
in ref transcript
Protease II [Amino acid transport and metabolism].
Changed!
pfam Peptidase_S9_N 255aa 4e-17
in modified transcript
Changed!
COG PtrB 213aa 3e-49
in modified transcript
Changed!
COG PtrB 273aa 3e-25
in modified transcript
Changed!
COG DAP2 451aa 6e-07
in modified transcript
Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism].
PREPL
rs.PREPL.F2 rs.PREPL.R2 162 348
NCBIGene 36.3 9581
Single exon skipping, size difference: 186
Exclusion in the protein (no frameshift)
Reference transcript: NM_006036
pfam Peptidase_S9 162aa 1e-20
in ref transcript
Prolyl oligopeptidase family.
Changed!
pfam Peptidase_S9_N 320aa 2e-18
in ref transcript
Prolyl oligopeptidase, N-terminal beta-propeller domain. This unusual 7-stranded beta-propeller domain protects the catalytic triad of prolyl oligopeptidase (see pfam00326), excluding larger peptides and proteins from proteolysis in the cytosol.
Changed!
COG PtrB 551aa 7e-83
in ref transcript
Protease II [Amino acid transport and metabolism].
Changed!
pfam Peptidase_S9_N 212aa 5e-14
in modified transcript
Changed!
COG PtrB 267aa 7e-52
in modified transcript
Changed!
COG PtrB 212aa 6e-20
in modified transcript
PRF1
rs.PRF1.F1 rs.PRF1.R1 103 129
NCBIGene 36.3 5551
Alternative 3-prime, size difference: 26
Inclusion in 5'UTR
Reference transcript: NM_005041
cd C2 82aa 1e-11
in ref transcript
Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
smart MACPF 203aa 4e-44
in ref transcript
membrane-attack complex / perforin.
pfam C2 81aa 5e-15
in ref transcript
C2 domain.
COG COG5038 97aa 6e-04
in ref transcript
Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
PRKCD
rs.PRKCD.F1 rs.PRKCD.R1 180 292
NCBIGene 36.3 5580
Single exon skipping, size difference: 112
Exclusion in 5'UTR
Reference transcript: NM_006254
cd STKc_nPKC_delta 316aa 1e-179
in ref transcript
STKc_nPKC_delta: Serine/Threonine Kinases (STKs), Novel Protein Kinase C (nPKC), delta isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The nPKC subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. PKCs are classified into three groups (classical, atypical, and novel) depending on their mode of activation and the structural characteristics of their regulatory domain. nPKCs are calcium-independent, but require DAG (1,2-diacylglycerol) and phosphatidylserine (PS) for activity. There are four nPKC isoforms, delta, epsilon, eta, and theta. PKC-delta plays a role in cell cycle regulation and programmed cell death in many cell types. It slows down cell proliferation, inducing cell cycle arrest and enhancing cell differentiation. PKC-delta is also involved in the regulation of transcription as well as immune and inflammatory responses. It plays a central role in the genotoxic stress response that leads to DNA damaged-induced apoptosis.
cd C1 50aa 2e-13
in ref transcript
Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
cd C1 50aa 8e-10
in ref transcript
smart S_TKc 245aa 8e-73
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam C1_1 53aa 6e-18
in ref transcript
Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
smart S_TK_X 64aa 2e-17
in ref transcript
Extension to Ser/Thr-type protein kinases.
pfam C1_1 50aa 2e-11
in ref transcript
PTZ PTZ00263 323aa 1e-68
in ref transcript
protein kinase A catalytic subunit; Provisional.
PRKDC
rs.PRKDC.F1 rs.PRKDC.R1 305 398
NCBIGene 36.3 5591
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_006904
Changed!
cd PIKKc_DNA-PK 297aa 1e-115
in ref transcript
DNA-dependent protein kinase (DNA-PK), catalytic domain; The DNA-PK catalytic domain subfamily is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. DNA-PK is a member of the phosphoinositide 3-kinase-related protein kinase (PIKK) subfamily. PIKKs have intrinsic serine/threonine kinase activity and are distinguished from other PKs by their unique catalytic domain, similar to that of lipid PI3K, and their large molecular weight (240-470 kDa). DNA-PK is comprised of a regulatory subunit, containing the Ku70/80 subunit, and a catalytic subunit, which contains a NUC194 domain of unknown function, a FAT (FRAP, ATM and TRRAP) domain, a catalytic domain, and a FATC domain at the C-terminus. It is part of a multi-component system involved in non-homologous end joining (NHEJ), a process of repairing double strand breaks (DSBs) by joining together two free DNA ends of little homology. DNA-PK functions as a molecular sensor for DNA damage that enhances the signal via phosphorylation of downstream targets. It may also act as a protein scaffold that aids the localization of DNA repair proteins to the site of DNA damage. DNA-PK also plays a role in the maintenance of telomeric stability and the prevention of chromosomal end fusion.
pfam NUC194 398aa 0.0
in ref transcript
NUC194 domain. This is domain B in the catalytic subunit of DNA-dependent protein kinases.
Changed!
pfam PI3_PI4_kinase 268aa 1e-57
in ref transcript
Phosphatidylinositol 3- and 4-kinase. Some members of this family probably do not have lipid kinase activity and are protein kinases.
pfam FAT 448aa 2e-52
in ref transcript
FAT domain. The FAT domain is named after FRAP, ATM and TRRAP.
pfam FATC 32aa 5e-05
in ref transcript
FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal. The solution structure of the FATC domain suggests it plays a role in redox-dependent structural and cellular stability.
Changed!
COG TEL1 579aa 2e-48
in ref transcript
Phosphatidylinositol kinase and protein kinases of the PI-3 kinase family [Signal transduction mechanisms / Cell division and chromosome partitioning / Chromatin structure and dynamics / DNA replication, recombination, and repair / Intracellular trafficking and secretion].
Changed!
cd PIKKc_DNA-PK 266aa 1e-114
in modified transcript
Changed!
pfam PI3_PI4_kinase 237aa 1e-51
in modified transcript
Changed!
COG TEL1 548aa 7e-44
in modified transcript
PRR3
rs.PRR3.F1 rs.PRR3.R1 194 257
NCBIGene 36.3 80742
Single exon skipping, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_025263
PRR5
rs.PRR5.F1 rs.PRR5.R1 128 245
NCBIGene 36.3 55615
Single exon skipping, size difference: 117
Exclusion of the protein initiation site
Reference transcript: NM_001017528
Changed!
pfam HbrB 132aa 6e-40
in ref transcript
HbrB-like. HbrB is involved hyphal growth and polarity.
Changed!
pfam HbrB 71aa 4e-22
in modified transcript
PSAT1
rs.PSAT1.F1 rs.PSAT1.R1 208 346
NCBIGene 36.3 29968
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_058179
Changed!
cd PSAT_like 359aa 1e-175
in ref transcript
Phosphoserine aminotransferase (PSAT) family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major group in this CD corresponds to phosphoserine aminotransferase (PSAT). PSAT is active as a dimer and catalyzes the conversion of phosphohydroxypyruvate to phosphoserine.
Changed!
TIGR serC_1 354aa 1e-174
in ref transcript
This model represents the common form of the phosphoserine aminotransferase SerC. The phosphoserine aminotransferase of the archaeon Methanosarcina barkeri and putative phosphoserine aminotransferase of Mycobacterium tuberculosis are represented by separate models. All are members of the class V aminotransferases (pfam00266).
Changed!
PRK PRK05355 359aa 1e-161
in ref transcript
phosphoserine aminotransferase; Provisional.
Changed!
cd PSAT_like 313aa 1e-148
in modified transcript
Changed!
TIGR serC_1 308aa 1e-147
in modified transcript
Changed!
PRK PRK05355 313aa 1e-137
in modified transcript
PSG11
rs.PSG11.F1 rs.PSG11.R1 195 263
NCBIGene 36.3 5680
Alternative 3-prime, size difference: 68
Inclusion in 3'UTR
Reference transcript: NM_002785
cd IGcam 79aa 3e-07
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 81aa 3e-04
in ref transcript
pfam V-set 93aa 3e-10
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
smart IG_like 74aa 1e-08
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IG_like 61aa 7e-04
in ref transcript
PSMA4
rs.PSMA4.F1 rs.PSMA4.R1 104 180
NCBIGene 36.3 5685
Alternative 5-prime, size difference: 76
Exclusion in 5'UTR
Reference transcript: NM_002789
cd proteasome_alpha_type_4 214aa 1e-108
in ref transcript
proteasome_alpha_type_4. The 20S proteasome, multisubunit proteolytic complex, is the central enzyme of nonlysosomal protein degradation in both the cytosol and nucleus. It is composed of 28 subunits arranged as four homoheptameric rings that stack on top of one another forming an elongated alpha-beta-beta-alpha cylinder with a central cavity. The proteasome alpha and beta subunits are members of the N-terminal nucleophile (Ntn)-hydrolase superfamily. Their N-terminal threonine residues are exposed as a nucleophile in peptide bond hydrolysis. Mammals have 7 alpha and 7 beta proteasome subunits while archaea have one of each.
TIGR arc_protsome_A 233aa 8e-62
in ref transcript
This protein family describes the archaeal proteasome alpha subunit, homologous to both the beta subunit and to the alpha and beta subunits of eukaryotic proteasome subunits. This family is universal in the first 29 complete archaeal genomes but occasionally is duplicated.
PTZ PTZ00246 237aa 7e-86
in ref transcript
proteasome subunit alpha; Provisional.
PSMA4
rs.PSMA4.F2 rs.PSMA4.R2 120 283
NCBIGene 36.3 5685
Single exon skipping, size difference: 163
Exclusion in the protein causing a frameshift
Reference transcript: NM_001102667
Changed!
cd proteasome_alpha_type_4 214aa 1e-108
in ref transcript
proteasome_alpha_type_4. The 20S proteasome, multisubunit proteolytic complex, is the central enzyme of nonlysosomal protein degradation in both the cytosol and nucleus. It is composed of 28 subunits arranged as four homoheptameric rings that stack on top of one another forming an elongated alpha-beta-beta-alpha cylinder with a central cavity. The proteasome alpha and beta subunits are members of the N-terminal nucleophile (Ntn)-hydrolase superfamily. Their N-terminal threonine residues are exposed as a nucleophile in peptide bond hydrolysis. Mammals have 7 alpha and 7 beta proteasome subunits while archaea have one of each.
Changed!
TIGR arc_protsome_A 233aa 8e-62
in ref transcript
This protein family describes the archaeal proteasome alpha subunit, homologous to both the beta subunit and to the alpha and beta subunits of eukaryotic proteasome subunits. This family is universal in the first 29 complete archaeal genomes but occasionally is duplicated.
Changed!
PTZ PTZ00246 237aa 7e-86
in ref transcript
proteasome subunit alpha; Provisional.
Changed!
cd proteasome_alpha_type_4 14aa 6e-04
in modified transcript
Changed!
PTZ PTZ00246 16aa 9e-05
in modified transcript
PSMD13
rs.PSMD13.F1 rs.PSMD13.R1 217 296
NCBIGene 36.3 5719
Single exon skipping, size difference: 79
Inclusion in the protein causing a frameshift
Reference transcript: NM_175932
Changed!
pfam PCI 105aa 5e-14
in ref transcript
PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15).
PTBP1
rs.PTBP1.F1 rs.PTBP1.R1 103 124
NCBIGene 36.3 5725
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_002819
cd RRM 71aa 4e-10
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 72aa 2e-08
in ref transcript
cd RRM 73aa 6e-07
in ref transcript
cd RRM 71aa 7e-05
in ref transcript
Changed!
TIGR hnRNP-L_PTB 501aa 1e-180
in ref transcript
Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
COG COG0724 171aa 9e-07
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
TIGR hnRNP-L_PTB 494aa 1e-175
in modified transcript
PTCH1
rs.PTCH1.F1 PTCH.R5 100 117
NCBIGene 36.3 5727
Alternative 5-prime, size difference: 17
Exclusion in 5'UTR
Reference transcript: NM_001083604
TIGR 2A060602 1055aa 0.0
in ref transcript
COG COG1033 173aa 2e-09
in ref transcript
Predicted exporters of the RND superfamily [General function prediction only].
COG COG1033 108aa 1e-04
in ref transcript
PTCH1
rs.PTCH1.F2 rs.PTCH1.R2 216 370
NCBIGene 36.3 5727
Alternative 5-prime, size difference: 154
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001083603
Changed!
TIGR 2A060602 1141aa 0.0
in ref transcript
Changed!
COG COG1033 173aa 2e-09
in ref transcript
Predicted exporters of the RND superfamily [General function prediction only].
Changed!
COG COG1033 108aa 1e-04
in ref transcript
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_000957
pfam 7tm_1 252aa 1e-09
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
PTGER3
rs.PTGER3.F2 rs.PTGER3.R2 163 255
NCBIGene 36.3 5733
Single exon skipping, size difference: 92
Inclusion in the protein causing a frameshift
Reference transcript: NM_000957
pfam 7tm_1 252aa 1e-09
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
PTGS1
rs.PTGS1.F1 rs.PTGS1.R1 135 246
NCBIGene 36.3 5742
Alternative 5-prime, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_000962
cd EGF_CA 38aa 0.005
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Changed!
pfam An_peroxidase 356aa 2e-86
in ref transcript
Animal haem peroxidase.
pfam EGF 31aa 0.001
in ref transcript
EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
Changed!
pfam An_peroxidase 319aa 9e-78
in modified transcript
PTPN13
rs.PTPN13.F1 PTPN13.R3 104 119
NCBIGene 36.3 5783
Alternative 5-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_080685
cd PTPc 228aa 3e-79
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PDZ_signaling 85aa 7e-16
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 80aa 1e-15
in ref transcript
Changed!
cd PDZ_signaling 90aa 2e-13
in ref transcript
cd FERM_C 94aa 2e-13
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
cd PDZ_signaling 79aa 2e-08
in ref transcript
cd PDZ_signaling 88aa 6e-08
in ref transcript
smart PTPc 253aa 5e-91
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart KIND 188aa 2e-42
in ref transcript
kinase non-catalytic C-lobe domain. It is an interaction domain identified as being similar to the C-terminal protein kinase catalytic fold (C lobe). Its presence at the N terminus of signalling proteins and the absence of the active-site residues in the catalytic and activation loops suggest that it folds independently and is likely to be non-catalytic. The occurrence of KIND only in metazoa implies that it has evolved from the catalytic protein kinase domain into an interaction domain possibly by keeping the substrate-binding features.
smart B41 210aa 4e-40
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam FERM_C 88aa 2e-17
in ref transcript
FERM C-terminal PH-like domain.
smart PDZ 90aa 1e-16
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
Changed!
smart PDZ 92aa 2e-15
in ref transcript
smart PDZ 83aa 1e-13
in ref transcript
smart PDZ 92aa 2e-11
in ref transcript
smart PDZ 84aa 9e-08
in ref transcript
COG PTP2 267aa 1e-40
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
Changed!
cd PDZ_signaling 85aa 3e-15
in modified transcript
Changed!
smart PDZ 87aa 4e-17
in modified transcript
PTPN13
rs.PTPN13.F2 rs.PTPN13.R2 137 194
NCBIGene 36.3 5783
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_080685
cd PTPc 228aa 3e-79
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PDZ_signaling 85aa 7e-16
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 80aa 1e-15
in ref transcript
cd PDZ_signaling 90aa 2e-13
in ref transcript
cd FERM_C 94aa 2e-13
in ref transcript
The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
cd PDZ_signaling 79aa 2e-08
in ref transcript
cd PDZ_signaling 88aa 6e-08
in ref transcript
smart PTPc 253aa 5e-91
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart KIND 188aa 2e-42
in ref transcript
kinase non-catalytic C-lobe domain. It is an interaction domain identified as being similar to the C-terminal protein kinase catalytic fold (C lobe). Its presence at the N terminus of signalling proteins and the absence of the active-site residues in the catalytic and activation loops suggest that it folds independently and is likely to be non-catalytic. The occurrence of KIND only in metazoa implies that it has evolved from the catalytic protein kinase domain into an interaction domain possibly by keeping the substrate-binding features.
smart B41 210aa 4e-40
in ref transcript
Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
pfam FERM_C 88aa 2e-17
in ref transcript
FERM C-terminal PH-like domain.
smart PDZ 90aa 1e-16
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart PDZ 92aa 2e-15
in ref transcript
smart PDZ 83aa 1e-13
in ref transcript
smart PDZ 92aa 2e-11
in ref transcript
smart PDZ 84aa 9e-08
in ref transcript
COG PTP2 267aa 1e-40
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
smart PTPc 253aa 9e-87
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
COG PTP2 237aa 1e-45
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
PTPRA
rs.PTPRA.F1 rs.PTPRA.R1 101 191
NCBIGene 36.3 5786
Single exon skipping, size difference: 90
Inclusion in 5'UTR
Reference transcript: NM_002836
cd PTPc 231aa 3e-89
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 231aa 5e-85
in ref transcript
smart PTPc 259aa 1e-97
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 261aa 1e-93
in ref transcript
COG PTP2 231aa 2e-43
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 241aa 3e-43
in ref transcript
PTPRD
rs.PTPRD.F1 rs.PTPRD.R1 102 114
NCBIGene 36.3 5789
Single exon skipping, size difference: 12
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_002839
cd PTPc 230aa 5e-92
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 231aa 8e-90
in ref transcript
cd FN3 92aa 3e-13
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 90aa 7e-13
in ref transcript
cd IGcam 88aa 2e-12
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 91aa 4e-12
in ref transcript
cd FN3 95aa 8e-12
in ref transcript
cd FN3 93aa 2e-11
in ref transcript
cd FN3 89aa 3e-11
in ref transcript
cd FN3 107aa 4e-09
in ref transcript
cd FN3 82aa 2e-06
in ref transcript
cd IGcam 78aa 1e-05
in ref transcript
smart PTPc 256aa 1e-101
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 260aa 1e-100
in ref transcript
pfam I-set 92aa 5e-16
in ref transcript
Immunoglobulin I-set domain.
pfam fn3 85aa 8e-15
in ref transcript
Fibronectin type III domain.
pfam fn3 88aa 2e-13
in ref transcript
pfam fn3 83aa 2e-13
in ref transcript
smart FN3 79aa 3e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
smart IGc2 74aa 7e-11
in ref transcript
Immunoglobulin C-2 Type.
smart FN3 83aa 2e-10
in ref transcript
pfam fn3 100aa 5e-10
in ref transcript
smart IG_like 79aa 7e-08
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart FN3 81aa 4e-05
in ref transcript
COG PTP2 268aa 1e-48
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 281aa 9e-45
in ref transcript
PTPRM
rs.PTPRM.F1 rs.PTPRM.R1 326 365
NCBIGene 36.3 5797
Multiple exon skipping, size difference: 39
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001105244
cd PTPc 228aa 3e-89
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 233aa 7e-68
in ref transcript
cd MAM 156aa 8e-41
in ref transcript
Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
cd FN3 83aa 1e-06
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 95aa 8e-04
in ref transcript
cd FN3 93aa 0.004
in ref transcript
smart PTPc 255aa 1e-102
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 263aa 3e-74
in ref transcript
smart MAM 160aa 2e-49
in ref transcript
Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
pfam fn3 93aa 4e-07
in ref transcript
Fibronectin type III domain.
smart FN3 85aa 4e-05
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
smart IG_like 88aa 3e-04
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart FN3 80aa 0.002
in ref transcript
COG PTP2 228aa 3e-45
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 247aa 2e-23
in ref transcript
PTPRS
rs.PTPRS.F1 rs.PTPRS.R1 101 113
NCBIGene 36.3 5802
Single exon skipping, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_002850
cd PTPc 230aa 5e-95
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
cd PTPc 231aa 2e-90
in ref transcript
cd FN3 90aa 6e-14
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 89aa 2e-12
in ref transcript
cd IGcam 87aa 4e-12
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd FN3 93aa 4e-11
in ref transcript
cd IGcam 91aa 5e-11
in ref transcript
cd FN3 90aa 2e-07
in ref transcript
cd FN3 80aa 2e-07
in ref transcript
cd FN3 106aa 2e-07
in ref transcript
cd IGcam 83aa 2e-06
in ref transcript
cd FN3 83aa 6e-05
in ref transcript
cd FN3 64aa 0.001
in ref transcript
smart PTPc 256aa 1e-106
in ref transcript
Protein tyrosine phosphatase, catalytic domain.
smart PTPc 260aa 1e-100
in ref transcript
pfam I-set 92aa 2e-17
in ref transcript
Immunoglobulin I-set domain.
smart FN3 79aa 1e-12
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
smart FN3 80aa 2e-12
in ref transcript
smart FN3 83aa 5e-11
in ref transcript
smart IGc2 74aa 1e-10
in ref transcript
Immunoglobulin C-2 Type.
pfam fn3 80aa 7e-09
in ref transcript
Fibronectin type III domain.
smart IG_like 79aa 7e-09
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
pfam fn3 91aa 4e-08
in ref transcript
pfam fn3 99aa 2e-07
in ref transcript
smart FN3 79aa 6e-05
in ref transcript
smart FN3 62aa 3e-04
in ref transcript
COG PTP2 268aa 6e-51
in ref transcript
Protein tyrosine phosphatase [Signal transduction mechanisms].
COG PTP2 266aa 2e-48
in ref transcript
PUS1
rs.PUS1.F1 rs.PUS1.R1 137 529
NCBIGene 36.3 80324
Alternative 5-prime, size difference: 392
Exclusion of the protein initiation site
Reference transcript: NM_025215
cd PseudoU_synth_PUS1_PUS2 253aa 8e-91
in ref transcript
PseudoU_synth_PUS1_PUS2: Pseudouridine synthase, PUS1/ PUS2 like. This group consists of eukaryotic pseudouridine synthases similar to Saccharomyces cerevisiae Pus1p, S. cerevisiae Pus2p, Caenorhabditis elegans Pus1p and human PUS1. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). No cofactors are required. S. cerevisiae Pus1p catalyzes the formation of psi34 and psi36 in the intron-containing tRNAIle, psi35 in the intron-containing tRNATyr, psi27 and/or psi28 in several yeast cytoplasmic tRNAs and, psi44 in U2 small nuclear RNA (U2 snRNA). The presence of the intron is required for the formation of psi 34, 35 and 36. In addition S. cerevisiae PUS1 makes are psi 26, 65 and 67. C. elegans Pus1p does not modify psi44 in U2 snRNA. Mouse Pus1p makes psi27/28 in pre- tRNASer , tRNAVal and tRNAIle, psi 34/36 in tRNAIle and, psi 32 and potentially 67 in tRNAVal. Psi44 in U2 snRNA and psi32 in tRNAs are highly phylogenetically conserved. Psi 26,27,28,34,35,36,65 and 67 in tRNAs are less highly conserved. Mouse Pus1p regulates nuclear receptor activity through pseudouridylation of Steroid Receptor RNA Activator. Missense mutation in human PUS1 causes mitochondrial myopathy and sideroblastic anemia (MLASA).
TIGR hisT_truA 252aa 1e-36
in ref transcript
universal so far, single copy in all prokaryotes, 3 in yeast. Trusted cutoff for orthology is about 100 based on 1 match only in complete prokaryote with length > 200.
COG TruA 280aa 2e-44
in ref transcript
Pseudouridylate synthase [Translation, ribosomal structure and biogenesis].
PUS7L
rs.PUS7L.F1 rs.PUS7L.R1 122 537
NCBIGene 36.3 83448
Alternative 5-prime, size difference: 415
Exclusion in 5'UTR
Reference transcript: NM_001098615
cd PseudoU_synth_ScPUS7 401aa 1e-111
in ref transcript
PseudoU_synth_ScPUS7: Pseudouridine synthase, TruD family. This group consists of eukaryotic pseudouridine synthases similar to Saccharomyces cerevisiae Pus7. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). Saccharomyces cerevisiae Pus7 makes psi35 in U2 small nuclear RNA (U2 snRNA), psi13 in cytoplasmic tRNAs and psi35 in pre-tRNATyr. Psi35 in yeast U2 snRNA and psi13 in tRNAs are highly phylogenetically conserved. Psi34 is the mammalian U2 snRNA counterpart of yeast U2 snRNA psi35.
cd PseudoU_synth_ScPUS7 28aa 0.002
in ref transcript
TIGR TIGR00094 400aa 1e-33
in ref transcript
MJ11364 is a strong partial match from 50 to 230 aa.
COG COG0585 405aa 1e-50
in ref transcript
Uncharacterized conserved protein [Function unknown].
PWWP2B
rs.PWWP2B.F1 rs.PWWP2B.R1 127 430
NCBIGene 36.3 170394
Alternative 5-prime, size difference: 303
Exclusion of the stop codon
Reference transcript: NM_138499
Changed!
cd Dnmt3b_related 87aa 5e-32
in ref transcript
The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Changed!
pfam PWWP 71aa 4e-10
in ref transcript
PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown.
PXMP3
rs.PXMP3.F1 rs.PXMP3.R1 209 241
NCBIGene 36.3 5828
Single exon skipping, size difference: 32
Exclusion in 5'UTR
Reference transcript: NM_000318
pfam Pex2_Pex12 207aa 4e-47
in ref transcript
Pex2 / Pex12 amino terminal region. This region is found at the N terminal of a number of known and predicted peroxins including Pex2, Pex10 and Pex12. This conserved region is usually associated with a C terminal ring finger (pfam00097) domain.
smart RING 40aa 0.001
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
COG PEX10 46aa 0.002
in ref transcript
RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
PYHIN1
rs.PYHIN1.F1 rs.PYHIN1.R1 139 166
NCBIGene 36.3 149628
Alternative 3-prime, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_152501
pfam HIN 159aa 3e-51
in ref transcript
HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758.
pfam PAAD_DAPIN 79aa 6e-17
in ref transcript
PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
RABEP1
rs.RABEP1.F1 rs.RABEP1.R1 283 382
NCBIGene 36.3 9135
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_004703
Changed!
pfam Rab5-bind 196aa 3e-25
in ref transcript
Rab5 binding. Members of this family are predominantly found in Rabaptin and allow for binding to the GTPase Rab5. This interaction is necessary and sufficient for Rab5-dependent recruitment of Rabaptin5 to early endosomal membranes.
pfam Rabaptin 151aa 6e-20
in ref transcript
Rabaptin.
pfam Rabaptin 111aa 2e-17
in ref transcript
TIGR SMC_prok_B 296aa 4e-06
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
COG Smc 302aa 5e-08
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG Smc 264aa 5e-04
in ref transcript
Changed!
pfam Rab5-bind 163aa 5e-20
in modified transcript
Changed!
TIGR SMC_prok_B 280aa 1e-04
in modified transcript
Changed!
COG Smc 289aa 1e-06
in modified transcript
RAC1
rs.RAC1.F2 rs.RAC1.R2 115 172
NCBIGene 36.3 5879
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_018890
Changed!
cd Rac1_like 192aa 1e-107
in ref transcript
Rac1-like subfamily. The Rac1-like subfamily consists of Rac1, Rac2, and Rac3 proteins, plus the splice variant Rac1b that contains a 19-residue insertion near switch II relative to Rac1. While Rac1 is ubiquitously expressed, Rac2 and Rac3 are largely restricted to hematopoietic and neural tissues respectively. Rac1 stimulates the formation of actin lamellipodia and membrane ruffles. It also plays a role in cell-matrix adhesion and cell anoikis. In intestinal epithelial cells, Rac1 is an important regulator of migration and mediates apoptosis. Rac1 is also essential for RhoA-regulated actin stress fiber and focal adhesion complex formation. In leukocytes, Rac1 and Rac2 have distinct roles in regulating cell morphology, migration, and invasion, but are not essential for macrophage migration or chemotaxis. Rac3 has biochemical properties that are closely related to Rac1, such as effector interaction, nucleotide binding, and hydrolysis; Rac2 has a slower nucleotide association and is more efficiently activated by the RacGEF Tiam1. Both Rac1 and Rac3 have been implicated in the regulation of cell migration and invasion in human metastatic breast cancer. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
Changed!
smart RHO 189aa 1e-98
in ref transcript
Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
Changed!
COG COG1100 193aa 1e-27
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
Changed!
cd Rac1_like 173aa 1e-111
in modified transcript
Changed!
smart RHO 170aa 1e-102
in modified transcript
Changed!
COG COG1100 174aa 3e-31
in modified transcript
RAD17
rs.RAD17.F1 rs.RAD17.R1 154 338
NCBIGene 36.3 5884
Single exon skipping, size difference: 184
Inclusion in the protein causing a new stop codon
Reference transcript: NM_133339
Changed!
cd AAA 37aa 0.001
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
Changed!
TIGR rad24 640aa 0.0
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Changed!
PRK PRK04195 423aa 1e-16
in ref transcript
replication factor C large subunit; Provisional.
RAD17
rs.RAD17.F2 rs.RAD17.R2 114 372
NCBIGene 36.3 5884
Single exon skipping, size difference: 258
Exclusion in the protein (no frameshift)
Reference transcript: NM_133343
cd AAA 37aa 0.001
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
Changed!
TIGR rad24 640aa 0.0
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Changed!
PRK PRK04195 423aa 8e-17
in ref transcript
replication factor C large subunit; Provisional.
Changed!
TIGR rad24 553aa 0.0
in modified transcript
Changed!
PRK PRK04195 406aa 9e-12
in modified transcript
RAD17
rs.RAD17.F3 rs.RAD17.R3 271 375
NCBIGene 36.3 5884
Single exon skipping, size difference: 104
Exclusion in 5'UTR
Reference transcript: NM_133338
cd AAA 37aa 0.001
in ref transcript
The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
TIGR rad24 640aa 0.0
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
PRK PRK04195 423aa 8e-17
in ref transcript
replication factor C large subunit; Provisional.
RAD51
rs.RAD51.F1 rs.RAD51.R1 122 413
NCBIGene 36.3 5888
Exon skipping and alternative 3-prime or 5-prime, size difference: 291
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_002875
Changed!
cd Rad51_DMC1_radA 234aa 1e-98
in ref transcript
Rad51_DMC1_radA,B. This group of recombinases includes the eukaryotic proteins RAD51, RAD55/57 and the meiosis-specific protein DMC1, and the archaeal proteins radA and radB. They are closely related to the bacterial RecA group. Rad51 proteins catalyze a similiar recombination reaction as RecA, using ATP-dependent DNA binding activity and a DNA-dependent ATPase. However, this reaction is less efficient and requires accessory proteins such as RAD55/57.
Changed!
TIGR recomb_RAD51 315aa 1e-178
in ref transcript
This eukaryotic sequence family consists of RAD51, a protein involved in DNA homologous recombination and repair. It is similar in sequence the exclusively meiotic recombinase DMC1 (TIGR02238), to archaeal families RadA (TIGR02236) and RadB (TIGR02237), and to bacterial RecA (TIGR02012).
Changed!
PTZ PTZ00035 332aa 1e-125
in ref transcript
Rad51; Provisional.
Changed!
cd Rad51_DMC1_radA 168aa 1e-59
in modified transcript
Changed!
TIGR recomb_RAD51 170aa 4e-89
in modified transcript
Changed!
TIGR recomb_RAD51 56aa 2e-20
in modified transcript
Changed!
PTZ PTZ00035 173aa 3e-64
in modified transcript
Changed!
PTZ PTZ00035 67aa 1e-10
in modified transcript
RAD51L3
rs.RAD51L3.F1 rs.RAD51L3.R1 98 434
NCBIGene 36.3 5892
Multiple exon skipping, size difference: 336
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_002878
Changed!
cd Rad51_DMC1_radA 236aa 6e-43
in ref transcript
Rad51_DMC1_radA,B. This group of recombinases includes the eukaryotic proteins RAD51, RAD55/57 and the meiosis-specific protein DMC1, and the archaeal proteins radA and radB. They are closely related to the bacterial RecA group. Rad51 proteins catalyze a similiar recombination reaction as RecA, using ATP-dependent DNA binding activity and a DNA-dependent ATPase. However, this reaction is less efficient and requires accessory proteins such as RAD55/57.
Changed!
TIGR recomb_radA 306aa 2e-14
in ref transcript
This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA (TIGR02012), eukaryotic RAD51 (TIGR02239), and archaeal RadB (TIGR02237). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein.
Changed!
PRK radB 223aa 3e-17
in ref transcript
DNA repair and recombination protein RadB; Provisional.
Changed!
cd Rad51_DMC1_radA 170aa 4e-19
in modified transcript
Changed!
PRK radB 84aa 4e-04
in modified transcript
RANBP3
rs.RANBP3.F1 rs.RANBP3.R1 111 126
NCBIGene 36.3 8498
Alternative 5-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_007322
cd RanBD 125aa 3e-26
in ref transcript
Ran-binding domain; This domain of approximately 150 residues shares structural similarity to the PH domain, but lacks detectable sequence similarity. Ran is a Ras-like nuclear small GTPase, which regulates receptor-mediated transport between the nucleus and the cytoplasm. RanGTP hydrolysis is stimulated by RanGAP together with the Ran-binding domain containing acessory proteins RanBP1 and RanBP2. These accessory proteins stabilize the active GTP-bound form of Ran . The Ran-binding domain is found in multiple copies in Nuclear pore complex proteins.
smart RanBD 111aa 3e-15
in ref transcript
Ran-binding domain. Domain of apporximately 150 residues that stabilises the GTP-bound form of Ran (the Ras-like nuclear small GTPase).
COG YRB1 71aa 1e-04
in ref transcript
Ran GTPase-activating protein (Ran-binding protein) [Intracellular trafficking and secretion].
RAP1A
rs.RAP1A.F1 rs.RAP1A.R1 176 270
NCBIGene 36.3 5906
Single exon skipping, size difference: 94
Exclusion in 5'UTR
Reference transcript: NM_001010935
cd Rap_like 149aa 3e-84
in ref transcript
Rap-like subfamily. The Rap subfamily consists of the Rap1, Rap2, and RSR1. Rap subfamily proteins perform different cellular functions, depending on the isoform and its subcellular localization. For example, in rat salivary gland, neutrophils, and platelets, Rap1 localizes to secretory granules and is believed to regulate exocytosis or the formation of secretory granules. Rap1 has also been shown to localize in the Golgi of rat fibroblasts, zymogen granules, plasma membrane, and microsomal membrane of the pancreatic acini, as well as in the endocytic compartment of skeletal muscle cells and fibroblasts. Rap1 localizes in the nucleus of human oropharyngeal squamous cell carcinomas (SCCs) and cell lines. Rap1 plays a role in phagocytosis by controlling the binding of adhesion receptors (typically integrins) to their ligands. In yeast, Rap1 has been implicated in multiple functions, including activation and silencing of transcription and maintenance of telomeres. Rap2 is involved in multiple functions, including activation of c-Jun N-terminal kinase (JNK) to regulate the actin cytoskeleton and activation of the Wnt/beta-catenin signaling pathway in embryonic Xenopus. A number of effector proteins for Rap2 have been identified, including isoform 3 of the human mitogen-activated protein kinase kinase kinase kinase 4 (MAP4K4) and Traf2- and Nck-interacting kinase (TNIK), and the RalGEFs RalGDS, RGL, and Rlf, which also interact with Rap1 and Ras. RSR1 is the fungal homolog of Rap1 and Rap2. In budding yeasts, it is involved in selecting a site for bud growth, which directs the establishment of cell polarization. The Rho family GTPase Cdc42 and its GEF, Cdc24, then establish an axis of polarized growth. It is believed that Cdc42 interacts directly with RSR1 in vivo. In filamentous fungi such as Ashbya gossypii, RSR1 is a key regulator of polar growth in the hypha. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Ras proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
smart RAS 151aa 8e-78
in ref transcript
Ras subfamily of RAS small GTPases. Similar in fold and function to the bacterial EF-Tu GTPase. p21Ras couples receptor Tyr kinases and G protein receptors to protein kinase cascades.
COG COG1100 150aa 1e-16
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
RAP1GDS1
rs.RAP1GDS1.F1 rs.RAP1GDS1.R1 193 340
NCBIGene 36.3 5910
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_001100426
cd ARM 114aa 1e-08
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Changed!
cd ARM 124aa 3e-07
in ref transcript
pfam Arm 32aa 0.003
in ref transcript
Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
Changed!
COG SRP1 57aa 0.008
in ref transcript
Karyopherin (importin) alpha [Intracellular trafficking and secretion].
Changed!
cd ARM 120aa 3e-09
in modified transcript
Changed!
COG SRP1 318aa 9e-04
in modified transcript
RARB
rs.RARB.F1 rs.RARB.R1 137 494
NCBIGene 36.3 5915
Alternative 5-prime, size difference: 357
Exclusion of the protein initiation site
Reference transcript: NM_000965
cd NR_LBD_RAR 231aa 1e-131
in ref transcript
The ligand binding domain (LBD) of retinoic acid receptor (RAR), a members of the nuclear receptor superfamily. The ligand binding domain (LBD) of retinoic acid receptor (RAR): Retinoic acid receptors are members of the nuclear receptor (NR) superfamily of ligand-regulated transcription factors. RARs mediate the biological effect of retinoids, including both naturally dietary vitamin A (retinol) metabolites and active synthetic analogs. Retinoids play key roles in a wide variety of essential biological processes, such as vertebrate embryonic morphogenesis and organogenesis, differentiation and apoptosis, and homeostasis. RARs function as heterodimers with retinoic X receptors by binding to specific RAR response elements (RAREs) found in the promoter regions of retinoid target genes. In the absence of ligand, the RAR-RXR heterodimer recruits the corepressor proteins NCoR or AMRT, and associated factors such as histone deacetylases or DNA-methyltransferases, leading to an inactive condensed chromatin structure, preventing transcription. Upon ligand binding, the corepressors are released, and coactivator complexes such as histone acetyltransferase or histone arginine methyltransferases are recruited to activate transcription. There are three RAR subtypes (alpha, beta, gamma), originating from three distinct genes. For each subtype, several isoforms exist that differ in their N-terminal region, allowing retinoids to exert their pleiotropic effects. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, retinoic acid receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a non-conserved hinge and a C-terminal ligand binding domain (LBD).
Changed!
pfam zf-C4 76aa 6e-38
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
pfam Hormone_recep 181aa 5e-31
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
Changed!
pfam zf-C4 42aa 8e-17
in modified transcript
RASA4
rs.RASA4.F1 rs.RASA4.R1 141 279
NCBIGene 36.3 10156
Single exon skipping, size difference: 138
Exclusion in the protein (no frameshift)
Reference transcript: NM_006989
cd RasGAP_RASA4 331aa 1e-172
in ref transcript
Ras GTPase activating-like 4 protein (RASAL4), also known as Ca2+ -promoted Ras inactivator (CAPRI), is a member of the GAP1 family. Members of the GAP1 family are characterized by a conserved domain structure comprising N-terminal tandem C2 domains, a highly conserved central RasGAP domain, and a C-terminal pleckstrin-homology domain that is associated with a Bruton's tyrosine kinase motif. RASAL4, like RASAL, is a cytosolic protein that undergoes a rapid translocation to the plasma membrane in response to a receptor-mediated elevation in the concentration of intracellular free Ca2+ ([Ca2+]i). However, unlike RASAL, RASAL4 does not sense oscillations in [Ca2+]i.
Changed!
cd PH_RasGAP_CG9209 103aa 1e-34
in ref transcript
RAS_GTPase activating protein (GAP)_CG9209 pleckstrin homology (PH) domain. This protein consists of two C2 domains, followed by a RasGAP domain, a PH domain and a BTK domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinsases, regulators of G-proteins, endocytotic GTPAses, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
cd C2_1 118aa 2e-29
in ref transcript
Protein kinase C conserved region 2, subgroup 1; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing synaptotagmins, specific protein kinases C (PKC) subtypes and other proteins, the N-terminal beta strand occupies the position of what is the C-terminal strand in subgroup 2.
cd C2 102aa 3e-20
in ref transcript
Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
smart RasGAP 362aa 3e-97
in ref transcript
GTPase-activator protein for Ras-like GTPases. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position. Improved domain limits from structure.
pfam C2 82aa 2e-21
in ref transcript
C2 domain.
pfam C2 82aa 8e-20
in ref transcript
Changed!
smart PH 106aa 6e-08
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
pfam BTK 28aa 2e-07
in ref transcript
BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region.
COG COG5038 88aa 3e-10
in ref transcript
Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
COG COG5038 81aa 4e-09
in ref transcript
COG IQG1 246aa 4e-05
in ref transcript
Protein involved in regulation of cellular morphogenesis/cytokinesis [Cell division and chromosome partitioning / Signal transduction mechanisms].
RASSF1
rs.RASSF1.F1 rs.RASSF1.R1 100 112
NCBIGene 36.3 11186
Alternative 3-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_170714
cd RASSF1_RA 96aa 4e-39
in ref transcript
RASSF1 (also known as RASSF3 and NORE1) is a tumour suppressor protein with a C-terminal Ras-associating (RA) domain that binds Ras. RASSF1 also binds the proapoptotic protein kinase MST1 and is thus thought to regulate the proapoptotic signalling pathway. RASSF1 also associates with microtubule-associated proteins like MAP1B and regulates tubulin polymerization. RASSF1 also binds CDC20 and regulates mitosis by inhibiting the anaphase-promoting complex and preventing degradation of cyclin A and cyclin B until the spindle checkpoint becomes fully operational.
Changed!
cd C1 54aa 1e-06
in ref transcript
Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
smart RA 88aa 6e-17
in ref transcript
Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.).
Changed!
pfam C1_1 54aa 2e-07
in ref transcript
Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
Changed!
cd C1 50aa 2e-08
in modified transcript
Changed!
pfam C1_1 50aa 2e-08
in modified transcript
RBM23
rs.RBM23.F1 rs.RBM23.R1 107 155
NCBIGene 36.3 55147
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_001077351
cd RRM 73aa 2e-21
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 73aa 9e-10
in ref transcript
TIGR SF-CC1 292aa 3e-92
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
COG COG0724 162aa 3e-15
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
RBM23
rs.RBM23.F2 rs.RBM23.R2 217 271
NCBIGene 36.3 55147
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_001077351
cd RRM 73aa 2e-21
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 73aa 9e-10
in ref transcript
Changed!
TIGR SF-CC1 292aa 3e-92
in ref transcript
A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
COG COG0724 162aa 3e-15
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
TIGR SF-CC1 286aa 1e-90
in modified transcript
RBM33
rs.RBM33.F1 rs.RBM33.R1 173 203
NCBIGene 36.3 155435
Alternative 3-prime, size difference: 30
Exclusion in the protein (no frameshift)
Reference transcript: NM_001008408
cd RRM 64aa 0.001
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM_2 64aa 3e-04
in ref transcript
RNA recognition motif.
RBM35A
rs.RBM35A.F1 rs.RBM35A.R1 100 112
NCBIGene 36.3 54845
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_017697
cd RRM 76aa 1e-07
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 75aa 3e-07
in ref transcript
cd ERI-1_3'hExo_like 117aa 3e-07
in ref transcript
This subfamily is composed of Caenorhabditis elegans ERI-1, human 3' exonuclease (3'hExo), Drosophila exonuclease snipper (snp), and similar proteins from eukaryotes and bacteria. These are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. ERI-1 has been implicated in the degradation of small interfering RNAs (RNAi). 3'hExo participates in the degradation of histone mRNAs. Snp is a non-essential exonuclease that efficiently degrades structured RNA and DNA substrates as long as there is a minimum of 2 nucleotides in the 3' overhang to initiate degradation. Snp is not a functional homolog of either ERI-1 or 3'hExo.
smart RRM_2 75aa 6e-05
in ref transcript
RNA recognition motif.
smart RRM_2 74aa 3e-04
in ref transcript
RBM47
rs.RBM47.F1 rs.RBM47.R1 123 330
NCBIGene 36.3 54502
Single exon skipping, size difference: 207
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098634
cd RRM 72aa 1e-13
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 68aa 8e-12
in ref transcript
cd RRM 79aa 1e-06
in ref transcript
Changed!
TIGR hnRNP-R-Q 573aa 0.0
in ref transcript
Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
COG COG0724 143aa 4e-09
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
COG COG0724 71aa 2e-07
in ref transcript
Changed!
TIGR hnRNP-R-Q 354aa 1e-178
in modified transcript
Changed!
TIGR hnRNP-R-Q 175aa 6e-25
in modified transcript
RBMY1A1
rs.RBMY1A1.F1 rs.RBMY1A1.R1 345 523
NCBIGene 36.3 5940
Single exon skipping, size difference: 178
Exclusion in the protein causing a frameshift
Reference transcript: NM_005058
Changed!
cd RRM 73aa 1e-22
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
Changed!
pfam RRM_1 70aa 2e-20
in ref transcript
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
Changed!
pfam RBM1CTR 44aa 2e-08
in ref transcript
RBM1CTR (NUC064) family. This C-terminal region is found in RBM1-like RNA binding hnRNPs.
Changed!
COG COG0724 84aa 1e-09
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
cd RRM 65aa 2e-17
in modified transcript
Changed!
pfam RRM_1 63aa 2e-16
in modified transcript
Changed!
COG COG0724 79aa 2e-08
in modified transcript
RCC1
rs.RCC1.F1 rs.RCC1.R1 111 153
NCBIGene 36.3 1104
Alternative 5-prime, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_001048194
pfam RCC1 50aa 5e-11
in ref transcript
Regulator of chromosome condensation (RCC1).
pfam RCC1 49aa 1e-10
in ref transcript
pfam RCC1 51aa 5e-10
in ref transcript
pfam RCC1 52aa 8e-10
in ref transcript
pfam RCC1 66aa 1e-08
in ref transcript
pfam RCC1 31aa 2e-04
in ref transcript
pfam RCC1 51aa 4e-04
in ref transcript
Changed!
COG ATS1 350aa 5e-42
in ref transcript
Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
Changed!
COG ATS1 415aa 3e-42
in modified transcript
RCP9
rs.RCP9.F1 rs.RCP9.R1 222 321
NCBIGene 36.3 27297
Single exon skipping, size difference: 99
Exclusion in the protein (no frameshift)
Reference transcript: NM_014478
Changed!
pfam RNA_pol_III_C17 120aa 5e-36
in ref transcript
RNA polymerase III subunit C17. C17 (aka CGRP-RCP) is an essential subunit of RNA polymerase III. C17 forms a subcomplex with C25 which is likely to be the counterpart of subcomplex Rpb4/7 in Pol II.
Changed!
pfam RNA_pol_III_C17 87aa 1e-22
in modified transcript
REG3A
rs.REG3A.F1 rs.REG3A.R1 107 130
NCBIGene 36.3 5068
Alternative 3-prime, size difference: 23
Inclusion in 5'UTR
Reference transcript: NM_138937
cd CLECT_REG-1_like 134aa 7e-41
in ref transcript
CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates several bacterial strains from the intestinal flora and it has been suggested that it is involved in the control of the intestinal bacterial ecosystem. Rat lithostathine has calcium carbonate crystal inhibitor activity in vitro. REG-IV is unregulated in pancreatic, gastric, hepatocellular, and prostrate adenocarcinomas. REG-IV activates the EGF receptor/Akt/AP-1 signaling pathway in colorectal carcinoma. Ansocalcin, SCA-1 and -2 are found at high concentration in the calcified egg shell layer of goose and ostrich, respectively and tend to form aggregates. Ansocalcin nucleates calcite crystal aggregates in vitro.
smart CLECT 133aa 2e-23
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
REG3G
rs.REG3G.F1 rs.REG3G.R1 280 380
NCBIGene 36.3 130120
Alternative 3-prime, size difference: 100
Inclusion in 5'UTR
Reference transcript: NM_198448
cd CLECT_REG-1_like 134aa 5e-38
in ref transcript
CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates several bacterial strains from the intestinal flora and it has been suggested that it is involved in the control of the intestinal bacterial ecosystem. Rat lithostathine has calcium carbonate crystal inhibitor activity in vitro. REG-IV is unregulated in pancreatic, gastric, hepatocellular, and prostrate adenocarcinomas. REG-IV activates the EGF receptor/Akt/AP-1 signaling pathway in colorectal carcinoma. Ansocalcin, SCA-1 and -2 are found at high concentration in the calcified egg shell layer of goose and ostrich, respectively and tend to form aggregates. Ansocalcin nucleates calcite crystal aggregates in vitro.
smart CLECT 133aa 2e-21
in ref transcript
C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
REPIN1
rs.REPIN1.F1 rs.REPIN1.R1 274 399
NCBIGene 36.3 29803
Single exon skipping, size difference: 125
Inclusion in the protein causing a frameshift
Reference transcript: NM_001099695
Changed!
COG COG5048 275aa 2e-06
in ref transcript
FOG: Zn-finger [General function prediction only].
RERE
rs.RERE.F1 rs.RERE.R1 207 392
NCBIGene 36.3 473
Single exon skipping, size difference: 185
Exclusion in 5'UTR
Reference transcript: NM_012102
cd BAH_MTA 206aa 4e-60
in ref transcript
BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The Metastasis-associated protein MTA1 is part of the NURD (nucleosome remodeling and deacetylating) complex and plays a role in cellular transformation and metastasis. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
cd ZnF_GATA 55aa 4e-09
in ref transcript
Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C.
cd SANT 44aa 0.001
in ref transcript
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
pfam Atrophin-1 622aa 1e-154
in ref transcript
Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteristic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
pfam Atrophin-1 134aa 4e-20
in ref transcript
smart ZnF_GATA 50aa 4e-11
in ref transcript
zinc finger binding to DNA consensus sequence [AT]GATA[AG].
smart BAH 177aa 3e-08
in ref transcript
Bromo adjacent homology domain.
pfam ELM2 53aa 2e-07
in ref transcript
ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in a protein from Arabidopsis thaliana. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.
smart SANT 46aa 2e-04
in ref transcript
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.
RGR
rs.RGR.F1 rs.RGR.R1 100 112
NCBIGene 36.3 5995
Alternative 3-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_002921
Changed!
pfam 7tm_1 229aa 7e-11
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
Changed!
pfam 7tm_1 225aa 3e-12
in modified transcript
RGS11
rs.RGS11.F1 rs.RGS11.R1 94 105
NCBIGene 36.3 8786
Alternative 5-prime, size difference: 11
Inclusion in the protein causing a frameshift
Reference transcript: NM_183337
Changed!
cd DEP_RGS7-like 88aa 5e-25
in ref transcript
DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in RGS (regulator of G-protein signaling) proteins of the subfamily R7. This subgroup contains RGS7, RGS6, RGS9 and RGS11. They share a common domain architecture, containing, beside the RGS domain, a DEP domain and a GGL (G-protein gamma subunit-like ) domain. RGS proteins are GTPase-activating (GAP) proteins of heterotrimeric G proteins by increasing the rate of GTP hydrolysis of the alpha subunit. The fungal homologs, like yeast Sst2, share a related common domain architecture, containing RGS and DEP domains. Sst2 has been identified as the principal regulator of mating pheromone signaling and recently the DEP domain of Sst2 has been shown to be necessary and sufficient to mediate receptor interaction.
Changed!
cd GGL 56aa 3e-13
in ref transcript
G protein gamma subunit-like motifs, the alpha-helical G-gamma chain dimerizes with the G-beta propeller subunit as part of the heterotrimeric G-protein complex; involved in signal transduction via G-protein-coupled receptors.
Changed!
pfam RGS 115aa 7e-35
in ref transcript
Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
Changed!
smart GGL 62aa 2e-18
in ref transcript
G protein gamma subunit-like motifs.
Changed!
pfam DEP 80aa 1e-16
in ref transcript
Domain found in Dishevelled, Egl-10, and Pleckstrin. The DEP domain is responsible for mediating intracellular protein targeting and regulation of protein stability in the cell. The DEP domain is present in a number of signaling molecules, including Regulator of G protein Signaling (RGS) proteins, and has been implicated in membrane targeting. New findings in yeast, however, demonstrate a major role for a DEP domain in mediating the interaction of an RGS protein to the C-terminal tail of a GPCR, thus placing RGS in close proximity with its substrate G protein alpha subunit.
RGS8
rs.RGS8.F1 rs.RGS8.R1 153 282
NCBIGene 36.3 85397
Single exon skipping, size difference: 129
Inclusion in the protein causing a new stop codon
Reference transcript: NM_033345
Changed!
pfam RGS 115aa 2e-38
in ref transcript
Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
RHBDD2
rs.RHBDD2.F1 rs.RHBDD2.R1 295 417
NCBIGene 36.3 57414
Single exon skipping, size difference: 122
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001040456
Changed!
pfam Rhomboid 146aa 6e-11
in ref transcript
Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth factor Spitz, allowing it to activate the Drosophila EGF receptor. Analysis has shown that Rhomboid-1 is an intramembrane serine protease (EC:3.4.21.105). Parasite-encoded rhomboid enzymes are also important for invasion of host cells by Toxoplasma and the malaria parasite.
Changed!
COG GlpG 79aa 0.005
in ref transcript
Uncharacterized membrane protein (homolog of Drosophila rhomboid) [General function prediction only].
RHOC
rs.RHOC.F1 rs.RHOC.R1 293 362
NCBIGene 36.3 389
Single exon skipping, size difference: 69
Exclusion in 5'UTR
Reference transcript: NM_175744
cd RhoA_like 175aa 1e-106
in ref transcript
RhoA-like subfamily. The RhoA subfamily consists of RhoA, RhoB, and RhoC. RhoA promotes the formation of stress fibers and focal adhesions, regulating cell shape, attachment, and motility. RhoA can bind to multiple effector proteins, thereby triggering different downstream responses. In many cell types, RhoA mediates local assembly of the contractile ring, which is necessary for cytokinesis. RhoA is vital for muscle contraction; in vascular smooth muscle cells, RhoA plays a key role in cell contraction, differentiation, migration, and proliferation. RhoA activities appear to be elaborately regulated in a time- and space-dependent manner to control cytoskeletal changes. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins. RhoA and RhoC are observed only in geranylgeranylated forms; however, RhoB can be present in palmitoylated, farnesylated, and geranylgeranylated forms. RhoA and RhoC are highly relevant for tumor progression and invasiveness; however, RhoB has recently been suggested to be a tumor suppressor. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
smart RHO 173aa 7e-99
in ref transcript
Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
COG COG1100 184aa 6e-32
in ref transcript
GTPase SAR1 and related small G proteins [General function prediction only].
RIMS2
rs.RIMS2.F1 rs.RIMS2.R1 154 220
NCBIGene 36.3 9699
Single exon skipping, size difference: 66
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001100117
cd C2_1 121aa 3e-28
in ref transcript
Protein kinase C conserved region 2, subgroup 1; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing synaptotagmins, specific protein kinases C (PKC) subtypes and other proteins, the N-terminal beta strand occupies the position of what is the C-terminal strand in subgroup 2.
cd C2_1 124aa 2e-22
in ref transcript
cd PDZ_signaling 82aa 2e-11
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart C2 104aa 5e-14
in ref transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
smart C2 108aa 2e-12
in ref transcript
smart PDZ 91aa 3e-11
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
pfam RPH3A_effector 132aa 2e-08
in ref transcript
Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas.
Protein kinase C conserved region 2, subgroup 1; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing synaptotagmins, specific protein kinases C (PKC) subtypes and other proteins, the N-terminal beta strand occupies the position of what is the C-terminal strand in subgroup 2.
Changed!
cd C2_1 124aa 2e-22
in ref transcript
cd PDZ_signaling 82aa 2e-11
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
smart C2 104aa 5e-14
in ref transcript
Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
smart C2 108aa 2e-12
in ref transcript
smart PDZ 91aa 3e-11
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
pfam RPH3A_effector 132aa 2e-08
in ref transcript
Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas.
Changed!
cd C2_1 125aa 6e-23
in modified transcript
RLN2
rs.RLN2.F1 rs.RLN2.R1 271 372
NCBIGene 36.3 6019
Single exon skipping, size difference: 101
Inclusion in the protein causing a frameshift
Reference transcript: NM_134441
Changed!
cd IlGF_relaxin_like 40aa 3e-10
in ref transcript
IlGF_like family, relaxin_like subgroup, specific to vertebrates. Members include a number of active peptides including (pro)relaxin, mammalian Leydig cell-specific insulin-like peptide (gene INSL3), early placenta insulin-like peptide (ELIP; gene INSL4), and insulin-like peptides 5 (INSL5) and 6 (INSL6). Members of this subgroup are widely expressed in testes (INSL3, INSL6), decidua, placenta, prostate, corpus luteum, brain (various relaxins), GI tract, and kidney (INSL5) where they serve a variety of functions in parturition and development. Typically, the active forms of these peptide hormones are composed of two chains (A and B) linked by two disulfide bonds; the arrangement of four cysteines is conserved in the "A" chain: Cys1 is linked by a disulfide bond to Cys3, Cys2 and Cys4 are linked by interchain disulfide bonds to cysteines in the "B" chain. This alignment contains both chains, plus the intervening linker region, arranged as found in the propeptide form. Propeptides are cleaved to yield two separate chains linked covalently by the two disulfide bonds.
cd IlGF_relaxin_like 24aa 5e-05
in ref transcript
Changed!
pfam Insulin 50aa 3e-08
in ref transcript
Insulin/IGF/Relaxin family. Superfamily includes insulins; relaxins; insulin-like growth factor; and bombyxin. All are secreted regulatory hormones. Disulfide rich, all-alpha fold. Alignment includes B chain, linker (which is processed out of the final product), and A chain.
Changed!
smart IlGF 40aa 2e-06
in ref transcript
Insulin / insulin-like growth factor / relaxin family. Family of proteins including insulin, relaxin, and IGFs. Insulin decreases blood glucose concentration.
Changed!
smart IlGF 37aa 3e-06
in modified transcript
RNASE1
rs.RNASE1.F1 rs.RNASE1.R1 100 112
NCBIGene 36.3 6035
Alternative 3-prime, size difference: 12
Inclusion in 5'UTR
Reference transcript: NM_198234
cd RNase_A_canonical 120aa 1e-39
in ref transcript
Canonical RNase A family includes all vertebrate homologues to the bovine pancreatic ribonuclease A (RNase A) that contain the catalytic site, necessary for RNase activity. In the human genome 8 RNases , refered to as "canonical" RNases, have been identified, pancreatic RNase (RNase 1), Eosinophil Derived Neurotoxin (SEDN/RNASE 2), Eosinophil Cationic Protein (ECP/RNase 3), RNase 4, Angiogenin (RNase 5), RNase 6 or k6, the skin derived RNase (RNase 7) and RNase 8. The eight human genes are all located in a cluster on chromosome 14. Canonical RNase A enzymes have special biological activities; for example, some stimulate the development of vascular endothelial cells, dendritic cells, and neurons, are cytotoxic/anti-tumoral and/or anti-pathogenic. RNase A is involved in endonucleolytic cleavage of 3'-phosphomononucleotides and 3'-phosphooligonucleotides ending in C-P or U-P with 2',3'-cyclic phosphate intermediates. The catalytic mechanism is a transphosphorylation of P-O 5' bonds on the 3' side of pyrimidines and subsequent hydrolysis to generate 3' phosphate groups. The canonical RNase A family proteins have a conserved catalytic triad (two histidines and one lysine). They also share 6 to 8 cysteines that form three to four disulfide bonds. Two disulfide bonds that are close to the N and C termini contribute most significantly to conformational stability. Angiogenin or RNAse 5 has been implicated in tumor-associated angiogenesis. Comparative analysis in mammals and birds indicates that the whole family may have originated from a RNase 5-like gene. This hypothesis is supported by the fact that only RNase 5-like RNases have been reported outside the mammalian class. The RNase 5 group would therefore be the most ancient form of this family, and all other members would have arisen during mammalian evolution.
smart RNAse_Pc 124aa 1e-54
in ref transcript
Pancreatic ribonuclease.
RNASEN
rs.RNASEN.F1 rs.RNASEN.R1 172 248
NCBIGene 36.3 29102
Single exon skipping, size difference: 76
Inclusion in 5'UTR
Reference transcript: NM_013235
cd RIBOc 133aa 1e-32
in ref transcript
RIBOc. Ribonuclease III C terminal domain. This group consists of eukaryotic, bacterial and archeal ribonuclease III (RNAse III) proteins. RNAse III is a double stranded RNA-specific endonuclease. Prokaryotic RNAse III is important in post-transcriptional control of mRNA stability and translational efficiency. It is involved in the processing of ribosomal RNA precursors. Prokaryotic RNAse III also plays a role in the maturation of tRNA precursors and in the processing of phage and plasmid transcripts. Eukaryotic RNase III's participate (through direct cleavage) in rRNA processing, in processing of small nucleolar RNAs (snoRNAs) and snRNA's (components of the spliceosome). In eukaryotes RNase III or RNaseIII like enzymes such as Dicer are involved in RNAi (RNA interference) and miRNA (micro-RNA) gene silencing.
cd RIBOc 115aa 2e-29
in ref transcript
cd DSRM 72aa 1e-11
in ref transcript
Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
TIGR RNaseIII 222aa 2e-47
in ref transcript
This family consists of bacterial examples of ribonuclease III. This enzyme cleaves double-stranded rRNA. It is involved in processing ribosomal RNA precursors. It is found even in minimal genones such as Mycoplasma genitalium and Buchnera aphidicola, and in some cases has been shown to be an essential gene. These bacterial proteins contain a double-stranded RNA binding motif (pfam00035) and a ribonuclease III domain (pfam00636). Eukaryotic homologs tend to be much longer proteins with additional domains, localized to the nucleus, and not included in this family.
smart RIBOc 133aa 7e-33
in ref transcript
Ribonuclease III family.
PRK rnc 229aa 1e-51
in ref transcript
ribonuclease III; Reviewed.
PRK rnc 109aa 8e-27
in ref transcript
RNASEN
rs.RNASEN.F2 rs.RNASEN.R2 131 242
NCBIGene 36.3 29102
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_013235
cd RIBOc 133aa 1e-32
in ref transcript
RIBOc. Ribonuclease III C terminal domain. This group consists of eukaryotic, bacterial and archeal ribonuclease III (RNAse III) proteins. RNAse III is a double stranded RNA-specific endonuclease. Prokaryotic RNAse III is important in post-transcriptional control of mRNA stability and translational efficiency. It is involved in the processing of ribosomal RNA precursors. Prokaryotic RNAse III also plays a role in the maturation of tRNA precursors and in the processing of phage and plasmid transcripts. Eukaryotic RNase III's participate (through direct cleavage) in rRNA processing, in processing of small nucleolar RNAs (snoRNAs) and snRNA's (components of the spliceosome). In eukaryotes RNase III or RNaseIII like enzymes such as Dicer are involved in RNAi (RNA interference) and miRNA (micro-RNA) gene silencing.
cd RIBOc 115aa 2e-29
in ref transcript
cd DSRM 72aa 1e-11
in ref transcript
Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
TIGR RNaseIII 222aa 2e-47
in ref transcript
This family consists of bacterial examples of ribonuclease III. This enzyme cleaves double-stranded rRNA. It is involved in processing ribosomal RNA precursors. It is found even in minimal genones such as Mycoplasma genitalium and Buchnera aphidicola, and in some cases has been shown to be an essential gene. These bacterial proteins contain a double-stranded RNA binding motif (pfam00035) and a ribonuclease III domain (pfam00636). Eukaryotic homologs tend to be much longer proteins with additional domains, localized to the nucleus, and not included in this family.
smart RIBOc 133aa 7e-33
in ref transcript
Ribonuclease III family.
PRK rnc 229aa 1e-51
in ref transcript
ribonuclease III; Reviewed.
PRK rnc 109aa 8e-27
in ref transcript
RNF13
rs.RNF13.F1 rs.RNF13.R1 90 100
NCBIGene 36.3 11342
Alternative 5-prime, size difference: 10
Inclusion in the protein causing a frameshift
Reference transcript: NM_007282
Changed!
cd PA_C_RZF_like 158aa 8e-50
in ref transcript
PA_C-RZF_ like: Protease-associated (PA) domain C_RZF-like. This group includes various PA domain-containing proteins similar to C-RZF (chicken embryo RING zinc finger) protein. These proteins contain a C3H2C3 RING finger. C-RZF is expressed in embryo cells and is restricted mainly to brain and heart, it is localized to both the nucleus and endosomes. Additional C3H2C3 RING finger proteins belonging to this group, include Arabidopsis ReMembR-H2 protein and mouse sperizin. ReMembR-H2 is likely to be an integral membrane protein, and to traffic through the endosomal pathway. Sperizin is expressed in haploid germ cells and localized in the cytoplasm, it may participate in spermatogenesis. The significance of the PA domain to these proteins has not been ascertained. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate.
Changed!
cd RING 46aa 1e-10
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
Changed!
pfam PA 92aa 2e-11
in ref transcript
PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family. It has been suggested that this domain forms a lid-like structure that covers the active site in active proteases, and is involved in protein recognition in vacuolar sorting receptors.
Changed!
smart RING 42aa 7e-09
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
Changed!
COG COG5540 253aa 1e-11
in ref transcript
RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd PA_C_RZF_like 116aa 1e-37
in modified transcript
Changed!
pfam PA 66aa 7e-08
in modified transcript
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_134260
cd NR_LBD_ROR_like 241aa 1e-127
in ref transcript
The ligand binding domain of Retinoid-related orphan receptors, of the nuclear receptor superfamily. The ligand binding domain (LBD) of Retinoid-related orphan receptors (RORs): Retinoid-related orphan receptors (RORs) are transcription factors belonging to the nuclear receptor superfamily. RORs are key regulators of many physiological processes during embryonic development. RORs bind as monomers to specific ROR response elements (ROREs) consisting of the consensus core motif AGGTCA preceded by a 5-bp A/T-rich sequence. Transcription regulation by RORs is mediated through certain corepressors, as well as coactivators. There are three subtypes of retinoid-related orphan receptors (RORs), alpha, beta, and gamma that differ only in N-terminal sequence and are distributed in distinct tissues. RORalpha plays a key role in the development of the cerebellum, particularly in the regulation of the maturation and survival of Purkinje cells. RORbeta expression is largely restricted to several regions of the brain, the retina, and pineal gland. RORgamma is essential for lymph node organogenesis. Recently, it has been su ggested that cholesterol or a cholesterol derivative is the natural ligand of RORalpha. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, retinoid-related orphan receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a non-conserved hinge and a C-terminal ligand binding domain (LBD).
pfam zf-C4 76aa 6e-39
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
pfam Hormone_recep 183aa 3e-33
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
RPL14
rs.RPL14.F1 rs.RPL14.R1 122 233
NCBIGene 36.3 9045
Alternative 5-prime, size difference: 111
Exclusion of the protein initiation site
Reference transcript: NM_001034996
pfam Ribosomal_L14e 77aa 1e-20
in ref transcript
Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14.
PTZ PTZ00065 123aa 3e-19
in ref transcript
60S ribosomal protein L14; Provisional.
RPP14
rs.RPP14.F1 rs.RPP14.R1 135 369
NCBIGene 36.3 11102
Alternative 5-prime, size difference: 234
Exclusion in 5'UTR
Reference transcript: NM_001098783
cd R_hydratase 127aa 1e-37
in ref transcript
(R)-hydratase [(R)-specific enoyl-CoA hydratase] catalyzes the hydration of trans-2-enoyl CoA to (R)-3-hydroxyacyl-CoA as part of the PHA (polyhydroxyalkanoate) biosynthetic pathway. (R)-hydratase contains a hot-dog fold similar to those of thioesterase II, and beta-hydroxydecanoyl-ACP dehydratase, MaoC dehydratase, Hydratase-Dehydrogenase-Epimerase protein (HDE), and the fatty acid synthase beta subunit. The active site lies within a substrate-binding tunnel formed by the (R)-hydratase homodimer. A subset of the bacterial (R)-hydratases contain a C-terminal phosphotransacetylase (PTA) domain.
pfam MaoC_dehydratas 108aa 2e-15
in ref transcript
MaoC like domain. The maoC gene is part of a operon with maoA which is involved in the synthesis of monoamine oxidase. The MaoC protein is found to share similarity with a wide variety of enzymes; estradiol 17 beta-dehydrogenase 4, peroxisomal hydratase-dehydrogenase-epimerase, fatty acid synthase beta subunit. Several bacterial proteins that are composed solely of this domain have (R)-specific enoyl-CoA hydratase activity. This domain is also present in the NodN nodulation protein N.
Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from eukaryotes and archaebacteria. Gadd45 and MyD118.
RPS15A
rs.RPS15A.F1 rs.RPS15A.R1 474 540
NCBIGene 36.3 6210
Alternative 5-prime, size difference: 66
Exclusion in 5'UTR
Reference transcript: NM_001019
pfam Ribosomal_S8 125aa 3e-37
in ref transcript
Ribosomal protein S8.
PTZ PTZ00158 130aa 8e-68
in ref transcript
40S ribosomal protein S15A; Provisional.
RPS24
rs.RPS24.F1 rs.RPS24.R1 112 134
NCBIGene 36.3 6229
Single exon skipping, size difference: 22
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001026
pfam Ribosomal_S24e 83aa 4e-16
in ref transcript
Ribosomal protein S24e.
PTZ PTZ00071 130aa 9e-29
in ref transcript
40S ribosomal protein S24; Provisional.
RRBP1
rs.RRBP1.F1 rs.RRBP1.R1 112 189
NCBIGene 36.3 6238
Single exon skipping, size difference: 77
Exclusion in 5'UTR
Reference transcript: NM_001042576
pfam Rib_recp_KP_reg 144aa 7e-12
in ref transcript
Ribosome receptor lysine/proline rich region. This highly conserved region is found towards the C-terminus of the transmembrane domain. The function is unclear.
TIGR SMC_prok_A 286aa 1e-04
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
TIGR SMC_prok_B 340aa 5e-04
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
PRK PRK00409 94aa 0.003
in ref transcript
recombination and DNA strand exchange inhibitor protein; Reviewed.
PRK PRK02224 456aa 0.009
in ref transcript
chromosome segregation protein; Provisional.
RSRC2
rs.RSRC2.F1 rs.RSRC2.R1 100 122
NCBIGene 36.3 65117
Alternative 3-prime, size difference: 22
Exclusion in the protein causing a frameshift
Reference transcript: NM_198261
RTKN
rs.RTKN.F1 rs.RTKN.R1 102 529
NCBIGene 36.3 6242
Single exon skipping, size difference: 427
Exclusion of the protein initiation site
Reference transcript: NM_033046
cd PH 99aa 5e-06
in ref transcript
Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
Changed!
smart Hr1 56aa 1e-08
in ref transcript
Rho effector or protein kinase C-related kinase homology region 1 homologues. Alpha-helical domain found in vertebrate PRK1 and yeast PKC1 protein kinases C. The HR1 in rhophilin bind RhoGTP; those in PRK1 bind RhoA and RhoB. Also called RBD - Rho-binding domain.
smart PH 99aa 1e-07
in ref transcript
Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
Changed!
smart Hr1 41aa 5e-04
in modified transcript
RTN2
rs.RTN2.F1 rs.RTN2.R1 157 376
NCBIGene 36.3 6253
Single exon skipping, size difference: 219
Exclusion in the protein (no frameshift)
Reference transcript: NM_005619
Changed!
pfam Reticulon 171aa 1e-30
in ref transcript
Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues.
Changed!
pfam Reticulon 171aa 6e-32
in modified transcript
RUFY2
rs.RUFY2.F1 rs.RUFY2.R1 160 262
NCBIGene 36.3 55680
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_017987
cd FYVE 53aa 2e-19
in ref transcript
FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
pfam FYVE 62aa 2e-24
in ref transcript
FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. We have included members which do not conserve these histidine residues but are clearly related.
TIGR SMC_prok_B 285aa 3e-13
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Changed!
COG Smc 308aa 1e-12
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
PTZ PTZ00303 65aa 2e-04
in ref transcript
phosphatidylinositol kinase; Provisional.
Changed!
COG Smc 305aa 2e-12
in modified transcript
RUNX2
rs.RUNX2.F1 rs.RUNX2.R1 310 376
NCBIGene 36.3 860
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_001024630
pfam Runt 134aa 9e-74
in ref transcript
Runt domain.
pfam RunxI 78aa 2e-32
in ref transcript
Runx inhibition domain. This domain lies to the C-terminus of Runx-related transcription factors and homologous proteins (AML, CBF-alpha, PEBP2). Its function might be to interact with functional cofactors.
RUSC1
rs.RUSC1.F1 rs.RUSC1.R1 121 439
NCBIGene 36.3 23623
Alternative 3-prime, size difference: 318
Exclusion in the protein (no frameshift)
Reference transcript: NM_001105203
cd SH3 50aa 3e-08
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Changed!
pfam RUN 137aa 5e-21
in ref transcript
RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signalling pathways. The domain is comprises six conserved regions, which in some proteins have considerable insertions between them. The domain core is thought to take up a predominantly alpha fold, with basic amino acids in regions A and D possibly playing a functional role in interactions with Ras GTPases.
smart SH3 53aa 6e-10
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Changed!
pfam RUN 92aa 5e-16
in modified transcript
RUSC1
rs.RUSC1.F2 rs.RUSC1.R2 138 181
NCBIGene 36.3 23623
Alternative 5-prime, size difference: 43
Inclusion in the protein causing a frameshift
Reference transcript: NM_001105205
Changed!
cd SH3 50aa 7e-09
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Changed!
pfam RUN 137aa 1e-20
in ref transcript
RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signalling pathways. The domain is comprises six conserved regions, which in some proteins have considerable insertions between them. The domain core is thought to take up a predominantly alpha fold, with basic amino acids in regions A and D possibly playing a functional role in interactions with Ras GTPases.
Changed!
smart SH3 53aa 1e-10
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
RYR1
rs.RYR1.F1 rs.RYR1.R1 125 140
NCBIGene 36.3 6261
Single exon skipping, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_000540
pfam Ins145_P3_rec 199aa 1e-71
in ref transcript
Inositol 1,4,5-trisphosphate/ryanodine receptor. This domain corresponds to the ligand binding region on inositol 1,4,5-trisphosphate receptor, and the N terminal region of the ryanodine receptor. Both receptors are involved in Ca2+ release. They can couple to the activation of neurotransmitter-gated receptors and voltage-gated Ca2+ channels on the plasma membrane, thus allowing the endoplasmic reticulum discriminate between different types of neuronal activity.
pfam RYDR_ITPR 204aa 1e-62
in ref transcript
RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3.
pfam MIR 179aa 3e-51
in ref transcript
MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a domain that may have a ligand transferase function.
pfam RYDR_ITPR 209aa 2e-49
in ref transcript
pfam RIH_assoc 133aa 7e-42
in ref transcript
RyR and IP3R Homology associated. This eukaryotic domain is found in ryanodine receptors (RyR) and inositol 1,4,5-trisphosphate receptors (IP3R) which together form a superfamily of homotetrameric ligand-gated intracellular Ca2+ channels. There seems to be no known function for this domain. Also see the IP3-binding domain pfam01365 and pfam02815.
pfam RyR 95aa 4e-41
in ref transcript
RyR domain. This domain is called RyR for Ryanodine receptor. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown.
pfam RyR 95aa 3e-39
in ref transcript
pfam RR_TM4-6 225aa 3e-39
in ref transcript
Ryanodine Receptor TM 4-6. This region covers TM regions 4-6 of the ryanodine receptor 1 family.
pfam RyR 90aa 3e-38
in ref transcript
pfam RyR 89aa 4e-31
in ref transcript
smart SPRY 123aa 1e-29
in ref transcript
Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
pfam SPRY 137aa 9e-24
in ref transcript
SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
pfam SPRY 141aa 6e-23
in ref transcript
S100A4
rs.S100A4.F1 rs.S100A4.R1 136 185
NCBIGene 36.3 6275
Single exon skipping, size difference: 49
Exclusion in 5'UTR
Reference transcript: NM_019554
cd S-100 87aa 6e-25
in ref transcript
S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents the interaction site of S100 proteins with their target proteins. There is experimental evidence showing that many S100 proteins have multiple binding partners with diverse mode of interaction with different targets. In addition to S100 proteins (such as S100A1,-3,-4,-6,-7,-10,-11,and -13), this group includes the ''fused'' gene family, a group of calcium binding S100-related proteins. The ''fused'' gene family includes multifunctional epidermal differentiation proteins - profilaggrin, trichohyalin, repetin, hornerin, and cornulin; functionally these proteins are associated with keratin intermediate filaments and partially crosslinked to the cell envelope. These ''fused'' gene proteins contain N-terminal sequence with two Ca-binding EF-hands motif, which may be associated with calcium signaling in epidermal cells and autoprocessing in a calcium-dependent manner. In contrast to S100 proteins, "fused" gene family proteins contain an extraordinary high number of almost perfect peptide repeats with regular array of polar and charged residues similar to many known cell envelope proteins.
pfam S_100 44aa 2e-13
in ref transcript
S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins.
SCARB1
rs.SCARB1.F1 rs.SCARB1.R1 184 313
NCBIGene 36.3 949
Single exon skipping, size difference: 129
Exclusion of the stop codon
Reference transcript: NM_005505
pfam CD36 414aa 1e-168
in ref transcript
CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion.
SCARF2
rs.SCARF2.F1 rs.SCARF2.R1 100 115
NCBIGene 36.3 91179
Alternative 3-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_153334
SCN3A
rs.SCN3A.F1 rs.SCN3A.R1 163 310
NCBIGene 36.3 6328
Alternative 5-prime, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_006922
pfam Na_trans_assoc 221aa 1e-65
in ref transcript
Sodium ion transport-associated. Members of this family contain a region found exclusively in eukaryotic sodium channels or their subunits, many of which are voltage-gated. Members very often also contain between one and four copies of pfam00520 and, less often, one copy of pfam00612.
pfam Ion_trans 226aa 9e-34
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
pfam Ion_trans 159aa 5e-25
in ref transcript
pfam Ion_trans 141aa 5e-25
in ref transcript
pfam Ion_trans 178aa 1e-24
in ref transcript
pfam Ion_trans 72aa 2e-13
in ref transcript
SCN5A
rs.SCN5A.F1 SCN5A.R5 271 325
NCBIGene 36.3 6331
Single exon skipping, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_198056
pfam Na_trans_assoc 265aa 7e-60
in ref transcript
Sodium ion transport-associated. Members of this family contain a region found exclusively in eukaryotic sodium channels or their subunits, many of which are voltage-gated. Members very often also contain between one and four copies of pfam00520 and, less often, one copy of pfam00612.
pfam Ion_trans 209aa 2e-29
in ref transcript
Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
Changed!
pfam Ion_trans 229aa 8e-26
in ref transcript
pfam Ion_trans 123aa 2e-25
in ref transcript
pfam Ion_trans 176aa 5e-23
in ref transcript
pfam Ion_trans 74aa 2e-13
in ref transcript
Changed!
pfam Ion_trans 211aa 9e-21
in modified transcript
SCYL1
rs.SCYL1.F1 rs.SCYL1.R1 123 174
NCBIGene 36.3 57410
Alternative 3-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_020680
cd S_TKc 228aa 3e-16
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
pfam Pkinase 189aa 1e-09
in ref transcript
Protein kinase domain.
PTZ PTZ00267 251aa 5e-04
in ref transcript
NIMA-related protein kinase; Provisional.
SDCBP
rs.SDCBP.F1 rs.SDCBP.R1 102 120
NCBIGene 36.3 6386
Alternative 5-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_001007067
cd PDZ_signaling 80aa 4e-13
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 74aa 3e-09
in ref transcript
smart PDZ 83aa 2e-10
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.
pfam Sec23_trunk 240aa 5e-90
in ref transcript
Sec23/Sec24 trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.
pfam Sec23_helical 103aa 2e-29
in ref transcript
Sec23/Sec24 helical domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is composed of five alpha helices.
pfam Sec23_BS 85aa 1e-28
in ref transcript
Sec23/Sec24 beta-sandwich domain.
pfam zf-Sec23_Sec24 39aa 2e-12
in ref transcript
Sec23/Sec24 zinc finger. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is found to be zinc binding domain.
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
COG COG2319 306aa 6e-15
in ref transcript
FOG: WD40 repeat [General function prediction only].
SEMA6D
rs.SEMA6D.F1 rs.SEMA6D.R1 152 209
NCBIGene 36.3 80031
Single exon skipping, size difference: 57
Exclusion in the protein (no frameshift)
Reference transcript: NM_153618
pfam Sema 418aa 1e-121
in ref transcript
Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor and the human Plexin-A3 precursor.
pfam PSI 45aa 7e-07
in ref transcript
Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related Caenorhabditis elegans protein contains four copies of the repeat. The Met receptor contains a single copy of the repeat. The Pfam alignment shows 6 conserved cysteine residues that may form three conserved disulphide bridges, whereas shows 8 conserved cysteines. The pattern of conservation suggests that cysteines 5 and 7 (that are not absolutely conserved) form a disulphide bridge (Personal observation. A Bateman).
pfam DUF1043 111aa 0.007
in ref transcript
Protein of unknown function (DUF1043). This family consists of several hypothetical bacterial proteins of unknown function.
SEMA6D
rs.SEMA6D.F2 rs.SEMA6D.R2 342 475
NCBIGene 36.3 80031
Alternative 5-prime, size difference: 133
Exclusion in the protein causing a frameshift
Reference transcript: NM_153618
pfam Sema 418aa 1e-121
in ref transcript
Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor and the human Plexin-A3 precursor.
pfam PSI 45aa 7e-07
in ref transcript
Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related Caenorhabditis elegans protein contains four copies of the repeat. The Met receptor contains a single copy of the repeat. The Pfam alignment shows 6 conserved cysteine residues that may form three conserved disulphide bridges, whereas shows 8 conserved cysteines. The pattern of conservation suggests that cysteines 5 and 7 (that are not absolutely conserved) form a disulphide bridge (Personal observation. A Bateman).
Changed!
pfam DUF1043 111aa 0.007
in ref transcript
Protein of unknown function (DUF1043). This family consists of several hypothetical bacterial proteins of unknown function.
SENP6
rs.SENP6.F1 rs.SENP6.R1 121 142
NCBIGene 36.3 26054
Single exon skipping, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_015571
pfam Peptidase_C48 120aa 9e-10
in ref transcript
Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn.
pfam Peptidase_C48 95aa 5e-08
in ref transcript
COG ULP1 112aa 3e-18
in ref transcript
Protease, Ulp1 family [Posttranslational modification, protein turnover, chaperones].
COG ULP1 91aa 2e-06
in ref transcript
SENP7
rs.SENP7.F1 rs.SENP7.R1 191 386
NCBIGene 36.3 57337
Single exon skipping, size difference: 195
Exclusion in the protein (no frameshift)
Reference transcript: NM_020654
pfam Peptidase_C48 242aa 3e-20
in ref transcript
Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn.
COG ULP1 114aa 7e-21
in ref transcript
Protease, Ulp1 family [Posttranslational modification, protein turnover, chaperones].
COG ULP1 81aa 0.009
in ref transcript
SEPT8
rs.SEPT8.F1 rs.SEPT8.R1 137 405
NCBIGene 36.3 23176
Alternative 3-prime, size difference: 268
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001098812
cd CDC_Septin 269aa 1e-110
in ref transcript
CDC/Septin. Septins are a conserved family of GTP-binding proteins associated with diverse processes in dividing and non-dividing cells. They were first discovered in the budding yeast S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and CDC12) required for normal bud morphology. Septins are also present in metazoan cells, where they are required for cytokinesis in some systems, and implicated in a variety of other processes involving organization of the cell cortex and exocytosis. In humans, 12 septin genes generate dozens of polypeptides, many of which comprise heterooligomeric complexes. Since septin mutants are commonly defective in cytokinesis and formation of the neck formation of the neck filaments/septin rings, septins have been considered to be the primary constituents of the neck filaments. Septins belong to the GTPase superfamily for their conserved GTPase motifs and enzymatic activities.
pfam Septin 268aa 3e-84
in ref transcript
Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerise to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis.
COG CDC3 322aa 6e-74
in ref transcript
Septin family protein [Cell division and chromosome partitioning / Cytoskeleton].
SERBP1
rs.SERBP1.F1 rs.SERBP1.R1 103 148
NCBIGene 36.3 26135
Alternative 3-prime, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_001018067
Changed!
pfam HABP4_PAI-RBP1 125aa 2e-16
in ref transcript
Hyaluronan / mRNA binding family. This family includes the HABP4 family of hyaluronan-binding proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1. HABP4 has been observed to bind hyaluronan (a glucosaminoglycan), but it is not known whether this is its primary role in vivo. It has also been observed to bind RNA, but with a lower affinity than that for hyaluronan. PAI-1 mRNA-binding protein specifically binds the mRNA of type-1 plasminogen activator inhibitor (PAI-1), and is thought to be involved in regulation of mRNA stability. However, in both cases, the sequence motifs predicted to be important for ligand binding are not conserved throughout the family, so it is not known whether members of this family share a common function.
Changed!
pfam HABP4_PAI-RBP1 110aa 9e-18
in modified transcript
SERPINB8
rs.SERPINB8.F1 rs.SERPINB8.R1 100 117
NCBIGene 36.3 5271
Alternative 5-prime, size difference: 17
Exclusion in 5'UTR
Reference transcript: NM_198833
cd PAI-2 371aa 1e-137
in ref transcript
Plasminogen Activator Inhibitor-2 (PAI-2). PAI-2 is a serine protease inhibitor that belongs to the ov-serpin branch of the serpin superfamily. It is is an effective inhibitor of urinary plasminogen activator (urokinase or uPA) and is involved in cell differentiation, tissue growth and regeneration.
smart SERPIN 362aa 1e-116
in ref transcript
SERine Proteinase INhibitors.
COG COG4826 372aa 1e-60
in ref transcript
Serine protease inhibitor [Posttranslational modification, protein turnover, chaperones].
SEZ6
rs.SEZ6.F1 rs.SEZ6.R1 103 119
NCBIGene 36.3 124925
Alternative 3-prime, size difference: 16
Exclusion in the protein causing a frameshift
Reference transcript: NM_178860
cd CUB 111aa 9e-19
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd CUB 109aa 6e-16
in ref transcript
cd CCP 57aa 1e-10
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd CCP 59aa 2e-08
in ref transcript
cd CCP 60aa 1e-07
in ref transcript
cd CCP 56aa 2e-06
in ref transcript
cd CCP 57aa 3e-04
in ref transcript
smart CUB 99aa 1e-15
in ref transcript
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
smart CUB 100aa 1e-14
in ref transcript
pfam Sushi 56aa 2e-12
in ref transcript
Sushi domain (SCR repeat).
smart CCP 60aa 7e-08
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
smart CCP 58aa 7e-08
in ref transcript
pfam Sushi 55aa 5e-07
in ref transcript
smart CCP 56aa 4e-05
in ref transcript
SEZ6L2
rs.SEZ6L2.F1 rs.SEZ6L2.R1 123 333
NCBIGene 36.3 26470
Alternative 3-prime, size difference: 210
Exclusion in the protein (no frameshift)
Reference transcript: NM_201575
cd CUB 91aa 6e-11
in ref transcript
CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
cd CUB 69aa 7e-11
in ref transcript
cd CCP 57aa 4e-09
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
cd CUB 113aa 3e-08
in ref transcript
cd CCP 59aa 3e-08
in ref transcript
cd CCP 60aa 2e-07
in ref transcript
cd CCP 56aa 2e-06
in ref transcript
pfam CUB 91aa 2e-08
in ref transcript
CUB domain.
smart CCP 56aa 3e-08
in ref transcript
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
smart CCP 56aa 5e-08
in ref transcript
pfam Sushi 58aa 1e-07
in ref transcript
Sushi domain (SCR repeat).
smart CCP 60aa 3e-07
in ref transcript
smart CUB 59aa 3e-06
in ref transcript
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
smart CUB 102aa 5e-04
in ref transcript
pfam Sushi 57aa 0.003
in ref transcript
SFRS12
rs.SFRS12.F1 rs.SFRS12.R1 372 496
NCBIGene 36.3 140890
Single exon skipping, size difference: 124
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001077199
Changed!
cd RRM 73aa 2e-14
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 74aa 2e-05
in ref transcript
Changed!
smart RRM_2 72aa 5e-14
in ref transcript
RNA recognition motif.
Changed!
smart RRM 71aa 1e-04
in ref transcript
RNA recognition motif.
Changed!
COG COG0724 106aa 7e-05
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
pfam RRM_1 69aa 1e-05
in modified transcript
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
Changed!
COG COG0724 86aa 7e-04
in modified transcript
SFRS18
rs.SFRS18.F1 rs.SFRS18.R1 203 283
NCBIGene 36.3 25957
Single exon skipping, size difference: 80
Exclusion in 5'UTR
Reference transcript: NM_032870
SGCE
rs.SGCE.F1 rs.SGCE.R1 163 198
NCBIGene 36.3 8910
Single exon skipping, size difference: 35
Inclusion in the protein causing a frameshift
Reference transcript: NM_001099401
pfam Sarcoglycan_2 389aa 1e-168
in ref transcript
Sarcoglycan alpha/epsilon. Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear. This family contains alpha and epsilon members.
SGCE
rs.SGCE.F2 rs.SGCE.R2 147 222
NCBIGene 36.3 8910
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_001099401
Changed!
pfam Sarcoglycan_2 389aa 1e-168
in ref transcript
Sarcoglycan alpha/epsilon. Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear. This family contains alpha and epsilon members.
Changed!
pfam Sarcoglycan_2 386aa 1e-169
in modified transcript
SGCE
rs.SGCE.F3 rs.SGCE.R3 137 164
NCBIGene 36.3 8910
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_001099401
Changed!
pfam Sarcoglycan_2 389aa 1e-168
in ref transcript
Sarcoglycan alpha/epsilon. Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear. This family contains alpha and epsilon members.
Changed!
pfam Sarcoglycan_2 380aa 1e-162
in modified transcript
SGSM1
rs.SGSM1.F1 rs.SGSM1.R1 174 339
NCBIGene 36.3 129049
Single exon skipping, size difference: 165
Exclusion in the protein (no frameshift)
Reference transcript: NM_001039948
smart TBC 145aa 2e-30
in ref transcript
Domain in Tre-2, BUB2p, and Cdc16p. Probable Rab-GAPs. Widespread domain present in Gyp6 and Gyp7, thereby giving rise to the notion that it performs a GTP-activator activity on Rab-like GTPases.
pfam RUN 146aa 5e-21
in ref transcript
RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signalling pathways. The domain is comprises six conserved regions, which in some proteins have considerable insertions between them. The domain core is thought to take up a predominantly alpha fold, with basic amino acids in regions A and D possibly playing a functional role in interactions with Ras GTPases.
COG COG5210 212aa 2e-18
in ref transcript
GTPase-activating protein [General function prediction only].
SGSM2
rs.SGSM2.F1 rs.SGSM2.R1 329 464
NCBIGene 36.3 9905
Single exon skipping, size difference: 135
Exclusion in the protein (no frameshift)
Reference transcript: NM_014853
smart TBC 147aa 1e-31
in ref transcript
Domain in Tre-2, BUB2p, and Cdc16p. Probable Rab-GAPs. Widespread domain present in Gyp6 and Gyp7, thereby giving rise to the notion that it performs a GTP-activator activity on Rab-like GTPases.
pfam RUN 144aa 6e-25
in ref transcript
RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signalling pathways. The domain is comprises six conserved regions, which in some proteins have considerable insertions between them. The domain core is thought to take up a predominantly alpha fold, with basic amino acids in regions A and D possibly playing a functional role in interactions with Ras GTPases.
COG COG5210 212aa 1e-19
in ref transcript
GTPase-activating protein [General function prediction only].
SH2D5
rs.SH2D5.F1 rs.SH2D5.R1 170 299
NCBIGene 36.3 400745
Single exon skipping, size difference: 129
Exclusion of the protein initiation site
Reference transcript: NM_001103161
cd SH2 88aa 2e-09
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Changed!
cd PTB 119aa 2e-07
in ref transcript
Phosphotyrosine-binding (PTB) domain; PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. The PTB domain of SHC binds to a NPXpY sequence. More recent studies have found that some types of PTB domains such as the neuronal protein X11 and in the cell-fate determinant protein Numb can bind to peptides which are not tyrosine phosphorylated; whereas, other PTB domains can bind motifs lacking tyrosine residues altogether.
Changed!
smart PTB 121aa 3e-09
in ref transcript
Phosphotyrosine-binding domain, phosphotyrosine-interaction (PI) domain. PTB/PI domain structure similar to those of pleckstrin homology (PH) and IRS-1-like PTB domains.
smart SH2 77aa 0.002
in ref transcript
Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
SHMT1
rs.SHMT1.F1 rs.SHMT1.R1 215 332
NCBIGene 36.3 6470
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_004169
Changed!
cd SHMT 427aa 0.0
in ref transcript
Serine-glycine hydroxymethyltransferase (SHMT). This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). SHMT carries out interconversion of serine and glycine; it catalyzes the transfer of hydroxymethyl group of N5, N10-methylene tetrahydrofolate to glycine resulting in the formation of serine and tetrahydrofolate. Both eukaryotic and prokaryotic SHMT enzymes form tight obligate homodimers; the mammalian enzyme forms a homotetramer comprising four pyridoxal phosphate-bound active sites.
Changed!
pfam SHMT 400aa 0.0
in ref transcript
Serine hydroxymethyltransferase.
Changed!
PTZ PTZ00094 457aa 0.0
in ref transcript
serine hydroxymethyltransferase; Provisional.
Changed!
cd SHMT 388aa 1e-167
in modified transcript
Changed!
pfam SHMT 361aa 1e-178
in modified transcript
Changed!
PTZ PTZ00094 418aa 0.0
in modified transcript
SHPRH
rs.SHPRH.F1 rs.SHPRH.R1 100 447
NCBIGene 36.3 257218
Alternative 3-prime, size difference: 347
Exclusion of the stop codon
Reference transcript: NM_001042683
cd H15 73aa 5e-11
in ref transcript
linker histone 1 and histone 5 domains; the basic subunit of chromatin is the nucleosome, consisting of an octamer of core histones, two full turns of DNA, a linker histone (H1 or H5) and a variable length of linker DNA; H1/H5 are chromatin-associated proteins that bind to the exterior of nucleosomes and dramatically stabilize the highly condensed states of chromatin fibers; stabilization of higher order folding occurs through electrostatic neutralization of the linker DNA segments, through a highly positively charged carboxy- terminal domain known as the AKP helix (Ala, Lys, Pro); thought to be involved in specific protein-protein and protein-DNA interactions and play a role in suppressing core histone tail domain acetylation in the chromatin fiber.
cd HELICc 121aa 1e-08
in ref transcript
Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
cd RING 51aa 1e-04
in ref transcript
RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
pfam SNF2_N 217aa 2e-14
in ref transcript
SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
pfam Linker_histone 73aa 2e-08
in ref transcript
linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types.
smart HELICc 81aa 7e-06
in ref transcript
helicase superfamily c-terminal domain.
pfam SNF2_N 25aa 1e-04
in ref transcript
smart RING 47aa 0.001
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
smart PHD 32aa 0.006
in ref transcript
PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RI NG finger and the FYV E finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BR OMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RI NG fingers misidentified as PHD fingers.
Changed!
COG HepA 168aa 1e-16
in ref transcript
Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].
COG HepA 282aa 3e-13
in ref transcript
COG PEX10 73aa 0.009
in ref transcript
RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
COG HepA 138aa 2e-15
in modified transcript
SIKE
rs.SIKE.F1 rs.SIKE.R1 100 112
NCBIGene 36.3 80143
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_001102396
Changed!
pfam DUF837 176aa 7e-57
in ref transcript
Protein of unknown function (DUF837). This family consists of several eukaryotic proteins of unknown function. One of the family members is a circulating cathodic antigen (CCA) found in Schistosoma mansoni (Blood fluke).
Changed!
pfam DUF837 172aa 8e-58
in modified transcript
SLA
rs.SLA.F1 rs.SLA.R1 105 383
NCBIGene 36.3 6503
Single exon skipping, size difference: 278
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001045557
Changed!
cd SH2 93aa 3e-20
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Changed!
smart SH2 85aa 6e-21
in ref transcript
Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
Changed!
smart SH3 38aa 5e-04
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
SLC12A6
rs.SLC12A6.F1 rs.SLC12A6.R1 134 179
NCBIGene 36.3 9990
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_133647
TIGR 2a30 1042aa 0.0
in ref transcript
COG PotE 244aa 2e-16
in ref transcript
Amino acid transporters [Amino acid transport and metabolism].
COG PotE 201aa 5e-08
in ref transcript
SLC12A6
rs.SLC12A6.F2 rs.SLC12A6.R2 243 309
NCBIGene 36.3 9990
Alternative 3-prime, size difference: 66
Exclusion of the protein initiation site
Reference transcript: NM_001042496
TIGR 2a30 1042aa 0.0
in ref transcript
COG PotE 244aa 3e-16
in ref transcript
Amino acid transporters [Amino acid transport and metabolism].
COG PotE 201aa 7e-08
in ref transcript
SLC17A3
rs.SLC17A3.F1 rs.SLC17A3.R1 130 364
NCBIGene 36.3 10786
Single exon skipping, size difference: 234
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098486
Changed!
cd MFS 356aa 9e-21
in ref transcript
The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
Changed!
TIGR 2A0114euk 475aa 1e-102
in ref transcript
Changed!
COG UhpC 228aa 2e-10
in ref transcript
Sugar phosphate permease [Carbohydrate transport and metabolism].
Changed!
cd MFS 272aa 3e-10
in modified transcript
Changed!
TIGR 2A0114euk 313aa 2e-72
in modified transcript
Changed!
TIGR 2A0114euk 50aa 6e-07
in modified transcript
Changed!
COG UhpC 166aa 4e-05
in modified transcript
SLC23A1
rs.SLC23A1.F1 rs.SLC23A1.R1 97 109
NCBIGene 36.3 9963
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_152685
Changed!
pfam Xan_ur_permease 437aa 8e-61
in ref transcript
Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predicted transmembrane helices.
Changed!
COG UraA 499aa 2e-43
in ref transcript
Xanthine/uracil permeases [Nucleotide transport and metabolism].
Changed!
pfam Xan_ur_permease 433aa 1e-62
in modified transcript
Changed!
COG UraA 495aa 2e-44
in modified transcript
SLC25A45
rs.SLC25A45.F1 rs.SLC25A45.R1 117 172
NCBIGene 36.3 283130
Single exon skipping, size difference: 55
Exclusion of the protein initiation site
Reference transcript: NM_182556
pfam Mito_carr 92aa 5e-19
in ref transcript
Mitochondrial carrier protein.
pfam Mito_carr 92aa 8e-19
in ref transcript
Changed!
pfam Mito_carr 76aa 2e-15
in ref transcript
PTZ PTZ00169 163aa 1e-11
in ref transcript
ADP/ATP transporter on adenylate translocase; Provisional.
Changed!
PTZ PTZ00169 166aa 2e-11
in ref transcript
SLC35C2
rs.SLC35C2.F1 rs.SLC35C2.R1 142 205
NCBIGene 36.3 51006
Alternative 3-prime, size difference: 63
Exclusion in the protein (no frameshift)
Reference transcript: NM_173179
pfam TPT 149aa 5e-16
in ref transcript
Triose-phosphate Transporter family. This family includes transporters with a specificity for triose phosphate.
SLC35C2
rs.SLC35C2.F2 rs.SLC35C2.R2 108 303
NCBIGene 36.3 51006
Alternative 3-prime, size difference: 195
Inclusion in 5'UTR
Reference transcript: NM_015945
pfam TPT 149aa 5e-16
in ref transcript
Triose-phosphate Transporter family. This family includes transporters with a specificity for triose phosphate.
SLC38A1
rs.SLC38A1.F1 rs.SLC38A1.R1 191 530
NCBIGene 36.3 81539
Alternative 5-prime, size difference: 339
Exclusion in 5'UTR
Reference transcript: NM_030674
pfam Aa_trans 386aa 2e-42
in ref transcript
Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembrane domains. MTR is a N system amino acid transporter system protein involved in methyltryptophan resistance. Other members of this family include proline transporters and amino acid permeases.
PTZ PTZ00206 364aa 6e-22
in ref transcript
amino acid transporter; Provisional.
SLC3A2
rs.SLC3A2.F1 rs.SLC3A2.R1 313 403
NCBIGene 36.3 6520
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_001012661
TIGR trehalose_treC 136aa 4e-16
in ref transcript
Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.
pfam Alpha-amylase 244aa 8e-14
in ref transcript
Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain.
TIGR treS_nterm 120aa 3e-08
in ref transcript
Trehalose synthase interconverts maltose and alpha, alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor (PMID:15378530).
COG AmyA 340aa 3e-18
in ref transcript
Glycosidases [Carbohydrate transport and metabolism].
SLC3A2
rs.SLC3A2.F2 rs.SLC3A2.R2 231 324
NCBIGene 36.3 6520
Single exon skipping, size difference: 93
Exclusion in the protein (no frameshift)
Reference transcript: NM_001012661
TIGR trehalose_treC 136aa 4e-16
in ref transcript
Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.
pfam Alpha-amylase 244aa 8e-14
in ref transcript
Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain.
TIGR treS_nterm 120aa 3e-08
in ref transcript
Trehalose synthase interconverts maltose and alpha, alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor (PMID:15378530).
COG AmyA 340aa 3e-18
in ref transcript
Glycosidases [Carbohydrate transport and metabolism].
SLC47A2
rs.SLC47A2.F1 rs.SLC47A2.R1 99 207
NCBIGene 36.3 146802
Alternative 3-prime, size difference: 108
Exclusion in the protein (no frameshift)
Reference transcript: NM_152908
Changed!
TIGR matE 373aa 6e-35
in ref transcript
The MATE family consists of probable efflux proteins including a functionally characterized multi drug efflux system from Vibrio parahaemolyticus, a putative ethionine resistance protein of Saccharomyces cerevisiae, and the functionally uncharacterized DNA damage-inducible protein F (DinF) of E. coli. These proteins have 12 probable TMS.
Changed!
TIGR matE 337aa 6e-39
in modified transcript
Changed!
COG NorM 435aa 7e-40
in modified transcript
SLC4A3
rs.SLC4A3.F1 rs.SLC4A3.R1 191 251
NCBIGene 36.3 6508
Alternative 3-prime, size difference: 60
Inclusion in 5'UTR
Reference transcript: NM_201574
TIGR ae 910aa 0.0
in ref transcript
They preferentially catalyze anion exchange (antiport) reactions, typically acting as HCO3-:Cl- antiporters, but also transporting a range of other inorganic and organic anions. Additionally, renal Na+:HCO3- cotransporters have been found to be members of the AE family. They catalyze the reabsorption of HCO3- in the renal proximal tubule.
SLC5A10
rs.SLC5A10.F1 rs.SLC5A10.R1 385 433
NCBIGene 36.3 125206
Alternative 3-prime, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_152351
Changed!
pfam SSF 446aa 3e-88
in ref transcript
Sodium:solute symporter family.
Changed!
PRK PRK10484 592aa 5e-41
in ref transcript
putative symporter YidK; Provisional.
Changed!
pfam SSF 430aa 3e-90
in modified transcript
Changed!
PRK PRK10484 576aa 3e-42
in modified transcript
SLC7A3
rs.SLC7A3.F1 rs.SLC7A3.R1 123 147
NCBIGene 36.3 84889
Alternative 3-prime, size difference: 24
Inclusion in 5'UTR
Reference transcript: NM_001048164
TIGR 2A0303 583aa 1e-180
in ref transcript
COG PotE 418aa 2e-34
in ref transcript
Amino acid transporters [Amino acid transport and metabolism].
SLC7A6
rs.SLC7A6.F1 rs.SLC7A6.R1 270 357
NCBIGene 36.3 9057
Single exon skipping, size difference: 87
Exclusion in 5'UTR
Reference transcript: NM_001076785
TIGR 2A0308 485aa 1e-147
in ref transcript
COG PotE 456aa 9e-35
in ref transcript
Amino acid transporters [Amino acid transport and metabolism].
SLC9A6
rs.SLC9A6.F1 rs.SLC9A6.R1 154 250
NCBIGene 36.3 10479
Alternative 5-prime, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_001042537
Changed!
TIGR b_cpa1 408aa 7e-72
in ref transcript
This Hmm is specific for the eukaryotic members members of this family.
Changed!
COG NhaP 359aa 7e-42
in ref transcript
NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism].
Changed!
TIGR b_cpa1 459aa 1e-72
in modified transcript
Changed!
COG NhaP 361aa 6e-41
in modified transcript
SLFN11
rs.SLFN11.F1 rs.SLFN11.R1 147 255
NCBIGene 36.3 91607
Alternative 5-prime, size difference: 108
Exclusion in 5'UTR
Reference transcript: NM_001104587
pfam AAA_4 101aa 5e-16
in ref transcript
Divergent AAA domain. This family is related to the pfam00004 family, and presumably has the same function (ATP-binding).
COG COG2865 87aa 2e-06
in ref transcript
Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription].
SLFN11
rs.SLFN11.F2 rs.SLFN11.R2 100 162
NCBIGene 36.3 91607
Alternative 5-prime, size difference: 62
Exclusion in 5'UTR
Reference transcript: NM_001104587
pfam AAA_4 101aa 5e-16
in ref transcript
Divergent AAA domain. This family is related to the pfam00004 family, and presumably has the same function (ATP-binding).
COG COG2865 87aa 2e-06
in ref transcript
Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription].
SLFN11
rs.SLFN11.F3 rs.SLFN11.R3 140 300
NCBIGene 36.3 91607
Single exon skipping, size difference: 160
Exclusion in 5'UTR
Reference transcript: NM_001104587
pfam AAA_4 101aa 5e-16
in ref transcript
Divergent AAA domain. This family is related to the pfam00004 family, and presumably has the same function (ATP-binding).
COG COG2865 87aa 2e-06
in ref transcript
Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription].
SLTM
rs.SLTM.F1 rs.SLTM.R1 264 318
NCBIGene 36.3 79811
Alternative 3-prime, size difference: 54
Exclusion in the protein (no frameshift)
Reference transcript: NM_024755
cd RRM 75aa 6e-12
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart RRM 71aa 1e-11
in ref transcript
RNA recognition motif.
smart SAP 35aa 8e-06
in ref transcript
Putative DNA-binding (bihelical) motif predicted to be involved in chromosomal organisation.
COG COG0724 112aa 0.001
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
SMAD5
rs.SMAD5.F1 rs.SMAD5.R1 176 260
NCBIGene 36.3 4090
Single exon skipping, size difference: 84
Exclusion in 5'UTR
Reference transcript: NM_001001419
cd MH2 176aa 1e-73
in ref transcript
MH2 domain; C terminal domain of SMAD family proteins, responsible for receptor interaction, transactivation, and homo- and heterooligomerisation; also known as Domain B in dwarfin family proteins.
cd MH1 121aa 4e-34
in ref transcript
MH1 is a small DNA binding domain, binding in an unusal way involving a beta hairpin structure binding to the major groove. MH1 is present in Smad proteins, an important family of proteins involved in TGF-beta signalling and frequent targets of tumorigenic mutations. Also known as Domain A in dwarfin family proteins.
pfam MH2 176aa 1e-80
in ref transcript
MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins such as Smads. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins and provides specificity and selectivity to Smad function and also is critical for mediating interactions in Smad oligomers. Unlike MH1, MH2 does not bind DNA. The well-studied MH2 domain of Smad4 is composed of five alpha helices and three loops enclosing a beta sandwich. Smads are involved in the propagation of TGF-beta signals by direct association with the TGF-beta receptor kinase which phosphorylates the last two Ser of a conserved 'SSXS' motif located at the C-terminus of MH2.
pfam MH1 76aa 2e-35
in ref transcript
MH1 domain. The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related proteins such as Smads. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain shows that a highly conserved 11 residue beta hairpin is used to bind the DNA consensus sequence GNCN in the major groove, shown to be vital for the transcriptional activation of target genes. Not all examples of MH1 can bind to DNA however. Smad2 cannot bind DNA and has a large insertion within the hairpin that presumably abolishes DNA binding. A basic helix (H2) in MH1 with the nuclear localisation signal KKLKK has been shown to be essential for Smad3 nuclear import. Smads also use the MH1 domain to interact with transcription factors such as Jun, TFE3, Sp1, and Runx.
SMAP1
rs.SMAP1.F1 rs.SMAP1.R1 151 232
NCBIGene 36.3 60682
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_001044305
pfam ArfGap 100aa 8e-37
in ref transcript
Putative GTPase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
COG COG5347 91aa 2e-29
in ref transcript
GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
SMN1
rs.SMN1.F1 rs.SMN1.R1 118 214
NCBIGene 36.3 6606
Single exon skipping, size difference: 96
Exclusion in the protein (no frameshift)
Reference transcript: NM_000344
cd TUDOR 48aa 2e-08
in ref transcript
Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
Changed!
pfam SMN 166aa 3e-40
in ref transcript
Survival motor neuron protein (SMN). This family consists of several eukaryotic survival motor neuron (SMN) proteins. The Survival of Motor Neurons (SMN) protein, the product of the spinal muscular atrophy-determining gene, is part of a large macromolecular complex (SMN complex) that functions in the assembly of spliceosomal small nuclear ribonucleoproteins (snRNPs). The SMN complex functions as a specificity factor essential for the efficient assembly of Sm proteins on U snRNAs and likely protects cells from illicit, and potentially deleterious, non-specific binding of Sm proteins to RNAs.
Changed!
pfam SMN 35aa 2e-11
in ref transcript
Changed!
pfam SMN 229aa 7e-52
in modified transcript
SMN2
rs.SMN2.F1 rs.SMN2.R1 123 177
NCBIGene 36.3 6607
Single exon skipping, size difference: 54
Exclusion of the stop codon
Reference transcript: NM_017411
cd TUDOR 48aa 2e-08
in ref transcript
Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
pfam SMN 166aa 3e-40
in ref transcript
Survival motor neuron protein (SMN). This family consists of several eukaryotic survival motor neuron (SMN) proteins. The Survival of Motor Neurons (SMN) protein, the product of the spinal muscular atrophy-determining gene, is part of a large macromolecular complex (SMN complex) that functions in the assembly of spliceosomal small nuclear ribonucleoproteins (snRNPs). The SMN complex functions as a specificity factor essential for the efficient assembly of Sm proteins on U snRNAs and likely protects cells from illicit, and potentially deleterious, non-specific binding of Sm proteins to RNAs.
Changed!
pfam SMN 35aa 2e-11
in ref transcript
Changed!
pfam SMN 27aa 2e-08
in modified transcript
SNHG3-RCC1
rs.SNHG3-RCC1.F1 rs.SNHG3-RCC1.R1 103 246
NCBIGene 36.3 751867
Single exon skipping, size difference: 143
Exclusion in 5'UTR
Reference transcript: NM_001048197
pfam RCC1 50aa 1e-10
in ref transcript
Regulator of chromosome condensation (RCC1).
pfam RCC1 49aa 2e-10
in ref transcript
pfam RCC1 51aa 1e-09
in ref transcript
pfam RCC1 52aa 2e-09
in ref transcript
pfam RCC1 66aa 3e-08
in ref transcript
pfam RCC1 31aa 2e-04
in ref transcript
pfam RCC1 51aa 7e-04
in ref transcript
COG ATS1 350aa 3e-42
in ref transcript
Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
SNRK
rs.SNRK.F1 rs.SNRK.R1 338 400
NCBIGene 36.3 54861
Single exon skipping, size difference: 62
Exclusion in 5'UTR
Reference transcript: NM_017719
cd S_TKc 254aa 6e-82
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
smart S_TKc 243aa 3e-84
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
PTZ PTZ00263 258aa 5e-42
in ref transcript
protein kinase A catalytic subunit; Provisional.
SNUPN
rs.SNUPN.F1 rs.SNUPN.R1 201 306
NCBIGene 36.3 10073
Alternative 5-prime, size difference: 105
Exclusion in 5'UTR
Reference transcript: NM_005701
SNX14
rs.SNX14.F1 rs.SNX14.R1 161 293
NCBIGene 36.3 57231
Multiple exon skipping, size difference: 132
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_153816
cd PX_SNX14 123aa 3e-50
in ref transcript
The phosphoinositide binding Phox Homology domain of Sorting Nexin 14. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX14 may be involved in recruiting other proteins to the membrane via protein-protein and protein-ligand interaction. It is expressed in the embryonic nervous system of mice, and is co-expressed in the motoneurons and the anterior pituary with Islet-1. SNX14 shows a similar domain architecture as SNX13, containing an N-terminal PXA domain, a regulator of G protein signaling (RGS) domain, a PX domain, and a C-terminal domain that is conserved in some SNXs.
pfam Nexin_C 106aa 5e-25
in ref transcript
Sorting nexin C terminal. This region is found a the C terminal of proteins belonging to the sorting nexin family. It is found on proteins which also contain pfam00787.
Changed!
smart PXA 156aa 6e-22
in ref transcript
Domain associated with PX domains. unpubl. observations.
pfam PX 105aa 2e-17
in ref transcript
PX domain. PX domains bind to phosphoinositides.
smart RGS 128aa 2e-07
in ref transcript
Regulator of G protein signalling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
COG COG5391 140aa 0.002
in ref transcript
Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
Changed!
smart PXA 131aa 2e-12
in modified transcript
SNX21
rs.SNX21.F1 rs.SNX21.R1 99 110
NCBIGene 36.3 90203
Single exon skipping, size difference: 11
Inclusion in the protein causing a frameshift
Reference transcript: NM_033421
Changed!
cd PX_SNX21 112aa 1e-55
in ref transcript
The phosphoinositide binding Phox Homology domain of Sorting Nexin 21. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. Some SNXs are localized in early endosome structures such as clathrin-coated pits, while others are located in late structures of the endocytic pathway. SNX21, also called SNX-L, is distinctly and highly-expressed in fetal liver and may be involved in protein sorting and degradation during embryonic liver development.
Changed!
pfam PX 106aa 8e-12
in ref transcript
PX domain. PX domains bind to phosphoinositides.
Changed!
COG COG5391 112aa 3e-04
in ref transcript
Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
Changed!
cd PX_SNX21 19aa 1e-04
in modified transcript
SOD2
rs.SOD2.F1 rs.SOD2.R1 283 400
NCBIGene 36.3 6648
Single exon skipping, size difference: 117
Exclusion in the protein (no frameshift)
Reference transcript: NM_001024465
Changed!
pfam Sod_Fe_C 106aa 3e-49
in ref transcript
Iron/manganese superoxide dismutases, C-terminal domain. superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to hydrogen peroxide and molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the Mn/Fe-binding family is one. In humans, there is a cytoplasmic Cu/Zn SOD, and a mitochondrial Mn/Fe SOD. C-terminal domain is a mixed alpha/beta fold.
Changed!
pfam Sod_Fe_N 82aa 3e-34
in ref transcript
Iron/manganese superoxide dismutases, alpha-hairpin domain. superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to hydrogen peroxide and molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the Mn/Fe-binding family is one. In humans, there is a cytoplasmic Cu/Zn SOD, and a mitochondrial Mn/Fe SOD. N-terminal domain is a long alpha antiparallel hairpin. A small fragment of YTRE_LEPBI matches well - sequencing error?.
Changed!
COG SodA 200aa 1e-69
in ref transcript
Superoxide dismutase [Inorganic ion transport and metabolism].
Changed!
pfam Sod_Fe_C 103aa 3e-47
in modified transcript
Changed!
pfam Sod_Fe_N 59aa 6e-21
in modified transcript
Changed!
COG SodA 161aa 3e-52
in modified transcript
Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001034954
cd SH3 54aa 3e-12
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd SH3 52aa 8e-11
in ref transcript
cd SH3 52aa 6e-10
in ref transcript
smart SH3 59aa 4e-13
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
smart SH3 53aa 3e-12
in ref transcript
smart SH3 59aa 7e-11
in ref transcript
pfam Sorb 41aa 4e-10
in ref transcript
Sorbin homologous domain.
SORBS1
rs.SORBS1.F2 rs.SORBS1.R2 106 208
NCBIGene 36.3 10580
Single exon skipping, size difference: 102
Exclusion in the protein (no frameshift)
Reference transcript: NM_001034954
cd SH3 54aa 3e-12
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd SH3 52aa 8e-11
in ref transcript
cd SH3 52aa 6e-10
in ref transcript
smart SH3 59aa 4e-13
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
smart SH3 53aa 3e-12
in ref transcript
smart SH3 59aa 7e-11
in ref transcript
pfam Sorb 41aa 4e-10
in ref transcript
Sorbin homologous domain.
SORBS2
rs.SORBS2.F1 rs.SORBS2.R1 156 206
NCBIGene 36.3 8470
Single exon skipping, size difference: 50
Inclusion in 5'UTR
Reference transcript: NM_021069
cd SH3 52aa 8e-14
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
cd SH3 54aa 9e-12
in ref transcript
cd SH3 54aa 3e-11
in ref transcript
pfam Sorb 50aa 9e-21
in ref transcript
Sorbin homologous domain.
smart SH3 53aa 5e-16
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
smart SH3 59aa 2e-12
in ref transcript
smart SH3 59aa 2e-12
in ref transcript
SP8
rs.SP8.F1 rs.SP8.R1 128 224
NCBIGene 36.3 221833
Single exon skipping, size difference: 96
Inclusion in the protein causing a new stop codon
Reference transcript: NM_182700
SPAG11B
rs.SPAG11B.F1 rs.SPAG11B.R1 195 271
NCBIGene 36.3 10407
Single exon skipping, size difference: 76
Inclusion in the protein causing a frameshift
Reference transcript: NM_058207
Changed!
pfam Defensin_beta 36aa 1e-04
in ref transcript
Beta defensin. The beta defensins are antimicrobial peptides implicated in the resistance of epithelial surfaces to microbial colonisation.
SPAG11B
rs.SPAG11B.F2 rs.SPAG11B.R2 117 195
NCBIGene 36.3 10407
Single exon skipping, size difference: 78
Exclusion in the protein (no frameshift)
Reference transcript: NM_058200
pfam Sperm_Ag_HE2 70aa 2e-28
in ref transcript
Sperm antigen HE2. This family consists of several variants of the human and chimpanzee sperm antigen proteins (HE2 and EP2 respectively). The EP2 gene codes for a family of androgen-dependent, epididymis-specific secretory proteins.The EP2 gene uses alternative promoters and differential splicing to produce a family of variant messages. The translated putative protein variants differ significantly from each other. Some of these putative proteins have similarity to beta-defensins, a family of antimicrobial peptides.
SPAG11B
rs.SPAG11B.F3 rs.SPAG11B.R3 180 256
NCBIGene 36.3 10407
Single exon skipping, size difference: 76
Inclusion in the protein causing a frameshift
Reference transcript: NM_058201
Changed!
pfam Sperm_Ag_HE2 70aa 1e-30
in ref transcript
Sperm antigen HE2. This family consists of several variants of the human and chimpanzee sperm antigen proteins (HE2 and EP2 respectively). The EP2 gene codes for a family of androgen-dependent, epididymis-specific secretory proteins.The EP2 gene uses alternative promoters and differential splicing to produce a family of variant messages. The translated putative protein variants differ significantly from each other. Some of these putative proteins have similarity to beta-defensins, a family of antimicrobial peptides.
Changed!
pfam Defensin_beta 36aa 2e-06
in ref transcript
Beta defensin. The beta defensins are antimicrobial peptides implicated in the resistance of epithelial surfaces to microbial colonisation.
Changed!
pfam Sperm_Ag_HE2 74aa 1e-29
in modified transcript
SPAG6
rs.SPAG6.F1 rs.SPAG6.R1 131 277
NCBIGene 36.3 9576
Single exon skipping, size difference: 146
Exclusion in the protein causing a frameshift
Reference transcript: NM_012443
cd ARM 119aa 7e-16
in ref transcript
Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
cd ARM 119aa 4e-14
in ref transcript
cd ARM 119aa 6e-10
in ref transcript
cd ARM 78aa 0.006
in ref transcript
pfam Arm 41aa 3e-05
in ref transcript
Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
pfam Adaptin_N 359aa 0.003
in ref transcript
Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pits and vesicles. The N-terminal region of the various adaptor proteins (APs) is constant by comparison to the C-terminal which is variable within members of the AP-2 family; and it has been proposed that this constant region interacts with another uniform component of the coated vesicles.
COG SRP1 312aa 1e-19
in ref transcript
Karyopherin (importin) alpha [Intracellular trafficking and secretion].
SPG3A
rs.SPG3A.F1 rs.SPG3A.R1 123 138
NCBIGene 36.3 51062
Single exon skipping, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_015915
cd GBP 247aa 1e-55
in ref transcript
Guanylate-binding protein (GBP), N-terminal domain. Guanylate-binding proteins (GBPs) define a group of proteins that are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. Furthermore, two unique regions around the base and the phosphate-binding areas, the guanine and the phosphate caps, respectively, give the nucleotide-binding site a unique appearance not found in the canonical GTP-binding proteins. The phosphate cap, which constitutes the region analogous to switch I, completely shields the phosphate-binding site from solvent such that a potential GTPase-activating protein (GAP) cannot approach.
pfam GBP 272aa 1e-101
in ref transcript
Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP.
SPOP
rs.SPOP.F1 rs.SPOP.R1 193 245
NCBIGene 36.3 8405
Single exon skipping, size difference: 52
Exclusion in 5'UTR
Reference transcript: NM_001007230
cd MATH_SPOP 139aa 8e-70
in ref transcript
Speckle-type POZ protein (SPOP) family, MATH domain; composed of proteins with similarity to human SPOP. SPOP was isolated as a novel antigen recognized by serum from a scleroderma patient, whose overexpression in COS cells results in a discrete speckled pattern in the nuclei. It contains an N-terminal MATH domain and a C-terminal BTB (also called POZ) domain. Together with Cul3, SPOP constitutes an ubiquitin E3 ligase which is able to ubiquitinate the PcG protein BMI1, the variant histone macroH2A1 and the death domain-associated protein Daxx. Therefore, SPOP may be involved in the regulation of these proteins and may play a role in transcriptional regulation, apoptosis and X-chromosome inactivation. Cul3 binds to the BTB domain of SPOP whereas Daxx and the macroH2A1 nonhistone region have been shown to bind to the MATH domain. Both MATH and BTB domains are necessary for the nuclear speckled accumulation of SPOP. There are many proteins, mostly uncharacterized, containing both MATH and BTB domains from C. elegans and plants which are excluded from this family.
pfam BTB 108aa 3e-21
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
smart MATH 102aa 8e-10
in ref transcript
meprin and TRAF homology.
SPP1
rs.SPP1.F1 rs.SPP1.R1 250 331
NCBIGene 36.3 6696
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040058
Changed!
pfam Osteopontin 314aa 1e-96
in ref transcript
Osteopontin.
Changed!
pfam Osteopontin 287aa 2e-76
in modified transcript
SPP1
rs.SPP1.F2 rs.SPP1.R2 373 415
NCBIGene 36.3 6696
Single exon skipping, size difference: 42
Exclusion in the protein (no frameshift)
Reference transcript: NM_001040058
Changed!
pfam Osteopontin 314aa 1e-96
in ref transcript
Osteopontin.
Changed!
pfam Osteopontin 300aa 9e-89
in modified transcript
SPPL2B
rs.SPPL2B.F1 rs.SPPL2B.R1 149 212
NCBIGene 36.3 56928
Alternative 5-prime, size difference: 63
Exclusion in 3'UTR
Reference transcript: NM_001077238
cd PA_hSPPL_like 120aa 6e-51
in ref transcript
PA_hSPPL_like: Protease-associated domain containing human signal peptide peptidase-like (hSPPL)-like. This group contains various PA domain-containing proteins similar to hSPPL2a and 2b. These SPPLs are GxGD aspartic proteases. SPPL2a is sorted to the late endosomes, SPPL2b to the plasma membrane. In activated dendritic cells, hSPPL2a and 2b catalyze the intramembrane proteolysis of tumor necrosis factor alpha triggering IL-12 production. hSPPL2a and 2b may have a broad substrate spectrum. The significance of the PA domain to these SPPLs has not been ascertained. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate.
pfam PA 81aa 9e-10
in ref transcript
PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family. It has been suggested that this domain forms a lid-like structure that covers the active site in active proteases, and is involved in protein recognition in vacuolar sorting receptors.
smart PSN 58aa 0.003
in ref transcript
Presenilin, signal peptide peptidase, family. Presenilin 1 and presenilin 2 are polytopic membrane proteins, whose genes are mutated in some individuals with Alzheimer's disease. Distant homologues, present in eukaryotes and archaea, also contain conserved aspartic acid residues which are predicted to contribute to catalysis. At least one member of this family has been shown to possess signal peptide peptidase activity.
SPRR3
rs.SPRR3.F1 rs.SPRR3.R1 107 191
NCBIGene 36.3 6707
Single exon skipping, size difference: 84
Exclusion in 5'UTR
Reference transcript: NM_005416
pfam Cornifin 137aa 1e-18
in ref transcript
Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of the SPRR gene family resides in the structure of the central segments of the encoded polypeptides that are built up from tandemly repeated units of either eight (SPRR1 and SPRR3) or nine (SPRR2) amino acids with the general consensus XKXPEPXX where X is any amino acid.
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_014850
Changed!
cd RhoGAP_srGAP 188aa 6e-99
in ref transcript
RhoGAP_srGAP: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in srGAPs. srGAPs are components of the intracellular part of Slit-Robo signalling pathway that is important for axon guidance and cell migration. srGAPs contain an N-terminal FCH domain, a central RhoGAP domain and a C-terminal SH3 domain; this SH3 domain interacts with the intracellular proline-rich-tail of the Roundabout receptor (Robo). This interaction with Robo then activates the rhoGAP domain which in turn inhibits Cdc42 activity. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
cd SH3 50aa 2e-09
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
smart RhoGAP 172aa 1e-51
in ref transcript
GTPase-activator protein for Rho-like GTPases. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases. etter domain limits and outliers.
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Changed!
cd RhoGAP_srGAP 177aa 1e-94
in modified transcript
SSBP3
rs.SSBP3.F1 rs.SSBP3.R1 364 424
NCBIGene 36.3 23648
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_145716
Changed!
pfam SSDP 144aa 2e-06
in ref transcript
Single-stranded DNA binding protein, SSDP. This is a family of eukaryotic single-stranded DNA binding proteins with specificity to a pyrimidine-rich element found in the promoter region of the alpha2(I) collagen gene.
Changed!
pfam SSDP 138aa 2e-07
in modified transcript
SSX5
rs.SSX5.F1 rs.SSX5.R1 330 453
NCBIGene 36.3 6758
Single exon skipping, size difference: 123
Exclusion in the protein (no frameshift)
Reference transcript: NM_021015
Changed!
smart KRAB 59aa 6e-11
in ref transcript
krueppel associated box.
pfam SSXRD 34aa 5e-08
in ref transcript
SSXRD motif. SSX1 can repress transcription, and this has been attributed to a putative Kruppel associated box (KRAB) repression domain at the N-terminus. However, from the analysis of these deletion constructs further repression activity was found at the C-terminus of SSX1. Which has been called the SSXRD (SSX Repression Domain). The potent repression exerted by full-length SSX1 appears to localise to this region.
Changed!
smart KRAB 60aa 6e-11
in modified transcript
STAG2
rs.STAG2.F1 rs.STAG2.R1 357 422
NCBIGene 36.3 10735
Single exon skipping, size difference: 65
Exclusion in 5'UTR
Reference transcript: NM_001042750
pfam STAG 107aa 4e-44
in ref transcript
STAG domain. STAG domain proteins are subunits of cohesin complex - a protein complex required for sister chromatid cohesion in eukaryotes. The STAG domain is present in Schizosaccharomyces pombe mitotic cohesin Psc3, and the meiosis specific cohesin Rec11. Many organisms express a meiosis-specific STAG protein, for example, mice and humans have a meiosis specific variant called STAG3, although budding yeast does not have a meiosis specific version.
COG IRR1 294aa 2e-18
in ref transcript
Cohesin [Cell division and chromosome partitioning].
STAG2
rs.STAG2.F2 rs.STAG2.R2 208 319
NCBIGene 36.3 10735
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_001042749
pfam STAG 107aa 4e-44
in ref transcript
STAG domain. STAG domain proteins are subunits of cohesin complex - a protein complex required for sister chromatid cohesion in eukaryotes. The STAG domain is present in Schizosaccharomyces pombe mitotic cohesin Psc3, and the meiosis specific cohesin Rec11. Many organisms express a meiosis-specific STAG protein, for example, mice and humans have a meiosis specific variant called STAG3, although budding yeast does not have a meiosis specific version.
COG IRR1 294aa 2e-18
in ref transcript
Cohesin [Cell division and chromosome partitioning].
STAG3L1
rs.STAG3L1.F1 rs.STAG3L1.R1 108 232
NCBIGene 36.3 54441
Single exon skipping, size difference: 124
Exclusion in the protein causing a frameshift
Reference transcript: NM_018991
Changed!
COG IRR1 114aa 1e-08
in ref transcript
Cohesin [Cell division and chromosome partitioning].
STAT3
rs.STAT3.F1 rs.STAT3.R1 105 127
NCBIGene 36.3 6774
Alternative 3-prime, size difference: 22
Exclusion in 5'UTR
Reference transcript: NM_139276
cd SH2 92aa 4e-06
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
pfam STAT_bind 254aa 1e-123
in ref transcript
STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. This family represents the DNA binding domain of STAT, which has an ig-like fold. STAT proteins also include an SH2 domain pfam00017.
pfam STAT_alpha 182aa 4e-63
in ref transcript
STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STAT proteins also include an SH2 domain pfam00017.
pfam STAT_int 115aa 4e-46
in ref transcript
STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STAT proteins also include an SH2 domain pfam00017.
pfam SH2 91aa 4e-09
in ref transcript
SH2 domain.
STAT3
rs.STAT3.F2 rs.STAT3.R2 102 152
NCBIGene 36.3 6774
Alternative 3-prime, size difference: 50
Exclusion in the protein causing a frameshift
Reference transcript: NM_139276
cd SH2 92aa 4e-06
in ref transcript
Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
pfam STAT_bind 254aa 1e-123
in ref transcript
STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. This family represents the DNA binding domain of STAT, which has an ig-like fold. STAT proteins also include an SH2 domain pfam00017.
pfam STAT_alpha 182aa 4e-63
in ref transcript
STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STAT proteins also include an SH2 domain pfam00017.
pfam STAT_int 115aa 4e-46
in ref transcript
STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STAT proteins also include an SH2 domain pfam00017.
pfam SH2 91aa 4e-09
in ref transcript
SH2 domain.
STAU1
rs.STAU1.F1 rs.STAU1.R1 177 300
NCBIGene 36.3 6780
Single exon skipping, size difference: 123
Inclusion in 5'UTR
Reference transcript: NM_017453
cd DSRM 67aa 1e-12
in ref transcript
Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
smart DSRM 67aa 3e-14
in ref transcript
Double-stranded RNA binding motif.
smart DSRM 32aa 0.001
in ref transcript
PRK rnc 72aa 1e-10
in ref transcript
ribonuclease III; Reviewed.
STEAP2
rs.STEAP2.F1 rs.STEAP2.R1 303 353
NCBIGene 36.3 261729
Exon skipping and alternative 3-prime or 5-prime, size difference: 50
Inclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_001040665
pfam F420_oxidored 175aa 3e-37
in ref transcript
NADP oxidoreductase coenzyme F420-dependent.
COG COG2085 186aa 4e-26
in ref transcript
Predicted dinucleotide-binding enzymes [General function prediction only].
COG MgtA 151aa 0.001
in ref transcript
Cation transport ATPase [Inorganic ion transport and metabolism].
STK19
rs.STK19.F1 rs.STK19.R1 100 112
NCBIGene 36.3 8859
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_032454
Changed!
pfam Stk19 240aa 4e-83
in ref transcript
Serine-threonine protein kinase 19. This serine-threonine protein kinase number 19 is expressed from the MHC and predominantly in the nucleus. Protein kinases are involved in signal transduction pathways and play fundamental roles in the regulation of cell functions. This is a novel Ser/Thr protein kinase, that has Mn2+-dependent protein kinase activity that phosphorylates alpha -casein at Ser/Thr residues and histone at Ser residues. It can be covalently modified by the reactive ATP analogue 5'-p-fluorosulfonylbenzoyladenosine in the absence of ATP, and this modification is prevented in the presence of 1 mM ATP, indicating that the kinase domain of is capable of binding ATP.
Changed!
pfam Stk19 236aa 2e-83
in modified transcript
STRN3
rs.STRN3.F1 rs.STRN3.R1 147 399
NCBIGene 36.3 29966
Multiple exon skipping, size difference: 252
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001083893
cd WD40 322aa 9e-49
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
pfam Striatin 135aa 1e-41
in ref transcript
Striatin family. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (pfam00400) repeat domain. It acts as a scaffold protein and is involved in signalling pathways.
smart WD40 40aa 2e-06
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 40aa 5e-06
in ref transcript
smart WD40 34aa 1e-05
in ref transcript
smart WD40 38aa 2e-04
in ref transcript
COG COG2319 318aa 1e-20
in ref transcript
FOG: WD40 repeat [General function prediction only].
SUMF2
rs.SUMF2.F1 rs.SUMF2.R1 157 202
NCBIGene 36.3 25870
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_001042468
Changed!
pfam DUF323 269aa 9e-65
in ref transcript
Domain of unknown function (DUF323). This presumed domain is found in bacterial and eukaryotic proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown. The domain has also been found in eukaryotic proteins required for post-translational sulphatase modification (SUMF1). These proteins are associated with the rare disorder multiple sulphatase deficiency (MSD).
Changed!
COG COG1262 275aa 1e-36
in ref transcript
Uncharacterized conserved protein [Function unknown].
Changed!
pfam DUF323 254aa 2e-63
in modified transcript
Changed!
COG COG1262 260aa 1e-37
in modified transcript
SUMO2
rs.SUMO2.F1 rs.SUMO2.R1 250 322
NCBIGene 36.3 6613
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_006937
Changed!
cd Sumo 87aa 6e-31
in ref transcript
Small ubiquitin-related modifier (SUMO) proteins are conjugated to numerous intracellular targets and serve to modulate protein interaction, localization, activity or stability. SUMO (also known as "Smt3" and "sentrin" in other organisms) is linked to several different pathways, including nucleocytoplasmic transport. Attachment of SUMO to targets proteins is stimulated by PIAS (Protein inhibitor of activated STATs) proteins which serve as E3-like ligases.
Changed!
smart UBQ 65aa 4e-08
in ref transcript
Ubiquitin homologues. Ubiquitin-mediated proteolysis is involved in the regulated turnover of proteins required for controlling cell cycle progression.
Changed!
COG SMT3 91aa 8e-18
in ref transcript
Ubiquitin-like protein (sentrin) [Posttranslational modification, protein turnover, chaperones].
Changed!
cd Sumo 63aa 1e-16
in modified transcript
Changed!
COG SMT3 67aa 3e-08
in modified transcript
SYN1
rs.SYN1.F1 rs.SYN1.R1 123 161
NCBIGene 36.3 6853
Alternative 3-prime, size difference: 38
Exclusion in the protein causing a frameshift
Reference transcript: NM_006950
pfam Synapsin_C 203aa 1e-121
in ref transcript
Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown.
pfam Synapsin 98aa 7e-52
in ref transcript
Synapsin, N-terminal domain.
pfam Synapsin_N 27aa 2e-10
in ref transcript
Synapsin N-terminal. This highly conserved domain of synapsin proteins has a serine at position 9 or 10 which is a phosphorylation site. The domain appears to be the part of the molecule that binds to calmodulin.
COG RimK 298aa 4e-06
in ref transcript
Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) [Coenzyme metabolism / Translation, ribosomal structure and biogenesis].
SYNE2
rs.SYNE2.F2 rs.SYNE2.R2 132 174
NCBIGene 36.3 23224
Alternative 3-prime, size difference: 42
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_182914
cd SPEC 214aa 1e-12
in ref transcript
Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
cd CH 103aa 5e-12
in ref transcript
Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
cd SPEC 215aa 1e-10
in ref transcript
cd CH 103aa 6e-10
in ref transcript
cd SPEC 214aa 4e-09
in ref transcript
cd SPEC 221aa 8e-08
in ref transcript
Changed!
cd SPEC 216aa 1e-07
in ref transcript
cd SPEC 207aa 2e-05
in ref transcript
pfam KASH 60aa 7e-14
in ref transcript
Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
pfam CH 99aa 2e-13
in ref transcript
Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
smart CH 102aa 2e-13
in ref transcript
Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
pfam SMC_N 302aa 1e-06
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
TIGR SMC_prok_B 305aa 4e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
smart SPEC 102aa 9e-05
in ref transcript
Spectrin repeats.
smart SPEC 104aa 3e-04
in ref transcript
TIGR SMC_prok_A 303aa 3e-04
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
TIGR SMC_prok_A 375aa 4e-04
in ref transcript
pfam SMC_N 281aa 0.002
in ref transcript
COG SAC6 255aa 3e-22
in ref transcript
Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
COG Smc 266aa 3e-04
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
Changed!
cd SPEC 230aa 1e-05
in modified transcript
SYTL2
rs.SYTL2.F1 rs.SYTL2.R1 110 158
NCBIGene 36.3 54843
Single exon skipping, size difference: 48
Exclusion in the protein (no frameshift)
Reference transcript: NM_206927
cd C2_1 127aa 9e-17
in ref transcript
Protein kinase C conserved region 2, subgroup 1; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing synaptotagmins, specific protein kinases C (PKC) subtypes and other proteins, the N-terminal beta strand occupies the position of what is the C-terminal strand in subgroup 2.
cd C2_1 128aa 4e-15
in ref transcript
pfam C2 90aa 2e-10
in ref transcript
C2 domain.
pfam C2 88aa 7e-10
in ref transcript
TAC4
rs.TAC4.F1 rs.TAC4.R1 102 120
NCBIGene 36.3 255061
Alternative 5-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_170685
TAC4
rs.TAC4.F2 rs.TAC4.R2 139 172
NCBIGene 36.3 255061
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_170685
TAC4
rs.TAC4.F3 rs.TAC4.R3 164 224
NCBIGene 36.3 255061
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_170685
TAX1BP1
rs.TAX1BP1.F1 rs.TAX1BP1.R1 200 326
NCBIGene 36.3 8887
Alternative 5-prime, size difference: 126
Exclusion in the protein (no frameshift)
Reference transcript: NM_006024
pfam CALCOCO1 454aa 3e-51
in ref transcript
Calcium binding and coiled-coil domain (CALCOCO1) like. Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
TIGR SMC_prok_B 249aa 2e-05
in ref transcript
SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
COG Smc 316aa 2e-10
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
TBCE
rs.TBCE.F1 rs.TBCE.R1 154 202
NCBIGene 36.3 6905
Alternative 5-prime, size difference: 48
Exclusion in 5'UTR
Reference transcript: NM_001079515
cd LRR_RI 159aa 7e-05
in ref transcript
Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
pfam CAP_GLY 67aa 9e-20
in ref transcript
CAP-Gly domain. Cytoskeleton-associated proteins (CAPs) are involved in the organisation of microtubules and transportation of vesicles and organelles along the cytoskeletal network. A conserved motif, CAP-Gly, has been identified in a number of CAPs, including CLIP-170 and dynactins. The crystal structure of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain was recently solved. The domain contains three beta-strands. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove.
COG NIP100 67aa 1e-07
in ref transcript
Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning].
COG COG4886 217aa 4e-07
in ref transcript
Leucine-rich repeat (LRR) protein [Function unknown].
TBX3
rs.TBX3.F1 rs.TBX3.R1 283 343
NCBIGene 36.3 6926
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_016569
Changed!
cd TBOX 202aa 1e-85
in ref transcript
T-box DNA binding domain of the T-box family of transcriptional regulators. The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.
Changed!
pfam T-box 200aa 1e-97
in ref transcript
T-box. The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box proteins are found in a wide range of animals, but not in other kingdoms such as plants. Family members are all thought to bind to the DNA consensus sequence TCACACCT. they are found exclusively in the nucleus, and perform DNA-binding and transcriptional activation/repression roles. They are generally required for development of the specific tissues they are expressed in, and mutations in T-box genes are implicated in human conditions such as DiGeorge syndrome and X-linked cleft palate, which feature malformations.
Changed!
cd TBOX 182aa 4e-89
in modified transcript
Changed!
pfam T-box 180aa 1e-101
in modified transcript
TCEAL1
rs.TCEAL1.F1 rs.TCEAL1.R1 100 112
NCBIGene 36.3 9338
Alternative 5-prime, size difference: 12
Exclusion in 5'UTR
Reference transcript: NM_004780
pfam TFA 86aa 3e-12
in ref transcript
Transcription elongation factor A, SII-related family. The function of this family is unclear, but two members from Homo sapiesn are described as transcription elongation factor A, SII-like proteins.
TCEAL3
rs.TCEAL3.F1 rs.TCEAL3.R1 99 111
NCBIGene 36.3 85012
Alternative 5-prime, size difference: 12
Exclusion in 5'UTR
Reference transcript: NM_001006933
pfam TFA 133aa 2e-11
in ref transcript
Transcription elongation factor A, SII-related family. The function of this family is unclear, but two members from Homo sapiesn are described as transcription elongation factor A, SII-like proteins.
TCEAL4
rs.TCEAL4.F1 rs.TCEAL4.R1 178 237
NCBIGene 36.3 79921
Alternative 5-prime, size difference: 59
Exclusion in 5'UTR
Reference transcript: NM_024863
pfam TFA 79aa 5e-08
in ref transcript
Transcription elongation factor A, SII-related family. The function of this family is unclear, but two members from Homo sapiesn are described as transcription elongation factor A, SII-like proteins.
TCF12
rs.TCF12.F1 rs.TCF12.R1 90 100
NCBIGene 36.3 6938
Alternative 3-prime, size difference: 10
Inclusion in 5'UTR
Reference transcript: NM_207036
cd HLH 61aa 1e-06
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
smart HLH 54aa 7e-10
in ref transcript
helix loop helix domain.
TCF19
rs.TCF19.F1 rs.TCF19.R1 124 370
NCBIGene 36.3 6941
Alternative 3-prime, size difference: 246
Inclusion in 5'UTR
Reference transcript: NM_001077511
cd FHA 98aa 2e-08
in ref transcript
Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
pfam FHA 72aa 4e-06
in ref transcript
FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
pfam PHD 36aa 2e-05
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
TCF4
rs.TCF4.F1 rs.TCF4.R1 101 113
NCBIGene 36.3 6925
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_001083962
cd HLH 61aa 2e-07
in ref transcript
Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001093728
Changed!
pfam Tcp11 433aa 1e-130
in ref transcript
T-complex protein 11. This family consists of several eukaryotic T-complex protein 11 (Tcp11) related sequences. Tcp11 is only expressed in fertile adult mammalian testes and is thought to be important in sperm function and fertility. The family also contains the yeast Sok1 protein which is known to suppress cyclic AMP-dependent protein kinase mutants.
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001082538
Changed!
pfam DUF1619 310aa 1e-106
in ref transcript
Protein of unknown function (DUF1619). This is a family of sequences derived from hypothetical eukaryotic proteins. The region in question is approximately 330 residues long and has a cysteine rich amino-terminus.
Changed!
pfam DUF1619 296aa 8e-88
in modified transcript
TCTN1
rs.TCTN1.F2 rs.TCTN1.R2 100 115
NCBIGene 36.3 79600
Alternative 3-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_001082538
pfam DUF1619 310aa 1e-106
in ref transcript
Protein of unknown function (DUF1619). This is a family of sequences derived from hypothetical eukaryotic proteins. The region in question is approximately 330 residues long and has a cysteine rich amino-terminus.
TDRKH
rs.TDRKH.F1 rs.TDRKH.R1 143 278
NCBIGene 36.3 11022
Alternative 5-prime and 3-prime, size difference: 135
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_006862
Changed!
cd KH-I 63aa 6e-12
in ref transcript
K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
cd TUDOR 48aa 2e-08
in ref transcript
Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
pfam TUDOR 120aa 8e-30
in ref transcript
Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein.
Changed!
pfam KH_1 62aa 4e-13
in ref transcript
KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
Changed!
PRK PRK11824 65aa 0.001
in ref transcript
Changed!
cd KH-I 59aa 1e-09
in modified transcript
Changed!
pfam KH_1 59aa 6e-11
in modified transcript
Changed!
PRK PRK11824 33aa 0.004
in modified transcript
TDRKH
rs.TDRKH.F2 rs.TDRKH.R2 141 166
NCBIGene 36.3 11022
Alternative 5-prime, size difference: 25
Exclusion in 5'UTR
Reference transcript: NM_006862
cd KH-I 63aa 6e-12
in ref transcript
K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
cd TUDOR 48aa 2e-08
in ref transcript
Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
pfam TUDOR 120aa 8e-30
in ref transcript
Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein.
pfam KH_1 62aa 4e-13
in ref transcript
KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
Telomeric Repeat binding Factor or TTAGGG Repeat binding Factor, central (dimerization) domain Homology; TRFH. Telomeres are protein/DNA complexes that make up the physical ends of eukaryotic linear chromosomes and are essential for chromosome stability, protecting the chromosome ends from degradation and end-to-end fusion. Proteins TRF1, TRF2 and Taz1 bind telomeric DNA and are also involved in recruiting interacting proteins, TIN2, and Rap1, to the telomeres. It has also been demonstrated that PARP1 associates with TRF2 and is capable of poly(ADP-ribosyl)ation of TRF2, which affects binding of TRF2 to telomeric DNA. TRF1, TRF2 and Taz1 proteins contain three functional domains: an N-terminal acidic domain, a central TRF-specific/dimerization domain, and a C-terminal DNA binding domain with a single Myb-like repeat. Homodimerization, a prerequisite to DNA binding, results in the juxtaposition of two Myb DNA binding domains.
cd SANT 47aa 2e-05
in ref transcript
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
pfam TRF 181aa 1e-78
in ref transcript
Telomere repeat binding factor (TRF). Telomere repeat binding factor (TRF) family proteins are important for the regulation of telomere stability. The two related human TRF proteins hTRF1 and hTRF2 form homodimers and bind directly to telomeric TTAGGG repeats via the myb DNA binding domain pfam00249 at the carboxy terminus. TRF1 is implicated in telomere length regulation and TRF2 in telomere protection. Other telomere complex associated proteins are recruited through their interaction with either TRF1 or TRF2. The fission yeast protein Taz1p (telomere-associated in Schizosaccharomyces pombe) has similarity to both hTRF1 and hTRF2 and may perform the dual functions of TRF1 and TRF2 at fission yeast telomeres. This domain is composed of multiple alpha helices arranged in a solenoid conformation similar to TPR repeats.
pfam Myb_DNA-binding 49aa 1e-07
in ref transcript
Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
TERT
rs.TERT.F1 rs.TERT.R1 208 244
NCBIGene 36.3 7015
Alternative 3-prime, size difference: 36
Exclusion in the protein (no frameshift)
Reference transcript: NM_198253
cd TERT 112aa 8e-23
in ref transcript
TERT: Telomerase reverse transcriptase (TERT). Telomerase is a ribonucleoprotein (RNP) that synthesizes telomeric DNA repeats. The telomerase RNA subunit provides the template for synthesis of these repeats. The catalytic subunit of RNP is known as telomerase reverse transcriptase (TERT). The reverse transcriptase (RT) domain is located in the C-terminal region of the TERT polypeptide. Single amino acid substitutions in this region lead to telomere shortening and senescence. Telomerase is an enzyme that, in certain cells, maintains the physical ends of chromosomes (telomeres) during replication. In somatic cells, replication of the lagging strand requires the continual presence of an RNA primer approximately 200 nucleotides upstream, which is complementary to the template strand. Since there is a region of DNA less than 200 base pairs from the end of the chromosome where this is not possible, the chromosome is continually shortened. However, a surplus of repetitive DNA at the chromosome ends protects against the erosion of gene-encoding DNA. Telomerase is not normally expressed in somatic cells. It has been suggested that exogenous TERT may extend the lifespan of, or even immortalize, the cell. However, recent studies have shown that telomerase activity can be induced by a number of oncogenes. Conversely, the oncogene c-myc can be activated in human TERT immortalized cells. Sequence comparisons place the telomerase proteins in the RT family but reveal hallmarks that distinguish them from retroviral and retrotransposon relatives.
TH
rs.TH.F1 rs.TH.R1 100 112
NCBIGene 36.3 7054
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_199292
cd eu_TyrOH 298aa 1e-166
in ref transcript
Eukaryotic tyrosine hydroxylase (TyrOH); a member of the biopterin-dependent aromatic amino acid hydroxylase family of non-heme, iron(II)-dependent enzymes that also includes prokaryotic and eukaryotic phenylalanine-4-hydroxylase (PheOH) and eukaryotic tryptophan hydroxylase (TrpOH). TyrOH catalyzes the conversion of tyrosine to L-dihydroxyphenylalanine (L-DOPA), the rate-limiting step in the biosynthesis of the catecholamines dopamine, noradrenaline, and adrenaline.
cd ACT_TH 122aa 6e-30
in ref transcript
ACT domain of the nonheme iron-dependent aromatic amino acid hydroxylase, tyrosine hydroxylases (TH). TH catalyses the hydroxylation of L-Tyr to 3,4-dihydroxyphenylalanine, the rate limiting step in the biosynthesis of catecholamines (dopamine, noradrenaline and adrenaline), functioning as hormones and neurotransmitters. The enzyme is not regulated by its amino acid substrate, but instead by phosphorylation at several serine residues located N-terminal of the ACT domain, and by feedback inhibition by catecholamines at the active site. Members of this CD belong to the superfamily of ACT regulatory domains.
TIGR Tyr_3_monoox 457aa 0.0
in ref transcript
This model describes tyrosine 3-monooxygenase, a member of the family of tetrameric, biopterin-dependent aromatic amino acid hydroxylases found in metazoans. It is closely related to tetrameric phenylalanine-4-hydroxylase and tryptophan 5-monooxygenase, and more distantly related to the monomeric phenylalanine-4-hydroxylase found in some Gram-negative bacteria.
PRK phhA 238aa 5e-61
in ref transcript
phenylalanine 4-monooxygenase; Reviewed.
THADA
rs.THADA.F1 rs.THADA.R1 116 400
NCBIGene 36.3 63892
Alternative 5-prime, size difference: 284
Exclusion in 5'UTR
Reference transcript: NM_001083953
pfam DUF2428 304aa 3e-68
in ref transcript
Putative death-receptor fusion protein (DUF2428). This is a family of proteins conserved from plants to humans. The function is not known. Several members have been annotated as being HEAT repeat-containing proteins while others are designated as death-receptor interacting proteins, but neither of these could be confirmed.
COG COG5543 475aa 5e-37
in ref transcript
Uncharacterized conserved protein [Function unknown].
TIA1
rs.TIA1.F1 rs.TIA1.R1 100 133
NCBIGene 36.3 7072
Single exon skipping, size difference: 33
Exclusion in the protein (no frameshift)
Reference transcript: NM_022173
cd RRM 74aa 3e-19
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 73aa 1e-15
in ref transcript
cd RRM 69aa 6e-14
in ref transcript
Changed!
TIGR PABP-1234 269aa 2e-24
in ref transcript
There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
Changed!
COG COG0724 177aa 1e-18
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
COG COG0724 175aa 7e-12
in ref transcript
Changed!
TIGR PABP-1234 258aa 2e-27
in modified transcript
Changed!
COG COG0724 200aa 4e-19
in modified transcript
TIRAP
rs.TIRAP.F1 rs.TIRAP.R1 100 116
NCBIGene 36.3 114609
Alternative 5-prime, size difference: 16
Exclusion in 5'UTR
Reference transcript: NM_148910
smart TIR 83aa 3e-07
in ref transcript
Toll - interleukin 1 - resistance.
TJP2
rs.TJP2.F1 rs.TJP2.R1 101 542
NCBIGene 36.3 9414
Multiple exon skipping, size difference: 441
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_004817
cd PDZ_signaling 86aa 9e-15
in ref transcript
PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
cd PDZ_signaling 80aa 1e-09
in ref transcript
cd PDZ_signaling 75aa 1e-07
in ref transcript
smart GuKc 188aa 3e-33
in ref transcript
Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
smart PDZ 90aa 6e-14
in ref transcript
Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
smart PDZ 72aa 1e-10
in ref transcript
smart PDZ 78aa 3e-10
in ref transcript
pfam SH3_2 59aa 0.001
in ref transcript
Variant SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
TMC domain. These sequences are similar to a region conserved amongst various protein products of the transmembrane channel-like (TMC) gene family, such as Transmembrane channel-like protein 3 and EVIN2 - this region is termed the TMC domain. Mutations in these genes are implicated in a number of human conditions, such as deafness and epidermodysplasia verruciformis. TMC proteins are thought to have important cellular roles, and may be modifiers of ion channels or transporters.
TMEM134
rs.TMEM134.F1 rs.TMEM134.R1 112 157
NCBIGene 36.3 80194
Single exon skipping, size difference: 45
Exclusion in the protein (no frameshift)
Reference transcript: NM_025124
Changed!
pfam DUF872 81aa 7e-19
in ref transcript
Eukaryotic protein of unknown function (DUF872). This family consists of several uncharacterised eukaryotic proteins. The function of this family is unknown.
Changed!
pfam DUF872 38aa 8e-13
in modified transcript
TMEM134
rs.TMEM134.F2 rs.TMEM134.R2 120 147
NCBIGene 36.3 80194
Alternative 3-prime, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_025124
pfam DUF872 81aa 7e-19
in ref transcript
Eukaryotic protein of unknown function (DUF872). This family consists of several uncharacterised eukaryotic proteins. The function of this family is unknown.
TMEM176B
rs.TMEM176B.F1 rs.TMEM176B.R1 341 452
NCBIGene 36.3 28959
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_014020
Changed!
pfam LR8 205aa 2e-75
in ref transcript
LR8 protein. This family consists of several LR8 like proteins from humans, mice and rats. The function of the human LR8 protein is unknown although it is known to be strongly expressed in the lung fibroblasts.
Changed!
pfam LR8 168aa 2e-61
in modified transcript
TMEM177
rs.TMEM177.F1 rs.TMEM177.R1 105 446
NCBIGene 36.3 80775
Alternative 5-prime, size difference: 341
Exclusion in 5'UTR
Reference transcript: NM_001105198
TMEM22
rs.TMEM22.F1 rs.TMEM22.R1 132 397
NCBIGene 36.3 80723
Alternative 5-prime, size difference: 265
Exclusion in 5'UTR
Reference transcript: NM_025246
TIGR 2A78 258aa 1e-08
in ref transcript
COG RhaT 288aa 2e-10
in ref transcript
Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only].
TMEM55B
rs.TMEM55B.F1 rs.TMEM55B.R1 106 127
NCBIGene 36.3 90809
Alternative 5-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_001100814
Changed!
pfam Tmemb_55A 277aa 3e-85
in ref transcript
Transmembrane protein 55A. Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction: 1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.
Changed!
pfam Tmemb_55A 270aa 3e-84
in modified transcript
TMEM70
rs.TMEM70.F1 rs.TMEM70.R1 102 116
NCBIGene 36.3 54968
Alternative 5-prime, size difference: 14
Inclusion in the protein causing a new stop codon
Reference transcript: NM_017866
Changed!
pfam DUF1301 135aa 6e-56
in ref transcript
Protein of unknown function (DUF1301). This family contains a number of eukaryotic proteins of unknown function that are approximately 160 residues long.
TMEM80
rs.TMEM80.F1 rs.TMEM80.R1 136 160
NCBIGene 36.3 283232
Alternative 5-prime, size difference: 24
Exclusion in the protein (no frameshift)
Reference transcript: NM_001042463
pfam Transmemb_17 68aa 1e-05
in ref transcript
Predicted membrane protein. This is a 100 amino acid region of a family of proteins conserved from nematodes to humans. It is predicted to be a transmembrane region but its function is not known.
TMEM91
rs.TMEM91.F1 rs.TMEM91.R1 222 376
NCBIGene 36.3 641649
Alternative 3-prime, size difference: 154
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001098821
Changed!
pfam CD225 49aa 1e-08
in ref transcript
Interferon-induced transmembrane protein. This family includes the human leukocyte antigen CD225, which is an interferon inducible transmembrane protein, and is associated with interferon induced cell growth suppression.
Changed!
pfam CD225 29aa 0.003
in modified transcript
TMPO
rs.TMPO.F1 rs.TMPO.R1 150 477
NCBIGene 36.3 7112
Multiple exon skipping, size difference: 327
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_001032283
pfam Thymopoietin 49aa 4e-19
in ref transcript
Thymopoietin protein. Short protein of 49 amino acid isolated from bovine spleen cells. Thymopoietins (TMPOs) are a group of ubiquitously expressed nuclear proteins. They are suggested to play an important role in nuclear envelope organisation and cell cycle control.
pfam LEM 28aa 1e-04
in ref transcript
LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1.
TMPRSS4
rs.TMPRSS4.F1 rs.TMPRSS4.R1 104 119
NCBIGene 36.3 56649
Alternative 3-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_019894
cd Tryp_SPc 228aa 2e-68
in ref transcript
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
cd LDLa 35aa 0.002
in ref transcript
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
smart Tryp_SPc 226aa 2e-77
in ref transcript
Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
Changed!
smart SR 91aa 9e-07
in ref transcript
Scavenger receptor Cys-rich. The sea ucrhin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.
pfam Ldl_recept_a 22aa 0.005
in ref transcript
Low-density lipoprotein receptor domain class A.
COG COG5640 228aa 8e-26
in ref transcript
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
Changed!
smart SR 86aa 4e-07
in modified transcript
TMTC4
rs.TMTC4.F1 rs.TMTC4.R1 143 353
NCBIGene 36.3 84899
Single exon skipping, size difference: 210
Exclusion of the protein initiation site
Reference transcript: NM_032813
cd TPR 100aa 3e-12
in ref transcript
Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
cd TPR 99aa 4e-11
in ref transcript
cd TPR 99aa 2e-07
in ref transcript
cd TPR 85aa 1e-06
in ref transcript
pfam DUF1736 80aa 5e-33
in ref transcript
Domain of unknown function (DUF1736). This domain of unknown function is found in various hypothetical metazoan proteins.
TIGR PEP_TPR_lipo 231aa 1e-12
in ref transcript
This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
COG TadD 202aa 5e-08
in ref transcript
Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion].
TMUB2
rs.TMUB2.F1 rs.TMUB2.R1 133 220
NCBIGene 36.3 79089
Alternative 3-prime, size difference: 87
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001076674
Changed!
cd UBL 63aa 9e-06
in ref transcript
UBLs function by remodeling the surface of their target proteins, changing their target's half-life, enzymatic activity, protein-protein interactions, subcellular localization or other properties. At least 10 different ubiquitin-like modifications exist in mammals, and attachment of different ubls to a target leads to different biological consequences. Ubl-conjugation cascades are initiated by activating enzymes, which also coordinate the ubls with their downstream pathways.
Changed!
pfam ubiquitin 51aa 6e-08
in ref transcript
Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
TMUB2
rs.TMUB2.F2 rs.TMUB2.R1 186 254
NCBIGene 36.3 79089
Single exon skipping, size difference: 68
Exclusion of the protein initiation site
Reference transcript: NM_001076674
cd UBL 63aa 9e-06
in ref transcript
UBLs function by remodeling the surface of their target proteins, changing their target's half-life, enzymatic activity, protein-protein interactions, subcellular localization or other properties. At least 10 different ubiquitin-like modifications exist in mammals, and attachment of different ubls to a target leads to different biological consequences. Ubl-conjugation cascades are initiated by activating enzymes, which also coordinate the ubls with their downstream pathways.
pfam ubiquitin 51aa 6e-08
in ref transcript
Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
TNRC15
rs.TNRC15.F1 rs.TNRC15.R1 116 186
NCBIGene 36.3 26058
Single exon skipping, size difference: 70
Inclusion in 5'UTR
Reference transcript: NM_001103147
cd GYF 56aa 5e-16
in ref transcript
GYF domain: contains conserved Gly-Tyr-Phe residues; Proline-binding domain in CD2-binding and other proteins. Involved in signaling lymphocyte activity. Also present in other unrelated proteins (mainly unknown) derived from diverse eukaryotic species.
pfam GYF 56aa 4e-18
in ref transcript
GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein.
TNRC15
rs.TNRC15.F2 rs.TNRC15.R2 100 118
NCBIGene 36.3 26058
Alternative 3-prime, size difference: 18
Exclusion in the protein (no frameshift)
Reference transcript: NM_001103147
cd GYF 56aa 5e-16
in ref transcript
GYF domain: contains conserved Gly-Tyr-Phe residues; Proline-binding domain in CD2-binding and other proteins. Involved in signaling lymphocyte activity. Also present in other unrelated proteins (mainly unknown) derived from diverse eukaryotic species.
pfam GYF 56aa 4e-18
in ref transcript
GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein.
TNRC15
rs.TNRC15.F3 rs.TNRC15.R3 158 224
NCBIGene 36.3 26058
Single exon skipping, size difference: 66
Exclusion in the protein (no frameshift)
Reference transcript: NM_001103147
cd GYF 56aa 5e-16
in ref transcript
GYF domain: contains conserved Gly-Tyr-Phe residues; Proline-binding domain in CD2-binding and other proteins. Involved in signaling lymphocyte activity. Also present in other unrelated proteins (mainly unknown) derived from diverse eukaryotic species.
pfam GYF 56aa 4e-18
in ref transcript
GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein.
TNRC15
rs.TNRC15.F4 rs.TNRC15.R4 106 176
NCBIGene 36.3 26058
Single exon skipping, size difference: 70
Exclusion in 5'UTR
Reference transcript: NM_001103147
cd GYF 56aa 5e-16
in ref transcript
GYF domain: contains conserved Gly-Tyr-Phe residues; Proline-binding domain in CD2-binding and other proteins. Involved in signaling lymphocyte activity. Also present in other unrelated proteins (mainly unknown) derived from diverse eukaryotic species.
pfam GYF 56aa 4e-18
in ref transcript
GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein.
TOM1L2
rs.TOM1L2.F1 rs.TOM1L2.R1 159 309
NCBIGene 36.3 146691
Single exon skipping, size difference: 150
Exclusion in the protein (no frameshift)
Reference transcript: NM_001082968
Changed!
cd VHS_Tom1 141aa 2e-74
in ref transcript
VHS domain family, Tom1 subfamily; The VHS domain is an essential part of Tom1 (Target of myb1 - retroviral oncogene) protein. The VHS domain has a superhelical structure similar to the structure of the ARM repeats and is present at the very N-termini of proteins. It is a right-handed superhelix of eight alpha helices. The VHS domain has been found in a number of proteins, some of which have been implicated in intracellular trafficking and sorting. The VHS domain of the Tom1 protein is essential for the negative regulation of Interleukin-1 and Tumor Necrosis Factor-induced signaling pathways.
Changed!
smart VHS 136aa 2e-46
in ref transcript
Domain present in VPS-27, Hrs and STAM. Unpublished observations. Domain of unknown function.
pfam GAT 93aa 5e-21
in ref transcript
GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilises membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins.
Changed!
cd VHS_Tom1 91aa 2e-39
in modified transcript
Changed!
smart VHS 86aa 1e-21
in modified transcript
TOX2
rs.TOX2.F1 rs.TOX2.R1 272 353
NCBIGene 36.3 84969
Single exon skipping, size difference: 81
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098797
Changed!
cd HMG-box 48aa 9e-10
in ref transcript
High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Changed!
pfam HMG_box 49aa 4e-07
in ref transcript
HMG (high mobility group) box.
Changed!
COG NHP6B 48aa 3e-04
in ref transcript
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
Changed!
cd HMG-box 43aa 1e-08
in modified transcript
Changed!
pfam HMG_box 44aa 3e-06
in modified transcript
Changed!
COG NHP6B 43aa 0.002
in modified transcript
TOX2
rs.TOX2.F2 rs.TOX2.R2 103 228
NCBIGene 36.3 84969
Single exon skipping, size difference: 125
Exclusion in 5'UTR
Reference transcript: NM_032883
cd HMG-box 48aa 1e-09
in ref transcript
High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
pfam HMG_box 49aa 4e-07
in ref transcript
HMG (high mobility group) box.
COG NHP6B 48aa 5e-04
in ref transcript
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
TP53BP2
rs.TP53BP2.F1 rs.TP53BP2.R1 304 436
NCBIGene 36.3 7159
Single exon skipping, size difference: 132
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001031685
Changed!
cd ANK 118aa 4e-20
in ref transcript
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
Changed!
cd SH3 51aa 2e-09
in ref transcript
Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Changed!
smart SH3 49aa 2e-10
in ref transcript
Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Changed!
pfam Ank 32aa 2e-06
in ref transcript
Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
Changed!
pfam Ank 32aa 2e-05
in ref transcript
Changed!
pfam SMC_N 146aa 2e-05
in ref transcript
RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
Changed!
COG Arp 95aa 6e-05
in ref transcript
FOG: Ankyrin repeat [General function prediction only].
Changed!
COG SbcC 144aa 1e-04
in ref transcript
ATPase involved in DNA repair [DNA replication, recombination, and repair].
TP53I11
rs.TP53I11.F1 rs.TP53I11.R1 250 380
NCBIGene 36.3 9537
Single exon skipping, size difference: 130
Exclusion in 5'UTR
Reference transcript: NM_001076787
TPD52
rs.TPD52.F1 rs.TPD52.R1 101 170
NCBIGene 36.3 7163
Multiple exon skipping, size difference: 69
Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001025252
Changed!
pfam TPD52 159aa 1e-39
in ref transcript
Tumour protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family. These proteins have an amino terminal coiled-coil that allows members to form homo- and heterodimers with each other.
Changed!
pfam TPD52 182aa 7e-37
in modified transcript
TPD52L2
rs.TPD52L2.F1 rs.TPD52L2.R1 105 165
NCBIGene 36.3 7165
Single exon skipping, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_199360
Changed!
pfam TPD52 163aa 4e-26
in ref transcript
Tumour protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family. These proteins have an amino terminal coiled-coil that allows members to form homo- and heterodimers with each other.
Changed!
pfam TPD52 143aa 2e-29
in modified transcript
TPK1
rs.TPK1.F1 rs.TPK1.R1 312 459
NCBIGene 36.3 27010
Single exon skipping, size difference: 147
Exclusion in the protein (no frameshift)
Reference transcript: NM_022445
Changed!
TIGR thi_PPkinase 214aa 8e-40
in ref transcript
This model has been revised. Originally, it described strictly eukaryotic thiamine pyrophosphokinase. However, it is now expanded to include also homologous enzymes, apparently functionally equivalent, from species that rely on thiamine pyrophosphokinase rather than thiamine-monophosphate kinase (TIGR01379) to produce the active TPP cofactor. This includes the thiamine pyrophosphokinase from Bacillus subtilis, previously designated YloS.
Changed!
COG THI80 222aa 2e-27
in ref transcript
Thiamine pyrophosphokinase [Coenzyme metabolism].
Changed!
pfam TPK_catalytic 90aa 2e-27
in modified transcript
Thiamin pyrophosphokinase, catalytic domain. Family of thiamin pyrophosphokinase (EC:2.7.6.2). Thiamin pyrophosphokinase (TPK) catalyses the transfer of a pyrophosphate group from ATP to vitamin B1 (thiamin) to form the coenzyme thiamin pyrophosphate (TPP). Thus, TPK is important for the formation of a coenzyme required for central metabolic functions. The structure of thiamin pyrophosphokinase suggest that the enzyme may operate by a mechanism of pyrophosphoryl transfer similar to those described for pyrophosphokinases functioning in nucleotide biosynthesis.
Changed!
pfam TPK_B1_binding 70aa 6e-21
in modified transcript
Thiamin pyrophosphokinase, vitamin B1 binding domain. Family of thiamin pyrophosphokinase (EC:2.7.6.2). Thiamin pyrophosphokinase (TPK) catalyses the transfer of a pyrophosphate group from ATP to vitamin B1 (thiamin) to form the coenzyme thiamin pyrophosphate (TPP). Thus, TPK is important for the formation of a coenzyme required for central metabolic functions. The structure of thiamin pyrophosphokinase suggest that the enzyme may operate by a mechanism of pyrophosphoryl transfer similar to those described for pyrophosphokinases functioning in nucleotide biosynthesis.
Changed!
COG THI80 173aa 4e-16
in modified transcript
TPTE2
rs.TPTE2.F1 rs.TPTE2.R1 171 291
NCBIGene 36.3 93492
Single exon skipping, size difference: 120
Exclusion in the protein (no frameshift)
Reference transcript: NM_199254
cd PTPc 89aa 8e-05
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
pfam PTEN_C2 131aa 4e-30
in ref transcript
C2 domain of PTEN tumour-suppressor protein. This is the C2 domain-like domain, in greek key form, of the PTEN protein, phosphatidyl-inositol triphosphate phosphatase, and it is the C-terminus. This domain may well include a CBR3 loop which means it plays a central role in membrane binding. This domain associates across an extensive interface with the N-terminal phosphatase domain DSPc (pfam00782) suggesting that the C2 domain productively positions the catalytic part of the protein on the membrane 1].
smart PTPc_motif 59aa 2e-04
in ref transcript
Protein tyrosine phosphatase, catalytic domain motif.
Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
Reference transcript: NM_199254
cd PTPc 89aa 8e-05
in ref transcript
Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
pfam PTEN_C2 131aa 4e-30
in ref transcript
C2 domain of PTEN tumour-suppressor protein. This is the C2 domain-like domain, in greek key form, of the PTEN protein, phosphatidyl-inositol triphosphate phosphatase, and it is the C-terminus. This domain may well include a CBR3 loop which means it plays a central role in membrane binding. This domain associates across an extensive interface with the N-terminal phosphatase domain DSPc (pfam00782) suggesting that the C2 domain productively positions the catalytic part of the protein on the membrane 1].
smart PTPc_motif 59aa 2e-04
in ref transcript
Protein tyrosine phosphatase, catalytic domain motif.
Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF3 subfamily, TRAF domain; TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF3 was first described as a molecule that binds the cytoplasmic tail of CD40. However, it is not required for CD40 signaling. More recently, TRAF3 has been identified as a key regulator of type I interferon (IFN) production and the mammalian innate antiviral immunity. It mediates IFN responses in Toll-like receptor (TLR)-dependent as well as TLR-independent viral recognition pathways. It is also a key element in immunological homeostasis through its regulation of the anti-inflammatory cytokine interleukin-10. TRAF3 contains a RING finger domain, five zinc finger domains, and a TRAF domain. The TRAF domain can be divided into a more divergent N-terminal alpha helical region (TRAF-N), and a highly conserved C-terminal MATH subdomain (TRAF-C) with an eight-stranded beta-sandwich structure. TRAF-N mediates trimerization while TRAF-C interacts with receptors.
pfam MATH 126aa 4e-18
in ref transcript
MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode Caenorhabditis elegans.
pfam zf-TRAF 57aa 1e-11
in ref transcript
TRAF-type zinc finger.
pfam Cast 176aa 2e-05
in ref transcript
RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
smart RING 36aa 0.002
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
TIGR rad18 203aa 0.005
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
COG Smc 161aa 5e-07
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG COG5222 59aa 0.005
in ref transcript
Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].
TRAF3
rs.TRAF3.F2 rs.TRAF3.R2 252 327
NCBIGene 36.3 7187
Single exon skipping, size difference: 75
Exclusion in the protein (no frameshift)
Reference transcript: NM_145725
cd MATH_TRAF3 186aa 1e-106
in ref transcript
Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF3 subfamily, TRAF domain; TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF3 was first described as a molecule that binds the cytoplasmic tail of CD40. However, it is not required for CD40 signaling. More recently, TRAF3 has been identified as a key regulator of type I interferon (IFN) production and the mammalian innate antiviral immunity. It mediates IFN responses in Toll-like receptor (TLR)-dependent as well as TLR-independent viral recognition pathways. It is also a key element in immunological homeostasis through its regulation of the anti-inflammatory cytokine interleukin-10. TRAF3 contains a RING finger domain, five zinc finger domains, and a TRAF domain. The TRAF domain can be divided into a more divergent N-terminal alpha helical region (TRAF-N), and a highly conserved C-terminal MATH subdomain (TRAF-C) with an eight-stranded beta-sandwich structure. TRAF-N mediates trimerization while TRAF-C interacts with receptors.
pfam MATH 126aa 4e-18
in ref transcript
MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode Caenorhabditis elegans.
pfam zf-TRAF 57aa 1e-11
in ref transcript
TRAF-type zinc finger.
Changed!
pfam Cast 176aa 2e-05
in ref transcript
RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
smart RING 36aa 0.002
in ref transcript
Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
Changed!
TIGR rad18 203aa 0.005
in ref transcript
This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
COG Smc 161aa 5e-07
in ref transcript
Chromosome segregation ATPases [Cell division and chromosome partitioning].
COG COG5222 59aa 0.005
in ref transcript
Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].
Changed!
pfam Cast 155aa 6e-05
in modified transcript
TRAPPC5
rs.TRAPPC5.F1 rs.TRAPPC5.R1 123 146
NCBIGene 36.3 126003
Alternative 5-prime, size difference: 23
Exclusion in 5'UTR
Reference transcript: NM_174894
pfam TRAPP 139aa 5e-40
in ref transcript
Transport protein particle (TRAPP) component. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is a large multimeric protein that contains at least 10 subunits. This family contains many TRAPP family proteins. The Bet3 subunit is one of the better characterised TRAPP proteins and has a dimeric structure with hydrophobic channels. The channel entrances are located on a putative membrane-interacting surface that is distinctively flat, wide and decorated with positively charged residues. Bet3 is proposed to localise TRAPP to the Golgi.
COG COG5128 148aa 5e-29
in ref transcript
Transport protein particle (TRAPP) complex subunit [Intracellular trafficking and secretion].
TRAPPC6B
rs.TRAPPC6B.F1 rs.TRAPPC6B.R1 142 226
NCBIGene 36.3 122553
Single exon skipping, size difference: 84
Exclusion in the protein (no frameshift)
Reference transcript: NM_001079537
Changed!
pfam TRAPP 153aa 4e-48
in ref transcript
Transport protein particle (TRAPP) component. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is a large multimeric protein that contains at least 10 subunits. This family contains many TRAPP family proteins. The Bet3 subunit is one of the better characterised TRAPP proteins and has a dimeric structure with hydrophobic channels. The channel entrances are located on a putative membrane-interacting surface that is distinctively flat, wide and decorated with positively charged residues. Bet3 is proposed to localise TRAPP to the Golgi.
Changed!
COG COG5128 141aa 1e-05
in ref transcript
Transport protein particle (TRAPP) complex subunit [Intracellular trafficking and secretion].
Changed!
pfam TRAPP 125aa 1e-34
in modified transcript
Changed!
COG COG5128 113aa 0.001
in modified transcript
TRDMT1
rs.TRDMT1.F1 rs.TRDMT1.R1 169 241
NCBIGene 36.3 1787
Single exon skipping, size difference: 72
Exclusion in the protein (no frameshift)
Reference transcript: NM_004412
Changed!
cd Cyt_C5_DNA_methylase 176aa 4e-46
in ref transcript
Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology. Cytosine-specific DNA methylases are found both in prokaryotes and eukaryotes. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the mammalian genome. These effects include transcriptional repression via inhibition of transcription factor binding or the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability.
cd Cyt_C5_DNA_methylase 132aa 4e-14
in ref transcript
Changed!
TIGR dcm 382aa 1e-21
in ref transcript
All proteins in this family for which functions are known are DNA-cytosine methyltransferases. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Changed!
COG Dcm 167aa 2e-33
in ref transcript
Site-specific DNA methylase [DNA replication, recombination, and repair].
PRK PRK10458 47aa 0.002
in ref transcript
DNA cytosine methylase; Provisional.
Changed!
cd Cyt_C5_DNA_methylase 152aa 4e-36
in modified transcript
Changed!
TIGR dcm 358aa 5e-17
in modified transcript
Changed!
COG Dcm 143aa 2e-26
in modified transcript
TRO
rs.TRO.F1 rs.TRO.R1 103 118
NCBIGene 36.3 7216
Alternative 3-prime, size difference: 15
Exclusion in 5'UTR
Reference transcript: NM_001039705
pfam MAGE 170aa 9e-79
in ref transcript
MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
TRSPAP1
rs.TRSPAP1.F1 rs.TRSPAP1.R1 298 446
NCBIGene 36.3 54952
Single exon skipping, size difference: 148
Inclusion in the protein causing a new stop codon
Reference transcript: NM_017846
Changed!
cd RRM 75aa 1e-09
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 69aa 3e-09
in ref transcript
Changed!
TIGR PABP-1234 182aa 1e-18
in ref transcript
There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
Changed!
COG COG0724 97aa 2e-08
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
Changed!
COG COG0724 88aa 6e-07
in ref transcript
Changed!
smart RRM 66aa 1e-10
in modified transcript
RNA recognition motif.
Changed!
COG COG0724 70aa 3e-08
in modified transcript
TSC2
rs.TSC2.F1 rs.TSC2.R1 210 279
NCBIGene 36.3 7249
Single exon skipping, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_000548
pfam Rap_GAP 184aa 2e-82
in ref transcript
Rap/ran-GAP.
pfam Tuberin 347aa 8e-53
in ref transcript
Tuberin. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterised by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. The TSC2 gene codes for tuberin and interacts with hamartin pfam04388, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.
TSPAN4
rs.TSPAN4.F1 rs.TSPAN4.R1 163 243
NCBIGene 36.3 7106
Single exon skipping, size difference: 80
Exclusion of the protein initiation site
Reference transcript: NM_003271
cd NET-5_like_LEL 98aa 1e-40
in ref transcript
Tetraspanin, extracellular domain or large extracellular loop (LEL), NET-5_like family. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". This sub-family contains proteins similar to human tetraspan NET-5.
Changed!
pfam Tetraspannin 222aa 3e-35
in ref transcript
Tetraspanin family.
Changed!
pfam Tetraspannin 166aa 3e-24
in modified transcript
TTC23
rs.TTC23.F1 rs.TTC23.R1 229 351
NCBIGene 36.3 64927
Single exon skipping, size difference: 122
Exclusion in 5'UTR
Reference transcript: NM_001040655
TTLL3
rs.TTLL3.F1 rs.TTLL3.R1 360 486
NCBIGene 36.3 26140
Single exon skipping, size difference: 129
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001025930
pfam TTL 292aa 7e-90
in ref transcript
Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This modification cycle involves a specific carboxypeptidase and the activity of the tubulin-tyrosine ligase (TTL). The true physiological function of TTL has so far not been established. Tubulin-tyrosine ligase (TTL) catalyses the ATP-dependent post-translational addition of a tyrosine to the carboxy terminal end of detyrosinated alpha-tubulin. In normally cycling cells, the tyrosinated form of tubulin predominates. However, in breast cancer cells, the detyrosinated form frequently predominates, with a correlation to tumour aggressiveness. On the other hand, 3-nitrotyrosine has been shown to be incorporated, by TTL, into the carboxy terminal end of detyrosinated alpha-tubulin. This reaction is not reversible by the carboxypeptidase enzyme. Cells cultured in 3-nitrotyrosine rich medium showed evidence of altered microtubule structure and function, including altered cell morphology, epithelial barrier dysfunction, and apoptosis.
TTLL3
rs.TTLL3.F2 rs.TTLL3.R2 169 298
NCBIGene 36.3 26140
Single exon skipping, size difference: 129
Inclusion in the protein (no stop codon or frameshift)
Reference transcript: NM_001025930
pfam TTL 292aa 7e-90
in ref transcript
Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This modification cycle involves a specific carboxypeptidase and the activity of the tubulin-tyrosine ligase (TTL). The true physiological function of TTL has so far not been established. Tubulin-tyrosine ligase (TTL) catalyses the ATP-dependent post-translational addition of a tyrosine to the carboxy terminal end of detyrosinated alpha-tubulin. In normally cycling cells, the tyrosinated form of tubulin predominates. However, in breast cancer cells, the detyrosinated form frequently predominates, with a correlation to tumour aggressiveness. On the other hand, 3-nitrotyrosine has been shown to be incorporated, by TTL, into the carboxy terminal end of detyrosinated alpha-tubulin. This reaction is not reversible by the carboxypeptidase enzyme. Cells cultured in 3-nitrotyrosine rich medium showed evidence of altered microtubule structure and function, including altered cell morphology, epithelial barrier dysfunction, and apoptosis.
TTN
rs.TTN.F1 rs.TTN.R1 163 301
NCBIGene 36.3 7273
Single exon skipping, size difference: 138
Exclusion in 5'UTR
Reference transcript: NM_133378
cd S_TKc 256aa 3e-62
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
cd FN3 91aa 3e-16
in ref transcript
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
cd FN3 93aa 1e-15
in ref transcript
cd FN3 87aa 3e-15
in ref transcript
cd FN3 93aa 4e-15
in ref transcript
cd FN3 94aa 7e-15
in ref transcript
cd FN3 89aa 1e-14
in ref transcript
cd FN3 94aa 2e-14
in ref transcript
cd FN3 93aa 3e-14
in ref transcript
cd FN3 90aa 4e-14
in ref transcript
cd FN3 90aa 5e-14
in ref transcript
cd FN3 89aa 6e-14
in ref transcript
cd FN3 90aa 6e-14
in ref transcript
cd FN3 93aa 7e-14
in ref transcript
cd FN3 94aa 1e-13
in ref transcript
cd FN3 94aa 1e-13
in ref transcript
cd FN3 94aa 1e-13
in ref transcript
cd FN3 90aa 2e-13
in ref transcript
cd FN3 86aa 2e-13
in ref transcript
cd FN3 89aa 2e-13
in ref transcript
cd FN3 92aa 2e-13
in ref transcript
cd FN3 92aa 3e-13
in ref transcript
cd FN3 93aa 3e-13
in ref transcript
cd FN3 91aa 4e-13
in ref transcript
cd FN3 94aa 4e-13
in ref transcript
cd FN3 91aa 4e-13
in ref transcript
cd FN3 91aa 4e-13
in ref transcript
cd FN3 94aa 5e-13
in ref transcript
cd FN3 92aa 6e-13
in ref transcript
cd FN3 88aa 6e-13
in ref transcript
cd FN3 94aa 1e-12
in ref transcript
cd FN3 92aa 1e-12
in ref transcript
cd FN3 93aa 1e-12
in ref transcript
cd FN3 88aa 2e-12
in ref transcript
cd FN3 86aa 2e-12
in ref transcript
cd FN3 90aa 2e-12
in ref transcript
cd FN3 84aa 2e-12
in ref transcript
cd FN3 84aa 2e-12
in ref transcript
cd FN3 84aa 2e-12
in ref transcript
cd FN3 90aa 3e-12
in ref transcript
cd FN3 90aa 4e-12
in ref transcript
cd FN3 94aa 4e-12
in ref transcript
cd FN3 84aa 4e-12
in ref transcript
cd FN3 84aa 5e-12
in ref transcript
cd FN3 94aa 5e-12
in ref transcript
cd FN3 84aa 5e-12
in ref transcript
cd FN3 92aa 6e-12
in ref transcript
cd FN3 93aa 6e-12
in ref transcript
cd FN3 90aa 6e-12
in ref transcript
cd FN3 94aa 6e-12
in ref transcript
cd FN3 90aa 7e-12
in ref transcript
cd FN3 92aa 8e-12
in ref transcript
cd FN3 79aa 8e-12
in ref transcript
cd FN3 94aa 1e-11
in ref transcript
cd FN3 92aa 1e-11
in ref transcript
cd FN3 94aa 1e-11
in ref transcript
cd FN3 94aa 2e-11
in ref transcript
cd FN3 93aa 2e-11
in ref transcript
cd FN3 91aa 2e-11
in ref transcript
cd FN3 92aa 3e-11
in ref transcript
cd FN3 93aa 3e-11
in ref transcript
cd FN3 92aa 3e-11
in ref transcript
cd IGcam 87aa 3e-11
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd FN3 82aa 4e-11
in ref transcript
cd FN3 89aa 4e-11
in ref transcript
cd FN3 99aa 5e-11
in ref transcript
cd FN3 89aa 5e-11
in ref transcript
cd FN3 88aa 5e-11
in ref transcript
cd FN3 95aa 5e-11
in ref transcript
cd FN3 82aa 6e-11
in ref transcript
cd FN3 93aa 6e-11
in ref transcript
cd FN3 92aa 7e-11
in ref transcript
cd FN3 89aa 7e-11
in ref transcript
cd FN3 88aa 8e-11
in ref transcript
cd FN3 94aa 1e-10
in ref transcript
cd FN3 94aa 1e-10
in ref transcript
cd FN3 93aa 1e-10
in ref transcript
cd FN3 92aa 2e-10
in ref transcript
cd FN3 89aa 3e-10
in ref transcript
cd FN3 85aa 3e-10
in ref transcript
cd FN3 94aa 4e-10
in ref transcript
cd FN3 80aa 4e-10
in ref transcript
cd FN3 91aa 4e-10
in ref transcript
cd FN3 94aa 4e-10
in ref transcript
cd IGcam 89aa 4e-10
in ref transcript
cd FN3 95aa 5e-10
in ref transcript
cd FN3 94aa 5e-10
in ref transcript
cd FN3 92aa 5e-10
in ref transcript
cd FN3 82aa 6e-10
in ref transcript
cd FN3 94aa 1e-09
in ref transcript
cd FN3 89aa 1e-09
in ref transcript
cd IGcam 86aa 1e-09
in ref transcript
cd FN3 91aa 2e-09
in ref transcript
cd FN3 94aa 2e-09
in ref transcript
cd FN3 92aa 2e-09
in ref transcript
cd FN3 97aa 2e-09
in ref transcript
cd FN3 94aa 2e-09
in ref transcript
cd IGcam 88aa 2e-09
in ref transcript
cd FN3 90aa 3e-09
in ref transcript
cd IGcam 77aa 5e-09
in ref transcript
cd FN3 86aa 1e-08
in ref transcript
cd FN3 94aa 1e-08
in ref transcript
cd FN3 92aa 1e-08
in ref transcript
cd IGcam 87aa 2e-08
in ref transcript
cd IGcam 87aa 6e-08
in ref transcript
cd FN3 92aa 8e-08
in ref transcript
cd FN3 94aa 8e-08
in ref transcript
cd FN3 84aa 8e-08
in ref transcript
cd FN3 87aa 1e-07
in ref transcript
cd IGcam 91aa 1e-07
in ref transcript
cd IGcam 77aa 1e-07
in ref transcript
cd FN3 84aa 3e-07
in ref transcript
cd IGcam 92aa 3e-07
in ref transcript
cd IGcam 82aa 3e-07
in ref transcript
cd IGcam 79aa 3e-07
in ref transcript
cd IGcam 88aa 4e-07
in ref transcript
cd IGcam 80aa 1e-06
in ref transcript
cd IGcam 88aa 1e-06
in ref transcript
cd FN3 94aa 2e-06
in ref transcript
cd IGcam 92aa 2e-06
in ref transcript
cd IGcam 86aa 2e-06
in ref transcript
cd IGcam 77aa 4e-06
in ref transcript
cd IGcam 88aa 9e-06
in ref transcript
cd IGcam 80aa 1e-05
in ref transcript
cd IGcam 93aa 1e-05
in ref transcript
cd IGcam 83aa 2e-05
in ref transcript
cd FN3 94aa 3e-05
in ref transcript
cd IGcam 92aa 4e-05
in ref transcript
cd IGcam 92aa 4e-05
in ref transcript
cd IGcam 79aa 4e-05
in ref transcript
cd IGcam 82aa 4e-05
in ref transcript
cd IGcam 78aa 5e-05
in ref transcript
cd IGcam 91aa 6e-05
in ref transcript
cd IGcam 83aa 6e-05
in ref transcript
cd IGcam 85aa 8e-05
in ref transcript
cd IGcam 86aa 9e-05
in ref transcript
cd IGcam 91aa 1e-04
in ref transcript
cd IG 72aa 1e-04
in ref transcript
Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd FN3 90aa 2e-04
in ref transcript
cd IGcam 79aa 2e-04
in ref transcript
cd IGcam 81aa 2e-04
in ref transcript
cd IGcam 75aa 3e-04
in ref transcript
cd IGcam 78aa 4e-04
in ref transcript
cd IGcam 74aa 4e-04
in ref transcript
cd IGcam 82aa 6e-04
in ref transcript
cd IGcam 92aa 0.001
in ref transcript
cd IGcam 85aa 0.001
in ref transcript
cd IGcam 72aa 0.001
in ref transcript
cd IGcam 71aa 0.004
in ref transcript
cd IGcam 92aa 0.007
in ref transcript
cd IGcam 84aa 0.008
in ref transcript
cd IGcam 77aa 0.009
in ref transcript
smart S_TKc 244aa 2e-63
in ref transcript
Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
pfam I-set 89aa 2e-23
in ref transcript
Immunoglobulin I-set domain.
pfam I-set 88aa 4e-19
in ref transcript
pfam I-set 90aa 5e-19
in ref transcript
pfam I-set 90aa 1e-18
in ref transcript
pfam I-set 93aa 2e-18
in ref transcript
smart FN3 83aa 8e-17
in ref transcript
Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
pfam I-set 91aa 6e-16
in ref transcript
pfam I-set 92aa 6e-16
in ref transcript
smart FN3 84aa 1e-14
in ref transcript
smart FN3 83aa 1e-14
in ref transcript
pfam I-set 79aa 3e-14
in ref transcript
pfam I-set 81aa 3e-14
in ref transcript
smart FN3 86aa 3e-14
in ref transcript
pfam I-set 92aa 5e-14
in ref transcript
pfam I-set 81aa 8e-14
in ref transcript
smart FN3 83aa 2e-13
in ref transcript
smart FN3 84aa 4e-13
in ref transcript
pfam I-set 90aa 8e-13
in ref transcript
smart FN3 83aa 8e-13
in ref transcript
pfam I-set 76aa 1e-12
in ref transcript
smart FN3 80aa 1e-12
in ref transcript
smart FN3 84aa 1e-12
in ref transcript
smart FN3 81aa 1e-12
in ref transcript
smart FN3 83aa 1e-12
in ref transcript
smart FN3 85aa 1e-12
in ref transcript
smart FN3 83aa 2e-12
in ref transcript
pfam fn3 86aa 2e-12
in ref transcript
Fibronectin type III domain.
smart FN3 84aa 3e-12
in ref transcript
smart FN3 84aa 3e-12
in ref transcript
smart FN3 82aa 3e-12
in ref transcript
smart FN3 82aa 3e-12
in ref transcript
smart FN3 84aa 4e-12
in ref transcript
pfam I-set 87aa 5e-12
in ref transcript
smart FN3 84aa 5e-12
in ref transcript
smart FN3 82aa 6e-12
in ref transcript
smart FN3 82aa 6e-12
in ref transcript
pfam I-set 84aa 7e-12
in ref transcript
pfam I-set 81aa 7e-12
in ref transcript
smart FN3 83aa 7e-12
in ref transcript
pfam I-set 81aa 8e-12
in ref transcript
smart FN3 83aa 8e-12
in ref transcript
smart FN3 85aa 9e-12
in ref transcript
smart FN3 82aa 1e-11
in ref transcript
smart FN3 82aa 1e-11
in ref transcript
smart FN3 82aa 1e-11
in ref transcript
pfam I-set 93aa 2e-11
in ref transcript
smart FN3 83aa 2e-11
in ref transcript
smart FN3 90aa 2e-11
in ref transcript
smart FN3 81aa 2e-11
in ref transcript
smart FN3 84aa 3e-11
in ref transcript
smart FN3 83aa 3e-11
in ref transcript
smart FN3 85aa 3e-11
in ref transcript
smart FN3 84aa 3e-11
in ref transcript
smart FN3 84aa 3e-11
in ref transcript
smart FN3 84aa 3e-11
in ref transcript
smart FN3 84aa 3e-11
in ref transcript
pfam I-set 77aa 4e-11
in ref transcript
smart FN3 84aa 4e-11
in ref transcript
smart FN3 84aa 4e-11
in ref transcript
smart FN3 84aa 4e-11
in ref transcript
smart FN3 84aa 4e-11
in ref transcript
smart FN3 84aa 4e-11
in ref transcript
smart FN3 82aa 4e-11
in ref transcript
smart FN3 84aa 4e-11
in ref transcript
smart FN3 85aa 4e-11
in ref transcript
smart FN3 83aa 4e-11
in ref transcript
smart FN3 81aa 5e-11
in ref transcript
smart FN3 81aa 5e-11
in ref transcript
pfam I-set 81aa 6e-11
in ref transcript
smart FN3 84aa 6e-11
in ref transcript
smart FN3 82aa 6e-11
in ref transcript
smart FN3 84aa 6e-11
in ref transcript
pfam I-set 82aa 7e-11
in ref transcript
smart FN3 81aa 7e-11
in ref transcript
smart FN3 85aa 7e-11
in ref transcript
smart FN3 83aa 7e-11
in ref transcript
smart FN3 84aa 7e-11
in ref transcript
smart FN3 83aa 7e-11
in ref transcript
smart FN3 74aa 9e-11
in ref transcript
smart FN3 84aa 9e-11
in ref transcript
pfam I-set 84aa 1e-10
in ref transcript
pfam I-set 72aa 1e-10
in ref transcript
smart FN3 84aa 1e-10
in ref transcript
smart FN3 81aa 1e-10
in ref transcript
pfam I-set 90aa 2e-10
in ref transcript
smart FN3 84aa 2e-10
in ref transcript
smart FN3 83aa 2e-10
in ref transcript
smart FN3 84aa 2e-10
in ref transcript
smart FN3 84aa 2e-10
in ref transcript
pfam I-set 81aa 4e-10
in ref transcript
smart FN3 79aa 4e-10
in ref transcript
pfam I-set 81aa 5e-10
in ref transcript
smart FN3 84aa 5e-10
in ref transcript
smart FN3 84aa 5e-10
in ref transcript
smart FN3 84aa 6e-10
in ref transcript
smart FN3 83aa 6e-10
in ref transcript
smart FN3 83aa 6e-10
in ref transcript
pfam fn3 80aa 6e-10
in ref transcript
smart FN3 85aa 7e-10
in ref transcript
pfam fn3 86aa 9e-10
in ref transcript
smart FN3 82aa 1e-09
in ref transcript
smart FN3 84aa 1e-09
in ref transcript
smart FN3 85aa 1e-09
in ref transcript
smart FN3 77aa 1e-09
in ref transcript
pfam I-set 78aa 2e-09
in ref transcript
smart FN3 84aa 2e-09
in ref transcript
smart FN3 82aa 2e-09
in ref transcript
smart FN3 81aa 2e-09
in ref transcript
smart FN3 80aa 2e-09
in ref transcript
smart FN3 83aa 2e-09
in ref transcript
smart FN3 84aa 3e-09
in ref transcript
pfam I-set 90aa 4e-09
in ref transcript
smart FN3 85aa 4e-09
in ref transcript
pfam I-set 72aa 5e-09
in ref transcript
smart FN3 81aa 5e-09
in ref transcript
smart FN3 84aa 5e-09
in ref transcript
pfam fn3 87aa 5e-09
in ref transcript
pfam I-set 88aa 6e-09
in ref transcript
smart FN3 76aa 6e-09
in ref transcript
pfam I-set 70aa 7e-09
in ref transcript
smart FN3 84aa 7e-09
in ref transcript
pfam I-set 90aa 8e-09
in ref transcript
pfam I-set 82aa 9e-09
in ref transcript
pfam I-set 93aa 1e-08
in ref transcript
smart FN3 85aa 2e-08
in ref transcript
pfam I-set 81aa 4e-08
in ref transcript
smart FN3 84aa 5e-08
in ref transcript
pfam I-set 77aa 6e-08
in ref transcript
pfam I-set 81aa 7e-08
in ref transcript
smart FN3 72aa 8e-08
in ref transcript
pfam I-set 83aa 9e-08
in ref transcript
pfam I-set 79aa 9e-08
in ref transcript
smart FN3 84aa 9e-08
in ref transcript
smart FN3 82aa 9e-08
in ref transcript
pfam I-set 81aa 1e-07
in ref transcript
pfam I-set 78aa 2e-07
in ref transcript
smart FN3 84aa 2e-07
in ref transcript
smart FN3 70aa 2e-07
in ref transcript
smart FN3 84aa 2e-07
in ref transcript
pfam I-set 60aa 3e-07
in ref transcript
pfam I-set 83aa 3e-07
in ref transcript
smart FN3 80aa 3e-07
in ref transcript
pfam I-set 81aa 5e-07
in ref transcript
smart FN3 85aa 5e-07
in ref transcript
smart FN3 77aa 6e-07
in ref transcript
pfam I-set 81aa 1e-06
in ref transcript
pfam I-set 79aa 1e-06
in ref transcript
smart FN3 84aa 3e-06
in ref transcript
smart FN3 84aa 4e-06
in ref transcript
pfam I-set 81aa 1e-05
in ref transcript
pfam I-set 81aa 1e-05
in ref transcript
smart FN3 77aa 2e-05
in ref transcript
smart FN3 84aa 2e-05
in ref transcript
pfam I-set 73aa 4e-05
in ref transcript
pfam I-set 81aa 5e-05
in ref transcript
pfam I-set 85aa 5e-05
in ref transcript
pfam I-set 78aa 8e-05
in ref transcript
COG SPS1 286aa 2e-28
in ref transcript
Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains. Group I TRX is a small ancient protein that alter the redox state of target proteins via the reversible oxidation of an active site dithiol, present in a CXXC motif, partially exposed at the protein's surface. TRX reduces protein disulfide bonds, resulting in a disulfide bond at its active site. Oxidized TRX is converted to the active form by TRX reductase, using reducing equivalents derived from either NADPH or ferredoxins. By altering their redox state, TRX regulates the functions of at least 30 target proteins, some of which are enzymes and transcription factors. It also plays an important role in the defense against oxidative stress by directly reducing hydrogen peroxide and certain radicals, and by serving as a reductant for peroxiredoxins. At least two major types of functional TRXs have been reported in most organisms; in eukaryotes, they are located in the cytoplasm and the mitochondria. Higher plants contain more types (at least 20 TRX genes have been detected in the genome of Arabidopsis thaliana), two of which (types f amd m) are located in the same compartment, the chloroplast. Also included in the alignment are TRX-like domains which show sequence homology to TRX but do not contain the redox active CXXC motif. Group II proteins, in addition to either a redox active TRX or a TRX-like domain, also contain additional domains, which may or may not possess homology to known proteins.
pfam Thioredoxin 103aa 4e-18
in ref transcript
Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active centre disulfide bond. Some members with only the active site are not separated from the noise.
pfam Glutenin_hmw 360aa 9e-13
in ref transcript
High molecular weight glutenin subunit. Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterised by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.
PTZ PTZ00051 99aa 2e-18
in ref transcript
thioredoxin; Provisional.
PRK PRK07764 177aa 0.002
in ref transcript
DNA polymerase III subunits gamma and tau; Validated.
U2AF2
rs.U2AF2.F1 rs.U2AF2.R1 104 116
NCBIGene 36.3 11338
Alternative 5-prime, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_007279
cd RRM 75aa 1e-12
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
cd RRM 79aa 4e-08
in ref transcript
cd RRM 58aa 2e-05
in ref transcript
Changed!
TIGR U2AF_lg 412aa 1e-160
in ref transcript
Members of this subfamily are found in plants, metazoa and fungi.
COG COG0724 82aa 2e-09
in ref transcript
RNA-binding proteins (RRM domain) [General function prediction only].
COG COG0724 189aa 8e-06
in ref transcript
Changed!
TIGR U2AF_lg 408aa 1e-161
in modified transcript
UBE2C
rs.UBE2C.F1 rs.UBE2C.R1 218 305
NCBIGene 36.3 11065
Single exon skipping, size difference: 87
Exclusion in the protein (no frameshift)
Reference transcript: NM_007019
Changed!
cd UBCc 138aa 6e-45
in ref transcript
Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Changed!
pfam UQ_con 137aa 5e-53
in ref transcript
Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
Changed!
COG COG5078 140aa 3e-43
in ref transcript
Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
Changed!
cd UBCc 98aa 1e-30
in modified transcript
Changed!
pfam UQ_con 98aa 3e-37
in modified transcript
Changed!
COG COG5078 100aa 3e-32
in modified transcript
UBE4B
rs.UBE4B.F1 rs.UBE4B.R1 102 489
NCBIGene 36.3 10277
Single exon skipping, size difference: 387
Exclusion in the protein (no frameshift)
Reference transcript: NM_001105562
pfam Ufd2P_core 623aa 0.0
in ref transcript
Ubiquitin elongating factor core. This is the most conserved part of the core region of Ufd2P ubiquitin elongating factor or E4, running from helix alpha-11 to alpha-38. It consists of 31 helices of variable length connected by loops of variable size forming a compact unit; the helical packing pattern of the compact unit consists of five structural repeats that resemble tandem Armadillo (ARM) repeats. This domain is involved in ubiquitination as it binds Cdc48p and escorts ubiquitinated proteins from Cdc48p to the proteasome for degradation. The core is structurally similar to the nuclear transporter protein importin-alpha. The core is associated with the U-box at the C-terminus, pfam04564, which has ligase activity.
pfam U-box 74aa 2e-27
in ref transcript
U-box domain. This domain is related to the Ring finger pfam00097 but lacks the zinc binding residues.
COG UFD2 804aa 1e-107
in ref transcript
Ubiquitin fusion degradation protein 2 [Posttranslational modification, protein turnover, chaperones].
UBL5
rs.UBL5.F1 rs.UBL5.R1 109 189
NCBIGene 36.3 59286
Alternative 5-prime, size difference: 80
Exclusion in 5'UTR
Reference transcript: NM_024292
cd Ubl5 73aa 3e-34
in ref transcript
UBL5 (also known as HUB1) is a ubiquitin-like modifier that is both widely expressed and highly phylogenetically conserved. At the C-terminal end of the ubiquitin-like fold of UBL5 is a di-tyrosine motif followed by a single variable residue instead of the characteristic di-glycine found in all other ubiquitin-like modifiers. ULB5 interacts with a cyclin-like kinase called CLK4 but not with other cyclin-like kinase family members.
pfam ubiquitin 61aa 0.004
in ref transcript
Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
UBTF
rs.UBTF.F1 rs.UBTF.R1 326 437
NCBIGene 36.3 7343
Single exon skipping, size difference: 111
Exclusion in the protein (no frameshift)
Reference transcript: NM_014233
Changed!
cd HMGB-UBF_HMG-box 66aa 1e-06
in ref transcript
HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
cd HMGB-UBF_HMG-box 60aa 4e-05
in ref transcript
cd HMGB-UBF_HMG-box 63aa 2e-04
in ref transcript
cd HMG-box 44aa 2e-04
in ref transcript
High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Changed!
smart HMG 68aa 9e-08
in ref transcript
high mobility group.
smart HMG 67aa 2e-06
in ref transcript
smart HMG 62aa 4e-06
in ref transcript
pfam HMG_box 44aa 4e-05
in ref transcript
HMG (high mobility group) box.
pfam HMG_box 65aa 2e-04
in ref transcript
smart HMG 65aa 3e-04
in ref transcript
COG NHP6B 103aa 0.003
in ref transcript
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
UBXD5
rs.UBXD5.F1 rs.UBXD5.R1 133 493
NCBIGene 36.3 91544
Multiple exon skipping, size difference: 360
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_183008
pfam SEP 72aa 6e-19
in ref transcript
SEP domain. The SEP domain is named after Saccharomyces cerevisiae Shp1, Drosophila melanogaster eyes closed gene (eyc), and vertebrate p47. In p47, the SEP domain has been shown to bind to and inhibit the cysteine protease cathepsin L. Most SEP domains are succeeded closely by a UBX domain.
Changed!
PRK bchH 138aa 0.007
in ref transcript
magnesium chelatase subunit H; Provisional.
UEVLD
rs.UEVLD.F1 rs.UEVLD.R1 193 317
NCBIGene 36.3 55293
Single exon skipping, size difference: 124
Exclusion in the protein causing a frameshift
Reference transcript: NM_001040697
Changed!
cd LDH_1 289aa 1e-117
in ref transcript
A subgroup of L-lactate dehydrogenases. L-lactate dehydrogenases (LDH) are tetrameric enzymes catalyzing the last step of glycolysis in which pyruvate is converted to L-lactate. This subgroup is composed of eukaryotic LDHs. Vertebrate LDHs are non-allosteric. This is in contrast to some bacterial LDHs that are activated by an allosteric effector such as fructose-1,6-bisphosphate. LDHs are part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of protein families including the NAD(P)-binding domains of alcohol dehydrogenases, tyrosine-dependent oxidoreductases, glyceraldehyde-3-phosphate dehydrogenases, formate/glycerate dehydrogenases, siroheme synthases, 6-phosphogluconate dehydrogenases, aminoacid dehydrogenases, repressor rex, and NAD-binding potassium channel domains, among others.
Changed!
TIGR L-LDH-NAD 276aa 3e-52
in ref transcript
This model represents the NAD-dependent L-lactate dehydrogenases from bacteria and eukaryotes. This enzyme function as as the final step in anaerobic glycolysis. Although lactate dehydrogenases have in some cases been mistaken for malate dehydrogenases due to the similarity of these two substrates and the apparent ease with which evolution can toggle these activities, critical residues have been identified which can discriminate between the two activities. At the time of the creation of this model no hits above the trusted cutoff contained critical residues typical of malate dehydrogenases.
pfam UEV 120aa 4e-52
in ref transcript
UEV domain. This family includes the eukaryotic tumour susceptibility gene 101 protein (TSG101). Altered transcripts of this gene have been detected in sporadic breast cancers and many other human malignancies. However, the involvement of this gene in neoplastic transformation and tumorigenesis is still elusive. TSG101 is required for normal cell function of embryonic and adult tissues but that this gene is not a tumour suppressor for sporadic forms of breast cancer. This family is related to the ubiquitin conjugating enzymes.
Changed!
COG Mdh 288aa 1e-44
in ref transcript
Malate/lactate dehydrogenases [Energy production and conversion].
Changed!
cd LDH_1 182aa 1e-75
in modified transcript
Changed!
TIGR L-LDH-NAD 173aa 2e-31
in modified transcript
Changed!
COG Mdh 176aa 5e-29
in modified transcript
UMOD
rs.UMOD.F1 rs.UMOD.R1 107 170
NCBIGene 36.3 7369
Alternative 3-prime, size difference: 63
Inclusion in 5'UTR
Reference transcript: NM_001008389
cd EGF_CA 42aa 2e-04
in ref transcript
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
cd EGF_CA 36aa 4e-04
in ref transcript
smart ZP 250aa 1e-55
in ref transcript
Zona pellucida (ZP) domain. ZP proteins are responsible for sperm-adhesion fo the zona pellucida. ZP domains are also present in multidomain transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type III (betaglycan).
pfam EGF_CA 39aa 1e-06
in ref transcript
Calcium binding EGF domain.
smart EGF_CA 42aa 1e-05
in ref transcript
Calcium-binding EGF-like domain.
UPF3B
rs.UPF3B.F1 rs.UPF3B.R1 176 215
NCBIGene 36.3 65109
Single exon skipping, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_080632
pfam Smg4_UPF3 163aa 1e-34
in ref transcript
Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3.
URG4
rs.URG4.F1 rs.URG4.R1 100 127
NCBIGene 36.3 55665
Single exon skipping, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_001077663
COG COG2433 120aa 4e-05
in ref transcript
Uncharacterized conserved protein [Function unknown].
USP5
rs.USP5.F1 rs.USP5.R1 180 249
NCBIGene 36.3 8078
Alternative 5-prime, size difference: 69
Exclusion in the protein (no frameshift)
Reference transcript: NM_001098536
cd Peptidase_C19B 272aa 6e-91
in ref transcript
A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
cd Peptidase_C19B 58aa 8e-25
in ref transcript
cd UBA 34aa 1e-07
in ref transcript
Ubiquitin Associated domain. The UBA domain is a commonly occurring sequence motif in some members of the ubiquitination pathway, UV excision repair proteins, and certain protein kinases. Although its specific role is so far unknown, it has been suggested that UBA domains are involved in conferring protein target specificity. The domain, a compact three helix bundle, has a conserved GFP-loop and the proline is thought to be critical for binding. The UBA domain is distinct from the conserved three helical domain seen in the N-terminus of EF-TS and eukaryotic NAC proteins.
cd UBA 39aa 6e-06
in ref transcript
pfam UCH 290aa 1e-44
in ref transcript
Ubiquitin carboxyl-terminal hydrolase.
pfam zf-UBP 76aa 3e-20
in ref transcript
Zn-finger in ubiquitin-hydrolases and other protein.
pfam UCH 70aa 2e-15
in ref transcript
pfam UBA 35aa 5e-08
in ref transcript
UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to ubiquitin and the ubiquitination pathway. The structure of the UBA domain consists of a compact three helix bundle. This domain is found at the N terminus of EF-TS hence the name TS-N. The structure of EF-TS is known and this domain is implicated in its interaction with EF-TU. The domain has been found in non EF-TS proteins such as alpha-NAC and MJ0280.
pfam UBA 36aa 3e-05
in ref transcript
Changed!
COG UBP14 838aa 1e-101
in ref transcript
Isopeptidase T [Posttranslational modification, protein turnover, chaperones].
Changed!
COG UBP14 815aa 1e-105
in modified transcript
VCAM1
rs.VCAM1.F1 rs.VCAM1.R1 110 386
NCBIGene 36.3 7412
Single exon skipping, size difference: 276
Exclusion in the protein (no frameshift)
Reference transcript: NM_001078
cd IGcam 76aa 2e-08
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
cd IGcam 76aa 1e-07
in ref transcript
cd IGcam 86aa 1e-05
in ref transcript
Changed!
cd IGcam 86aa 4e-05
in ref transcript
cd IGcam 81aa 0.003
in ref transcript
pfam C2-set 89aa 7e-15
in ref transcript
Immunoglobulin C2-set domain.
pfam C2-set 89aa 6e-12
in ref transcript
smart IGc2 61aa 6e-12
in ref transcript
Immunoglobulin C-2 Type.
Changed!
pfam I-set 87aa 2e-11
in ref transcript
Immunoglobulin I-set domain.
pfam I-set 84aa 3e-11
in ref transcript
smart IG_like 79aa 3e-09
in ref transcript
Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
smart IG_like 78aa 9e-06
in ref transcript
VDR
rs.VDR.F1 rs.VDR.R1 118 240
NCBIGene 36.3 7421
Single exon skipping, size difference: 122
Exclusion in 5'UTR
Reference transcript: NM_001017535
cd NR_LBD_VDR 204aa 1e-117
in ref transcript
The ligand binding domain of vitamin D receptors, a member of the nuclear receptor superfamily. The ligand binding domain of vitamin D receptors (VDR): VDR is a member of the nuclear receptor (NR) superfamily that functions as classical endocrine receptors. VDR controls a wide range of biological activities including calcium metabolism, cell proliferation and differentiation, and immunomodulation. VDR is a high affinity receptor for the biologically most active Vitamin D metabolite, 1alpha,25-dihydroxyvitamin D3 (1alpha,25(OH)2D3). The binding of the ligand to the receptor induces a conformational change of the ligand binding domain (LBD) with consequent dissociation of corepressors. Upon ligand binding, VDR forms heterodimer with the retinoid X receptor (RXR) that binds to vitamin D response elements (VDREs), recruits coactivators. This leads to the expression of a large number of genes. Approximately 200 human genes are considered to be primary targets of VDR and even more genes are regulated indirectly. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, VDR has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
cd NR_LBD_VDR 36aa 8e-14
in ref transcript
pfam Hormone_recep 187aa 5e-34
in ref transcript
Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
pfam zf-C4 76aa 1e-33
in ref transcript
Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
VEGFA
rs.VEGFA.F1 rs.VEGFA.R1 106 141
NCBIGene 36.3 7422
Alternative 5-prime, size difference: 35
Exclusion in the protein causing a frameshift
Reference transcript: NM_001025366
cd PDGF 84aa 2e-24
in ref transcript
Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family domain; PDGF is a potent activator for cells of mesenchymal origin; PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer; VEGF is a potent mitogen in embryonic and somatic angiogenesis with a unique specificity for vascular endothelial cells; VEGF forms homodimers and exists in 4 different isoforms; overall, the VEGF monomer resembles that of PDGF, but its N-terminal segment is helical rather than extended; the cysteine knot motif is a common feature of this domain.
smart PDGF 83aa 2e-33
in ref transcript
Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family. Platelet-derived growth factor is a potent activator for cells of mesenchymal origin. PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer. Members of the VEGF family are homologues of PDGF.
VEGFA
rs.VEGFA.F2 rs.VEGFA.R2 338 404
NCBIGene 36.3 7422
Alternative 3-prime, size difference: 66
Exclusion of the stop codon
Reference transcript: NM_001025366
cd PDGF 84aa 2e-24
in ref transcript
Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family domain; PDGF is a potent activator for cells of mesenchymal origin; PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer; VEGF is a potent mitogen in embryonic and somatic angiogenesis with a unique specificity for vascular endothelial cells; VEGF forms homodimers and exists in 4 different isoforms; overall, the VEGF monomer resembles that of PDGF, but its N-terminal segment is helical rather than extended; the cysteine knot motif is a common feature of this domain.
smart PDGF 83aa 2e-33
in ref transcript
Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family. Platelet-derived growth factor is a potent activator for cells of mesenchymal origin. PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer. Members of the VEGF family are homologues of PDGF.
VEGFA
rs.VEGFA.F3 rs.VEGFA.R3 127 178
NCBIGene 36.3 7422
Alternative 5-prime, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_001025366
cd PDGF 84aa 2e-24
in ref transcript
Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family domain; PDGF is a potent activator for cells of mesenchymal origin; PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer; VEGF is a potent mitogen in embryonic and somatic angiogenesis with a unique specificity for vascular endothelial cells; VEGF forms homodimers and exists in 4 different isoforms; overall, the VEGF monomer resembles that of PDGF, but its N-terminal segment is helical rather than extended; the cysteine knot motif is a common feature of this domain.
smart PDGF 83aa 2e-33
in ref transcript
Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family. Platelet-derived growth factor is a potent activator for cells of mesenchymal origin. PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer. Members of the VEGF family are homologues of PDGF.
VHL
rs.VHL.F1 rs.VHL.R1 304 427
NCBIGene 36.3 7428
Single exon skipping, size difference: 123
Exclusion in the protein (no frameshift)
Reference transcript: NM_000551
Changed!
pfam VHL 156aa 2e-81
in ref transcript
von Hippel-Lindau disease tumour suppressor protein. VHL forms a ternary complex with the elonginB and elonginC proteins. This complex binds Cul2, which then is involved in regulation of vascular endothelial growth factor mRNA.
Changed!
pfam VHL 115aa 1e-49
in modified transcript
Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_017890
COG MRS6 199aa 7e-17
in ref transcript
Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion].
COG MRS6 83aa 8e-11
in ref transcript
VPS16
rs.VPS16.F1 rs.VPS16.R1 104 536
NCBIGene 36.3 64601
Multiple exon skipping, size difference: 432
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
Reference transcript: NM_022575
Changed!
pfam Vps16_N 417aa 1e-167
in ref transcript
Vps16, N-terminal region. This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport. The role of VPS16 in this complex is not known.
pfam Vps16_C 319aa 1e-115
in ref transcript
Vps16, C-terminal region. This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport. The role of VPS16 in this complex is not known.
Changed!
pfam Vps16_N 298aa 1e-108
in modified transcript
VPS26A
rs.VPS26A.F1 rs.VPS26A.R1 133 276
NCBIGene 36.3 9559
Single exon skipping, size difference: 143
Exclusion in the protein causing a frameshift
Reference transcript: NM_004896
Changed!
pfam Vps26 276aa 1e-135
in ref transcript
Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps5, Vps17, Vps29 and Vps35. This family also contains Down syndrome critical region 3/A.
Changed!
pfam Vps26 236aa 1e-110
in modified transcript
VPS29
rs.VPS29.F1 rs.VPS29.R1 100 112
NCBIGene 36.3 51699
Single exon skipping, size difference: 12
Exclusion in the protein (no frameshift)
Reference transcript: NM_057180
Changed!
TIGR yfcE 139aa 5e-23
in ref transcript
Members of this largely uncharacterized family share a motif approximating DXH(X25)GDXXD(X25)GNHD as found in several phosphoesterases, including the nucleases SbcD and Mre11, and a family of uncharacterized archaeal putative phosphoesterases described by TIGR00024. In this family, the His residue in GNHD portion of the motif is not conserved. The member MJ0936, one of two from Methanococcus jannaschii, was shown (PMID:15128743) to act on model phosphodiesterase substrates; a divalent cation was required.
Changed!
COG COG0622 173aa 3e-24
in ref transcript
Predicted phosphoesterase [General function prediction only].
Changed!
TIGR yfcE 140aa 5e-24
in modified transcript
Changed!
COG COG0622 174aa 4e-25
in modified transcript
VSIG4
rs.VSIG4.F1 rs.VSIG4.R1 102 384
NCBIGene 36.3 11326
Single exon skipping, size difference: 282
Exclusion in the protein (no frameshift)
Reference transcript: NM_007268
Changed!
cd IGcam 70aa 1e-06
in ref transcript
Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
Changed!
pfam I-set 76aa 1e-06
in ref transcript
Immunoglobulin I-set domain.
pfam V-set 115aa 6e-05
in ref transcript
Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
WDR1
rs.WDR1.F1 rs.WDR1.R1 100 520
NCBIGene 36.3 9948
Multiple exon skipping, size difference: 420
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_017491
Changed!
cd WD40 340aa 3e-43
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Changed!
cd WD40 291aa 5e-25
in ref transcript
Changed!
smart WD40 40aa 4e-06
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
pfam WD40 32aa 5e-05
in ref transcript
WD domain, G-beta repeat.
Changed!
smart WD40 34aa 1e-04
in ref transcript
smart WD40 42aa 3e-04
in ref transcript
Changed!
pfam eIF2A 97aa 0.004
in ref transcript
Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
Changed!
COG COG2319 427aa 1e-22
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
COG COG2319 223aa 1e-10
in ref transcript
Changed!
cd WD40 378aa 2e-31
in modified transcript
Changed!
cd WD40 78aa 1e-05
in modified transcript
Changed!
pfam WD40 39aa 1e-05
in modified transcript
Changed!
pfam WD40 33aa 2e-04
in modified transcript
Changed!
pfam eIF2A 89aa 0.006
in modified transcript
Changed!
COG COG2319 420aa 1e-21
in modified transcript
WDR16
rs.WDR16.F1 rs.WDR16.R1 198 402
NCBIGene 36.3 146845
Exon skipping and alternative 3-prime or 5-prime, size difference: 204
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_145054
cd WD40 285aa 9e-34
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Changed!
cd WD40 343aa 3e-16
in ref transcript
smart WD40 40aa 0.005
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 42aa 0.008
in ref transcript
COG COG2319 286aa 9e-21
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
COG COG2319 353aa 3e-14
in ref transcript
Changed!
cd WD40 343aa 5e-14
in modified transcript
Changed!
COG COG2319 315aa 8e-11
in modified transcript
WDR62
rs.WDR62.F1 rs.WDR62.R1 99 114
NCBIGene 36.3 284403
Alternative 3-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_001083961
cd WD40 253aa 2e-25
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd WD40 138aa 8e-12
in ref transcript
cd WD40 200aa 2e-10
in ref transcript
cd WD40 294aa 1e-09
in ref transcript
smart WD40 40aa 0.001
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
COG COG2319 307aa 3e-18
in ref transcript
FOG: WD40 repeat [General function prediction only].
COG COG2319 350aa 5e-11
in ref transcript
WHSC1
rs.WHSC1.F1 rs.WHSC1.R1 172 352
NCBIGene 36.3 7468
Single exon skipping, size difference: 180
Exclusion in 5'UTR
Reference transcript: NM_133330
cd MSH6_like 120aa 2e-43
in ref transcript
The PWWP domain is present in MSH6, a mismatch repair protein homologous to bacterial MutS. The PWWP domain of histone-lysine N-methyltransferase, also known as Nuclear SET domain-containing protein 3, is also included. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans. The role of the PWWP domain in MSH6 is not clear; MSH6 orthologs found in S. cerevisiae, Caenorhabditis elegans and Arabidopsis thaliana lack the PWWP domain. Histone methyltransferases (HMTases) induce the posttranslational methylation of lysine residues in histones and play a role in apoptosis. In the HMTase Whistle, the PWWP domain is necessary for HMTase activity. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
cd WHSC1_related 95aa 2e-40
in ref transcript
The PWWP domain was first identified in the WHSC1 (Wolf-Hirschhorn syndrome candidate 1) protein, a protein implicated in Wolf-Hirschhorn syndrome (WHS). When translocated, WHSC1 plays a role in lymphoid multiple myeloma (MM) disease, also known as plasmacytoma. WHCS1 proteins typically contain two copies of the PWWP domain. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
cd HMG-box 50aa 2e-06
in ref transcript
High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
pfam SET 126aa 1e-36
in ref transcript
SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
smart PWWP 63aa 3e-18
in ref transcript
domain with conserved PWWP motif. conservation of Pro-Trp-Trp-Pro residues.
smart AWS 51aa 2e-10
in ref transcript
associated with SET domains. subdomain of PRESET.
smart PWWP 63aa 2e-09
in ref transcript
smart HMG 63aa 1e-06
in ref transcript
high mobility group.
pfam PHD 43aa 2e-06
in ref transcript
PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
pfam PHD 45aa 0.004
in ref transcript
COG COG2940 245aa 5e-19
in ref transcript
Proteins containing SET domain [General function prediction only].
COG NHP6B 66aa 0.003
in ref transcript
Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
WISP1
rs.WISP1.F1 rs.WISP1.R1 125 386
NCBIGene 36.3 8840
Single exon skipping, size difference: 261
Exclusion in the protein (no frameshift)
Reference transcript: NM_003882
Changed!
pfam VWC 58aa 5e-04
in ref transcript
von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094.
pfam TSP_1 40aa 5e-04
in ref transcript
Thrombospondin type 1 domain.
WSB1
rs.WSB1.F1 rs.WSB1.R1 106 544
NCBIGene 36.3 26118
Multiple exon skipping, size difference: 438
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_015626
Changed!
cd WD40 244aa 9e-44
in ref transcript
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
cd SOCS_WSB1_SWIP1 40aa 7e-17
in ref transcript
SOCS (suppressors of cytokine signaling) box of WSB1/SWiP1-like proteins. This subfamily contains WSB-1 (SOCS-box-containing WD-40 protein), part of an E3 ubiquitin ligase for the thyroid-hormone-activating type 2 iodothyronine deiodinase (D2) and SWiP-1 (SOCS box and WD-repeats in Protein), a WD40-containing protein that is expressed in embryonic structures of chickens and regulated by Sonic Hedgehog (Shh). The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.
smart SOCS 43aa 2e-12
in ref transcript
suppressors of cytokine signalling. suppressors of cytokine signalling.
pfam WD40 36aa 7e-06
in ref transcript
WD domain, G-beta repeat.
Changed!
smart WD40 41aa 6e-05
in ref transcript
WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
smart WD40 40aa 7e-05
in ref transcript
pfam eIF2A 77aa 0.003
in ref transcript
Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
Changed!
COG COG2319 345aa 2e-30
in ref transcript
FOG: WD40 repeat [General function prediction only].
Changed!
cd WD40 227aa 4e-38
in modified transcript
Changed!
pfam WD40 40aa 8e-05
in modified transcript
Changed!
COG COG2319 230aa 8e-26
in modified transcript
WT1
rs.WT1.F1 rs.WT1.R1 344 395
NCBIGene 36.3 7490
Single exon skipping, size difference: 51
Exclusion in the protein (no frameshift)
Reference transcript: NM_024426
Changed!
pfam WT1 321aa 1e-143
in ref transcript
Wilm's tumour protein.
COG COG5048 129aa 2e-05
in ref transcript
FOG: Zn-finger [General function prediction only].
Changed!
pfam WT1 304aa 1e-132
in modified transcript
XAGE1D
rs.XAGE1D.F1 rs.XAGE1D.R1 111 431
NCBIGene 36.3 9503
Alternative 5-prime, size difference: 320
Exclusion of the protein initiation site
Reference transcript: NM_133431
Changed!
pfam GAGE 27aa 0.003
in modified transcript
GAGE protein. This family consists of several GAGE and XAGE proteins which are found exclusively in humans. The function of this family is unknown although they have been implicated in human cancers.
XCR1
rs.XCR1.F1 rs.XCR1.R1 181 303
NCBIGene 36.3 2829
Single exon skipping, size difference: 122
Exclusion in 5'UTR
Reference transcript: NM_005283
pfam 7tm_1 234aa 2e-26
in ref transcript
7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
XPO7
rs.XPO7.F1 rs.XPO7.R1 191 218
NCBIGene 36.3 23039
Alternative 5-prime, size difference: 27
Exclusion in the protein (no frameshift)
Reference transcript: NM_001100161
pfam IBN_N 67aa 6e-08
in ref transcript
Importin-beta N-terminal domain.
Changed!
COG CRM1 286aa 2e-05
in ref transcript
Importin beta-related nuclear transport receptor [Nuclear structure / Intracellular trafficking and secretion].
Changed!
COG CRM1 277aa 1e-05
in modified transcript
XRCC3
rs.XRCC3.F1 rs.XRCC3.R1 103 140
NCBIGene 36.3 7517
Alternative 3-prime, size difference: 37
Inclusion in 5'UTR
Reference transcript: NM_005432
cd Rad51_DMC1_radA 257aa 2e-64
in ref transcript
Rad51_DMC1_radA,B. This group of recombinases includes the eukaryotic proteins RAD51, RAD55/57 and the meiosis-specific protein DMC1, and the archaeal proteins radA and radB. They are closely related to the bacterial RecA group. Rad51 proteins catalyze a similiar recombination reaction as RecA, using ATP-dependent DNA binding activity and a DNA-dependent ATPase. However, this reaction is less efficient and requires accessory proteins such as RAD55/57.
TIGR recomb_radA 258aa 2e-26
in ref transcript
This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA (TIGR02012), eukaryotic RAD51 (TIGR02239), and archaeal RadB (TIGR02237). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein.
PRK radA 275aa 3e-27
in ref transcript
DNA repair and recombination protein RadA; Validated.
XRN1
rs.XRN1.F1 rs.XRN1.R1 187 226
NCBIGene 36.3 54464
Single exon skipping, size difference: 39
Exclusion in the protein (no frameshift)
Reference transcript: NM_019001
pfam XRN_N 228aa 1e-113
in ref transcript
XRN 5'-3' exonuclease N-terminus. This family aligns residues towards the N-terminus of several proteins with multiple functions. The members of this family all appear to possess 5'-3' exonuclease activity EC:3.1.11.-. Thus, the aligned region may be necessary for 5'->3' exonuclease function. The family also contains several Xrn1 and Xrn2 proteins. The 5'-3' exoribonucleases Xrn1p and Xrn2p/Rat1p function in the degradation and processing of several classes of RNA in Saccharomyces cerevisiae. Xrn1p is the main enzyme catalysing cytoplasmic mRNA degradation in multiple decay pathways, whereas Xrn2p/Rat1p functions in the processing of rRNAs and small nucleolar RNAs (snoRNAs) in the nucleus.
COG XRN1 396aa 1e-119
in ref transcript
5'-3' exonuclease [DNA replication, recombination, and repair / Cell division and chromosome partitioning / Translation].
COG XRN1 304aa 6e-66
in ref transcript
YIPF5
rs.YIPF5.F1 rs.YIPF5.R1 133 384
NCBIGene 36.3 81555
Alternative 5-prime, size difference: 251
Exclusion in 5'UTR
Reference transcript: NM_001024947
pfam Yip1 145aa 1e-32
in ref transcript
Yip1 domain. The Yip1 integral membrane domain contains four transmembrane alpha helices. The domain is characterised by the motifs DLYGP and GY. The Yip1 protein is a golgi protein involved in vesicular transport that interacts with GTPases.
COG YIP1 172aa 2e-36
in ref transcript
Rab GTPase interacting factor, Golgi membrane protein [Intracellular trafficking and secretion].
ZBTB40
rs.ZBTB40.F1 rs.ZBTB40.R1 110 398
NCBIGene 36.3 9923
Single exon skipping, size difference: 288
Exclusion in 5'UTR
Reference transcript: NM_001083621
pfam BTB 95aa 5e-17
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
FOG: Zn-finger [General function prediction only].
ZFAND5
rs.ZFAND5.F1 rs.ZFAND5.R1 263 400
NCBIGene 36.3 7763
Single exon skipping, size difference: 137
Exclusion in 5'UTR
Reference transcript: NM_001102420
pfam zf-AN1 40aa 7e-13
in ref transcript
AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets indicate the number of residues.
smart ZnF_A20 25aa 0.001
in ref transcript
A20-like zinc fingers. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappaB activation.
ZFYVE16
rs.ZFYVE16.F1 rs.ZFYVE16.R1 238 366
NCBIGene 36.3 9765
Single exon skipping, size difference: 128
Exclusion in 5'UTR
Reference transcript: NM_001105251
cd FYVE 54aa 4e-17
in ref transcript
FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
pfam FYVE 65aa 2e-24
in ref transcript
FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. We have included members which do not conserve these histidine residues but are clearly related.
PTZ PTZ00303 93aa 0.003
in ref transcript
phosphatidylinositol kinase; Provisional.
ZFYVE27
rs.ZFYVE27.F1 rs.ZFYVE27.R1 117 138
NCBIGene 36.3 118813
Single exon skipping, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_001002261
cd FYVE 59aa 3e-07
in ref transcript
FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
smart FYVE 63aa 5e-09
in ref transcript
Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PH D finger and the RI NG finger. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. The FYVE finger functions in the membrane recruitment of cytosolic proteins by binding to phosphatidylinositol 3-phosphate (PI3P), which is prominent on endosomes. The R+HHC+XCG motif is critical for PI3P binding.
ZFYVE27
rs.ZFYVE27.F2 rs.ZFYVE27.R2 107 122
NCBIGene 36.3 118813
Alternative 3-prime, size difference: 15
Exclusion in the protein (no frameshift)
Reference transcript: NM_001002261
cd FYVE 59aa 3e-07
in ref transcript
FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
smart FYVE 63aa 5e-09
in ref transcript
Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PH D finger and the RI NG finger. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. The FYVE finger functions in the membrane recruitment of cytosolic proteins by binding to phosphatidylinositol 3-phosphate (PI3P), which is prominent on endosomes. The R+HHC+XCG motif is critical for PI3P binding.
ZGPAT
rs.ZGPAT.F1 rs.ZGPAT.R1 172 232
NCBIGene 36.3 84619
Alternative 3-prime, size difference: 60
Exclusion in the protein (no frameshift)
Reference transcript: NM_032527
cd TUDOR 31aa 0.005
in ref transcript
Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
pfam G-patch 42aa 2e-08
in ref transcript
G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines.
smart ZnF_C3H1 22aa 6e-04
in ref transcript
zinc finger.
smart TUDOR 36aa 0.006
in ref transcript
Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
ZMYM5
rs.ZMYM5.F1 rs.ZMYM5.R1 150 436
NCBIGene 36.3 9205
Single exon skipping, size difference: 286
Exclusion in the protein causing a frameshift
Reference transcript: NM_001039650
Changed!
pfam zf-FCS 36aa 2e-06
in ref transcript
MYM-type Zinc finger with FCS sequence motif. MYM-type zinc fingers were identified in MYM family proteins. Human zinc finger protein 261 is involved in a chromosomal translocation and may be responsible for X-linked retardation in XQ13.1. Human zinc finger protein 198 is also involved in disease. In myeloproliferative disorders it is fused to FGF receptor 1; in atypical myeloproliferative disorders it is rearranged. Members of the family generally are involved in development. This Zn-finger domain functions as a transcriptional trans-activator of late vaccinia viral genes, and orthologues are also found in all nucleocytoplasmic large DNA viruses, NCLDV. This domain is also found fused to the C termini of recombinases from certain prokaryotic transposons.
ZNF133
rs.ZNF133.F1 rs.ZNF133.R1 138 550
NCBIGene 36.3 7692
Exon skipping and alternative 3-prime or 5-prime, size difference: 412
Inclusion in 5'UTR, Exclusion in 5'UTR, Exclusion in 5'UTR, Exclusion in 5'UTR
Reference transcript: NM_003434
smart KRAB 60aa 6e-28
in ref transcript
krueppel associated box.
COG COG5048 410aa 2e-11
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF142
rs.ZNF142.F1 rs.ZNF142.R1 284 406
NCBIGene 36.3 7701
Single exon skipping, size difference: 122
Inclusion in the protein causing a new stop codon
Reference transcript: NM_001105537
ZNF142
rs.ZNF142.F2 rs.ZNF142.R2 252 347
NCBIGene 36.3 7701
Alternative 3-prime, size difference: 95
Inclusion in 5'UTR
Reference transcript: NM_001105537
ZNF146
rs.ZNF146.F1 rs.ZNF146.R1 111 185
NCBIGene 36.3 7705
Single exon skipping, size difference: 74
Exclusion in 5'UTR
Reference transcript: NM_007145
COG COG5048 254aa 5e-04
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF155
rs.ZNF155.F1 rs.ZNF155.R1 193 270
NCBIGene 36.3 7711
Alternative 5-prime, size difference: 77
Exclusion in 5'UTR
Reference transcript: NM_003445
pfam KRAB 41aa 4e-19
in ref transcript
KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
COG COG5048 285aa 5e-06
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF187
rs.ZNF187.F1 rs.ZNF187.R1 102 506
NCBIGene 36.3 7741
Alternative 5-prime, size difference: 404
Inclusion in the protein causing a new stop codon
Reference transcript: NM_152736
Changed!
COG COG5048 154aa 9e-04
in ref transcript
FOG: Zn-finger [General function prediction only].
Changed!
smart SCAN 27aa 1e-07
in modified transcript
leucine rich region.
ZNF205
rs.ZNF205.F1 rs.ZNF205.R1 181 249
NCBIGene 36.3 7755
Alternative 3-prime, size difference: 68
Inclusion in 5'UTR
Reference transcript: NM_001042428
pfam KRAB 39aa 4e-13
in ref transcript
KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
pfam zf-C2H2 23aa 0.004
in ref transcript
Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
pfam zf-C2H2 23aa 0.006
in ref transcript
COG COG5048 162aa 6e-08
in ref transcript
FOG: Zn-finger [General function prediction only].
COG COG5048 67aa 4e-04
in ref transcript
ZNF226
rs.ZNF226.F1 rs.ZNF226.R1 98 267
NCBIGene 36.3 7769
Alternative 3-prime, size difference: 169
Inclusion in 5'UTR
Reference transcript: NM_001032373
smart KRAB 60aa 7e-19
in ref transcript
krueppel associated box.
COG COG5048 407aa 4e-13
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF228
rs.ZNF228.F1 rs.ZNF228.R1 170 219
NCBIGene 36.3 7771
Single exon skipping, size difference: 49
Exclusion of the protein initiation site
Reference transcript: NM_001083335
smart KRAB 61aa 3e-20
in ref transcript
krueppel associated box.
COG COG5048 393aa 3e-05
in ref transcript
FOG: Zn-finger [General function prediction only].
COG COG5048 388aa 3e-04
in ref transcript
ZNF239
rs.ZNF239.F1 rs.ZNF239.R1 199 240
NCBIGene 36.3 8187
Single exon skipping, size difference: 41
Exclusion in 5'UTR
Reference transcript: NM_001099282
COG COG5048 201aa 0.003
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF273
rs.ZNF273.F1 rs.ZNF273.R1 185 274
NCBIGene 36.3 10793
Single exon skipping, size difference: 89
Inclusion in the protein causing a frameshift
Reference transcript: NM_021148
Changed!
smart KRAB 61aa 1e-26
in ref transcript
krueppel associated box.
Changed!
COG COG5048 369aa 9e-06
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF280D
rs.ZNF280D.F1 rs.ZNF280D.R1 406 475
NCBIGene 36.3 54816
Single exon skipping, size difference: 69
Exclusion of the protein initiation site
Reference transcript: NM_017661
ZNF295
rs.ZNF295.F1 rs.ZNF295.R1 108 173
NCBIGene 36.3 49854
Single exon skipping, size difference: 65
Exclusion in 5'UTR
Reference transcript: NM_001098402
pfam BTB 101aa 1e-21
in ref transcript
BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
ZNF30
rs.ZNF30.F1 rs.ZNF30.R1 238 304
NCBIGene 36.3 90075
Alternative 5-prime, size difference: 66
Exclusion in 5'UTR
Reference transcript: NM_001099438
smart KRAB 62aa 3e-23
in ref transcript
krueppel associated box.
COG COG5048 366aa 2e-10
in ref transcript
FOG: Zn-finger [General function prediction only].
COG COG5048 217aa 5e-05
in ref transcript
ZNF331
rs.ZNF331.F1 rs.ZNF331.R1 113 143
NCBIGene 36.3 55422
Single exon skipping, size difference: 30
Exclusion in 5'UTR
Reference transcript: NM_001079907
pfam KRAB 41aa 7e-21
in ref transcript
KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
COG COG5048 315aa 1e-05
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF334
rs.ZNF334.F1 rs.ZNF334.R1 163 315
NCBIGene 36.3 55713
Single exon skipping, size difference: 152
Inclusion in the protein causing a new stop codon
Reference transcript: NM_018102
Changed!
smart KRAB 61aa 2e-29
in ref transcript
krueppel associated box.
Changed!
COG COG5048 441aa 2e-04
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF419
rs.ZNF419.F2 rs.ZNF419.R2 256 355
NCBIGene 36.3 79744
Exon skipping and alternative 3-prime or 5-prime, size difference: 99
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_001098491
Changed!
smart KRAB 61aa 8e-22
in ref transcript
krueppel associated box.
COG COG5048 156aa 1e-06
in ref transcript
FOG: Zn-finger [General function prediction only].
COG COG5048 331aa 2e-04
in ref transcript
Changed!
pfam KRAB 41aa 1e-18
in modified transcript
KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
ZNF468
rs.ZNF468.F1 rs.ZNF468.R1 126 244
NCBIGene 36.3 90333
Single exon skipping, size difference: 118
Inclusion in the protein causing a frameshift
Reference transcript: NM_001008801
Changed!
pfam KRAB 41aa 2e-21
in ref transcript
KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
Changed!
COG COG5048 196aa 4e-05
in ref transcript
FOG: Zn-finger [General function prediction only].
Changed!
smart KRAB 42aa 9e-20
in modified transcript
krueppel associated box.
ZNF493
rs.ZNF493.F1 rs.ZNF493.R1 117 340
NCBIGene 36.3 284443
Multiple exon skipping, size difference: 223
Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
Reference transcript: NM_001076678
Changed!
smart KRAB 61aa 2e-26
in ref transcript
krueppel associated box.
Changed!
COG COG5048 415aa 8e-12
in ref transcript
FOG: Zn-finger [General function prediction only].
Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
COG COG5048 286aa 3e-05
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF557
rs.ZNF557.F2 rs.ZNF557.R2 119 140
NCBIGene 36.3 79230
Alternative 3-prime, size difference: 21
Exclusion in the protein (no frameshift)
Reference transcript: NM_001044387
smart KRAB 58aa 4e-28
in ref transcript
krueppel associated box.
pfam zf-C2H2 23aa 0.006
in ref transcript
Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
COG COG5048 286aa 3e-05
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF586
rs.ZNF586.F1 rs.ZNF586.R1 181 308
NCBIGene 36.3 54807
Single exon skipping, size difference: 127
Exclusion in the protein causing a frameshift
Reference transcript: NM_017652
Changed!
pfam KRAB 41aa 3e-16
in ref transcript
KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
Changed!
COG COG5048 154aa 2e-04
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF596
rs.ZNF596.F1 rs.ZNF596.R1 217 287
NCBIGene 36.3 169270
Alternative 5-prime, size difference: 70
Exclusion in 5'UTR
Reference transcript: NM_001042416
smart KRAB 59aa 2e-20
in ref transcript
krueppel associated box.
COG COG5048 142aa 2e-05
in ref transcript
FOG: Zn-finger [General function prediction only].
COG COG5048 398aa 8e-05
in ref transcript
ZNF613
rs.ZNF613.F1 rs.ZNF613.R1 330 438
NCBIGene 36.3 79898
Single exon skipping, size difference: 208
Exclusion of the protein initiation site
Reference transcript: NM_001031721
Changed!
smart KRAB 60aa 3e-27
in ref transcript
krueppel associated box.
COG COG5048 339aa 6e-04
in ref transcript
FOG: Zn-finger [General function prediction only].
Changed!
smart KRAB 31aa 5e-09
in modified transcript
Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only].
ZNF638
rs.ZNF638.F1 rs.ZNF638.R1 90 100
NCBIGene 36.3 27332
Alternative 5-prime, size difference: 10
Exclusion in 5'UTR
Reference transcript: NM_014497
cd RRM 70aa 3e-04
in ref transcript
RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
smart ZnF_U1 33aa 5e-05
in ref transcript
U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
TIGR hnRNP-L_PTB 68aa 7e-04
in ref transcript
Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
TIGR hnRNP-L_PTB 67aa 0.003
in ref transcript
ZNF655
rs.ZNF655.F1 rs.ZNF655.R1 182 395
NCBIGene 36.3 79027
Multiple exon skipping, size difference: 213
Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
Reference transcript: NM_024061
Changed!
pfam KRAB 41aa 4e-19
in ref transcript
KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
ZNF706
rs.ZNF706.F1 rs.ZNF706.R1 246 361
NCBIGene 36.3 51123
Single exon skipping, size difference: 115
Exclusion in 5'UTR
Reference transcript: NM_001042510
pfam DUF1909 33aa 9e-06
in ref transcript
Domain of unknown function (DUF1909). This domain is found in a set of hypothetical eukaryotic proteins.
ZNF707
rs.ZNF707.F1 rs.ZNF707.R1 169 243
NCBIGene 36.3 286075
Single exon skipping, size difference: 74
Exclusion in 5'UTR
Reference transcript: NM_173831
smart KRAB 61aa 6e-26
in ref transcript
krueppel associated box.
COG COG5048 162aa 1e-07
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF74
rs.ZNF74.F1 rs.ZNF74.R1 263 349
NCBIGene 36.3 7625
Single exon skipping, size difference: 86
Exclusion in the protein causing a frameshift
Reference transcript: NM_003426
Changed!
smart KRAB 61aa 1e-27
in ref transcript
krueppel associated box.
Changed!
COG COG5048 157aa 2e-05
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF807
rs.ZNF807.F1 rs.ZNF807.R1 235 289
NCBIGene 36.3 643719
Alternative 3-prime, size difference: 54
Inclusion in the protein causing a new stop codon
Reference transcript: XM_001718828
ZNF83
rs.ZNF83.F1 rs.ZNF83.R1 316 406
NCBIGene 36.3 55769
Single exon skipping, size difference: 90
Exclusion in 5'UTR
Reference transcript: NM_001105549
COG COG5048 391aa 8e-07
in ref transcript
FOG: Zn-finger [General function prediction only].
ZNF92
rs.ZNF92.F1 rs.ZNF92.R1 183 310
NCBIGene 36.3 168374
Single exon skipping, size difference: 127
Exclusion in the protein causing a frameshift
Reference transcript: NM_152626
Changed!
smart KRAB 61aa 2e-27
in ref transcript
krueppel associated box.
Changed!
COG COG5048 381aa 3e-08
in ref transcript
FOG: Zn-finger [General function prediction only].
ZWILCH
rs.ZWILCH.F1 rs.ZWILCH.R1 306 401
NCBIGene 36.3 55055
Alternative 3-prime, size difference: 95
Inclusion in the protein causing a new stop codon
Reference transcript: NM_017975
Changed!
pfam DUF2352 556aa 0.0
in ref transcript
Uncharacterized conserved protein (DUF2352). Members of this family of uncharacterised proteins have no known function.